其他
.NET architecture arachnode.net is the most comprehensive open source C#/.NET web crawler available. Use arachnode.net from any .NET language.
Configurable Rules and Actions Implement custom pre- and post-request crawl rules and actions without source recompilation. The existing crawl rules and actions architecture easily enables crawling enhancements such as federation, partitioning and distributed caching.
Lucene.NET Integration Lucene.NET integration allows for full-text search through a familiar web interface. Easily integrate your search results into Solr or other Lucene index utilization solutions, whether they be in .NET, Java or any other language that supports Lucene.
SQL Server 2005/2008 and full-text indexing SQL Server 2005/2008 full-text indexing is configured at all appropriate content storage locations for files, images and web pages.
.DOC/.PDF/.PPT/.XLS Indexing Crawl, index and search Microsoft Word, PowerPoint and Excel and Adobe
爬虫
网络
搜索
使用
暂无评论