|
|
| webbase - C and C++ - Searching |
|
|
webbase |
|
|
5.1 |
|
|
Loic Dachary |
|
|
Free |
|
|
C and C++ / Searching |
|
|
Click to Visit |
|
|
Click to Download |
|
|
35 |
webbase is an internet web crawler written in C and later ported to C++. It uses a MySQL database to store information about crawled URLs. It is available as a command line program or as a library (shared or static). It has two main functions: crawl the WEB to get documents and build a full text database with these documents. The crawler part visits the documents and stores intersting information about them locally. It visits the document on a regular basis to make sure that it is still there and updates it if it changes. The full text database uses the local copies of the document to build a searchable index. The full text indexing functions are not included in webbase.
|
| Top C and C++ scripts |
1).
dtSearch Desktop with Spider dtSearch Desktop with Spider is a simple and an easy to use searching program that helps users to search any text on their systems instantly. This program supports all file formats.
2).
Larbin Larbin is a web crawler (also called (web) robot, spider, scooter, etc).
3).
SWISH-E SWISH-Enhanced is a fast, powerful, flexible, and easy to use system for indexing collections of Web pages or other text files.
4).
Glimpse Glimpse (which stands for GLobal IMPlicit SEarch) is a popular UNIX indexing and query system that allows you to search through a large set of files very quickly.
5).
ASPseek ASPSeek is an advanced fulltext Internet search engine,
optimized for fast search speed and high relevance.
6).
ht://Dig The ht://Dig system is a complete world wide web indexing and searching system for a small domain or intranet.
7).
DataparkSearch Engine DataparkSearch Engine is a browser based search engine software and is written in C that helps users to search keywords on their websites and organize them.
|
|
| New C and C++ scripts |
1).
SWISH-E SWISH-Enhanced is a fast, powerful, flexible, and easy to use system for indexing collections of Web pages or other text files.
2).
harvest Harvest is a system to collect information and make them searchable using a web interface.
3).
Larbin Larbin is a web crawler (also called (web) robot, spider, scooter, etc).
4).
dtSearch Publish Publishes an instantly searchable database to CD/DVD, effectively adding dtSearch "powerful Web-based engines" (eWEEK) to a CD/DVD. Has a dozen indexed & fielded data search options. Highlights hits in HTML, XML & PDF, displaying links & images.
5).
Namazu Namazu is a full-text search engine intended for easy use.
6).
dtSearch Desktop with Spider dtSearch Desktop with Spider is a simple and an easy to use searching program that helps users to search any text on their systems instantly. This program supports all file formats.
7).
Glimpse Glimpse (which stands for GLobal IMPlicit SEarch) is a popular UNIX indexing and query system that allows you to search through a large set of files very quickly.
|
|
|
|