Surf these sites: AltaVista Software Search Intranet 97 -- Software package providing roughly equivalent functionalities to AltaVista public search engine. Solaris, Digital Unix, NT. Demo version available for download. Building Task-Specific Interfaces to High Volume Conversational Data -- The philosophy behind http://www.phoaks.com/ People Helping One Another Know Stuff CCR WinOcular Information -- CCR, a software developer and integrator, specializes in document imaging, COLD report management, workflow software and Kodak scanner products for corporations, small business, all government entities, and schools. Dataware Technologies -- Dataware is a search engine vender. BRS/Search is their long time text based core product. They also have web enabled products. Excalibur Technologies International, Ltd. -- fuzzy searching and plain English meaning-based searching for text, and query-by-example searching for multimedia Findex Full Text Indexing and Retrieval Toolkit -- Findex is a highly portable and scalable toolkit (SDK) for adding full-text indexing and retrieval to applications. Findex is very fast both in indexing and retrieval, has low index overhead, and can scale to handle extremely large volumes of text. Glimpse -- Indexing and query system for personal file systems as well as organizations. Glimpse is the default search engine in Harvest. Harvest Web Indexing -- The Harvest Indexer can fetch and index data made available by HTTP, Gopher, FTP, or NNTP. It has summarisers capable of indexing data in a wide variety of file formats. Source code. IB Search Engine -- High speed, fully featured, multilingual fielded fulltext engine. Available for many platforms including Solaris, BSD, Linux and Windows-NT. ISYS -- Web Search Technology. ISearch -- Fulltext database, attempted to replace FreeWais. Not updated since 1996. Source code. Inventory of Full-Text Information Retrieval Software Vendors -- A project of the IFLA Section on Information Technology KE Software Inc. -- KE Texpress is an object/relational database that supports text as well as multimedia objects. Runs on a wide variety of platforms including Linux. LegalProNet.Com -- A secure web based service, allowing attorneys to store, manage, search and access their documents through the internet. Locus Search Engine Software -- Locus is a Linux based full text search engine. Lucene Search Engine -- Created by Doug Cutting previously of Apple computer and Xerox PARC, Lucene is a commercial java based search engine. Managing Gigabytes -- An excellent book about indexing techniques and a software package that implements the algorithms. The software was not updated since 1996 but still contains valuable functions and libraries. Source code (GPL). Mark-A-Tex -- Extraction-indexing software allows users to search, highlight, extract, and index a variety of file formats, with options to print, save, or re-search output. Megaputer Intelligence -- Megaputer provides a complete family of unique solutions for Natural Landuage Text Retrieval and Analysis, Data Mining and Knowledge Discovery in Databases. MicroISIS by UNESCO -- Non-numerical information storage and retrieval software developed to allow institutions, especially in developing countries, to streamline their information processing activities. Microsoft Index Server -- High-power commercial crawler/index/search tool. OpenText -- Supplier of information retrieval and collaborative software. PLS -- Personal Library Software is one of the major old line players in the full text search and retrieval market place. They were recently purchased by AOL. AOL is now licensing the PLS search engine free of charge to all users. There is no Linux version, however. ProIndex -- FullText indexing and retrieval development toolkit by InfoSphere. SMART for beginners -- SMART was an IR system written by Gerald Salton. Some think it is still the best information retrieval system available. I don''t know of an official SMART web site, but this tutorial gives information on where to ftp SMART from and how to start using it. SearchExpress -- Provides document scanning, optical character recognition and full-text searching. SimpleScan Software, Inc. -- Provides enterprise wide document management software. Thunderstone -- Provides SQL based relational full text retrieval, dynamic publishing, object management, and web indexing software. Free Webinator search-only version is available for up to 2,000 pages. Ultraseek Server -- The tools they use at their site for sale. Demo version available for download. Web Search Engine Software -- Create your own search engine quickly and easily with Web Search. Zebra Z39.50 Search Engine -- Zebra is a free indexing and retrieval system that conforms to ANSI standard Z39.50. It is very good for indexing and searching highly structured data such as MARC records, GILS records, etc. ZyLAB Europe -- Develops and markets ZyIMAGE, a suite of programs that allows you to efficiently and easily convert paper documents and computer-generated files into full-text searchable collections that can also be accessed from the web, distributed on optical media, or emailed after filtering according to a user''s profile. Highlights hits within the document or on the original scanned image. dtSearch -- Searches all popular file types, with features including hit highlighting, natural language, fuzzy, phonic, boolean, proximity, field, numeric range. ht://Dig -- A complete world wide web indexing and searching system for a small domain or intranet. Source code (GPL). locus -- locus lets you find words in your texts: newsgroup messages, Web page mirrors, electronic books - whatever you have. It uses word patterns (order, locality etc.) to match queries to texts, makes reasonable choices by default yet does exactly what you want when you specify it.
Help build the largest human-edited
directory on the web.