Home > Computers > Open Source > Software > Internet > Search Engines
Sites relating to search engine software with an open source license.
http://arachnode.net/
A .NET web crawler written in C# using SQL 2005 and Lucene. Documentation and online demonstration.
http://www.twmacinta.com/bddbot/
A web robot, search engine and web server written in Java and available under GPL. Includes related resources. [Project no longer actively updated]
http://www.atnf.csiro.au/computing/software/arch/
An open source, high precision corporate search engine based on Apache Nutch
http://www.datafari.com/
A packaged, Apache v2-licensed, enterprise search solution that leverages ManifoldCF for data sources, Solr for the search engine, and Cassandra for user management.
http://www.dataparksearch.org/
Open source search engine tool released under GPL and designed to organize search within a website, group of websites, intranet or local system.
http://www.egothor.org/
A cross-platform, full-featured text search engine written entirely in Java. It can be configured as a standalone engine, metasearcher, peer-to-peer HUB or used as a class library.
http://groonga.org/
An LGPL 2.1, open-source, fulltext search engine and column store written in C. Works with MySQL and Postgres. Site provides online documentation and downloads.
http://sourceforge.net/projects/grub/
Open source, cross-platform distributed crawler. FAQ, documentation and a support forum.
http://www.lemurproject.org/indri.php
A cross-platform search engine written in C++ that provides text search and a rich structured query language. BSD-like license.
http://guisearch.sourceforge.net/
A tool for finding code by looking at the applications' GUI text messages (e.g., "Undo") and returning associated callbacks/slots (e.g., slotUndo()). Allows searching the KDE project CVS repository as a live demonstration.
http://sourceforge.net/projects/locust/
Specifically designed for knowledge area or corporate search, written in C++.
http://www.norconex.com/collectors/collector-http/download
Java-based Apache licensed enterprise web crawler running on any platform, and integrating with virtually any search engines (open-source or commercial).
http://nutch.apache.org/
Effort to implement a prototype of an open source web-search engine.
http://www.opensearchserver.com/
A GPLv3 search engine and crawler for urls, databases, and file systems. Comes with an XML/HTTP API, PHP/ASP client. Based on Apache Tomcat, Java Server Faces and JBoss RichFaces.
http://www.openwebspider.org/
An open source web spider and search engine. Includes demo, source code and screenshots.
http://www.sphider.eu/
A lightweight search engine in PHP. Includes details of features, documentation, support forum, and download. [GPL]
http://sphinxsearch.com/
A search engine designed for indexing database content. It natively supports MySQL, PostgreSQL, and XML pipe interfaces. It is written in C++ and has a GPL license.
http://project-strus.net/
A collection of C++ (C++98) libraries and command line tools for building a competitive full-text search engine. Development status is pre-alpha.
http://xapian.org/
Open source search engine library written in C++, with bindings to allow use from other languages as well.
http://www.wumpus-search.org/
A C++, GPL-licensed search engine developed at the University of Waterloo. Wumpus allows control of the text unit retrieved based on structural constraints in the query.
http://www.yacy.net/
A distributed Web crawler and caching HTTP/HTTPS proxy built on the principles of peer-to-peer (P2P) networks.
http://www.seekquarry.com/
A PHP, GPLv3 search engine designed to do open web or intranet crawls.
Home > Computers > Open Source > Software > Internet > Search Engines
Thanks to DMOZ, which built a great web directory for nearly two decades and freely shared it with the web. About us