Login    


RankingAlgorithm, for accurate and relevant search


RankingAlgorithm  --  Register,  Login,  Download (it is free)  
 Software Agreement
RankingAlgorithm v1.4.2

RankingAgorithm is a search library that uses a new scoring algorithm to rank results accurately and relevantly. RankingAlgorithm is very easy to use since it uses the Apache Lucene** index but ranks and scores on its own.

Multiple algorithms are available, SIMPLE* and COMPLEX*. SIMPLE is a very fast algorithm and can return queries in <100ms on a 10m wikipedia index (complete index). It can also scale to 100m docs or maybe more. COMPLEX is a more complex algorithm so is a little slower compared to the SIMPLE, but can also still return queries in < 50ms on a 10m wikipedia index (complete index). COMPLEX is more accurate and should be able to give you the best rankings as compared to SIMPLE. SIMPLE is also very good and may be well suited than COMPLEX for some type of queries.

RankingAlgorithm works in two modes, document mode and product mode:

In document mode, it ranks documents such as HTML, Wikipedia, Word/PDF docs relevantly while in Product mode, a term's occurrence is taken into account and scored accordingly. So titles starting with "wii console" are ranked first, and the others rank lower as the occurrence of "wii console" shifts in the title or gets reversed, see below:

RankingAlgorithm is very easy to use as Apache Lucene the very popular open source library can be used create the index. The results from Solr (again a very popular open source search platform) with RankingAlgorithm seems to be comparable to Google and much better than Lucene for Perl index searches, see comparison.

Features

1. Search very accurate and relevant. Comparable to Google site search and much better than Lucene, see comparison.
2. Two algorithms, SIMPLE and COMPLEX.
3. Two modes, Document and Product mode. Product mode enables very accurate product/retail/short twitter text searches. Document mode enables relevant search, can be used for product searches too.
4. By default AND/OR combinations.
5. Very easy scoring with a relevancy score.
6. Very easy to use.
7. Score boosting, supports Document, Field, Query & Query term boosts.
8. Uses the very popular Lucene index. No changes to your code or index.
9. NRT search
10. Query a 10m wikipedia index in <50 ms, scale upto 100m docs

Search components

Autocomplete using Solr with RankingAlgorithm, similar to Google/Yahoo/Bing's autocomplete (It is free),  Give it a try


Demo


Documentation

Browse the javadocs here:

RankingAlgorithm 1.4.1 docs, RankingAlgorithm 1.4.1 Javadocs.
RankingAlgorithm 1.3 docs, RankingAlgorithm 1.3 Javadocs.
RankingAlgorithm 1.2 docs, RankingAlgorithm 1.2 Javadocs.
RankingAlgorithm 1.1/1.0 docs, RankingAlgorithm 1.1 Javadocs.

WIKI, Apache Solr with RankingAlgorithm WIKI.
Browse RankingAlgorithm usage Example.java, SimpleExample.java, LuceneCollectorExample.java

Downloads (Give it a try) (It is Free))