Demo Faceting/Autcomplete - Try Faceting, Autocomplete
Demo - Try it out, Search Wikipedia/Products/Perl/Php/Python/MusicBrainZ indexes
The demo is installed on a 2 core intel based system running Fedora 12 with Solr-RA ver 1.0, RankingAlgorithm ver 1.0. Solr has multiple cores, default, perl, a store, musicbrainz, php and python.
The XML dump from wikipedia.org was downloaded (about 29GB after uncompress) and split into 10 parts. Only the first 1 million pages have been indexed for the demo for resource constraints (A complete wikipedia index was tested using shards and multiple solr instances on multiple systems ). The XML was imported using the dataimport handler. See dataimport.xml, solrconfig.xml The <title> tag was copied into the text field using the copy field mechanism, see schema.
Try the following:
The perl index was created by downloading the docs at perldoc.per.org. Next, perl.org and nntp.perl.org was crawled using httrack. The downloaded files were next uploaded to Solr to be indexed using Apache Tika/Solr Cell using the java code at, Indexing Solr With Tika And Java. A script, postfiles.sh available under the example directory was next used to submit html, pdf, word, txt, etc. documents to Solr for indexing.
Steps to recreate index:
schema.xml, solrconfig.xml for the perl core
Try the following: