SolrJ is the simple Java client interface to Solr that insulates you from the dirty details of parsing and sending messages back and forth between your application and Solr, and by default communicates using a fast binary format instead of XML. You work in the familiar world of objects like SolrQuery, QueryResponse , and SolrDocument . SolrJ is a core part of the Solr project, and typically, though not always, it is updated as soon as new features are added to Solr on the server side.
We'll demonstrate using SolrJ to index web pages downloaded from
MusicBrainz.org and stored in a binary format called
ARC using the crawler Heritrix
. If you want to run Heritrix yourself, proceed to the next section. If you want to ...