O'Reilly logo

Apache Solr 3 Enterprise Search Server by Eric Pugh, David Smiley

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Chapter 3. Indexing Data

In this chapter we're going to explore ways to get data into Solr. The process of doing this is referred to as indexing, although importing is used too. This chapter is structured as follows:

  • Communicating with Solr
  • Sending data in Solr's Update-XML format
  • Commit, optimize, rollback, and deleting
  • Sending data in CSV format
  • Direct database and XML import through Solr's DataImportHandler (the DIH)
  • Extracting text from rich documents through Solr's ExtractingRequestHandler (also known as Solr Cell)
  • Document post-processing with UpdateRequestProcessors

You will also find some related options in Chapter 9, Integrating Solr that have to do with language bindings and framework integration, including a web crawler. Most use Solr's Update-XML ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required