O'Reilly logo

Instant Apache Solr for Indexing Data How-to by Alexandre Rafalovitch

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Indexing multiple languages (Advanced)

We have seen in previous examples that Solr can support issues with languages beyond English by, for example, enabling non-accented searches for words that originally have accents and complex characters.

In this example, we will look at deeper language support that Solr provides by automatically detecting the language used, by allowing the text processing of different languages and by hiding the implementation details from users and client applications.

As Solr is quite flexible with its language support, let's consider and implement one scenario:

  • An e-mail may arrive in one of the two languages: English or Russian
  • The language-specific content will be in the subject and message fields with both fields assumed ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required