Using a lowercased analyzer

In the previous attempt, we noted that, though we were able to solve the problem partially, there were still issues with it. In this case, it would be a good approach to use the keyword tokenizer. The keyword tokenizer allows you to keep the text as it is before it reaches the filters.

Note

As opposed to the not_analyzed approach, the keyword tokenizer approach allows us to apply filters, such as lowercase, to the text.

Now, let's see how we can implement this analyzer.

First, we need to create our analyzer while creating the index:

curl -X PUT "http://localhost:9200/news" -d '{
  "analysis": {
    "analyzer": {
      "flat": {
        "type": "custom",
        "tokenizer": "keyword",
        "filter": "lowercase"
      }
    }
  }
}'

Next in the mapping, we should map ...

Get Elasticsearch Blueprints now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.

Start your free trial

Elasticsearch Blueprints by Vineeth Mohan

Using a lowercased analyzer

Note

Don’t leave empty-handed

It’s yours, free.

Check it out now on O’Reilly