O'Reilly logo

Scaling Big Data with Hadoop and Solr by Hrishikesh Karambelkar

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Configuring SolrCloud to work with large indexes

In order to configure SolrCloud to run with large indexes, it is important to first design the system based on the requirements. The design has to be based on the following factors:

  • Number of nodes participating in the cloud
  • Distribution of shards and their replicas over nodes
  • Replication factors and leader
  • ZooKeeper setup

Prerequisites for this would require Apache Solr, ZooKeeper, J2EE container (optional).

Setting up the ZooKeeper ensemble

First, we need to set up a ZooKeeper ensemble on all the nodes. Although Apache Solr ships with embedded ZooKeeper, for large indexes and scalability requirements, it is recommended to go ahead with a full ZooKeeper set up. You can download the latest version of Apache ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required