Understand, design, build, and optimize your big data search engine with Hadoop and Apache Solr
Together, Apache Hadoop and Apache Solr help organizations resolve the problem of information extraction from big data by providing excellent distributed faceted search capabilities.
This book will help you learn everything you need to know to build a distributed enterprise search platform as well as optimize this search to a greater extent, resulting in the maximum utilization of available resources. Starting with the basics of Apache Hadoop and Solr, the book covers advanced topics of optimizing search with some interesting real-world use cases and sample Java code.
This is a step-by-step guide that will teach you how to build a high performance enterprise search while scaling data with Hadoop and Solr in an effortless manner.
What You Will Learn
Understand Apache Hadoop, its ecosystem, and Apache Solr
Explore industry-based architectures by designing a big data enterprise search with their applicability and benefits
Integrate Apache Solr with big data technologies such as Cassandra to enable better scalability and high availability for big data
Optimize the performance of your big data search platform with scaling data
Write MapReduce tasks to index your data
Configure your Hadoop instance to handle real-world big data problems
Work with Hadoop and Solr using real-world examples to benefit from their practical usage
Use Apache Solr as a NoSQL database
Downloading the example code for this book. You can download the example code files for all Packt books you have purchased from your account at http://www.PacktPub.com. If you purchased this book elsewhere, you can visit http://www.PacktPub.com/support and register to have the files e-mailed directly to you.