Application environments

Many businesses internally follow different ways of releasing their software to production. As part of this, they follow several continuous integration methodologies, in order to have better control over the stability of the Hadoop environments. It's good to build multiple smaller Hadoop clusters with X% of the data from the main production environment and run all the applications here.

Applications can build their integration tests on these dedicated environments (QA, Staging, and so on) and can release their software to production once everything is good.

One practice that I have come across is that organizations tend to directly ship the code to production and end up facing outage of their applications because ...

Get Modern Big Data Processing with Hadoop now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.