Five Questions with Murtaza Haider, author of “Getting Started with Data Science”

Posted by & filed under Big Data, Business, career advice, Content - Highlights and Reviews, Tech.

Recently, we were able to ask five questions of Murtaza Haider, about the new book from IBM Press called “Getting Started with Data Science: Making Sense of Data with Analytics.” Below, the author talks about the benefits of data science in today’s professional world. What are some examples of data science altering or impacting traditional… Read more »

Moving to MapReduce 2 with YARN

Posted by & filed under Content - Highlights and Reviews, Programming & Development.

A guest post by Martin ‘MC’ Brown, the author and contributor to over 26 books covering an array of topics, including the recently published Getting Started with CouchDB. His expertise spans myriad development languages and platforms Perl, Python, Java, JavaScript, Basic, Pascal, Modula-2, C, C++, Rebol, Gawk, Shellscript, Windows, Solaris, Linux, BeOS, Microsoft WP, Mac… Read more »

An Introduction to Storm

Posted by & filed under Content - Highlights and Reviews, Programming & Development.

Storm, a big-data processing system, has been presented by Twitter as a distributed and fault-tolerant stream processing system with the following key design features: Horizontal scalability: Computations and data processing are performed in parallel using multiple threads, processes and machines. Guaranteed message processing: The system guarantees that each message will be fully processed at least… Read more »

Document-Oriented Databases with MongoDB

Posted by & filed under Content - Highlights and Reviews, Programming & Development, Web Development.

SQL databases such as MySQL, Microsoft SQL Server and Oracle have served us well for the past decades. From small desktop applications to large enterprise resource planning systems, these databases have been faithfully fulfilling their tasks. The massive growth of web applications and cloud computing, however, presents new scalability challenges for which a new breed… Read more »

Introduction to the Hadoop framework

Posted by & filed under Content - Highlights and Reviews, Programming & Development, Web Development.

Apache Hadoop is an open source software framework that allows large sets of data to be processed using commodity hardware. Hadoop is designed to run on top of a large cluster of nodes that are connected to form a large distributed system. Hadoop implements a computational paradigm known as MapReduce, which was inspired by an… Read more »