Learning Hadoop 2

Main
Computers - Organization and Data Processing
Learning Hadoop 2

Learning Hadoop 2

Garry Turkington & Gabriele Modena

0 / 5.0

0 comments

Quanto ti piace questo libro?

Qual è la qualità del file?

Scarica il libro per la valutazione della qualità

Qual è la qualità dei file scaricati?

This book is primarily aimed at application and system developers interested in learning how to solve practical problems using the Hadoop framework and related components. Although we show examples in a few programming languages, a strong foundation in Java is the main prerequisite. Data engineers and architects might also find the material concerning data life cycle, file formats, and computational models useful.

Google started the change that would eventually be known as Hadoop, when in 2003, and in 2004, they released two academic papers describing the Google File System (GFS) and MapReduce. The two together provided a platform for very large-scale data processing in a highly efficient manner.

At the same time, Doug Cutting was working on the Nutch open source web crawler. He was working on elements within the system that resonated strongly once the Google GFS and MapReduce papers were published. Doug started work on open source implementations of these Google ideas, and Hadoop was soon born, firstly, as a subproject of Lucene, and then as its own top-level project within the Apache Software Foundation. Yahoo! hired Doug Cutting in 2006 and quickly became one of the most prominent supporters of the Hadoop project. In addition to often publicizing some of the largest Hadoop deployments in the world, Yahoo! allowed Doug and other engineers to contribute to Hadoop while employed by the company, not to mention contributing back some of its own internally developed Hadoop improvements and extensions.

Categorie:

Computers - Organization and Data Processing

Anno:

2015

Casa editrice:

Packt Publishing

Lingua:

english

ISBN:

B00TOIP6PS

File:

PDF, 2.94 MB