Hadoop 2 Essentials - An End-to-End Approach (Paperback)


This textbook adopts a unique approach to helping developers and CS students learn Hadoop MapReduce programming fast in an easy-to-setup, virtual 4-node Linux YARN cluster on a Windows laptop. Rather than filled with disjointed, piecemeal code snippets to show Hadoop MapReduce programming features one at a time, it is designed to place your total Hadoop MapReduce programming learning process in a common application context of mining customer spending patterns ensconced in large volumes of credit card transaction record data. Precise, end-to-end procedures are given to help you set up your Hadoop MapReduce development environment quickly on Eclipse with Maven on Windows. Step-by-step procedures are also given on how to set up a four-node Linux cluster at minimum so that you can run your MapReduce programs not only in local but also in standalone and fully distributed mode on a real cluster. In fact, all MapReduce programs presented in the book have been tested and verified on such a Linux cluster. This textbook mainly focuses on teaching Hadoop MapReduce programming in a scientific, objective, quantitative approach. Rather than heavily relying on subjective, verbose (and sometimes even pompous) textual descriptions with sparse code snippets, this textbook uses Hadoop Java APIs, Hadoop configuration parameters, complete MapReduce programs and their execution logs and outputs to demonstrate how Hadoop MapReduce framework works and how to write MapReduce programs. Specifically, this text covers the following subjects: * Introduction to Hadoop * Setting up a Linux Hadoop Cluster * The Hadoop Distributed FileSystem * MapReduce Job Orchestration and Workflows * Basic MapReduce Programming * Advanced MapReduce Programming * Hadoop Streaming * Hadoop Administration No matter what role you play on your team, this text can help you gain truly applicable Hadoop skills in a most effective and efficient manner. The book can also be used as a supplementary textbook for a distributed computing or Hadoop course offered to upper-division CS students.

R1,446
List Price R1,527
Save R81 5%

Or split into 4x interest-free payments of 25% on orders over R50
Learn more

Discovery Miles14460
Mobicred@R136pm x 12* Mobicred Info
Free Delivery
Delivery AdviceShips in 10 - 15 working days


Toggle WishListAdd to wish list
Review this Item

Product Description

This textbook adopts a unique approach to helping developers and CS students learn Hadoop MapReduce programming fast in an easy-to-setup, virtual 4-node Linux YARN cluster on a Windows laptop. Rather than filled with disjointed, piecemeal code snippets to show Hadoop MapReduce programming features one at a time, it is designed to place your total Hadoop MapReduce programming learning process in a common application context of mining customer spending patterns ensconced in large volumes of credit card transaction record data. Precise, end-to-end procedures are given to help you set up your Hadoop MapReduce development environment quickly on Eclipse with Maven on Windows. Step-by-step procedures are also given on how to set up a four-node Linux cluster at minimum so that you can run your MapReduce programs not only in local but also in standalone and fully distributed mode on a real cluster. In fact, all MapReduce programs presented in the book have been tested and verified on such a Linux cluster. This textbook mainly focuses on teaching Hadoop MapReduce programming in a scientific, objective, quantitative approach. Rather than heavily relying on subjective, verbose (and sometimes even pompous) textual descriptions with sparse code snippets, this textbook uses Hadoop Java APIs, Hadoop configuration parameters, complete MapReduce programs and their execution logs and outputs to demonstrate how Hadoop MapReduce framework works and how to write MapReduce programs. Specifically, this text covers the following subjects: * Introduction to Hadoop * Setting up a Linux Hadoop Cluster * The Hadoop Distributed FileSystem * MapReduce Job Orchestration and Workflows * Basic MapReduce Programming * Advanced MapReduce Programming * Hadoop Streaming * Hadoop Administration No matter what role you play on your team, this text can help you gain truly applicable Hadoop skills in a most effective and efficient manner. The book can also be used as a supplementary textbook for a distributed computing or Hadoop course offered to upper-division CS students.

Customer Reviews

No reviews or ratings yet - be the first to create one!

Product Details

General

Imprint

Createspace Independent Publishing Platform

Country of origin

United States

Release date

February 2014

Availability

Expected to ship within 10 - 15 working days

First published

February 2014

Authors

Dimensions

235 x 191 x 16mm (L x W x T)

Format

Paperback - Trade

Pages

308

ISBN-13

978-1-4954-9612-7

Barcode

9781495496127

Categories

LSN

1-4954-9612-0



Trending On Loot