Cover image for Hadoop MapReduce cookbook
Hadoop MapReduce cookbook
Title:
Hadoop MapReduce cookbook
Author:
Perera, Srinath.
ISBN:
9781849517294

9781621989035

9781849517287
Personal Author:
Publication Information:
Birmingham, UK : Packt Pub., 2013.
Physical Description:
1 online resource (1 volume) : illustrations
General Note:
Author names from credits page.

Includes index.
Contents:
Table of Contents; Hadoop MapReduce Cookbook; Hadoop MapReduce Cookbook; Credits; About the Authors; About the Reviewers; www.PacktPub.com; Support files, eBooks, discount offers and more; Why Subscribe?; Free Access for Packt account holders; Preface; What this book covers; What you need for this book; Who this book is for; Conventions; Reader feedback; Customer support; Downloading the example code; Errata; Piracy; Questions; 1. Getting Hadoop Up and Running in a Cluster; Introduction; Setting up Hadoop on your machine; Getting ready; How to do it ... ; How it works ...

Writing a WordCount MapReduce sample, bundling it, and running it using standalone HadoopGetting ready; How to do it ... ; How it works ... ; There's more ... ; Adding the combiner step to the WordCount MapReduce program; How to do it ... ; How it works ... ; There's more ... ; Setting up HDFS; Getting ready; How to do it ... ; How it works ... ; Using HDFS monitoring UI; Getting ready; How to do it ... ; HDFS basic command-line file operations; Getting ready; How to do it ... ; How it works ... ; There's more ... ; Setting Hadoop in a distributed cluster environment; Getting ready; How to do it ... ; How it works ...

There's more ... Running the WordCount program in a distributed cluster environment; Getting ready; How to do it ... ; How it works ... ; There's more ... ; Using MapReduce monitoring UI; How to do it ... ; How it works ... ; 2. Advanced HDFS; Introduction; Benchmarking HDFS; Getting ready; How to do it ... ; How it works ... ; There's more ... ; See also; Adding a new DataNode; Getting ready; How to do it ... ; There's more ... ; Rebalancing HDFS; See also; Decommissioning DataNodes; How to do it ... ; How it works ... ; See also; Using multiple disks/volumes and limiting HDFS disk usage; How to do it ...

Setting HDFS block sizeHow to do it ... ; There's more ... ; See also; Setting the file replication factor; How to do it ... ; How it works ... ; There's more ... ; See also; Using HDFS Java API; Getting ready; How to do it ... ; How it works ... ; There's more ... ; Configuring the FileSystem object; Retrieving the list of data blocks of a file; See also; Using HDFS C API (libhdfs); Getting ready; How to do it ... ; How it works ... ; There's more ... ; Configuring using HDFS configuration files; See also; Mounting HDFS (Fuse-DFS); Getting ready; How to do it ... ; How it works ... ; There's more ... ; Building libhdfs.

See alsoMerging files in HDFS; How to do it ... ; How it works ... ; 3. Advanced Hadoop MapReduce Administration; Introduction; Tuning Hadoop configurations for cluster deployments; Getting ready; How to do it ... ; How it works ... ; There's more ... ; Running benchmarks to verify the Hadoop installation; Getting ready; How to do it ... ; How it works ... ; There's more ... ; Reusing Java VMs to improve the performance; How to do it ... ; How it works ... ; Fault tolerance and speculative execution; How to do it ... ; How it works ... ; Debug scripts -- analyzing task failures; Getting ready; How to do it ...
Abstract:
Individual self-contained code recipes. Solve specific problems using individual recipes, or work through the book to develop your capabilities. If you are a big data enthusiast and striving to use Hadoop to solve your problems, this book is for you. Aimed at Java programmers with some knowledge of Hadoop MapReduce, this is also a comprehensive reference for developers and system admins who want to get up to speed using Hadoop.
Title Subject:

Added Author:
Holds: Copies: