Cover image for Pentaho for Big Data Analytics.
Pentaho for Big Data Analytics.
Title:
Pentaho for Big Data Analytics.
Author:
Patil, Manoj R.
ISBN:
9781783282166
Personal Author:
Physical Description:
1 online resource (135 pages)
Contents:
Pentaho for Big Data Analytics -- Table of Contents -- Pentaho for Big Data Analytics -- Credits -- About the Authors -- About the Reviewers -- www.PacktPub.com -- Support files, eBooks, discount offers and more -- Why Subscribe? -- Free Access for Packt account holders -- Preface -- What this book covers -- What you need for this book -- Who this book is for -- Conventions -- Reader feedback -- Customer support -- Downloading the example code -- Errata -- Piracy -- Questions -- 1. The Rise of Pentaho Analytics along with Big Data -- Pentaho BI Suite - components -- Data -- Server applications -- Thin Client Tools -- Design tools -- Edge over competitors -- Summary -- 2. Setting Up the Ground -- Pentaho BI Server and the development platform -- Prerequisites/system requirements -- Obtaining Pentaho BI Server (Community Edition) -- The JAVA_HOME and JRE_HOME environment variables -- Running Pentaho BI Server -- Pentaho User Console (PUC) -- Pentaho Action Sequence and solution -- The JPivot component example -- The message template component example -- The embedded HSQLDB database server -- Pentaho Marketplace -- Saiku installation -- Pentaho Administration Console (PAC) -- Creating data connections -- Summary -- 3. Churning Big Data with Pentaho -- An overview of Big Data and Hadoop -- Big Data -- Hadoop -- The Hadoop architecture -- The Hadoop ecosystem -- Hortonworks Sandbox -- Pentaho Data Integration (PDI) -- The Pentaho Big Data plugin configuration -- Importing data to Hive -- Putting a data file into HDFS -- Loading data from HDFS into Hive (job orchestration) -- Summary -- 4. Pentaho Business Analytics Tools -- The business analytics life cycle -- Preparing data -- Preparing BI Server to work with Hive -- Executing and monitoring a Hive MapReduce job -- Pentaho Reporting -- Data visualization and dashboard building.

Creating a layout using a predefined template -- Creating a data source -- Creating a component -- Summary -- 5. Visualization of Big Data -- Data visualization -- Data source preparation -- Repopulating the nyse_stocks Hive table -- Pentaho's data source integration -- Consuming PDI as a CDA data source -- Visualizing data using CTools -- Visualizing trends using a line chart -- Interactivity using a parameter -- Multiple pie charts -- Waterfall charts -- CSS styling -- Summary -- A. Big Data Sets -- Freebase -- U.S. airline on-time performance -- Amazon public data sets -- B. Hadoop Setup -- Hortonworks Sandbox -- Setting up the Hortonworks Sandbox -- Hortonworks Sandbox web administration -- Transferring a file using secure FTP -- Preparing Hive data -- The nyse_stocks sample data -- Index.
Abstract:
The book is a practical guide, full of step-by-step examples that are easy to follow and implement.This book is for developers, system administrators, and business intelligence professionals looking to learn how to get more out of their data through Pentaho. In order to best engage with the examples, some knowledge of Java will be required.
Local Note:
Electronic reproduction. Ann Arbor, Michigan : ProQuest Ebook Central, 2017. Available via World Wide Web. Access may be limited to ProQuest Ebook Central affiliated libraries.
Added Author:
Electronic Access:
Click to View
Holds: Copies: