Cover image for Professional Spark : Big Data Cluster Computing in Production.
Professional Spark : Big Data Cluster Computing in Production.
Title:
Professional Spark : Big Data Cluster Computing in Production.
Author:
Iancuta, Ema.
ISBN:
9781119254041
Personal Author:
Edition:
1st ed.
Physical Description:
1 online resource (199 pages)
Contents:
Title Page -- Introduction -- Who This Book Is For -- What This Book Covers -- How This Book Is Structured -- What You Need to Use This Book -- Conventions -- Source Code -- Chapter 1: Finishing Your Spark Job -- Installation of the Necessary Components -- The History of Distributed Computing That Led to Spark -- Using Various Formats for Storage -- Making Sense of Monitoring and Instrumentation -- Summary -- Chapter 2: Cluster Management -- Background -- Spark Components -- Spark Standalone -- YARN -- Mesos -- Comparison -- Summary -- Chapter 3: Performance Tuning -- Spark Execution Model -- Partitioning -- Shuffling Data -- Serialization -- Spark Cache -- Memory Management -- Shared Variables -- Data Locality -- Summary -- Chapter 4: Security -- Architecture -- ACL -- Network Security -- Encryption -- Event Logging -- Kerberos -- Apache Sentry -- Summary -- Chapter 5: Fault Tolerance or Job Execution -- Lifecycle of a Spark Job -- Job Scheduling -- Fault Tolerance -- Summary -- Chapter 6: Beyond Spark -- Data Warehousing -- Machine Learning -- External Frameworks -- Future Works -- Enterprise Usage -- Summary -- Copyright -- Credits -- Acknowledgments -- About the Authors -- About the Technical Editors -- EULA.
Local Note:
Electronic reproduction. Ann Arbor, Michigan : ProQuest Ebook Central, 2017. Available via World Wide Web. Access may be limited to ProQuest Ebook Central affiliated libraries.
Electronic Access:
Click to View
Holds: Copies: