Guide to High Performance Distributed Computing Case Studies with Hadoop, Scalding and Spark /
This timely text/reference describes the development and implementation of large-scale distributed processing systems using open source tools and technologies such as Hadoop, Scalding and Spark. Comprehensive in scope, the book presents state-of-the-art material on building high performance distribu...
Saved in:
| Main Authors: | , |
|---|---|
| Corporate Author: | |
| Format: | Electronic eBook |
| Language: | English |
| Published: |
Cham :
Springer International Publishing : Imprint: Springer,
2015.
|
| Edition: | 1st ed. 2015. |
| Series: | Computer Communications and Networks,
|
| Subjects: | |
| Online Access: | https://doi.org/10.1007/978-3-319-13497-0 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Table of Contents:
- Part I: Programming Fundamentals of High Performance Distributed Computing
- Introduction
- Getting Started with Hadoop
- Getting Started with Spark
- Programming Internals of Scalding and Spark
- Part II: Case studies using Hadoop, Scalding and Spark
- Case Study I: Data Clustering using Scalding and Spark
- Case Study II: Data Classification using Scalding and Spark
- Case Study III: Regression Analysis using Scalding and Spark
- Case Study IV: Recommender System using Scalding and Spark.



