Learn By Example: Hadoop, MapReduce for Big Data problems – Loonycorn

Salepage link: At HERE. Archive: http://archive.is/wip/rUSuS

Learn By Example: Hadoop, MapReduce for Big Data problems

A hands-on workout in Hadoop, MapReduce and the art of thinking “parallel”

df1875c8e10f0aca2d8f892d6efbb6751725a624 - WSO.lib

This course is a zoom-in, zoom-out, hands-on workout involving Hadoop, MapReduce and the art of thinking parallel.

Let’s parse that.

Zoom-in, Zoom-Out: This course is both broad and deep. It covers the individual components of Hadoop in great detail, and also gives you a higher level picture of how they interact with each other.

Hands-on workout involving Hadoop, MapReduce : This course will get you hands-on with Hadoop very early on. You’ll learn how to set up your own cluster using both VMs and the Cloud. All the major features of MapReduce are covered – including advanced topics like Total Sort and Secondary Sort.

The art of thinking parallel: MapReduce completely changed the way people thought about processing Big Data. Breaking down any problem into parallelizable units is an art. The examples in this course will train you to “think parallel”.

What’s Covered:

Lot’s of cool stuff ..

.. and of course all the basics:

Course Curriculum

Introduction

Why is Big Data a Big Deal

Installing Hadoop in a Local Environment

The MapReduce “Hello World”

Run a MapReduce Job

Juicing your MapReduce – Combiners, Shuffle and Sort and The Streaming API

HDFS and Yarn

MapReduce Customizations For Finer Grained Control

The Inverted Index, Custom Data Types for Keys, Bigram Counts and Unit Tests!

Input and Output Formats and Customized Partitioning

Recommendation Systems using Collaborative Filtering

Hadoop as a Database

K-Means Clustering

Setting up a Hadoop Cluster

Appendix

Original Content
WSO.lib
Logo
Compare items
  • Total (0)
Compare
0
Shopping cart