Map Reduce tutorial

  • WatchEMR Training: Intro to EMR (1 of 11)
  • WatchEMR Training: Getting Started with EMR (2 of 11)
  • WatchEMR Training: Running Jobs (3 of 11)
  • WatchEMR Training: Cluster of Servers (4 of 11)
  • WatchEMR Training: Dealing with Data (5 of 11)
  • WatchEMR Training: MapReduce Lab (6 of 11)
  • WatchEMR Training: Command Line Tools (7 of 11)
  • WatchEMR Training: Debugging Tips (8 of 11)
  • WatchEMR Training: Hive & Pig (9 of 11)
  • WatchEMR Training: Hive Job Flow (10 of 11)
  • WatchEMR Training: Advanced EMR (11 of 11)

Overview. Hadoop MapReduce is a software framework for easily writing applications which process vast amounts of data (multi-terabyte data-sets) in-parallel on large clusters (thousands of nodes) of commodity hardware in a reliable, fault-tolerant manner. A MapReduce job usually splits the input data-set into …

You may also like...

Leave a Reply