apache spark tutorials


  • WatchApache Spark - Setup Environment - Introduction
  • WatchApache Spark - Setup Environment on Mac
  • WatchApache Spark - Concepts and Architecture - Introduction
  • WatchApache Spark - Architecture Overview
  • WatchApache Spark - Cluster Modes
  • WatchApache Spark - Modules Overview
  • WatchApache Spark - Documentation Overview
  • WatchApache Spark - Launch Spark Shell (Scala)
  • WatchScala - Setup Scala and sbt introduction
  • WatchScala - Install and Validate
  • WatchScala - Run simple application
  • WatchScala - Install sbt and run scala application
  • WatchScala - Setup Scala IDE for Eclipse - Introduction
  • WatchScala - Setup Scala IDE and run simple application
  • WatchScala - integrate sbt with Scala IDE for Eclipse
  • WatchApache Spark - Develop applications using Scala IDE and sbt - Introduction
  • WatchApache Spark - Develop applications using Scala IDE and sbt
  • WatchApache Spark - Run Spark application on cluster
  • WatchApache Spark - Scala - SparkConf and SparkContext Introduction
  • WatchApache Spark - Scala - SparkConf
  • WatchApache Spark - Scala - SparkContext
  • WatchApache Spark - Scala - Externalizing Parameters
  • WatchApache Spark - Scala - Develop word count using Scala IDE for Eclipse
  • WatchApache Spark - Create RDD for Parallelized Collections
  • WatchApache Spark - Create RDD for external data sets on OS/local file system
  • WatchApache Spark - Create RDD for external data sets on HDFS files
  • WatchApache Spark - Understanding data model
  • WatchApache Spark - Transformations and Actions Overview
  • WatchApache Spark - Define problem statement and design
  • WatchApache Spark - Read Data Using Spark - Introduction
  • WatchApache Spark - Reading orders and order_items data
  • WatchApache Spark - Simple Transformations - Introduction
  • WatchApache Spark - Extract required fields - map
  • WatchApache Spark - Get revenue per order - reduceByKey
  • WatchApache Spark - Joins Introduction
  • WatchApache Spark - Join order and order_items - join transformation
  • WatchApache Spark - Compute average - Introduction
  • WatchApache Spark - Compute total revenue and total number of orders - aggregateByKey
  • WatchApache Spark - Compute average revenue
  • WatchApache Spark - Sort the data - sortByKey
  • WatchApache Spark - Save the output
  • WatchApache Spark - Running Application - Introduction
  • WatchApache Spark - Run Application Using Scala IDE
  • WatchApache Spark - Run Application On Cluster
  • WatchApache Spark - Accumulators Implementation - Scala
  • WatchApache Spark - Broadcast Variables - Scala - Introduction
  • WatchApache Spark - Broadcast Variables - Scala - Design
  • WatchApache Spark - Broadcast Variables - Scala - Implementation
  • WatchApache Spark - Scala - Data Frames - Introduction
  • WatchApache Spark - Scala - Data Frames and Operations

Apache Spark Tutorial

Apache Spark is a lightning-fast cluster computing designed for fast computation. It was built on top of Hadoop MapReduce and it extends the MapReduce model to efficiently use more types of computations which includes Interactive Queries and Stream Processing. This is a brief tutorial that explains the basics of Spark Core programming.

Audience

This tutorial has been prepared for professionals aspiring to learn the basics of Big Data Analytics using Spark Framework and become a Spark Developer. In addition, it would be useful for Analytics Professionals and ETL developers as well.

Prerequisites

Before you start proceeding with this tutorial, we assume that you have prior exposure to Scala programming, database concepts, and any of the Linux operating system flavors.


Tags Cloud