SPARK DEVELOPMENT TRAINING

Training

Online

£ 503.05 VAT inc.

*Indicative price

Original amount in INR:

₹ 52,714

Description

  • Type

    Training

  • Level

    Intermediate

  • Methodology

    Online

  • Duration

    1 Month

  • Start date

    Different dates available

  • Online campus

    Yes

  • Delivery of study materials

    Yes

  • Support service

    Yes

  • Virtual classes

    Yes

Workshop style coaching
Interactive approach
Course material
POC Implementation
Hands on practice exercises for each topic
Quiz at the end of each major topic
Tips and techniques on Cloudera Certification Examination
Linux concepts and basic commands
On Demand Services
Mock interviews for each individual will be conducted on need basis
SQL basics on need basis
Resume preparation and guidance
Interview questions

Facilities

Location

Start date

Online

Start date

Different dates availableEnrolment now open

About this course

SCALA for Spark

Questions & Answers

Add your question

Our advisors and other users will be able to reply to you

Who would you like to address this question to?

Fill in your details to get a reply

We will only publish your name and question

Reviews

Subjects

  • SQL
  • Scala
  • Spark
  • Scala REPL
  • Java to Scala
  • Java
  • REPL
  • Scala IDE
  • Operations
  • WHILE
  • Loops

Course programme

COURSE DETAILS & CURRICULUM

Scala Basics
  • What is Scala?
  • Why Scala for Spark?
  • Intro to Scala REPL : Journey from Java to Scala
  • Installing Scala IDE
  • Basic Operations
  • Defining Functions
Scala Essentials
  • Control Structures in Scala
  • loops – ForEach, While, Do-While
  • Collections – Array, ArrayBuffer, Map, Tuples, Lists
  • If Statements
  • Conditional Operators
  • Enumerations
OOP and FP
  • Class and Object Basics
  • Scala Constructors
  • Nested Classes
  • Visibility Rules
  • Overriding Methods
  • Functional Programming
  • Higher Order Functions
  • Traits
  • Interfaces
  • Layered Traits
Prerequisite: BigData and Hadoop Framework
  • Introduction to BigData
  • Challenges with Bigdata
  • Batch Vs. Realtime processing
  • Overview- Hadoop Ecosystem
  • HDFS
  • Review of MapReduce
  • Hive
  • Sqoop
  • Flume
APACHE SPARKIntroduction to Spark
  • What is Spark?
  • Spark Overview
  • Setting up environment
  • Using Spark Shell
  • Spark Web UI
Spark Basics
  • RDD's
  • Spark Context
  • Spark Ecosystem
  • In-Memory data – Spark
Working with RDD's
  • Creating, Loading and Saving RDD
  • Transformations in RDD
  • Actions in RDD
  • Key-Value Pair RDD
  • MapReduce and Pair RDD operations
  • RDD Partitions
Writing and Deploying Spark Applications
  • Spark Applications vs. Spark Shell
  • Creating Spark Context
  • Building a Spark Application
  • Running a Spark Application
  • Spark and Hadoop Integration-HDFS
  • Handling Sequence Files
Spark RDD
  • RDD Lineage
  • RDD Persistence Overview
  • Distributed Persistence
Spark Streaming
  • Spark Streaming Architecture
  • First Spark Streaming Programming
  • Transformations in Spark Streaming
Spark MLlib

1. What is Machine Learning?
2. ML library for Spark
3. Algorithms
  • Statistics
  • Classification
  • Regression
  • Clustering
  • Collaborative Filtering
Spark SQL
  • Overview on Hive
  • Spark SQL Architecture
  • SQLContext in Spark SQL
  • Working with DataFrames
  • Example for Spark SQL
  • Integrating Hive and Spark SQL
  • DataFrames and RDD's
  • Knowing JSON and Parquet File Formats
  • Loading of data
  • Comparing Spark SQL,Impala and Hive-on-Spark
GraphX
  • Overview of GraphX
  • Data Visualisation in Spark
  • Common Spark use-cases
Performance Tuning
  • Shared Variables: Broadcast Variables
  • Shared Variables: Accumulators
  • Common Performance Issues
  • Performance tuning tips

SPARK DEVELOPMENT TRAINING

£ 503.05 VAT inc.

*Indicative price

Original amount in INR:

₹ 52,714