SPARK DEVELOPMENT TRAINING
Training
Online
*Indicative price
Original amount in INR:
₹ 52,714
Description
-
Type
Training
-
Level
Intermediate
-
Methodology
Online
-
Duration
1 Month
-
Start date
Different dates available
-
Online campus
Yes
-
Delivery of study materials
Yes
-
Support service
Yes
-
Virtual classes
Yes
Workshop style coaching
Interactive approach
Course material
POC Implementation
Hands on practice exercises for each topic
Quiz at the end of each major topic
Tips and techniques on Cloudera Certification Examination
Linux concepts and basic commands
On Demand Services
Mock interviews for each individual will be conducted on need basis
SQL basics on need basis
Resume preparation and guidance
Interview questions
Facilities
Location
Start date
Start date
About this course
SCALA for Spark
Reviews
Subjects
- SQL
- Scala
- Spark
- Scala REPL
- Java to Scala
- Java
- REPL
- Scala IDE
- Operations
- WHILE
- Loops
Course programme
Scala Basics
- What is Scala?
- Why Scala for Spark?
- Intro to Scala REPL : Journey from Java to Scala
- Installing Scala IDE
- Basic Operations
- Defining Functions
- Control Structures in Scala
- loops – ForEach, While, Do-While
- Collections – Array, ArrayBuffer, Map, Tuples, Lists
- If Statements
- Conditional Operators
- Enumerations
- Class and Object Basics
- Scala Constructors
- Nested Classes
- Visibility Rules
- Overriding Methods
- Functional Programming
- Higher Order Functions
- Traits
- Interfaces
- Layered Traits
- Introduction to BigData
- Challenges with Bigdata
- Batch Vs. Realtime processing
- Overview- Hadoop Ecosystem
- HDFS
- Review of MapReduce
- Hive
- Sqoop
- Flume
- What is Spark?
- Spark Overview
- Setting up environment
- Using Spark Shell
- Spark Web UI
- RDD's
- Spark Context
- Spark Ecosystem
- In-Memory data – Spark
- Creating, Loading and Saving RDD
- Transformations in RDD
- Actions in RDD
- Key-Value Pair RDD
- MapReduce and Pair RDD operations
- RDD Partitions
- Spark Applications vs. Spark Shell
- Creating Spark Context
- Building a Spark Application
- Running a Spark Application
- Spark and Hadoop Integration-HDFS
- Handling Sequence Files
- RDD Lineage
- RDD Persistence Overview
- Distributed Persistence
- Spark Streaming Architecture
- First Spark Streaming Programming
- Transformations in Spark Streaming
1. What is Machine Learning?
2. ML library for Spark
3. Algorithms
- Statistics
- Classification
- Regression
- Clustering
- Collaborative Filtering
- Overview on Hive
- Spark SQL Architecture
- SQLContext in Spark SQL
- Working with DataFrames
- Example for Spark SQL
- Integrating Hive and Spark SQL
- DataFrames and RDD's
- Knowing JSON and Parquet File Formats
- Loading of data
- Comparing Spark SQL,Impala and Hive-on-Spark
- Overview of GraphX
- Data Visualisation in Spark
- Common Spark use-cases
- Shared Variables: Broadcast Variables
- Shared Variables: Accumulators
- Common Performance Issues
- Performance tuning tips
SPARK DEVELOPMENT TRAINING
*Indicative price
Original amount in INR:
₹ 52,714