Course not currently available

Introduction to Big Data Hadoop Course

Short course

In London ()

Price on request

Description

  • Type

    Short course

  • Level

    Beginner

  • Duration

    1 Day

Master the concepts of the Hadoop framework

Learn the concepts of the big data Hadoop framework by completing PCWorkshops introductory course. By adding Hadoop to your skillset, you could potentially advance your career in big data or in businesses that will require your analysis skills.

About this course

We will go through the Hadoop history by looking at the ecosystem and stack, cluster architecture overview and the development environment. Then move on to the basic commands and concepts that will help consolidate the course. In no time you will acquire the skills of data engineer and many other roles.

Some background knowledge of Hadoop will help you excel

Questions & Answers

Add your question

Our advisors and other users will be able to reply to you

Fill in your details to get a reply

We will only publish your name and question

Reviews

This centre's achievements

2018

All courses are up to date

The average rating is higher than 3.7

More than 50 reviews in the last 12 months

This centre has featured on Emagister for 7 years

Subjects

  • Big Data
  • Data Analytics
  • Hadoop
  • MapReduce
  • Hadoop cluster
  • HDFS
  • Batch Processing
  • MapReduce computations
  • Analytics professional
  • Hadoop Ecosystem

Teachers and trainers (1)

Mary  Smith

Mary Smith

Tutor

How Mary Smith teaches: I am consultative, flexible and understanding. I appreciate the trust given to me. What Mary Smith teaches: Databases, MS SQL Server, Oracle 11g, MySQL, Access Database, Excel, MS Power BI, Tableau, SSRS, MS SQL Server Report Builder. Java Coding Basics.

Course programme

Course Outline


The problem space and example applications
Why don’t traditional approaches scale?

Requirements
Hadoop Background


Hadoop History

  • The ecosystem and stack: HDFS, MapReduce, Hive, Pig…
  • Cluster architecture overview
  • Development Environment

---------------------------------------------------------------------------------------------

Hadoop distribution and basic commands
Eclipse development
HDFS Introduction


The HDFS command line and web interfaces
The HDFS Java API (lab)
MapReduce Introduction


Key philosophy: move computation, not data
Core concepts: Mappers, reducers, drivers
The MapReduce Java API (lab)
Real-World MapReduce


Optimizing with Combiners and Partitioners (lab)
More common algorithms: sorting, indexing and searching (lab)
Testing with MRUnit
Higher-level Tools


Patterns to abstract “thinking in MapReduce”
The Cascading library (practical)
§ The Hive database (practical)

Introduction to Big Data Hadoop Course

Price on request