Big Data with Hadoop Developer

Training

Online

£ 990 VAT inc.

Description

  • Type

    Training

  • Level

    Intermediate

  • Methodology

    Online

  • Class hours

    30h

  • Duration

    5 Days

  • Online campus

    Yes

  • Delivery of study materials

    Yes

The Big data is nothing more than large volume of data. It has been around for around two decades. Big data is large amount of data owned by a company, obtained and manipulated through some new techniques in order to produce valuable format in the best way possible.

In order to store and process big data an open-source framework Hadoop 3.0 can be used. Hadoop can process big data in a distributed environment across clusters of computers using simple programming models. Hadoop can scale up from single servers to hundreads of machines, each will offer local computation and storage. Our training in Big Data with Hadoop will teach you the concept of Big Data and will practically show how you can use Hadoop framework in distributed environment.

About this course

At the end of the training the candidates will be able to develop applications in Big Data

Questions & Answers

Add your question

Our advisors and other users will be able to reply to you

Who would you like to address this question to?

Fill in your details to get a reply

We will only publish your name and question

Reviews

Subjects

  • Big Data
  • Hadoop
  • HIVE
  • Pig
  • Analytics
  • Oracle

Teachers and trainers (1)

Aras Arasilango

Aras Arasilango

Director

I am a member of International Society of Business Leaders and a pioneer in Meta-computing space. I am a senior IT Scientist and CTO at Testenium Limited (testenium.co.uk) with unique skills acquired during 38 years of IT experience. I have invented the most innovative meta-programming language called TAMIL, to develop secure applications within a second. I was a director of CBIT Certifications of Bridgeport University and Syracuse University. Developed a world's first voice activated programming code generated cloud platform approid.co.uk.

Course programme

INTRODUCTION TO HDFS AND HADOOP 3.0

  • Explain Hadoop 3.0 and YARN
  • Explain how HDFS Federation works in Hadoop 3.0
  • Explain the various tools and frameworks in the Hadoop 3.0 ecosystem
  • Use the Hadoop cbrent to input data into HDFS
  • Using HDFS commands
  • Distributed systemsPreview
  • Big Data Use Cases

MAPREDUCE AND YARN

  • Explain the architecture of MapReduce
  • Run a MapReduce job on Hadoop
  • Monitor a MapReduce job

PIG

  • Write a Pig script to explore and transform data in HDFS
  • Define advanced Pig relations
  • Use Pig to apply structure to unstructured Big Data
  • Invoke a Pig User-Defined Function
  • Compute Quantiles with Pig
  • Explore data with Pig
  • Spbrt a dataset with Pig
  • Join datasets with Pig
  • UsePig to prepare data for Hive

HIVE AND HCATALOG

  • Write a Hive query
  • Understand how Hive tables are defined and implemented
  • Use Hive to run SQL-brke queries to perform data analysis
  • Perform a multi-table select in Hive
  • Design a proper schema for Hive
  • Explain the uses and purpose of HCatalog
  • Use HCatalog with Pig and Hive
  • Computing ngrams with Hive
  • Analyzing Big Data with Hive
  • Understanding MapReduce in Hive
  • Joining datasets with Hive
  • Streaming data with Hive and Python

HADOOP TOOLS

  • Use Sqoop to transfer data between Hadoop and a relational database
  • Using Sqoop to transfer data between HDFS and a RDBMS
  • Using HCatalog with Pig
  • Define a workflow using Oozie

BIG DATA ANALYTICS

  • Hive – Data ETL
  • Importing data to Excel

EXPORT BIG DATA TO ORACLE USING ORACLE OBIEE TO ANALYSE BIG DATA RISK FACTOR ANALYSIS

  • Using Spark to Analyse the Risk Factor
  • Using Pig to Analyse Risk Factor
  • Compute Driver Risk Factor

Big Data with Hadoop Developer

£ 990 VAT inc.