Solving 10 Hadoop'able Problems

Course

Online

£ 150 + VAT

Description

  • Type

    Course

  • Methodology

    Online

  • Start date

    Different dates available

Need solutions to your big data problems? Here are 10 real-world projects demonstrating problems solved using Hadoop.The Apache Hadoop ecosystem is a popular and powerful tool to solve big data problems. With so many competing tools to process data, many users want to know which particular problems are well suited to Hadoop, and how to implement those solutions. To know what types of problems are Hadoop-able it is good to start with a basic understanding of the core components of Hadoop. You will learn about the ecosystem designed to run on top of Hadoop as well as software that is deployed alongside it. These tools give us the building blocks to build data processing applications. This course covers the core parts of the Hadoop ecosystem, helping to give a broad understanding and get you up-and-running fast. Next, it describes a number of common problems as case-study projects Hadoop is able to solve. These sections are broken down into sections by different projects, each serving as a specific use case for solving big data problems. By the end of this course, you will have been exposed to a wide variety of Hadoop software and examples of how it is used to solve common big data problems.About The AuthorTomasz Lelek is a Software Engineer and Co-Founder of InitLearn. He mostly does programming in Java and Scala. He dedicates his time and efforts to get better at everything. He is currently delving into big data technologies. Tomasz is very passionate about everything associated with software development. He has been a speaker at a few conferences in Poland-Confitura and JDD, and at the Krakow Scala User Group. He has also conducted a live coding session at Geecon Conference. He was also a speaker at an international event in Dhaka. He is very enthusiastic and loves to share his knowledge.

Facilities

Location

Start date

Online

Start date

Different dates availableEnrolment now open

About this course

Explore the Hadoop big data Ecosystem in a nutshell
Process payment data from an event stream using the streaming API: Payment Analyzer
Detect BOT traffic using Spark Streaming, make log data queryable, and investigate customer data
Supply Chain analysis - find top-seller items in a streaming way, enhance top-seller items
Analyze Customer churn amounts quantitatively with DataFrame queries
Perform IoT sensor data analysis with device response to system failures and data streams
High-performance computation with neighborhood aggregations
Page ranking using Spark GraphX
Threat Analysis - Analyzing weblogs for suspicious activity and anomalies in network traffic
Extract information from unstructured text via Spark DataFrames
Perform sentiment analysis of posts using Logistic Regression, and find the author of a post
Find what product users want to buy using Cloudera Sandbox Toolkit
Use movie history to suggest content, and test and experiment with Recommendation Engine

Questions & Answers

Add your question

Our advisors and other users will be able to reply to you

Who would you like to address this question to?

Fill in your details to get a reply

We will only publish your name and question

Reviews

This centre's achievements

2021

All courses are up to date

The average rating is higher than 3.7

More than 50 reviews in the last 12 months

This centre has featured on Emagister for 4 years

Subjects

  • Financial Training
  • Trade
  • Financial
  • Database training
  • SQL
  • Database
  • Apache
  • Surveillance
  • Information Systems management
  • IT

Course programme

Core Components 3 lectures 14:39 The Course Overview This video gives an overview of the entire course. Hadoop Distributed File System (HDFS) In this video, we will see what a HDFS is. • What a Hadoop is • What the Hadoop Distributed File System is • Explain HDFS architecture Distributed Compute Capability YARN In this video, we will learn about YARN. • what the YARN is • How it is used with Spark Core Components- Quiz Core Components 3 lectures 14:39 The Course Overview This video gives an overview of the entire course. Hadoop Distributed File System (HDFS) In this video, we will see what a HDFS is. • What a Hadoop is • What the Hadoop Distributed File System is • Explain HDFS architecture Distributed Compute Capability YARN In this video, we will learn about YARN. • what the YARN is • How it is used with Spark Core Components- Quiz The Course Overview This video gives an overview of the entire course. The Course Overview This video gives an overview of the entire course. The Course Overview This video gives an overview of the entire course. The Course Overview This video gives an overview of the entire course. This video gives an overview of the entire course. This video gives an overview of the entire course. Hadoop Distributed File System (HDFS) In this video, we will see what a HDFS is. • What a Hadoop is • What the Hadoop Distributed File System is • Explain HDFS architecture Hadoop Distributed File System (HDFS) In this video, we will see what a HDFS is. • What a Hadoop is • What the Hadoop Distributed File System is • Explain HDFS architecture Hadoop Distributed File System (HDFS) In this video, we will see what a HDFS is. • What a Hadoop is • What the Hadoop Distributed File System is • Explain HDFS architecture Hadoop Distributed File System (HDFS) In this video, we will see what a HDFS is. • What a Hadoop is • What the Hadoop Distributed File System is • Explain HDFS architecture In this video, we will see what a HDFS is. • What a Hadoop is • What the Hadoop Distributed File System is • Explain HDFS architecture In this video, we will see what a HDFS is. • What a Hadoop is • What the Hadoop Distributed File System is • Explain HDFS architecture Distributed Compute Capability YARN In this video, we will learn about YARN. • what the YARN is • How it is used with Spark Distributed Compute Capability YARN In this video, we will learn about YARN. • what the YARN is • How it is used with Spark Distributed Compute Capability YARN In this video, we will learn about YARN. • what the YARN is • How it is used with Spark Distributed Compute Capability YARN In this video, we will learn about YARN. • what the YARN is • How it is used with Spark In this video, we will learn about YARN. • what the YARN is • How it is used with Spark In this video, we will learn about YARN. • what the YARN is • How it is used with Spark Core Components- Quiz Core Components- Quiz Core Components- Quiz Core Components- Quiz Downstream Ecosystem 5 lectures 28:09 Apache Hive for ETL and SQL Like In this video, we will see what the Hive is. • When to use Hive • How Hive is using HDFS • What is a Metastore Message Queuing and Data Ingestion Kafka In this video, we will see what a pub-sub is. • What the topic is • How Kafka topics scale • What is a topic offset NoSQL Datastores – Hadoop HBase, Accumulo In this video, we will see some column-oriented database concepts. • Explain HBASE architecture • Explain HBASE data structure • Explain Accumulo architecture Machine Learning – Spark and Spark MLlib In this video, we will see Spark architecture. • Explain RDD • Explain partitioning • Explain Spark MLlib Stream Processing – Spark Streaming In this video, we will explain Spark Streaming architecture. • Explain some micro batches • See the difference between latency versus throughput • Explain failure recovery and Check pointing Downstream Ecosystem- Quiz Downstream Ecosystem 5 lectures 28:09 Apache Hive for ETL and SQL Like In this video, we will see what the Hive is. • When to use Hive • How Hive is using HDFS • What is a Metastore Message Queuing and Data Ingestion Kafka In this video, we will see what a pub-sub is. • What the topic is • How Kafka topics scale • What is a topic offset NoSQL Datastores – Hadoop HBase, Accumulo In this video, we will see some column-oriented database concepts. • Explain HBASE architecture • Explain HBASE data structure • Explain Accumulo architecture Machine Learning – Spark and Spark MLlib In this video, we will see Spark architecture. • Explain RDD • Explain partitioning • Explain Spark MLlib Stream Processing – Spark Streaming In this video, we will explain Spark Streaming architecture. • Explain some micro batches • See the difference between latency versus throughput • Explain failure recovery and Check pointing Downstream Ecosystem- Quiz Apache Hive for ETL and SQL Like In this video, we will see what the Hive is. • When to use Hive • How Hive is using HDFS • What is a Metastore Apache Hive for ETL and SQL Like In this video, we will see what the Hive is. • When to use Hive • How Hive is using HDFS • What is a Metastore Apache Hive for ETL and SQL Like In this video, we will see what the Hive is. • When to use Hive • How Hive is using HDFS • What is a Metastore Apache Hive for ETL and SQL Like In this video, we will see what the Hive is. • When to use Hive • How Hive is using HDFS • What is a Metastore In this video, we will see what the Hive is. • When to use Hive • How Hive is using HDFS • What is a Metastore In this video, we will see what the Hive is. • When to use Hive • How Hive is using HDFS • What is a Metastore Message Queuing and Data Ingestion Kafka In this video, we will see what a pub-sub is. • What the topic is • How Kafka topics scale • What is a topic offset Message Queuing and Data Ingestion Kafka In this video, we will see what a pub-sub is. • What the topic is • How Kafka topics scale • What is a topic offset Message Queuing and Data Ingestion Kafka In this video, we will see what a pub-sub is. • What the topic is • How Kafka topics scale • What is a topic offset Message Queuing and Data Ingestion Kafka In this video, we will see what a pub-sub is. • What the topic is • How Kafka topics scale • What is a topic offset In this video, we will see what a pub-sub is. • What the topic is • How Kafka topics scale • What is a topic offset In this video, we will see what a pub-sub is. • What the topic is • How Kafka topics scale • What is a topic offset NoSQL Datastores – Hadoop HBase, Accumulo In this video, we will see some column-oriented database concepts. • Explain HBASE architecture • Explain HBASE data structure • Explain Accumulo architecture NoSQL Datastores – Hadoop HBase, Accumulo In this video, we will see some column-oriented database concepts. • Explain HBASE architecture • Explain HBASE data structure • Explain Accumulo architecture NoSQL Datastores – Hadoop HBase, Accumulo In this video, we will see some column-oriented database concepts. • Explain HBASE architecture • Explain HBASE data structure • Explain Accumulo architecture NoSQL Datastores – Hadoop HBase, Accumulo In this video, we will see some column-oriented database concepts. • Explain HBASE architecture • Explain HBASE data structure • Explain Accumulo architecture In this video, we will see some column-oriented database concepts. • Explain HBASE architecture • Explain HBASE data structure • Explain Accumulo architecture In this video, we will see some column-oriented database concepts. • Explain HBASE architecture • Explain HBASE data structure • Explain Accumulo architecture Machine Learning – Spark and Spark MLlib In this video, we will see Spark architecture. • Explain RDD • Explain partitioning • Explain Spark MLlib Machine Learning – Spark and Spark MLlib In this video, we will see Spark architecture. • Explain RDD • Explain partitioning • Explain Spark MLlib Machine Learning – Spark and Spark MLlib In this video, we will see Spark architecture. • Explain RDD • Explain partitioning • Explain Spark MLlib Machine Learning – Spark and Spark MLlib In this video, we will see Spark architecture. • Explain RDD • Explain partitioning • Explain Spark MLlib In this video, we will see Spark architecture. • Explain RDD • Explain partitioning • Explain Spark MLlib In this video, we will see Spark architecture. • Explain RDD • Explain partitioning • Explain Spark MLlib Stream Processing – Spark Streaming In this video, we will explain Spark Streaming architecture. • Explain some micro batches • See the difference between latency versus throughput • Explain failure recovery and Check pointing Stream Processing – Spark Streaming In this video, we will explain Spark Streaming architecture. • Explain some micro batches • See the difference between latency versus throughput • Explain failure recovery and Check pointing Stream Processing – Spark Streaming In this video, we will explain Spark Streaming architecture. • Explain some micro batches • See the difference between latency versus throughput • Explain failure recovery and Check pointing Stream Processing – Spark Streaming In this video, we will explain Spark Streaming architecture. • Explain some micro batches • See the difference between latency versus throughput • Explain failure recovery and Check pointing In this video, we will explain Spark Streaming architecture. • Explain some micro batches • See the difference between latency versus throughput • Explain failure recovery and Check pointing In this video, we will explain Spark Streaming architecture. • Explain some micro batches • See the difference between latency versus throughput • Explain failure recovery and Check pointing Downstream Ecosystem- Quiz Downstream Ecosystem- Quiz Downstream Ecosystem- Quiz Downstream Ecosystem- Quiz Financial, Trade, and Time Series Applications – Trade Surveillance 3 lectures 16:17 Processing Payment Data from an Event Stream In this video, we will process payment data. • Create DStream provider • Explain stream of payment Advanced Aggregations Using Streaming API – PaymentAnalyzer In this video, we will implement real-time logic on stream of events. • Implement PaymentAnalyzer • Save results to sink • Test the final result Storing Time Series Data in HBase In this video, we will save data to HBase. • Implement HBase connector • Save data into HBase Financial, Trade, and Time Series Applications – Trade Surveillance- Quiz Financial, Trade, and Time Series Applications – Trade Surveillance yze the amounts of customer churn based on transactional amounts. • Calculate churn based on the...

Additional information

Apache Hadoop ecosystem and solving data problems

Solving 10 Hadoop'able Problems

£ 150 + VAT