Description

Type

Short course
Level

Beginner
Methodology

Inhouse

Duration

2 Days
Start date

Different dates available

The NoSQL (Not Only SQL) persistence systems space offers a great variety of solutions that may be overwhelming. This class aims at helping the attendees understand the challenges of the emerging world of Big Data as well as identify suitable use cases for a variety of NoSQL systems such as Pig, Hive, HBase, Cassandra and MongoDB.

Facilities

Inhouse

Start date

Different dates availableEnrolment now open

About this course

Course objectives

Upon completion of this course, you will be able to:Introduce students to the core concepts of Big DataProvide a general overview of the most common NoSQL storesExplain how to choose the correct NoSQL database for specific use casesGeneral overview of the architecture of Hadoop and MongoDB

Who is it intended for?

Engineers, Programmers, Networking specialists, Managers, Executives

Requirements

None

What marks this course apart?

Very useful course for Engineers, Programmers, Networking specialists, Managers, Executives.

Questions & Answers

Add your question

Our advisors and other users will be able to reply to you

Who would you like to address this question to?

All
Students
Centre

Fill in your details to get a reply

We will only publish your name and question

Reviews

5.0

100%

4.8

excellent

Course rating

Recommended

Centre rating

Mercy Kahn

5.0

24/03/2019

About the course: The course is good and starting it after the work is really a good idea as one doesn't get time for, nothing to say further.

Would you recommend this course?: Yes

Margaret B. Johnson

5.0

22/03/2019

About the course: For accommodating and giving an example the second day class were extended by the tutor, he really covers a lot in just three days.

Would you recommend this course?: Yes

*All reviews collected by Emagister & iAgora have been verified

This centre's achievements

2018

How do you get the CUM LAUDE seal?

All courses are up to date

The average rating is higher than 3.7

More than 50 reviews in the last 12 months

This centre has featured on Emagister for 6 years

Subjects

Latin
Word
Java
Apache
Database
SQL
Database training
Design
Systems
Ms Word

Teachers and trainers (1)

Bright Solutions

Trainer

Course programme

#text-block-10 { margin-bottom:0px; text-align:left; }

Chapter 1. Defining Big Data

Transforming Data into Business Information
Quality of Data
Gartner’s Definition of Big Data
More Definitions of Big Data
Processing Big Data
Challenges Posed by Big Data
The Cloud and Big Data
The Business Value of Big Data
Big Data: Hype or Reality?
Big Data Quiz
Big Data Quiz Answers
Summary

Chapter 2. NoSQL and Big Data Systems Overview

Limitations of Relational Databases
Limitations of Relational Databases (Con’t)
What are NoSQL (Not Only SQL) Databases?
The Past and Present of the NoSQL World
NoSQL Database Properties
NoSQL Benefits
NoSQL Database Storage Types
The CAP Theorem
Mechanisms to Guarantee a Single CAP Property
NoSQL Systems CAP Triangle
Limitations of NoSQL Databases
Big Data Sharding
Sharding Example
Amazon S3
Amazon Storage SLAs
Amazon Glacier
Amazon S3 Security
Data Lifecycle Management with Amazon S3
Amazon S3 Cost Monitoring
OpenStack
Object Store (Swift)
Components of Swift
Google BigTable
BigTable-based Applications
BigTable Design
Google Cloud Storage
Summary

Chapter 3. Adopting NoSQL

Hype Cycle and Technology Adoption Model
Barriers to Adoption
Dismantling Barriers to Adoption
Use Cases for NoSQL Database Systems
Example Applications
Industry trends
Enterprise Hadoop Solutions Offerings
Enterprise Big Data / NoSQL Offerings
IBM InfoSphere Platform
Oracle Big Data Appliance
NoSQL Technology Adoption Action Plan
Summary

Chapter 4. MapReduce Overview

MapReduce Defined
Google’s MapReduce
MapReduce Explained
MapReduce Word Count Job
MapReduce Shared-Nothing Architecture
Similarity with SQL Aggregation Operations
Example of Map & Reduce Operations using JavaScript
Problems Suitable for Solving with MapReduce
Typical MapReduce Jobs
Fault-tolerance of MapReduce
Distributed Computing Economics
MapReduce Systems
Summary

Chapter 5. Introduction to MongoDB

MongoDB
MongoDB Features (Cont’d)
MongoDB’s Logo
Positioning of MongoDB
Sharding in MongoDB
Data Replication
A Sample Sharded Cluster Diagram
MongoDB Security
Authentication
Data and Network Encryption
MongoDB Limitations
MongoDB Operational Intelligence
MongoDB Use Cases
MongoDB Data Model
The _id Primary Key Filed Considerations
Terminology
MongoDB Data Model
Data Modeling in RDBMS
Data Modeling in MongoDB
MongoDB Data Modeling
A Sample JSON Document Matching the Schema
Data Lifecycle Management
Data Lifecycle Management: TTL
Data Lifecycle Management: Capped Collections
MongoDB Query Language (QL)
The
find
and
findOne
Methods
The
find
and
findOne
Methods
A MongoDB QL Example
Data Inserts
Creating an Index
MongoDB vs Apache CouchDB
Summary

Chapter 6. Hadoop Overview

Apache Hadoop
Apache Hadoop Logo
Typical Hadoop Applications
Hadoop Clusters
Hadoop Design Principles
Hadoop’s Core Components
Hadoop Simple Definition
High-Level Hadoop Architecture
Hadoop-based Systems for Data Analysis
Hadoop Caveats
Summary

Chapter 7. Hadoop Distributed File System Overview

Hadoop Distributed File System
Data Blocks
Data Block Replication Example
HDFS NameNode Directory Diagram
Accessing HDFS
Examples of HDFS Commands
Client Interactions with HDFS for the Read Operation
Read Operation Sequence Diagram
Client Interactions with HDFS for the Write Operation
Communication inside HDFS
Summary

#text-block-11 { margin-bottom:0px; text-align:left; }

Chapter 8. MapReduce with Hadoop

Hadoop’s MapReduce
MapReduce v1 (“Classic MapReduce”)
JobTracker and TaskTracker
YARN (MapReduce v2)
MapReduce Programming Options
Java MapReduce API
The Structure of a Java MapReduce Program
The Mapper Class
The Reducer Class
The Driver Class
Compiling Classes
Running the MapReduce Job
The Structure of a Single MapReduce Program
Combiner Pass (Optional)
Hadoop’s Streaming MapReduce
Python Word Count Mapper Program Example
Python Word Count Reducer Program Example
Setting up Java Classpath for Streaming Support
Streaming Use Cases
The Streaming API vs Java MapReduce API
Amazon Elastic MapReduce
Summary

Chapter 9. Apache Pig Scripting Platform

What is Pig?
Pig Latin
Apache Pig Logo
Pig Execution Modes
Local Execution Mode
MapReduce Execution Mode
Running Pig
Running Pig in Batch Mode
What is Grunt?
Pig Latin Statements
Pig Programs
Pig Latin Script Example
SQL Equivalent
Differences between Pig and SQL
Statement Processing in Pig
Comments in Pig
Supported Simple Data Types
Supported Complex Data Types
Arrays
Defining Relation’s Schema
The bytearray Generic Type
Using Field Delimiters
Referencing Fields in Relations
Summary

Chapter 10. Apache Pig HDFS Interface

The HDFS Interface
FSShell Commands (Short List)
Grunt’s Old File System Commands
Summary

Chapter 11. Apache Pig Relational and Eval Operators

Pig Relational Operators
Example of Using the JOIN Operator
Example of Using the Order By Operator
Caveats of Using Relational Operators
Pig Eval Functions
Caveats of Using Eval Functions (Operators)
Example of Using Single-column Eval Operations
Example of Using Eval Operators For Global Operations
Summary

Chapter 12. Hive

What is Hive?
Apache Hive Logo
Hive’s Value Proposition
Who uses Hive?
Hive’s Main Systems
Hive Features
Hive Architecture
HiveQL
Where are the Hive Tables Located?
Hive Command-line Interface (CLI)
Summary

Chapter 13. Hive Command-line Interface

Hive Command-line Interface (CLI)
The Hive Interactive Shell
Running Host OS Commands from the Hive Shell
Interfacing with HDFS from the Hive Shell
The Hive in Unattended Mode
The Hive CLI Integration with the OS Shell
Executing HiveQL Scripts
Comments in Hive Scripts
Variables and Properties in Hive CLI
Setting Properties in CLI
Example of Setting Properties in CLI
Hive Namespaces
Using the SET Command
Setting Properties in the Shell
Setting Properties for the New Shell Session
Summary

Chapter 14. Hive Data Definition Language

Hive Data Definition Language
Creating Databases in Hive
Using Databases
Creating Tables in Hive
Supported Data Type Categories
Common Primitive Types
Example of the CREATE TABLE Statement
The STRUCT Type
Table Partitioning
Table Partitioning
Table Partitioning on Multiple Columns
Viewing Table Partitions
Row Format
Data Serializers / Deserializers
File Format Storage
More on File Formats
The EXTERNAL DDL Parameter
Example of Using EXTERNAL
Creating an Empty Table
Dropping a Table
Table / Partition(s) Truncation
Alter Table/Partition/Column
Views
Create View Statement
Why Use Views?
Restricting Amount of Viewable Data
Examples of Restricting Amount of Viewable Data
Creating and Dropping Indexes
Describing Data
Summary

Chapter 15. Apache HBase

What is HBase?
HBase Design
HBase Features
The Write-Ahead Log (WAL) and MemStore
HBase vs RDBS
HBase vs Apache Cassandra
Interfacing with HBase
HBase Thrift And REST Gateway
HBase Table Design
Column Families
A Cell’s Value Versioning
Timestamps
Accessing Cells
HBase Table Design Digest
Table Horizontal Partitioning with Regions
HBase Compaction
Loading Data in HBase
HBase Shell
HBase Shell Command Groups
Creating and Populating a Table in HBase Shell
Getting a Cell’s Value
Counting Rows in an HBase Table
Summary

See related categories

Contact us

NoSQL Architecture Comparison Training – Big Data

Questions & Answers

Reviews

Reviews on other courses at this centre

This centre's achievements

Subjects

Course programme

Add similar courses
and compare them to help you choose.

NoSQL Architecture Comparison Training – Big Data

Questions & Answers

Reviews

Reviews on other courses at this centre

This centre's achievements

Subjects

Course programme

Add similar coursesand compare them to help you choose.

Add similar courses
and compare them to help you choose.