InfoSphere BigMatch for Hadoop v11.4 - SPVC

Course

In London

Price on request

Description

  • Type

    Course

  • Location

    London

  • Duration

    2 Days

  • Start date

    Different dates available

Contains: PDF course guide, as well as a lab environment where students can work through demonstrations and exercises at their own pace.

The IBM InfoSphere Big Match on Hadoop course will introduce students to the Probabilistic Matching Engine (PME) and how it can be used to resolve and discover entities across multiple data sets in Hadoop.  

Students will learn the basics of a PME algorithm including data model configuration, standardization, comparison and bucketing functions, weight generation, and threshold.

During the exercises, the student will work on a large use case, where they will apply their knowledge of Big Match to discover relationships be two data sets that can be used to understand the full view of the member data.

If you are enrolling in a Self Paced Virtual Classroom or Web Based Training course, before you enroll, please review the Self-Paced Virtual Classes and Web-Based Training Classes on our Terms and Conditions page, as well as the system requirements, to ensure that your system meets the minimum requirements for this course.

Facilities

Location

Start date

London
See map
Arrow Ecs Training, 56433

Start date

Different dates availableEnrolment now open

About this course



Understand the capabilities of the Probabilistic Matching Engine
Understand how the Probabilistic Matching engine is used with Big Insights to solve certain use cases.
Understand the technical framework of the Big Match solution and how member data is derived, bucketed and compared to produce a complete entity from multiple data sets.
Create a project and data model using the Big Match Console
Configure the HBase tables that will be used in a Big Match solution
Configure an algorithm using he Big Match console that includes Standardization, Comparison and Bucketing functions.
Set up Strings for Anonymous value, Equivalency values, Frequency values, and character maps using the Big Match console
Set up and run the Weight Generation process
Evaluate and set thresholds for the algorithm
Deploy a new algorithm to Big Match
Evaluate Entity results and reconfigure algorithm based on evaluation.  E.g. Large Buckets, Large Entities, Member not belonging to any buckets, etc


The course is designed for a technical audience that will be setting up a custom algorithm for the Probabilistic Matching Engine to use Big Match on Apache Hadoop to compare, match and/or search member records across multiple data sets.

This course has no pre-requisites.

Questions & Answers

Add your question

Our advisors and other users will be able to reply to you

Who would you like to address this question to?

Fill in your details to get a reply

We will only publish your name and question

Emagister S.L. (data controller) will process your data to carry out promotional activities (via email and/or phone), publish reviews, or manage incidents. You can learn about your rights and manage your preferences in the privacy policy.

Reviews

Subjects

  • Web

Course programme

1. Introduction to Big Match for Apache Hadoop
- What is Big Match
- How Big Match Works
- Big Match Components
- Big Match Architecture
2. Big Match Data Model Definition
- Members
- Attribute Types
- Member Attributes
- Sources
- Information Sources
3. PME Algorithm
- Standardization
- Bucketing
- Comparison Functions
4. Bucket Analysis
- Bucket Optimization
- Bucket Concerns
5. Weights
- String Weights
- Numeric Weights
- Multi-dimensional Weights
- Troubleshooting Weights
6. HBase Tables
- HBase concepts
- Big Match commands
- Big Match Tables (.pmebktidx, .pmemdmidx, .pmeentidx)
- Best Practices
7. BigMatch Applications
- PME Derive
- PME Compare
- PME Link
- PME Analysis

InfoSphere BigMatch for Hadoop v11.4 - SPVC

Price on request