Advanced Distributed Machine Learning with Spark - University of California

edX

Course

Online

Free

Description

  • Type

    Course

  • Methodology

    Online

  • Duration

    4 Weeks

  • Start date

    Different dates available

Learn how to develop and deploy distributed machine leaning pipelines and gain the expertise to write efficient, scalable code in Spark. With this course you earn while you learn, you gain recognized qualifications, job specific skills and knowledge and this helps you stand out in the job market.

Facilities

Location

Start date

Online

Start date

Different dates availableEnrolment now open

About this course

Python programming background; experience with PySpark and Machine Learning equivalent to CS120x: Distributed Machine Learning with Spark; comfort with mathematical and algorithmic reasoning; familiarity with basic machine learning concepts; exposure to algorithms, probability, linear algebra and calculus.

Questions & Answers

Add your question

Our advisors and other users will be able to reply to you

Who would you like to address this question to?

Fill in your details to get a reply

We will only publish your name and question

Reviews

This centre's achievements

2017

All courses are up to date

The average rating is higher than 3.7

More than 50 reviews in the last 12 months

This centre has featured on Emagister for 8 years

Subjects

  • Spark
  • Machine Learning
  • Data analysis
  • Statistics
  • Databricks

Course programme

Building on the core ideas presented in Distributed Machine Learning with Spark, this course covers advanced topics for training and deploying large-scale learning pipelines. You will study state-of-the-art distributed algorithms for collaborative filtering, ensemble methods (e.g., random forests), clustering and topic modeling, with a focus on model parallelism and the crucial tradeoffs between computation and communication. After completing this course, you will have a thorough understanding of the statistical and algorithmic principles required to develop and deploy distributed machine learning pipelines. You will further have the expertise to write efficient and scalable code in Spark, using MLlib and the spark.ml package in particular.

Advanced Distributed Machine Learning with Spark - University of California

Free