Exploratory Data Analysis with R

Course

Online

£ 10 + VAT

Description

  • Type

    Course

  • Methodology

    Online

  • Start date

    Different dates available

Harness the skills to analyze your data effectively with EDA and R.The greatest number of mistakes and failures in data analysis comes from not performing adequate Exploratory Data Analysis (EDA). Lack of EDA knowledge can expose you to the great risk of drawing incorrect, and potentially harmful, conclusions from your data analysis.In this course, you will learn how EDA helps you draw conclusions to make better sense of your data and implement correct techniques. We'll begin with a brief introduction to EDA, its importance, and advantages over BI tools. Using R libraries like dplyr and ggplot2, we will generate insights and formulate relevant questions for investigation and communicate the results effectively using visualizations. You will learn how to spot missing data and errors, validate assumptions, and identify the patterns for understanding the problem. Based on this, you’ll be able to select a correct ML model to use for your data.By the end of the course, you will be able to quickly get know and interpret various kinds of data sets you will be presented with, and easily understand how to handle and work with them in order to make them ready for further modeling activities.Here's the link to the GitHub repo to this course: Please note that basic knowledge of R and R Studio, together with some knowledge of descriptive statistics, are key to getting the best out of this course.About the Author
.
Andrea Cirillo is a Senior Audit Quantitative Analyst at Intesa Sanpaolo Banking Group. He works daily with copious volumes of "messy" data for the purpose of auditing credit risk models. This has prompted him to develop the key skills needed to succeed in Exploratory Data Analysis (EDA). Andrea is also an active contributor to the R community with well-received packages like updateR and paletteR

Facilities

Location

Start date

Online

Start date

Different dates availableEnrolment now open

About this course

Set up your data and code to avoid mistakes and ensure reproducibility
Really understand the structure and content of your data
Build clear plots to evaluate the distribution of your data with ggplot
Construct summaries of your variables with dplyr
Implement data cleaning and validation tasks to get your data ready for data mining activities
Test a hypothesis or check assumptions related to a specific model
Estimate parameters and figure the margins of error

Questions & Answers

Add your question

Our advisors and other users will be able to reply to you

Who would you like to address this question to?

Fill in your details to get a reply

We will only publish your name and question

Reviews

This centre's achievements

2021

All courses are up to date

The average rating is higher than 3.7

More than 50 reviews in the last 12 months

This centre has featured on Emagister for 4 years

Subjects

  • Team Training
  • IT risk
  • Project
  • Logic
  • Workflow
  • Data analysis
  • Options
  • Best Practice
  • Risk

Course programme

Setting the Stage: How to Organize Your EDA Working Area 6 lectures 42:25 The Course Overview This video will give you an overview about the course. Setting Up RStudio Project Dealing with data is a real error prone activity. It is easy to mess up thing, accidentally deleting data, or being unable to reproduce results previously obtained. A RStudio project can help you solve this issue. • Discover what reproducibility is and how RStudio projects can help • Discover the operational risks an RStudio project can help you address this • Create a RStudio project to host your analyses Organizing Project Structure (To Ensure Reproducibility) Not structuring your EDA project in a proper way, can become a big obstacle for the success of your EDA project. • Analyze the typical workflow of an EDA project, what it is made of, input, analysis, and output • Learn how a folder and files structure can better serve this workflow • Create this structure within your RStudio project, exploiting the power of system terminals. Coding Best Practice (Right Choices for Better Results) Coding without clear guidelines and handling data without clear rules can lead you to loss of information and data. • Learn how to structure your R code • Learn a set of minimal guidelines, to name R language objects, and structure your code • Learn how to protect the original input data from mess-up Using Git to Avoid Messing Up Your Analyses When working with a team on the same project, having a way to mark versions of your code and team-member contributions is of vital importance. • Learn what GIT is, and why it can solve typical problems of code versioning • Understand the logic beneath GIT • Exploit GIT within RStudio Pro Tip – Tweaking RStudio Project Options to Force Reproducibility Reproducibility is always a relevant topic. Within some business field, it is even a normative requirement. This video gives you one more help to ensure it. • Get to know RStudio project options • Discover which are the RStudio default options related to workspace saving and loading • Set the RStudio options for your project so to help you ensure reproducibility Setting the Stage: How to Organize Your EDA Working Area. 6 lectures 42:25 The Course Overview This video will give you an overview about the course. Setting Up RStudio Project Dealing with data is a real error prone activity. It is easy to mess up thing, accidentally deleting data, or being unable to reproduce results previously obtained. A RStudio project can help you solve this issue. • Discover what reproducibility is and how RStudio projects can help • Discover the operational risks an RStudio project can help you address this • Create a RStudio project to host your analyses Organizing Project Structure (To Ensure Reproducibility) Not structuring your EDA project in a proper way, can become a big obstacle for the success of your EDA project. • Analyze the typical workflow of an EDA project, what it is made of, input, analysis, and output • Learn how a folder and files structure can better serve this workflow • Create this structure within your RStudio project, exploiting the power of system terminals. Coding Best Practice (Right Choices for Better Results) Coding without clear guidelines and handling data without clear rules can lead you to loss of information and data. • Learn how to structure your R code • Learn a set of minimal guidelines, to name R language objects, and structure your code • Learn how to protect the original input data from mess-up Using Git to Avoid Messing Up Your Analyses When working with a team on the same project, having a way to mark versions of your code and team-member contributions is of vital importance. • Learn what GIT is, and why it can solve typical problems of code versioning • Understand the logic beneath GIT • Exploit GIT within RStudio Pro Tip – Tweaking RStudio Project Options to Force Reproducibility Reproducibility is always a relevant topic. Within some business field, it is even a normative requirement. This video gives you one more help to ensure it. • Get to know RStudio project options • Discover which are the RStudio default options related to workspace saving and loading • Set the RStudio options for your project so to help you ensure reproducibility The Course Overview This video will give you an overview about the course. The Course Overview This video will give you an overview about the course. The Course Overview This video will give you an overview about the course. The Course Overview This video will give you an overview about the course. This video will give you an overview about the course. This video will give you an overview about the course. Setting Up RStudio Project Dealing with data is a real error prone activity. It is easy to mess up thing, accidentally deleting data, or being unable to reproduce results previously obtained. A RStudio project can help you solve this issue. • Discover what reproducibility is and how RStudio projects can help • Discover the operational risks an RStudio project can help you address this • Create a RStudio project to host your analyses Setting Up RStudio Project Dealing with data is a real error prone activity. It is easy to mess up thing, accidentally deleting data, or being unable to reproduce results previously obtained. A RStudio project can help you solve this issue. • Discover what reproducibility is and how RStudio projects can help • Discover the operational risks an RStudio project can help you address this • Create a RStudio project to host your analyses Setting Up RStudio Project Dealing with data is a real error prone activity. It is easy to mess up thing, accidentally deleting data, or being unable to reproduce results previously obtained. A RStudio project can help you solve this issue. • Discover what reproducibility is and how RStudio projects can help • Discover the operational risks an RStudio project can help you address this • Create a RStudio project to host your analyses Setting Up RStudio Project Dealing with data is a real error prone activity. It is easy to mess up thing, accidentally deleting data, or being unable to reproduce results previously obtained. A RStudio project can help you solve this issue. • Discover what reproducibility is and how RStudio projects can help • Discover the operational risks an RStudio project can help you address this • Create a RStudio project to host your analyses Dealing with data is a real error prone activity. It is easy to mess up thing, accidentally deleting data, or being unable to reproduce results previously obtained. A RStudio project can help you solve this issue. • Discover what reproducibility is and how RStudio projects can help • Discover the operational risks an RStudio project can help you address this • Create a RStudio project to host your analyses Dealing with data is a real error prone activity. It is easy to mess up thing, accidentally deleting data, or being unable to reproduce results previously obtained. A RStudio project can help you solve this issue. • Discover what reproducibility is and how RStudio projects can help • Discover the operational risks an RStudio project can help you address this • Create a RStudio project to host your analyses Organizing Project Structure (To Ensure Reproducibility) Not structuring your EDA project in a proper way, can become a big obstacle for the success of your EDA project. • Analyze the typical workflow of an EDA project, what it is made of, input, analysis, and output • Learn how a folder and files structure can better serve this workflow • Create this structure within your RStudio project, exploiting the power of system terminals. Organizing Project Structure (To Ensure Reproducibility) Not structuring your EDA project in a proper way, can become a big obstacle for the success of your EDA project. • Analyze the typical workflow of an EDA project, what it is made of, input, analysis, and output • Learn how a folder and files structure can better serve this workflow • Create this structure within your RStudio project, exploiting the power of system terminals. Organizing Project Structure (To Ensure Reproducibility) Not structuring your EDA project in a proper way, can become a big obstacle for the success of your EDA project. • Analyze the typical workflow of an EDA project, what it is made of, input, analysis, and output • Learn how a folder and files structure can better serve this workflow • Create this structure within your RStudio project, exploiting the power of system terminals. Organizing Project Structure (To Ensure Reproducibility) Not structuring your EDA project in a proper way, can become a big obstacle for the success of your EDA project. • Analyze the typical workflow of an EDA project, what it is made of, input, analysis, and output • Learn how a folder and files structure can better serve this workflow • Create this structure within your RStudio project, exploiting the power of system terminals. Not structuring your EDA project in a proper way, can become a big obstacle for the success of your EDA project. • Analyze the typical workflow of an EDA project, what it is made of, input, analysis, and output • Learn how a folder and files structure can better serve this workflow • Create this structure within your RStudio project, exploiting the power of system terminals. Not structuring your EDA project in a proper way, can become a big obstacle for the success of your EDA project. • Analyze the typical workflow of an EDA project, what it is made of, input, analysis, and output • Learn how a folder and files structure can better serve this workflow • Create this structure within your RStudio project, exploiting the power of system terminals. Coding Best Practice (Right Choices for Better Results) Coding without clear guidelines and handling data without clear rules can lead you to loss of information and data. • Learn how to structure your R code • Learn a set of minimal guidelines, to name R language objects, and structure your code • Learn how to protect the original input data from mess-up Coding Best Practice (Right Choices for Better Results) Coding without clear guidelines and handling data without clear rules can lead you to loss of information and data. • Learn how to structure your R code • Learn a set of minimal guidelines, to name R language objects, and structure your code • Learn how to protect the original input data from mess-up Coding Best Practice (Right Choices for Better Results) Coding without clear guidelines and handling data without clear rules can lead you to loss of information and data. • Learn how to structure your R code • Learn a set of minimal guidelines, to name R language objects, and structure your code • Learn how to protect the original input data from mess-up Coding Best Practice (Right Choices for Better Results) Coding without clear guidelines and handling data without clear rules can lead you to loss of information and data. • Learn how to structure your R code • Learn a set of minimal guidelines, to name R language objects, and structure your code • Learn how to protect the original input data from mess-up Coding without clear guidelines and handling data without clear rules can lead you to loss of information and data. • Learn how to structure your R code • Learn a set of minimal guidelines, to name R language objects, and structure your code • Learn how to protect the original input data from mess-up Coding without clear guidelines and handling data without clear rules can lead you to loss of information and data. • Learn how to structure your R code • Learn a set of minimal guidelines, to name R language objects, and structure your code • Learn how to protect the original input data from mess-up Using Git to Avoid Messing Up Your Analyses When working with a team on the same project, having a way to mark versions of your code and team-member contributions is of vital importance. • Learn what GIT is, and why it can solve typical problems of code versioning • Understand the logic beneath GIT • Exploit GIT within RStudio Using Git to Avoid Messing Up Your Analyses When working with a team on the same project, having a way to mark versions of your code and team-member contributions is of vital importance. • Learn what GIT is, and why it can solve typical problems of code versioning • Understand the logic beneath GIT • Exploit GIT within RStudio Using Git to Avoid Messing Up Your Analyses When working with a team on the same project, having a way to mark versions of your code and team-member contributions is of vital importance. • Learn what GIT is, and why it can solve typical problems of code versioning • Understand the logic beneath GIT • Exploit GIT within RStudio Using Git to Avoid Messing Up Your Analyses When working with a team on the same project, having a way to mark versions of your code and team-member contributions is of vital importance. • Learn what GIT is, and why it can solve typical problems of code versioning • Understand the logic beneath GIT • Exploit GIT within RStudio When working with a team on the same project, having a way to mark versions of your code and team-member contributions is of vital importance. • Learn what GIT is, and why it can solve typical problems of code versioning • Understand the logic beneath GIT • Exploit GIT within RStudio When working with a team on the same project, having a way to mark versions of your code and team-member contributions is of vital importance. • Learn what GIT is, and why it can solve typical problems of code versioning • Understand the logic beneath GIT • Exploit GIT within RStudio Pro Tip – Tweaking RStudio Project Options to Force Reproducibility Reproducibility is always a relevant topic. Within some business field, it is even a normative requirement. This video gives you one more help to ensure it. • Get to know RStudio project options • Discover which are the RStudio default options related to workspace saving and loading • Set the RStudio options for your project so to help you ensure reproducibility Pro Tip – Tweaking RStudio Project Options to Force Reproducibility Reproducibility is always a relevant topic. Within some business field, it is even a normative requirement. This video gives you one more help to ensure it. • Get to know RStudio project options • Discover which are the RStudio default options related to workspace saving and loading • Set the RStudio options for your project so to help you ensure reproducibility Pro Tip – Tweaking RStudio Project Options to Force Reproducibility Reproducibility is always a relevant topic. Within some business field, it is even a normative requirement. This video gives you one more help to ensure it ...

Additional information

Basic knowledge of R and R Studio, together with some knowledge of descriptive statistics, are key to getting the best out of this course

Exploratory Data Analysis with R

£ 10 + VAT