Data Mining & Machine Learning in Finance
-
Now I amble to use Python and to explore data mining with the techniques I learned through this course. It was a great introduction.
← | →
-
This course has really helped me in understanding and learning Data Mining / Machine Learning in Finance - a good mix of code and theory.
← | →
-
Decent energetic method for instruction. Altogether different from my exceptionally specialized years in college.
← | →
Short course
In Singapore (Singapore), London, New York (USA) and another venue.
Learn the best techniques in order to mine financial data!
-
Type
Short course
-
Location
-
Duration
2 Days
The popularity of data science techniques such as data mining and machine learning has grown enormously in recent years. They present effective solutions to process and analyze the huge amount of data available to risk managers and financial analysts.
With the advances in computing power and distributed processing, it is now possible to process - and make sense of - the vast array of information that can be gathered from several different data sources.
This hands-on program covers key techniques - including several aspects of supervised and unsupervised machine learning - that can be used when mining financial data. The program also focuses on advanced data science techniques that are becoming widely used in financial markets for text analysis and artificial intelligence: Natural Language Processing (NLP) and Deep Learning (DL).
The program is delivered entirely through workshops and case studies. Participants will learn how to implement natural language processing techniques by building a sentiment analysis model to analyze text strings. In the deep learning section, participants will focus on the construction and testing of a neural network to solve a financial problem with the help of Python.
Most exercises and case studies are illustrated in Python, allowing you to learn how to work with this flexible programming language.
Facilities
Location
Start date
Start date
Start date
Start date
Start date
About this course
Portfolio managers
Risk managers
Professionals looking to introduce data-mining concepts in their day-to-day tasks
IT developers
Statisticians
Quant analysts
Financial Engineers
Basic notions of statistics
Good working knowledge of Excel
No prior knowledge of Python is required
Reviews
-
Now I amble to use Python and to explore data mining with the techniques I learned through this course. It was a great introduction.
← | →
-
This course has really helped me in understanding and learning Data Mining / Machine Learning in Finance - a good mix of code and theory.
← | →
-
Decent energetic method for instruction. Altogether different from my exceptionally specialized years in college.
← | →
Course rating
Recommended
Centre rating
Head of Quant Solutions
Anonymous
Anonymous
This centre's achievements
All courses are up to date
The average rating is higher than 3.7
More than 50 reviews in the last 12 months
This centre has featured on Emagister for 16 years
Subjects
- Financial
- Financial Training
- Programming
- IT risk
- Risk
- Finance
- Data Mining
- Clustering analysis
- OLS
- Sparsity
- Risk manager
- Ridge regression
- Financial Analysis
- Statisticians
- Quant analysts
- Finance Data Mining
- Data Mining in Finance
Teachers and trainers (1)
Jan De Spiegeleer
Teacher
Dr Jan De Spiegeleer is a co-Founder of RiskConcile a risk management advisory firm based in Lausanne. From 2007 till 2015 he was the head of risk management at Jabre Capital Partners, a Geneva-based hedge fund. He gained extensive knowledge of derivatives pricing, hedging and trading while working for KBC Financial Products in London, where he was managing director of the equity derivatives desk. Dr De Spiegeleer holds a Masters Degree in Civil Engineering (Royal Military Academy, Brussels - 1988), an MBA and a PhD in mathematics (KU Leuven - 1994 and 2013).
Course programme
- Association rules
- Classification vs. regression problems
- Clustering analysis
- Overview of third party solutions (Tableau, QlikeTech etc.) for visualization of large sets of data. Case studies will be worked out using matplotlib-library and plotly (open-source online data-collaboration platform)
- Graphical databases: applying network theory on portfolio analysis and introduction to graphical databases
- OLS (ordinary least squares)
- Ridge regression
- Sparsity
- Lasso
- Elastic Net
Workshop: Working out the optimal hedge of a large real-world equity portfolio using futures. The portfolio has a global nature (100+ shares) but only a limited set of futures is available
PCA- Principal component analysis of the term structure of interest rates and implied volatilities
- Principal component regression (PCR)
- Partial least squares (PLS)
Workshop: Using PCA to reduce dimensionality of a large data set of historical interest rate curves. The complex behaviour of this curve is spread over different maturities and this technique allows a risk manager to have a much better view of the dynamics of interest rate curves
Data classification – regression Kernel density estimation and classification- Kernel density estimation is an unsupervised learning procedure which leads to a simple family of procedures for non-parametric classification
Case study: Using kernels to derive probability distributions for financial data
Classification - part I- Naive Bayes classification: A straightforward and powerful technique to classify data
Case study: Working out a Bayes-predictor for a large data set containing different attributes of US banks. The Bayes classifier will be used to separate those banks that are likely to fail from those that are going to remain solvent
Classification - part II- Linear Discriminant Analysis (LDA)
- Logistic Regression
Case study: apply log-regression on a real-world dataset with high dimensionality
Day Two Data classification (cont.) Classification - part III- Classification Trees: CART-modelling leads to easy-to-use practical decision trees
- The concept of decision trees will be extended with techniques such as Random Forest and Bagging
Case study: Concepts such as cost functions, impurity levels, tree pruning and cross validation will be handled in detail
- Support Vector Machines (SVM)
- K-Nearest Neighbour learning
- Logistic Regression
Case study: The classification methods (SVM, K-Nearest and CART) are going to be put at work on different technical indicators (RSI, MACD etc.) of large sets of real-world financial data. This will illustrate how these classifiers can be used to partition stocks in different buckets according to the strength of different attributes in a fast way
Workshop: Data mining toolsAn introduction to Python - a powerful programming language. The applicability of Python in the domain of data analysis will be illustrated through practical examples with focus on machine learning using the 'scikit-learn' package. All examples will be covered in Jupyter-notebooks. Delegates will learn how to build custom reports in Python
Day Three
Natural Language Processing
Extracting real value from social media posts, images, email, PDFs and other sources of unstructured data is a big challenge for enterprises.
This section is devoted to the application of Natural Language Processing (NLP) to extract value from unstructured data. Several real-world examples of examining unstructured data in finance - including sentiment analysis of financial news - will be explored.
Workshop: Using the NLTK package of Python to:
- Explore and tokenize a text using Tf-Idf and Count Vectors
- Predict words in a text: building a word predictor starting from a text; writing a programme that can predict the word that follows a given word
- Understand the sentiment of a news item on a particular stock
Deep Learning
Deep Learning as a subfield of machine learning - Artificial Neural Networks (ANN) algorithms.
- Introduction to Deep Learning
- Forward propagation
- Word2vec approach
- Deeper networks and forward propagation
- Optimizing Neural Network with backward propagation
Case Study: Building a Deep Learning model with Python (with a focus on the Keras and Tensorflow packages)
Data Mining & Machine Learning in Finance