Data Engineering MSc
Master
In Dundee
Description
-
Type
Master
-
Location
Dundee (Scotland)
-
Duration
12 Months
-
Start date
September
This course will give you the skills you’ll need to succeed as a Data Engineer or Data Scientist, taking you from techniques to handle Big Data through to analysing and visualising this data to give meaningful insights. Subjects include:
Big Data Theory and Practice, how do we manage the volume, velocity and variety of big data.
NoSql Databases including Cassandra, Neo4j, Mongodb and a host of others.
Parallel data analysis (Hadoop, Spark)
Machine learning and data mining
Languages for Data Engineering (Python, R, Matlab etc)
Devops and Microservices for deploying Big Data solutions to the cloud.
Facilities
Location
Start date
Start date
About this course
Cloud and web based industries that handle large volumes of fast moving data that need to be stored, analysed and maintained. Examples include the publishing industry (paper, TV and internet), messaging services, data aggregators and advertising services
Internet of Things. A large amount of data is being generated by devices (robotic assembly lines, home power management, sensors etc.) all of which needs to be stored and analysed.
Health. The NHS (and others) are starting to store and analyse patient data on an unprecedented scale. The healthcare industry is also combining data sources from a large number of databases to improve patient well-being and health outcomes
Games industry. The games industry records an extraordinary amount of data about its customers' play activities, all of which needs to be stored and analysed. This course will equip students with the knowledge and skill to engage with the industry
Applicants should normally have an Honours degree at 2.1 level or above in computing or a related subject. All prospective students need to undergo a technical interview to ensure they have the necessary background knowledge, including Mathematics, to undertake the course.
Reviews
This centre's achievements
All courses are up to date
The average rating is higher than 3.7
More than 50 reviews in the last 12 months
This centre has featured on Emagister for 14 years
Subjects
- Data Mining
- Engineering
- Programming
- Professor Training
- Machine Learning
- Programming languages
- Data Engineering
- Languages
- Big Data
- Computer vision
- Devops and Microservices
Course programme
Semester 1
- Introduction to Data Mining and Machine Learning part 1
- Programming Languages for Data Engineering part 1
Options (dependent on experience)
- Big Data
- Computer Vision
- Devops and Microservices
- Secure Internet programming
- Technology Innovation Management
Semester 2
- Introduction to Data Mining and Machine Learning part 2
- Programming Languages for Data Engineering part 2
- Business Intelligence Systems
- Research Methods
Semester 3
- Research project with optional industrial collaboration
Course content
Each module on the course is designed to give the student the skills and understanding they need to succeed in the Data Engineering/ Science field. Content on the course includes (but is not limited to):
- CAP theorem
- Lamda Architecture
- Cassandra, Neo4j and other nosql databases
- The Storm distributed real time computation system
- Hadoop, HDFS, MapReduce, and other Hadoop/SQL technologies
- Spark framework
- Data Engineering languages such as Python, erlang, R, Matlab
- Continuous deployment and delivery to the cloud.
- Microservices using containers to deliver reliable infrastructure.
- Vision systems, which are becoming increasingly important in data engineering for extracting features from large quantities of images such as from traffic, medical and industrial
- RDBMS systems which will continue to play an important role in data handing and storage. You will be expected to research the history of RDMBS and delve in to the internals of modern systems
- OLAP cubes and Business Intelligence systems, which can be the best and quickest way to extract information from data stores
- Goals of machine learning and data mining
- Clustering: K-means, mixture models, hierarchical
- Dimensionality reduction and visualisation
- Inference: Bayes, MCMC
- Perceptrons, logistic regression, neural networks
- Max-margin methods (SVMs)
- Mining association rules
- Bayesian networks
The course will be taught by staff of Computing.Depending on the modules you take this will include
- Andy Cobley
- Dr Keith Edwards
- Professor Mark Whitehorn
- Professor Stephen McKenna
- Professor Manuel Trucco
- Professor Chris Reed
- Dr Jianguo Zhang.
The course is assessed through a combination of
- examinations
- coursework
- presentations
- interviews.
Each module is different: for instance the Big data module has 40% coursework, consisting of Erlang programming and a presentation on nosql databases, along with an examination worth 60%.
Additional information
Data Engineering MSc