MSc in Data Science

Course

In London

Price on request

Description

  • Type

    Course

  • Location

    London

  • Duration

    1 Year

  • Start date

    Different dates available

The MSc in Data Science will provide you with the technical and practical skills to analyse the big data that is the key to success in future business, digital media and science. The rate at which we are able to create data is rapidly accelerating. According to IBM, globally, we currently produce over 2.5 quintillion bytes of data a day. This ranges from biomedical data to social media activity and climate monitoring to retail transactions. These enormous quantities of data hold the keys to success across many domains from business and marketing to treating cancer or mitigating climate change. The pace at which we produce data is rapidly outstripping our ability to analyse and use it. Science and industry are crying out for a new generation of data scientists who combine the statistical skills of data analysis and the computational skills needed to carry out this analysis on a vast scale. The MSc in Data Science provides you with these skills. Studying this Masters, you will learn the mathematical foundations of statistics, data mining and machine learning, and apply these to practical, real world data. As well as these statistical skills, you will learn the computational techniques needed to efficiently analyse very large data sets. You will apply these skills to a range of real world data, under the guidance of experts in that domain. You will analyse trends in social media, make financial predictions and extract musical information from audio files. The degree will culminate in a final project in which you will you can apply your skills and follow your specialist interests. You will do a novel analysis of a real world data of your choice. The programme includes: A firm grounding in the theory of data mining, statistics and machine learning. Hands-on practical real world applications such as social media, biomedical data and financial data with Hadoop (used by Yahoo!, Facebook, Google, Twitter, LinkedIn, IBM,

Facilities

Location

Start date

London
See map
New Cross, SE14 6NW

Start date

Different dates availableEnrolment now open

About this course

You should have an undergraduate degree of at least upper second class standard in computing, physics and engineering, mathematical sciences or finance, and an interest in and capability for working in interdisciplinary contexts. In exceptional circumstances, outstanding practitioners or individuals with strong commercial experience may be considered. International qualifications We accept a wide range of international qualifications.

Questions & Answers

Add your question

Our advisors and other users will be able to reply to you

Who would you like to address this question to?

Fill in your details to get a reply

We will only publish your name and question

Reviews

Subjects

  • Visualisation
  • Computational
  • Biomedical
  • Climate
  • Programming
  • Financial Training
  • Industry
  • Systems
  • Planning
  • Project
  • Financial
  • Design
  • Statistics
  • Data analysis
  • Networks
  • Data Mining
  • Artificial Intelligence
  • Computing
  • IT
  • Social Media
  • Media
  • Skills and Training

Course programme

What you'll study You will study the following core modules: Module title Credits. Machine Learning & Statistical Data Mining Machine Learning & Statistical Data Mining 30 credits Introduces you to the areas of Machine Learning and Data Analysis and Mining. The module focuses on medium to advanced methods and techniques which are mostly used in data analytics. Real world applications are addressed by modelling and implementing applications encountered in business (e.g. customer analytics, credit scoring, financial forecasting), in health and medical sectors (e.g. automatic diagnosing, bioinformatics, genetic data mining), in unstructured data analysis (e.g. text and web mining applied in sentiment analysis and opinion mining), etc. Specialised software including R, IBM SPSS Statistics and RapidMiner are used to develop these applications. You are exposed also to current research topics in Data Mining or interdisciplinary research in which data analysis plays an essential role. 30 credits. Big Data Applications Big Data Applications 15 credits An in-depth study of scalable solutions to manage, process and analyse Big Data on servers, clusters of computers or on the cloud. In particular you will study Big Data computing approaches, trends and technologies as Apache Hadoop based on the MapReduce scalable computing approach, HBase database system and NoSQL technologies, Hive data warehouse system, Pig Latin for productively creating large scale data applications, Mahout for scalable machine learning, and scripting languages as Python currently used within Big Data processing. You will be exposed to and develop various types of Big Data applications including social network mining, reality mining, mobile phone large data analysis, intelligent web, etc. 15 credits. Data Programming Data Programming 15 credits This module introduces programming for Data Science, concentrating primarily on the tools and techniques that are key to achieve results quickly. The module covers current programming languages and environments commonly used in the wider Data Science community, along with ancillary tools and software systems, and gives the student the foundational skills to allow them to develop data-related software for their specific areas of interest. The knowledge and skills in this course cover the following general areas: Data representations: basic data types, comma-separated variables, Xtensible Markup Language, JavaScript Object Notation, Resource Description Framework, Relations; Data acquisition, storage, retrieval and publication: filesystems, version control, network programming, HTTP, Web servers, relational database systems; Data programming: string processing, numeric vector processing, data frames, scripting and statistical programming; Visualizations: automatically generating charts, graphs, and choropleth maps. Tutor: Dr Sorrel Harriet 15 credits. Data Science Research Topics Data Science Research Topics 15 credits The module introduces students to research topics related to Data Science and Computational Social Sciences, and to modern applications of Machine Learning, Data Mining, Computational Statistics and Data Analysis, Statistical Computing, Forecasting, Big Data Management and Analytics, and Soft Computing in industry. In particular students are given talks by Data Science and Computing industry professionals and by academics whose monodisciplinary or interdisciplinary research in areas as Computer Science, Sociology, Psychology, Bioinformatics, Biomedical Statistics and other disciplines is based or involves Data Analysis. The module will guide students' work in exploring a research theme of interest, as a preliminary phase preparing them for the final project. The module will expose students also to learning techniques of conducting research work and of writing scientific reports. Moreover students will be taught approaches for devising and optimising their strategies adapted to their skills in seeking and securing an employment, whether in research or the industry sector. The module will also expose students to a discussion of the ethical and legal issues related to the area of Data Science and its applications. 15 credits. Final Project in Data Science Final Project in Data Science 60 credits Students undertake substantial independent research projects that will allow them to demonstrate project planning and management skills, research skills, and written and oral presentation skills. You will integrate your knowledge and skills acquired in previous modules to implement a final project in Data Science or related interdisciplinary topics. The work will consist of a combination of research and highly applicative elements in various proportions. Students will have the opportunity to join research groups at Goldsmiths whose work is within the Data Science field, or is interdisciplinary and makes use of methodologies from Data Analysis or other components of Data Science as instruments of research. Project work can also be developed in industry as an internship. 60 credits. You will also choose from an anually approved list of modules which may include: Module title Credits. Artificial Intelligence Artificial Intelligence 15 credits Introduces the essential principles of artificial intelligence as part of computer science. The emphasis is on heuristic problem solving methods. Material includes: heuristic search techniques, knowledge representation, rule-based systems for deductive problem solving, search-based planning, and inductive machine learning. The heuristic techniques covered are: depth-first search, breath-first search, iterative deepening, bidirectional search, hill climbing, and adversarial search. Guidelines are provided for implementing practical expert systems, planning systems, and empirical learning systems with version spaces using the candidate elimination algorithm. 15 credits. Interaction Design Interaction Design 15 credits This module provides you with advanced skills in designing interactive systems and an in-depth understanding of emerging practico-theoretical developments in interaction design. The module is delivered as a series of workshops, lectures and seminars where you're introduced to a range of key technical skills for making interactive platforms, and develop an understanding of the role of prototyping though the embedding of technical work in the pursuit of a series of design briefs. You'll be able to then use these technologies in your projects, and develop an understanding of the roles of software and hardware development. 15 credits. Neural Networks Neural Networks 15 credits Introduces the theory and practice of neural computation. Covers the principles of neurocomputing with artificial neural networks widely used for addressing real-world problems such as classification, regression, pattern recognition, data mining, time-series prediction. We look at supervised and unsupervised learning. We study supervised learning using linear perceptrons, and non-linear models such as probabilistic neural networks, multilayer perceptrons, and radial-basis function networks. Unsupervised learning is studied using Kohonen networks. We provide contemporary training techniques for all these neural networks, and knowledge and tools for the specification, design, and practical implementation of neural networks. Tutor: Dr Nikolay Nikolaev 15 credits. Interactive Data Visualisation Interactive Data Visualisation 15 credits A large amount of data is available in electronic resources, both offline and online. This module gives a broad introduction to techniques for gathering data from electronic sources, such as databases and the internet. It covers both fundamental ideas and the use of some of the most important currently available tools. The course also presents tools and ideas for more effectively using the internet to communicate, visualise and generate news stories. 15 credits. Introduction to Natural Language Processing Introduction to Natural Language Processing 15 15. Geometric Data Analysis Geometric Data Analysis 15 credits This module focuses on the geometry and statistical processing underpinning component analysis methods, including numerical and score data and other quantitative data, as well as qualitative measurements and textual data. Classification, (hierarchical clustering, partitioning and discriminant analysis) are also relevant. The module also looks at the interpretation of results, graphical displays and other outputs, and practical implementation using the open source and universally supported R statistical and visualisation environment. 15 credits. Download the programme specification for the 2018-19 intake. If you would like an earlier version of the programme specification, please contact the Quality Office. Please note that due to staff research commitments not all of these modules may be available every year.

MSc in Data Science

Price on request