Name: Introduction to Computer Vision - Goergia Tech
Brand: Udacity

Description

Type

Course
Methodology

Online

Start date

Different dates available

This course provides an introduction to computer vision including fundamentals, methods for application and machine learning classification.

Facilities

Online

Start date

Different dates availableEnrolment now open

Questions & Answers

Add your question

Our advisors and other users will be able to reply to you

Who would you like to address this question to?

All
Students
Centre

Fill in your details to get a reply

We will only publish your name and question

Reviews

Subjects

Image
Camera
Geometry
Computer vision
Camera Models
Image Motion

Course programme

Intermediate

Approx. 4 months

Built by Join thousands of students Course Summary

This course provides an introduction to computer vision including fundamentals of image formation, camera imaging geometry, feature detection and matching, multiview geometry including stereo, motion estimation and tracking, and classification. We’ll develop basic methods for applications that include finding known models in images, depth recovery from stereo, camera calibration, image stabilization, automated alignment (e.g. panoramas), tracking, and action recognition. We focus less on the machine learning aspect of CV as that is really classification theory best learned in an ML course.

The focus of the course is to develop the intuitions and mathematics of the methods in lecture, and then to learn about the difference between theory and practice in the problem sets. All algorithms work perfectly in the slides. But remember what Yogi Berra said: In theory there is no difference between theory and practice. In practice there is. (Einstein said something similar but who knows more about real life?) In this course you do not, for the most part, apply high-level library functions but use low to mid level algorithms to analyze images and extract structural information.

Why Take This Course?

Images have become ubiquitous in computing. Sometimes we forget that images often capture the light reflected from a physical scene. This course gives you both insight into the fundamentals of image formation and analysis, as well as the ability to extract information much above the pixel level. These skills are useful for anyone interested in operating on images in a context-aware manner or where images from multiple scenarios need to be combined or organized in an appropriate way.

Prerequisites and Requirements

Data structures: You'll be writing code that builds representations of images, features, and geometric constructions.
A good working knowledge of Matlab and/or Python with NumPy. The lecture videos use Matlab for occasional demonstration because the instructor is too old to change. Problem sets will be done in Matlab or Python. As mentioned in the resources note below, you can use either Matlab or the open source version Octave.
This course has more math than many CS courses: Linear algebra, vector calculus, and linear algebra (that is not a typo).
No prior knowledge of vision is assumed though any experience with Signal Processing is helpful.

See the Technology Requirements for using Udacity.

Syllabus

A brief outline of units is given below, grouped into 10 parts:

1 Introduction

1A Introduction

2 Image Processing for Computer Vision

2A Linear image processing
2B Model fitting
2C Frequency domain analysis

3 Camera Models and Views

3A Camera models
3B Stereo geometry
3C Camera calibration
3D Multiple views

4 Image Features

4A Feature detection
4B Feature descriptors
4C Model fitting

5 Lighting

5A Photometry
5B Lightness
5C Shape from shading

6 Image Motion

6A Overview
6B Optical flow

7 Tracking

7A Introduction to tracking
7B Parametric models
7C Non-parametric models
7D Tracking considerations

8 Classification and Recognition

8A Introduction to recognition
8B Classification: Generative models
8C Classification: Discriminative models
8D Action recognition

9 Useful Methods

9A Color spaces and segmentation
9B Binary morphology
9C 3D perception

10 Human Visual System

10A The retina
10B Vision in the brain

GT OMSCS Students

Note: Please refer to your course website/schedule for further details, assignments, etc.

Spring 2015 resources (old):

Schedule: Suggested pace, assignments, deadlines, references.
Course website: Course information, problem sets, academic policies, grading scheme.
Piazza forum: Discussions, announcements, clarifications.
T-Square site: Problem set submissions.

Note: This course was previously offered as CS 4495.

See related categories

Contact us

Introduction to Computer Vision - Goergia Tech

Questions & Answers

Reviews

Subjects

Course programme

Add similar courses
and compare them to help you choose.

Introduction to Computer Vision - Goergia Tech

Questions & Answers

Reviews

Subjects

Course programme

Add similar coursesand compare them to help you choose.

Add similar courses
and compare them to help you choose.