Deep learning added a huge boost to the already rapidly developing field of computer vision. With deep learning, a lot of new applications of computer vision techniques have been introduced and are now becoming parts of our everyday lives. These include face recognition and indexing, photo stylization or machine vision in self-driving cars.
The goal of this course is to introduce students to computer vision, starting from basics and then turning to more modern deep learning models. We will cover both image and video recognition, including image classification and annotation, object recognition and image search, various object detection techniques, motion estimation, object tracking in video, human action recognition, and finally image stylization, editing and new image generation. In course project, students will learn how to build face recognition and manipulation system to understand the internal mechanics of this technology, probably the most renown and oftenly demonstrated in movies and TV-shows example of computer vision and AI.

From the lesson

Object detection

In this week, we focus on the object detection task — one of the central problems in vision. We start with recalling the conventional sliding window + classifier approach culminating in Viola-Jones detector. Tracing the development of deep convolutional detectors up until recent days, we consider R-CNN and single shot detector models. Practice includes training a face detection model using a deep convolutional neural network.