Announcements
The final exam will take place on Monday, December 14, at 12pm, in the usual classroom.Homework 5 has been released, and is due at 11:59pm on December 11.
[top]
Overview
Course description: In this class, students will learn the basics of modern computer vision. The first major part of the course will cover fundamental concepts such as image formation, color perception, image filtering, edge detection, texture description, feature extraction and matching, and grouping and fitting. A crash course in Machine Learning will follow, in preparation for the second course chapter on visual recognition. We will study state of the art approaches in object and scene recognition, activity recognition and first-person video, attribute-based description, image retrieval, unsupervised learning, and learning from big data. Finally, we will discuss a few newly introduced topics from the most recent computer vision conferences. The course format will include lectures, homework and exams.Prerequisites: CS1501, CS1502
Textbooks:
- Computer Vision: Algorithms and Applications by Richard Szeliski (available for free on author's page)
- Visual Object Recognition by Kristen Grauman and Bastian Leibe (accessible for free from campus)
Policies
Grading
Grading will be based on the following components:- Homework (5 problem sets) (50%)
- Midterm exam (20%)
- Final exam (20%)
- Participation (10%)
Homework Submission Mechanics
You will submit your homework using CourseWeb. Navigate to the CourseWeb page for CS1699, then click on "Assignments" (on the left) and the corresponding homework number. Attach, in a single zip file, your written responses and code. Name the file as YourFirstName_YourLastName.zip or YourFirstName_YourLastName.tar. Homework is due at 11:59pm on the due date.Exams
There will be one in-class midterm exam, and a final exam (December 14 at 12pm, in the regular classroom) which will focus on material from the latter part of the course.Attendance, in-class participation and discussion
Students are expected to regularly attend the class lectures, and should actively engage in in-class discussions. Your participation grade will be based on what fraction of lectures you attended, and how actively you participated in class. You have two "free" absences that you will not affect your participation grade. If you are absent beyond these, please let me know and explain. You can actively participate by, for example, responding to the instructor's or others' questions, asking questions or making meaningful remarks and comments about the lecture. You are also encouraged to bring in relevant articles you saw in the news.Late Policy
You get 3 "free" late days, i.e., you can submit homework a total of 3 days late. For example, you can submit one problem set 12 hours late, and another 60 hours late. Once you've used up your free late days, you will incur a penalty of 25% from the total project credit possible for each late day. A late day is anything from 1 minute to 24 hours.Collaboration Policy and Academic Honesty
You will do your work (exams and homework) individually. The work you turn in must be your own work. You are allowed to discuss the problem sets with your classmates, but do not look at code they might have written for the problem sets. You are also not allowed to search for code on the internet, use solutions posted online unless you are explicitly allowed to look at those, or to use Matlab's implementation if you are asked to write your own code. When in doubt about what you can or cannot use, ask the instructor! Plagiarism will cause you to fail the class and receive disciplinary penalty. Please consult the University Guidelines on Academic Integrity.Note on Disabilities
If you have a disability for which you are or may be requesting an accommodation, you are encouraged to contact both your instructor and Disability Resources and Services (DRS), 140 William Pitt Union, (412) 648-7890, drsrecep@pitt.edu, (412) 228-5347 for P3 ASL users, as early as possible in the term. DRS will verify your disability and determine reasonable accommodations for this course.Note on Medical Conditions
If you have a medical condition which will prevent you from doing a certain assignment or coming to class, you must inform the instructor of this before the deadline. You must then submit documentation of your condition within a week of the assignment deadline.Statement on Classroom Recording
To ensure the free and open discussion of ideas, students may not record classroom lectures, discussion and/or activities without the advance written permission of the instructor, and any such recording properly approved in advance can be used solely for the student's own private use.[top]
Schedule
Date | Topic | Readings | Lecture slides | Due |
9/01 | Introduction | [pptx] [pdf] | ||
9/03 | Matlab tutorial; Filters | Szeliski Sec. 1.1-1.2 | [pptx] [pdf] | HW1 out |
9/08 | Filters (cont'd) and texture | Szeliski Sec. 3.1.1, 3.2, 10.5 | [pptx] [pdf] | |
9/10 | Texture and other uses of filters | Szeliski Sec. 3.1.1, 3.2, 10.5 | [pptx] [pdf] | |
9/15 | Edges and binary images | Szeliski Sec. 3.3.2-4, 4.2 | [pptx] [pdf] | |
9/17 | Interest points and feature description | Szeliski Sec. 4.1, Grauman/Leibe Ch. 1-3; SIFT paper by David Lowe | [pptx] [pdf] | HW2 out |
9/22 | Color | Szeliski Sec. 2.3.2 | [pptx] [pdf] | HW1 due |
9/24 | Segmentation and grouping | Szeliski Sec. 5.3-4 | [pptx] [pdf] | |
9/29 | Fitting models | Szeliski Sec. 2.1.1-2, 4.3.2; Grauman/Leibe Ch. 5 |
[pptx] [pdf] | |
10/01 | Matching features; Indexing and retrieval | Szeliski Sec. 14.3; Grauman/Leibe Ch. 4, 6 |
[pptx] [pdf] | |
10/06 | Projective transformations and image stitching | Szeliski Sec 2.1.5, 3.6.1, 7.1, 7.2 | [pptx] [pdf] | HW3 out |
10/08 | Epipolar geometry and stereo vision | (see above) | [pptx] [pdf] | HW2 due |
10/13 | Midterm review | [pptx] [pdf] | ||
10/15 | Midterm (in class) | |||
10/20 | No class due to Fall Break | |||
10/22 | Intro to machine learning and visual recognition | Grauman/Leibe Ch. 7, Sec. 8.1 | [pptx] [pdf] | HW4 out |
10/27 | Intro to machine learning and visual recognition (cont'd) | Chapter 1 from Bishop's PRML | [pptx] [pdf] | |
10/29 | Support vector machines | Section 7.1 from Bishop's PRML | [pptx] [pdf] | |
11/03 | Bias-variance trade-off; Other models and problems | [pptx] [pdf] | HW3 due | |
11/05 | Interactive image search with attributes | WhittleSearch: Image Search with Relative Attribute Feedback. (A. Kovashka et al., CVPR 2012) | [pptx] [pdf] | |
11/10 | Detection I: Faces | Szeliski Sec. 14.1, Grauman/Leibe Sec. 11.1 | [pptx] [pdf] | |
11/12 | Detection II: Deformable part models | Szeliski Sec. 14.4, Grauman/Leibe Sec. 8.2, 9, 10.3.3, 11.2,3,5 | [pptx] [pdf] | |
11/17 | Detection III: Understanding detection failures | [pptx] [pdf] | ||
11/19 | Human factors (crowdsourcing and saliency) | Grauman/Leibe Sec. 10.1, Peekaboom: A Game for Locating Objects in Images. (L. von Ahn et al., CHI 2006) | [pptx] [pdf] | |
11/24 | Active learning Unsupervised learning (extra slides, not discussed in class) |
Visual Recognition with Humans in the Loop. (S. Branson et al., ECCV 2010) | [pptx] [pdf] | HW4 due; HW5 out |
11/26 | Thanksgiving break | |||
12/01 | Deep learning | Andrej Karpathy's class notes (also see Module 1 here) |
[pptx] [pdf] | |
12/03 | Tracking | Szeliski Sec. 4.1.4, 8.4, 12.6.4 | [pptx] [pdf] | |
12/08 | Human pose and actions | Recognizing Activities of Daily Living in First-Person Camera Views. (H. Pirsiavash and D. Ramanan, CVPR 2012) | [pptx] [pdf] | |
12/10 | Final review | [pptx] [pdf] | HW5 due | |
12/14 | Final exam (12pm, SenSq 5502) |
[top]
Resources
This course was inspired by the following courses:- Computer Vision by Kristen Grauman, UT Austin, Spring 2011
- Computer Vision by Derek Hoiem, UIUC, Spring 2015
- Matlab tutorial
- Linear algebra review by Fei-Fei Li
- Brief machine learning intro by Aditya Khosla and Joseph Lim
- Resources list (including code and data, tutorials, and other related courses) compiled by Devi Parikh
- Microsoft COCO (Common Objects in Context)
- ImageNet
- SUN Database
- Animals with Attributes
- Caltech-UCSD Birds 200
- INRIA Movie Actions
- Recognition datasets list compiled by Kristen Grauman
- Human activity datasets list compiled by Chao-Yeh Chen