CS1678: Intro to Deep Learning, Spring 2021

Location: Virtual (Zoom link on Canvas)
Time: Tuesday and Thursday, 2:50pm-4:05pm
Instructor: Adriana Kovashka (email: kovashka AT cs DOT pitt DOT edu; use "CS1678" at the beginning of the subject line)
Office: Virtual (same Zoom link)
Office hours: Tuesday/Thursday, 9am-11am, 1-2pm
TA: Ahmad Diab (email: AHD23 AT DOT pitt DOT edu; use "CS1678" at the beginning of the subject line)
TA's office hours: Monday 10am-11am, Wednesday 5pm-6pm, Friday 5pm-6pm
TA's office: Virtual (Zoom link on Canvas)

Overview

Course description: This course will cover the basics of modern deep neural networks. The first part of the course will introduce neural network architectures, activation functions, and operations. It will present different loss functions and describe how training is performed via backpropagation. In the second part, the course will describe specific types of neural networks, e.g. convolutional, recurrent, graph networks and transformers, as well as their applications in computer vision and natural language processing. The course will also briefly discuss reinforcement learning and unsupervised learning, in the context of neural networks. In addition to attending lectures and completing bi-weekly homework assignments, students will also carry out and present a project.

Prerequisites: Math 220 (Calculus I), Math 280 or 1180 (Linear Algebra), CS 1501 (Algorithm Implementation)

Piazza: Sign up for it here. Please use Piazza rather than email so everyone can benefit from the discussion-- you can post in such a way that only the instructor sees your name. Please try to answer each others' questions whenever possible. The best time to ask the instructor or TA questions is during office hours.

Programming language/framework: We will use Python, NumPy/SciPy, and PyTorch.

Textbooks: We will have required readings from the following textbook. There will also be additional readings from published papers. The following textbooks are useful as additional reference: [top]

Policies

Grading

Grading will be based on the following components:

Assignment Submission Mechanics

Homework, project reports and presentations slides are due at 11:59pm on the due date. You will submit your homework using Canvas, under "Assignments" and the corresponding homework ID. You should submit a single zip file with source/results/report/slides, as requested. Name the file YourFirstName_YourLastName.zip. Please comment your code! Grades will be posted on Canvas.

Note that Canvas will also contain an automatically computed running average column that you can use to gauge how you're doing in the class based on grades that are already available. Generally, Overall scores over 90% map to some type of A, over 80% to B, and over 70% to C.

Exams and Quizzes

There will be no exams. However, we will have almost daily low-stakes quizzes to be completed after class (open-book).

Participation

Students are expected to regularly attend the class lectures, and should actively engage in in-class discussions. Attendance will not be taken, but keep in mind that if you don't attend, you cannot participate. You can actively participate by, for example, responding to the instructor's or others' questions, asking questions or making meaningful remarks and comments about the lecture, answering others' questions on Piazza, or bringing in relevant articles you saw in the news. The grading rubric will be as follows: 1 = you attended infrequently, 2 = you attended frequently but did not speak in class, 3 = you attended frequently and spoke a few times, 4 = you participated frequently, 5 = you participated every other week or more.

Homework Late Policy

For homework assignments only (not for project components), you get 3 "free" late days counted in minutes, i.e., you can submit a total of 72 hours late. For example, you can submit one homework 12 hours late, and another 60 hours late. The 72-hour "budget" is total for all assignments, NOT per assignment. Once you've used up your free late days, you will incur a penalty of 25% from the total assignment credit possible for each late day. A late day is anything from 1 minute to 24 hours.

Collaboration Policy and Academic Honesty

You will do your work (homework and quizzes) individually. The work you turn in must be your own work. You are allowed to discuss the assignments with your classmates, but do not look at code they might have written for the assignments, or at their written answers. You are also not allowed to search for code on the internet, use solutions posted online unless you are explicitly allowed to look at those. When in doubt about what you can or cannot use, ask the instructor! Plagiarism will cause you to fail the class and receive disciplinary penalty. Please consult the University Guidelines on Academic Integrity. All project components involve group work.

Note on Disabilities

If you have a disability for which you are or may be requesting an accommodation, you are encouraged to contact both your instructor and Disability Resources and Services (DRS), 140 William Pitt Union, (412) 648-7890, drsrecep@pitt.edu, (412) 228-5347 for P3 ASL users, as early as possible in the term. DRS will verify your disability and determine reasonable accommodations for this course.

Note on Medical Conditions

If you have a medical condition which will prevent you from doing a certain assignment, you must inform the instructor of this before the deadline. You must then submit documentation of your condition within a week of the assignment deadline.

Statement on Classroom Recording

To ensure the free and open discussion of ideas, students may not record classroom lectures, discussion and/or activities without the advance written permission of the instructor, and any such recording properly approved in advance can be used solely for the student's own private use.

[top]

Project

Please enter your team information and topic here when you submit the project proposal.

The project is expected to be a new method for an existing problem or an application of techniques we studied in class (or another method) to a new problem that we have not discussed in class. Below are some tips: Rules and logistics: Rubric for project proposal (10 points):
  1. What do you propose to do?
  2. What have others attempted in this space, i.e. what is the relevant literature?
  3. Why is what you are proposing interesting?
  4. Why is it challenging?
  5. Why is it important?
  6. What data do you plan to use?
  7. What is your high-level idea of how your method will work?
  8. In what ways is this method novel?
  9. How will you evaluate the method, i.e. what metrics are you going to use, and what baselines are you going to compare to?
  10. Give a (1) conservative and (2) an ambitious schedule of milestones for your project.
Rubric for first report (40 points):
  1. What is the prior work in this space? Describe in detail (2-3 pages, in your own words) (1-10 points).
  2. How is your work novel, in the context of this prior work? (1-5)
  3. What is your proposed approach? Describe in detail (3-4 pages). (1-10)
  4. Why should your approach work well for this task? (1-5)
  5. In what ways is your proposed approach ambitious? (1-5)
  6. How do you plan to evaluate your method, against what baselines? (It's possible these will have changed after the proposal stage.) (1-5)
Rubric for the second report (30 points):
  1. What progress have you made since the first report? Describe in detail (1-2 pages). (1-10)
  2. What challenges have you encountered along the way? (1-5)
  3. What are your next steps? Describe in detail (1-2 pages). (1-10)
  4. What metrics are you going to use to evaluate your method? What datasets are you going to use? What are your experimental results so far, if any? (1-5)
Rubric for presentation (50 points):
  1. How well did the authors (presenters) explain what problem they are trying to solve? (1-5 points)
  2. How well did they explain why this problem is important and/or challenging? (1-5)
  3. How clearly was prior work described? How well did the authors explain how their proposed work is different than prior work? (1-5)
  4. How clearly did the authors describe their proposed approach? (1-10)
  5. How novel and ambitious is the proposed approach? (1-5)
  6. How well did the authors describe their experimental validation? How informative were the figures used? (1-5)
  7. Were all/most relevant experimental settings (e.g. datasets, tasks) and baselines (competitor methods) included in the experimental validation? (1-5)
  8. To what extent is the performance of the proposed method satisfactory? (1-5)
  9. How informative were the conclusions the authors drew about their method’s performance relative to other methods? How sensible was the discussion of limitations? How interesting was the discussion of future work? (1-5)
Some potential sources of ideas: [top]

Schedule

aaaa
Date Topic Readings Lecture slides Due
1/19 Introduction, administrivia, ML overview/review DL Ch 1-5,
Bishop PRML Sec. 1.1,
Bishop PRML Sec. 7.1
pptx pdf
1/21
1/26
1/28 Neural network basics (tasks, operations, training) DL Ch 4.2, 4.3, 6,
Bishop Ch 5
pptx pdf exercise answers
2/2
2/4 proposal due
2/9 PyTorch tutorial (TA) [links on Canvas]
2/11
2/16 Training part 2 (alternative notation, optimization, properties, convergence) DL Ch 7, 8, 11 pptx pdf
2/18 HW1 due
2/25
3/2 Convolutional neural networks
(architectures, visualization, applications)
DL Ch 9 pptx pdf report 1 due
3/4
3/9
3/11 Recurrent neural networks (architectures, training, applications) DL Ch 10, RNN blog 1, RNN blog 2, RNN vis pptx pdf
3/16
3/18 HW2 due
3/23 Transformers (self-attention in language and beyond) Mikolov NIPS 2013, Vaswani NIPS 2017, Devlin NAACL 2019, Illustrated Transformer and BERT, and Dosovitskiy ICLR 2021, Lu NeurIPS 2019 pptx pdf
3/25
3/30 report 2 due
4/1 Advanced topics (graph networks, unsupervised and reinforcement learning, generation, ethics) Kipf GCNs, Doersch ICCV 2015, Karpathy RL blog, Mnih NIPS-W 2013, Goodfellow NIPS 2014, Bolukbasi NIPS 2016, Hendricks ECCV 2018 pptx pdf
4/6
4/8 HW3 due
4/13
4/15 Project presentations
4/20 participation notes
4/22 slides due

[top]

Resources

This course was inspired by the following courses: Some computer vision datasets: Some code and frameworks of interest: [top]