|
CS 2731 / ISSP 2230 / INFSCI 2420: Introduction to Natural Language Processing (Fall 2019)
|
|
| Professor |
Dr. Diane Litman (5105 Sensq) |
| TA |
Ravneet Singh (5501 Sensq) |
| When & Where | Tuesday and Thursdays 1:00-2:15, SENSQ 5313 |
| Office Hours | Litman: Tu 4:30-5:30, Th 2:15-3:30, by appointment; Singh: M-Th 9-10:30 |
| Description | This course provides an introduction to the field of natural
language processing - the creation of programs
that can understand, generate, and learn languages used by
humans. It will expose students to applications such as
chatbots by means of
computational techniques including search algorithms, dynamic
programming, hidden markov models, probalistic context free
grammars, and machine learning algorithms.
Prerequisites: CS 1501 (algorithms) OR consent of the instructor Textbook: Speech and Language Processing (3rd edition online draft - free!) |
| Required Work (tentative) | Homeworks (35%): written and programming Exams (35%): midterm and final Group Course Project (25%): presentation and written report Supplemental Research Papers (5%): leading discussion and class participation Late Penalty: For assignments that may be accepted late, the penalty is 10% per day up to 5 days including Saturday, Sunday, and holidays. Assignments are due by 11:59pm. |
|
Date/Topic
|
Textbook Readings
|
Assignments and Other Readings |
|
| August 27 Introduction (pdf) |
Ch 1 | xkcd Humor (credits to E. Riloff for this and links below) |
|
|
I. Words
|
|||
| August 29 Text Normalization (pdf) |
Ch 2 (2.1-2.4) | Unix for Poets, pages 1-19
regular-expressions.info Humor: plurals, sentence tokenization |
|
| September 3, 5, 10 Language Modeling with N-Grams (pdf) |
Ch 3 (3.1-3.4) | Humor
Do at home: come prepared to discuss 9/5 HW1: assigned 9/5, due 9/24 9/4 NY Times: A breakthrough for AI (really NLP) technology |
|
| September 10, 12, 17 Part-of-Speech Tagging (pdf) |
Ch 8 (8.1-8.4.6) |
Humor
Schoolhouse Rock for Conjunctions Do at home: come prepared to decode "Fish sleep" using Viterbi on 9/17 |
|
|
II. Syntax (NOTE: The syntax chapter numbers are all off by one as they changed in the Octobe 2 textbook update)
|
|||
| September 17, 19 Constituency Grammars (pdf) |
Ch 11 (through 11.5) | Humor | |
| September 19, 24 Constituency Parsing (pdf) |
Ch 12 | Humor
Do at home: come prepared to show the result of parsing "The old dog the footsteps of the young" using the provided grammar on 9/24 |
|
| September 24, 26 Statistical Constituency Parsing (pdf) |
Ch 13 (13.1-13.5, 13.8) | HW2: assigned 9/26, due 10/10
Do at home: come prepared to derive parse trees for the provided sentence and grammar using CKY | |
|
III. Machine Learning
|
|||
| October 1, 3 Naive Bayes Classification (and Sentiment) (pdf) |
Ch 4 (through 4.8) | Humor
Do at home: come prepared to answer | |
| October 3, 8 Logistic Regression (pdf) |
Ch 5 (5.1-5.2; concepts in 5.3-5.7) | 9/26 NY Times: At Tech's Leading Edge, Worry about a Concentration of Power (evaluation of ML) | |
| October 8, 10, 15 Representation Learning (and Vector Semantics) (sparse vectors) (pdf) |
Ch 6 |
10/14 New Yorker: The Next Word: Where will predictive text take us? Midterm Review (pdf) |
|
| October 17 | Midterm Exam (closed book) | Through 9/26 class (statistical parsing). No makeups. | |
| October 22 Representation Learning (and Vector Semantics) (dense vectors) (pdf) |
Ch 6 |
HW3: assigned 10/22, due 11/5 Word2Vec Tutorial |
|
| October 24 Neural Nets (and Language Models) Guest Lecturer: Ravneet Singh |
Ch 7 | Introduction to Story Cloze Project (Corpus Paper, Shared Task Paper) | |
|
IV. Semantics
|
|||
| October 29, 31 Information Extraction (pdf) |
Ch 18 (18.1-2) | From word2vec to
BERT
(Bidirectional Encoder Representations from Transformers)
DRAFT project description |
|
| October 31, November 5 Word Senses and WordNet (pdf) |
Ch 19 | humor1,
humor2 |
|
| November 7, 12 Word Sense Disambiguation (pdf) |
Ch 19 | Midterm discussion Contextual Word Representations: A Contextual Introduction |
|
| November 14, 19 Semantic Role Labeling (pdf) |
Ch 20 | 11/11 NY Times: A.I. Systems Echo Biases They're Fed, Putting Scientists on Guard | |
|
V. Discourse and Applications
|
|||
| November 19, 21 Coreference Resolution (pdf) |
Ch 22 | ||
| December 3 Dialog Systems and Chatbots (pdf) |
Ch 26 | Final Exam Review (pdf) | |
| December 5 | Final Exam (closed book) | Everything since midterm (i.e., not cumulative). No makeups. | |
| December 10 cancelled due to double class on December 12 |
|||
| December 12 (from 1-3:50!!) | Project Presentations | Attendance required. | |