CS 3730: Advanced Topics in Natural Language Processing

CS 3730/ISSP 3120 Advanced Topics in Natural Language Processing (Spring 2013)

Schedule

Syllabus Home

Class#	Date	Lead	Materials/Deadlines	Topics and Readings
1	1/08 (Tue)	Jan Wiebe	Course webpage	Welcome and Administration
2	1/10 (Thu)	Jan Wiebe	slides poster	Subjectivity and Sentiment Analysis: From Words to Discourse ... to Pragmatics Cem Akkaya, Janyce Wiebe, Alexander Conrad, and Rada Mihalcea (2011). Improving the Impact of Subjectivity Word Sense Disambiguation on Contextual Opinion Analysis. CoNLL 2011 Swapna Somasundaran and Janyce Wiebe (2009). Recognizing Stances in Online Debates. ACL-IJCNLP 2009.
3	1/15 (Tue)	Jan Wiebe	slides updated slides exercises exercise answers The MPQA corpus, subjectivity lexicon, and word-sense annotations can be downloaded from my website once CSSD goes live with our new page.	Subjectivity Annotation Sections 1.1-1.2, 2.0, 3.1, 7.1 of Theresa Wilson's Dissertation Sections 1 & 2 of Janyce Wiebe, Theresa Wilson, and Claire Cardie (2005). Annotating expressions of opinions and emotions in language. Language Resources and Evaluation, volume 39, issue 2-3, pp. 165-210
4	1/17 (Thu)	Lingjia	Commentaries start today; enter into NB by 2pm the day before Bing Liu makes severcal datasets available at his sentiment page.	Product Review Mining Minqing Hu and Bing Liu Mining and Summarizing Customer Reviews KDD-2004. Kim, Soo-Min and Hovy, Eduard Automatic Identification of Pro and Con Reasons in Online Reviews. COLING-ACL-2006.
5	1/22 (Tue)	Huy		Product Review Mining, continued Oscar Tackstrom and Ryan McDonald (2011). Discovering Fine-Grained Sentiment with Latent Variable Structured Prediction Models ECIR-2011. Ana-Maria Popescu and Oren Etzioni (2005). Extracting Product Features and Opinions from Reviews EMNLP-2005. Also, joint extraction. Optional: Yoonjung Barbosa, Luciano and Kumar, Ravi and Pang, Bo and Tomkins, Andrew (2009). For a few dollars less: Identifying review pages sans human labels. NAACL-2009.
6	1/24 (Thu)	Jan Wiebe	Introductory slides Measuring annotator agreement: a presentation by Ron Artstein	Annotation Studies Ron Artstein and Massimo Poesio. Inter-coder agreement for computational linguistics (survey article). Computational Linguistics 34(4): 555-596, 2008
7	1/29 (Tue)	Wencan	The Toprak et al. data may be downloaded from this page	Aspects and Review Objects Mukherjee, Arjun and Liu, Bing Aspect Extraction through Semi-Supervised Modeling. ACL-2012. Toprak, Cigdem and Jakob, Niklas and Gurevych, Iryna (2010) Sentence and Expression Level Annotation of Opinions in User-Generated Discourse. ACL-2010. Aspect-based annotations. Optional: Huy Dalvi, Nilesh and Kumar, Ravi and Pang, Bo and Tomkins, Andrew Matching Reviews to Objects using a Language Model EMNLP-2009.
8	1/31 (Thu)	Phuong		Aspects, continued Yu, Jianxing and Zha, Zheng-Jun and Wang, Meng and Chua, Tat-Seng (2011). Aspect Ranking: Identifying Important Product Aspects from Online Consumer Reviews. ACL-2011. Wei, Wei and Gulla, Jon Atle (2010). Sentiment Learning on Product Reviews via Sentiment Ontology Tree. ACL-2010. Optional: Jesse Brody, Samuel and Elhadad, Noemie (2010). An Unsupervised Aspect-Sentiment Model for Online Reviews NAACL-2010.
9	2/05 (Tue)	Fan		Identifying Holders and Targets/Topics Stoyanov, Veselin and Cardie, Claire (2008). Topic Identification for Fine-Grained Opinion Analysis COLING-2008. Choi, Yejin and Cardie, Claire and Riloff, Ellen and Patwardhan, Siddharth (2005). Identifying Sources of Opinions with Conditional Random Fields and Extraction Patterns. EMNLP-2005. Optional: Phuong Jakob, Niklas and Gurevych, Iryna (2010). Extracting Opinion Targets in a Single and Cross-Domain Setting with Conditional Random Fields EMNLP-2011.
10	2/07 (Thu)	Huy	Project meetings completed	Joint Extraction/Recognition Ivan Titov and Ryan McDonald (2008). A Joint Model of Text and Aspect Ratings for Sentiment Summarization ACL-2008. Zhao, Xin and Jiang, Jing and Yan, Hongfei and Li, Xiaoming (2010). Jointly Modeling Aspects and Opinions with a MaxEnt-LDA Hybrid EMNLP-2010. Optional: Huichao Ivan Titov and Ryan McDonald (2008). Modeling online reviews with multi-grain topic models. WWW-2008.
11	2/12(Tue)	Yoonjung	The data from the Somasundaran et al. paper is available.	Joint Extraction/Recognition, continued Choi, Yejin and Breck, Eric and Cardie, Claire (2006). Joint Extraction of Entities and Relations for Opinion Recognition EMNLP-2006. Somasundaran, Swapna and Namata, Galileo and Wiebe, Janyce and Getoor, Lise (2009). Supervised and Unsupervised Methods in Employing Discourse Relations for Improving Opinion Polarity Classification EMNLP 2009. Optional: Huichao Du, Weifu and Tan, Songbo (2009). An Iterative Reinforcement Approach for Fine-Grained Opinion Mining ACL-2009.
12	2/14 (Thu)	Jesse	The data for Murray and Carenini is available here	Applications Carenini, Giuseppe and Ng, Raymond T. and Zhou, Xiaodong (2008). Summarizing Emails with Conversational Cohesion and Subjectivity ACL-2008. Murray, Gabriel and Carenini, Giuseppe (2009). Predicting Subjectivity in Multimodal Conversations. EMNLP-2009. Optional: Fan Minqing Hu and Bing Liu (2006). Opinion Extraction and Summarization on the Web. AAAI-2006.
13	2/19 (Tue)		Project proposals due TAC 2008 Opinion Question Answering Track. For the Opinion Question answering track, we would need a user agreement for the Blog6 data.	Applications, continued Oh, Jong-Hoon and Torisawa, Kentaro and Hashimoto, Chikara and Kawada, Takuya and De Saeger, Stijn and Kazama, Jun'ichi and Wang, Yiou (2012). Why Question Answering using Sentiment Analysis and Word Classes EMNLP-2012. This is also related to connotation. Paul, Michael and Zhai, ChengXiang and Girju, Roxana (2010). Summarizing Contrastive Viewpoints in Opinionated Text. EMNLP-2010. Optional: Lingjia Lerman, Kevin and McDonald, Ryan (2009). Sentiment Summarization: Evaluating and Learning User Preferences. EACL-2009.
14	2/21 (Thu)			Lab Class Subjectivity annotation, continued from Day 3, 1/15.
15	2/26 (Tue)	Huichao		Domain Adaptation Bollegala, Danushka and Weir, David and Carroll, John (2011). Using Multiple Sources to Construct a Sentiment Sensitive Thesaurus for Cross-Domain Sentiment Classification. ACL 2011. Blitzer, John and Dredze, Mark and Pereira, Fernando, Biographies, Bollywood, Boom-boxes and Blenders: Domain Adaptation for Sentiment Classification ACL-2007. Optional: Yoonjung He, Yulan and Lin, Chenghua and Alani, Harith (2011). Automatically Extracting Polarity-Bearing Topics for Cross-Domain Sentiment Classification ACL-2011.
16	2/28 (Thu)	Alex		Recognizing Stances, Arguments, and Viewpoints Swapna Somasundaran and Janyce Wiebe (2009). Recognizing Stances in Online Debates. ACL-IJCNLP 2009. Walker, M. and Anand, P and Abbott, R. and Tree, J.E.F. and Martell,C. and King, J. (2011). That's your evidence:? Classifying Stance in Online Political and Social Debate. Decision Support Sciences. 1-30. Elsevier. 2011 (based on their WASSA 2011 paper that won the best paper award). Optional: Fan Y. Lu and H. Wang and C. Zhai and D. Roth (2012). Unsupervised Discovery of Opposing Opinion Networks From Forum Discussions. CIKM-2012.
17	3/05 (Tue)	Alex		Recognizing Stances, Arguments, Viewpoints, continued. Park, Souneil and Lee, Kyung Soon and Song, Junehwa (2011). Contrasting Opposing Views of News Articles on Contentious Issues. ACL-2011. Abu-Jbara, Amjad and Dasigi, Pradeep and Diab, Mona and Radev, Dragomir (2011). Subgroup Detection in Ideological Discussions. ACL-2011. OPTIONAL: Wencan Lin, Wei-Hao and Hauptmann, Alexander (2006). Are These Documents Written from Different Perspectives? A Test of Different Perspectives Based on Statistical Distribution Divergence. ACL-2006.
18	3/07 (Thu)			Lab Class Argument data: Death Penalty (Baumgartner, Boef, Boydstun); ConvinceMe.net (Conrad).
19	3/19 (Tue)	Lingjia	See the bullet of "Data related to sentiment analysis" on Lillian Lee's data site	Graph-Based Approaches Bo Pang and Lillian Lee (2004). A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts ACL-04. Feng, Song and Bose, Ritwik and Choi, Yejin (2011). Learning General Connotation of Words using Graph-based Algorithms EMNLP-2011. This is also related to connotation. Optional: Wencan Li, Binyang and Zhou, Lanjun and Feng, Shi and Wong, Kam-Fai (2010). A Unified Graph Model for Sentence-Based Opinion Retrieval. ACL-2010.
20	3/21 (Thu)	Huichao	Project progress reports due	Phrase-Level Contextual Polarity Theresa Wilson, Janyce Wiebe, and Paul Hoffmann (2005). Recognizing Contextual Polarity in Phrase-Level Sentiment Analysis, EMNLP-2009. Agarwal, Apoorv and Biadsy, Fadi and Mckeown, Kathleen (2009). Contextual Phrase-Level Polarity Analysis Using Lexical Affect Scoring and Syntactic N-Grams. EACL 2009. Optional: Lingjia Socher, Richard, Huval, Brody, Manning, Christopher D. and Ng, Andrew Y. (2012). Semantic Compositionality through Recursive Matrix-Vector Spaces. EMNLP 2012.
21	3/26 (Tue)	Yoonjung		Phrase-Level Contextual Polarity, continued Yejin Choi and Claire Cardie (2008). Learning with Compositional Semantics as Structural Inference for Subsentential Sentiment Analysis. EMNLP-2008. Karo Moilanen and Stephen Pulman (2007). Sentiment Composition. RANLP 2007. Optional: Phuong Yessenalina, Ainur and Cardie, Claire (2011). Compositional Matrix-Space Models for Sentiment Analysis. EMNLP-2011.
22	3/28 (Thu)	Wencan		Conceptual Dependency Theory and Scripts Chapters 1-3 of Roger Schank and Robert Abelson (1977). Scripts, Plans, Goals and Understanding. Lawrence Erlbaum.
23	4/02 (Tue)	Phuong	Data created by Chambers and Jurafsky is available here Slides about scripts (thanks to Ellen Riloff) Slides about Conceptual Dependency (thanks to Ellen Riloff)	Learning Narrative and Script Schemas Chambers, Nathanael and Jurafsky, Dan (2008). Unsupervised Learning of Narrative Event Chains ACL-2008. Regneri, Michaela and Koller, Alexander and Pinkal, Manfred (2010) Learning Script Knowledge with Web Experiments ACL-2010. Optional: Alex Chambers, Nathanael and Jurafsky, Dan (2009). Unsupervised Learning of Narrative Schemas and their Participants ACL-2009. Chambers, Nathanael and Jurafsky, Dan (2009). A Database of Narrative Schemas LREC-2010
24	4/04 (Thu)	Fan		Plot Units Wendy Lehnert. Plot Units and Narrative Summarization. Cognitive Science 4 (1981). Goyal, Amit and Riloff, Ellen and Daume III, Hal (2012). A Computational Model for Plot Units. Computational Intelligence, 2012.
25	4/09 (Tue)			Lab Class Resurrecting "Old AI"
26	4/11 (Thurs)	Jesse	I've requested the data annotated in Ganter and Strube. The CoNLL-2010 shared task was on detecting hedges and their scope. The data is available here.	Hedges Ganter, Viola and Strube, Michael (2009). Finding Hedges by Chasing Weasels: Hedge Detection Using Wikipedia Tags and Shallow Linguistic Features. ACL-IJCNLP-2009 short paper. Medlock, Ben and Briscoe, Ted (2007). Weakly Supervised Learning for Hedge Classification in Scientific Literature. ACL-2007. Optional: Huy Richard Farkas (2010). The CoNLL-2010 Shared Task (the overview paper at the beginning). CoNLL-2010.
27	4/16 (Tue)	Jan	Project report draft due Webpage for FactBank and the PragBank Extension. Data: the Stanford PragBank distribution. Access a subset of the the original documents using the Browser here (otherwise, FactBank must be obtained from the LDC)	Factuality/Veridicality de Marneffe, Marie-Catherine; Christopher D. Manning; and Christopher Potts. 2012. Did it happen? The pragmatic complexity of veridicality assessment. Computational Linguistics 38(2): 301-333 Optional: Alex rabhakaran, Vinodkumar and Rambow, Owen and Diab, Mona (2010). Automatic Committed Belief Tagging. COLING-2010.
28	4/18 (Thu)	Class is cancelled.
29	4/23 (Tue)		Final project reports due.	Project presentations: Jesse, Lingjia and Yoonjung, Fan and Wencan. 25 minutes each, including questions.
30	4/25 (Thu)			Project presentations. Alex and Phuong; Huy and Huichao. 25 minutes each, including questions.