CS 3730/ISSP 3120 Advanced Topics in Natural Language Processing (Spring 2013)


Syllabus    Home   

Class# Date Lead Materials/Deadlines Topics and Readings
1 1/08 (Tue) Jan Wiebe Course webpage Welcome and Administration
2 1/10 (Thu) Jan Wiebe slides
Subjectivity and Sentiment Analysis: From Words to Discourse
... to Pragmatics

Cem Akkaya, Janyce Wiebe, Alexander Conrad, and Rada Mihalcea (2011). Improving the Impact of Subjectivity Word Sense Disambiguation on Contextual Opinion Analysis. CoNLL 2011

Swapna Somasundaran and Janyce Wiebe (2009). Recognizing Stances in Online Debates. ACL-IJCNLP 2009.

3 1/15 (Tue) Jan Wiebe slides
updated slides
exercise answers

The MPQA corpus, subjectivity lexicon, and word-sense annotations can be downloaded from my website once CSSD goes live with our new page.

Subjectivity Annotation

Sections 1.1-1.2, 2.0, 3.1, 7.1 of Theresa Wilson's Dissertation

Sections 1 & 2 of Janyce Wiebe, Theresa Wilson, and Claire Cardie (2005). Annotating expressions of opinions and emotions in language. Language Resources and Evaluation, volume 39, issue 2-3, pp. 165-210

4 1/17 (Thu) Lingjia Commentaries start today;
enter into NB by 2pm the day before

Bing Liu makes severcal datasets available at his sentiment page.

Product Review Mining

Minqing Hu and Bing Liu Mining and Summarizing Customer Reviews KDD-2004.

Kim, Soo-Min and Hovy, Eduard Automatic Identification of Pro and Con Reasons in Online Reviews. COLING-ACL-2006.

5 1/22 (Tue) Huy Product Review Mining, continued

Oscar Tackstrom and Ryan McDonald (2011). Discovering Fine-Grained Sentiment with Latent Variable Structured Prediction Models ECIR-2011.

Ana-Maria Popescu and Oren Etzioni (2005). Extracting Product Features and Opinions from Reviews EMNLP-2005. Also, joint extraction.

Optional: Yoonjung

Barbosa, Luciano and Kumar, Ravi and Pang, Bo and Tomkins, Andrew (2009). For a few dollars less: Identifying review pages sans human labels. NAACL-2009.

6 1/24 (Thu) Jan Wiebe Introductory slides

Measuring annotator agreement: a presentation by Ron Artstein

Annotation Studies

Ron Artstein and Massimo Poesio. Inter-coder agreement for computational linguistics (survey article). Computational Linguistics 34(4): 555-596, 2008

7 1/29 (Tue) Wencan

The Toprak et al. data may be downloaded from *this page*

Aspects and Review Objects

Mukherjee, Arjun and Liu, Bing Aspect Extraction through Semi-Supervised Modeling. ACL-2012.

Toprak, Cigdem and Jakob, Niklas and Gurevych, Iryna (2010) Sentence and Expression Level Annotation of Opinions in User-Generated Discourse. ACL-2010. Aspect-based annotations.

Optional: Huy

Dalvi, Nilesh and Kumar, Ravi and Pang, Bo and Tomkins, Andrew Matching Reviews to Objects using a Language Model EMNLP-2009.

8 1/31 (Thu) Phuong Aspects, continued

Yu, Jianxing and Zha, Zheng-Jun and Wang, Meng and Chua, Tat-Seng (2011). Aspect Ranking: Identifying Important Product Aspects from Online Consumer Reviews. ACL-2011.

Wei, Wei and Gulla, Jon Atle (2010). Sentiment Learning on Product Reviews via Sentiment Ontology Tree. ACL-2010.

Optional: Jesse

Brody, Samuel and Elhadad, Noemie (2010). An Unsupervised Aspect-Sentiment Model for Online Reviews NAACL-2010.

9 2/05 (Tue) Fan Identifying Holders and Targets/Topics

Stoyanov, Veselin and Cardie, Claire (2008). Topic Identification for Fine-Grained Opinion Analysis COLING-2008.

Choi, Yejin and Cardie, Claire and Riloff, Ellen and Patwardhan, Siddharth (2005). Identifying Sources of Opinions with Conditional Random Fields and Extraction Patterns. EMNLP-2005.

Optional: Phuong

Jakob, Niklas and Gurevych, Iryna (2010). Extracting Opinion Targets in a Single and Cross-Domain Setting with Conditional Random Fields EMNLP-2011.

10 2/07 (Thu) Huy Project meetings completed Joint Extraction/Recognition

Ivan Titov and Ryan McDonald (2008). A Joint Model of Text and Aspect Ratings for Sentiment Summarization ACL-2008.

Zhao, Xin and Jiang, Jing and Yan, Hongfei and Li, Xiaoming (2010). Jointly Modeling Aspects and Opinions with a MaxEnt-LDA Hybrid EMNLP-2010.

Optional: Huichao

Ivan Titov and Ryan McDonald (2008). Modeling online reviews with multi-grain topic models. WWW-2008.

11 2/12(Tue) Yoonjung The data from the Somasundaran et al. paper is available. Joint Extraction/Recognition, continued

Choi, Yejin and Breck, Eric and Cardie, Claire (2006). Joint Extraction of Entities and Relations for Opinion Recognition EMNLP-2006.

Somasundaran, Swapna and Namata, Galileo and Wiebe, Janyce and Getoor, Lise (2009). Supervised and Unsupervised Methods in Employing Discourse Relations for Improving Opinion Polarity Classification EMNLP 2009.

Optional: Huichao

Du, Weifu and Tan, Songbo (2009). An Iterative Reinforcement Approach for Fine-Grained Opinion Mining ACL-2009.

12 2/14 (Thu) Jesse The data for Murray and Carenini is available *here* Applications

Carenini, Giuseppe and Ng, Raymond T. and Zhou, Xiaodong (2008). Summarizing Emails with Conversational Cohesion and Subjectivity ACL-2008.

Murray, Gabriel and Carenini, Giuseppe (2009). Predicting Subjectivity in Multimodal Conversations. EMNLP-2009.

Optional: Fan

Minqing Hu and Bing Liu (2006). Opinion Extraction and Summarization on the Web. AAAI-2006.

13 2/19 (Tue) Project proposals due

TAC 2008 Opinion Question Answering Track. For the Opinion Question answering track, we would need a user agreement for the Blog6 data.

Applications, continued

Oh, Jong-Hoon and Torisawa, Kentaro and Hashimoto, Chikara and Kawada, Takuya and De Saeger, Stijn and Kazama, Jun'ichi and Wang, Yiou (2012). Why Question Answering using Sentiment Analysis and Word Classes EMNLP-2012. This is also related to connotation.

Paul, Michael and Zhai, ChengXiang and Girju, Roxana (2010). Summarizing Contrastive Viewpoints in Opinionated Text. EMNLP-2010.

Optional: Lingjia

Lerman, Kevin and McDonald, Ryan (2009). Sentiment Summarization: Evaluating and Learning User Preferences. EACL-2009.

14 2/21 (Thu) Lab Class

Subjectivity annotation, continued from Day 3, 1/15.

15 2/26 (Tue) Huichao Domain Adaptation

Bollegala, Danushka and Weir, David and Carroll, John (2011). Using Multiple Sources to Construct a Sentiment Sensitive Thesaurus for Cross-Domain Sentiment Classification. ACL 2011.

Blitzer, John and Dredze, Mark and Pereira, Fernando, Biographies, Bollywood, Boom-boxes and Blenders: Domain Adaptation for Sentiment Classification ACL-2007.

Optional: Yoonjung

He, Yulan and Lin, Chenghua and Alani, Harith (2011). Automatically Extracting Polarity-Bearing Topics for Cross-Domain Sentiment Classification ACL-2011.

16 2/28 (Thu) Alex Recognizing Stances, Arguments, and Viewpoints

Swapna Somasundaran and Janyce Wiebe (2009). Recognizing Stances in Online Debates. ACL-IJCNLP 2009.

Walker, M. and Anand, P and Abbott, R. and Tree, J.E.F. and Martell,C. and King, J. (2011). That's your evidence:? Classifying Stance in Online Political and Social Debate. Decision Support Sciences. 1-30. Elsevier. 2011 (based on their WASSA 2011 paper that won the best paper award).

Optional: Fan

Y. Lu and H. Wang and C. Zhai and D. Roth (2012). Unsupervised Discovery of Opposing Opinion Networks From Forum Discussions. CIKM-2012.

17 3/05 (Tue) Alex Recognizing Stances, Arguments, Viewpoints, continued.

Park, Souneil and Lee, Kyung Soon and Song, Junehwa (2011). Contrasting Opposing Views of News Articles on Contentious Issues. ACL-2011.

Abu-Jbara, Amjad and Dasigi, Pradeep and Diab, Mona and Radev, Dragomir (2011). Subgroup Detection in Ideological Discussions. ACL-2011.


Lin, Wei-Hao and Hauptmann, Alexander (2006). Are These Documents Written from Different Perspectives? A Test of Different Perspectives Based on Statistical Distribution Divergence. ACL-2006.

18 3/07 (Thu) Lab Class

Argument data: Death Penalty (Baumgartner, Boef, Boydstun); ConvinceMe.net (Conrad).

19 3/19 (Tue) Lingjia See the bullet of "Data related to sentiment analysis" on Lillian Lee's data site Graph-Based Approaches

Bo Pang and Lillian Lee (2004). A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts ACL-04.

Feng, Song and Bose, Ritwik and Choi, Yejin (2011). Learning General Connotation of Words using Graph-based Algorithms EMNLP-2011. This is also related to connotation.

Optional: Wencan

Li, Binyang and Zhou, Lanjun and Feng, Shi and Wong, Kam-Fai (2010). A Unified Graph Model for Sentence-Based Opinion Retrieval. ACL-2010.

20 3/21 (Thu) Huichao Project progress reports due Phrase-Level Contextual Polarity

Theresa Wilson, Janyce Wiebe, and Paul Hoffmann (2005). Recognizing Contextual Polarity in Phrase-Level Sentiment Analysis, EMNLP-2009.

Agarwal, Apoorv and Biadsy, Fadi and Mckeown, Kathleen (2009). Contextual Phrase-Level Polarity Analysis Using Lexical Affect Scoring and Syntactic N-Grams. EACL 2009.

Optional: Lingjia

Socher, Richard, Huval, Brody, Manning, Christopher D. and Ng, Andrew Y. (2012). Semantic Compositionality through Recursive Matrix-Vector Spaces. EMNLP 2012.

21 3/26 (Tue) Yoonjung Phrase-Level Contextual Polarity, continued

Yejin Choi and Claire Cardie (2008). Learning with Compositional Semantics as Structural Inference for Subsentential Sentiment Analysis. EMNLP-2008.

Karo Moilanen and Stephen Pulman (2007). Sentiment Composition. RANLP 2007.

Optional: Phuong

Yessenalina, Ainur and Cardie, Claire (2011). Compositional Matrix-Space Models for Sentiment Analysis. EMNLP-2011.

22 3/28 (Thu) Wencan Conceptual Dependency Theory and Scripts

Chapters 1-3 of Roger Schank and Robert Abelson (1977). Scripts, Plans, Goals and Understanding. Lawrence Erlbaum.

23 4/02 (Tue) Phuong

Data created by Chambers and Jurafsky is available *here*

Slides about scripts (thanks to Ellen Riloff)

Slides about Conceptual Dependency (thanks to Ellen Riloff)

Learning Narrative and Script Schemas

Chambers, Nathanael and Jurafsky, Dan (2008). Unsupervised Learning of Narrative Event Chains ACL-2008.

Regneri, Michaela and Koller, Alexander and Pinkal, Manfred (2010) Learning Script Knowledge with Web Experiments ACL-2010.

Optional: Alex

Chambers, Nathanael and Jurafsky, Dan (2009). Unsupervised Learning of Narrative Schemas and their Participants ACL-2009.

Chambers, Nathanael and Jurafsky, Dan (2009). A Database of Narrative Schemas LREC-2010

24 4/04 (Thu) Fan Plot Units

Wendy Lehnert. Plot Units and Narrative Summarization. Cognitive Science 4 (1981).

Goyal, Amit and Riloff, Ellen and Daume III, Hal (2012). A Computational Model for Plot Units. Computational Intelligence, 2012.

25 4/09 (Tue) Lab Class

Resurrecting "Old AI"

26 4/11 (Thurs) Jesse I've requested the data annotated in Ganter and Strube.

The CoNLL-2010 shared task was on detecting hedges and their scope. The data is available *here*.


Ganter, Viola and Strube, Michael (2009). Finding Hedges by Chasing Weasels: Hedge Detection Using Wikipedia Tags and Shallow Linguistic Features. ACL-IJCNLP-2009 short paper.

Medlock, Ben and Briscoe, Ted (2007). Weakly Supervised Learning for Hedge Classification in Scientific Literature. ACL-2007.

Optional: Huy

Richard Farkas (2010). The CoNLL-2010 Shared Task (the overview paper at the beginning). CoNLL-2010.

27 4/16 (Tue) Jan Project report draft due

Webpage for FactBank and the PragBank Extension.

Data: the Stanford PragBank distribution.

Access a subset of the the original documents using the Browser *here* (otherwise, FactBank must be obtained from the LDC)


de Marneffe, Marie-Catherine; Christopher D. Manning; and Christopher Potts. 2012. Did it happen? The pragmatic complexity of veridicality assessment. Computational Linguistics 38(2): 301-333

Optional: Alex

rabhakaran, Vinodkumar and Rambow, Owen and Diab, Mona (2010). Automatic Committed Belief Tagging. COLING-2010.

28 4/18 (Thu) Class is cancelled.
29 4/23 (Tue) Final project reports due.

Project presentations: Jesse, Lingjia and Yoonjung, Fan and Wencan. 25 minutes each, including questions.
30 4/25 (Thu) Project presentations. Alex and Phuong; Huy and Huichao. 25 minutes each, including questions.