Description: In this exercise, we will evaluate the output of a part-of-speech tagger on a set of documents. We will:
Credits: This exercise was developed for Johanna Moore's class at the University of Edinburgh.
The tagger is the Brill tagger described in Chapter 8 of the textbook, which can be downloaded from Eric Brill's Home Page. This paper describes the tagger in more detail. The tagger has already been downloaded (/afs/cs.pitt.edu/usr0/mrotaru/public/cs2731/hw2/RULE_BASED_TAGGER_V1.14) and installed for you. Here are some instructions that you might need.
You must "tokenize" the input to the tagger. In particular, you must perform the following substitutions:
% cd /afs/cs.pitt.edu/usr0/mrotaru/public/cs2731/RULE_BASED_TAGGER_V1.14/Bin_and_DataThen type
% ./tagger LEXICON filename BIGRAMS LEXICALRULEFILE CONTEXTUALRULEFILEHere filename is the name of the file to tag, and LEXICON, BIGRAMS, LEXICALRULEFILE, and CONTEXTUALRULEFILE are strings that you actually type. This will print the tagged file to standard output; if you want to save the output in a file called outfile, you can redirect it like this:
% ./tagger LEXICON filename BIGRAMS LEXICALRULEFILE CONTEXTUALRULEFILE > outfile
Choose five tagging errors and discuss the possible reasons for these errors.
SUBMIT: Print-outs showing the tagging errors you are discussing, and your discussion of the errors.
Quantitatively evaluate the performance of the tagger. To do this, you will use this program to compute the confusion matrices comparing the tagger's output to the gold standard and to compute Kappa.
SUBMIT: Kappa value, and answers to the above questions.
Try tagging the following texts. They have already been tokenised as specified on the tagging instructions page.
Examine the results. Do you think that having this part-of-speech information would have made the task of locating the date and time expressions on our first homework easier? Why or why not?
SUBMIT: Your answer to the above question, along with any parts of the tagged output that support your answer.