Topics for Today
=============================================
How can we evaluate AI systems?
E.g., for games
For tools, such as e-mail interfaces,scheduling tools, MT support tools
Customer satisfaction
Questionaires
Improvement in performance
Define formal measures
$$ Savings
Less turnover
============================================
word senses, parts of speech, syntax, …
(sometimes already exists)
The Turing Test
Room1 Room2
Person 1---------------- Computer responding
Person 2---------------- Person responding
Can the computer fool person 1 into thinking
it's a computer?
The Turing Test: What do you think?
internal algorithm used
(Also not well founded: computer could act like a crazy person, person could act like a computer)
Should we use Humans as our models?
People tried to build machines with
flapping wings
Wright brothers: ignored flapping wings,
and solved the problem in a different way
Maybe machines must do things differently
than people (animals) do them
Current practice: Both and Neither
Equivalent to in-depth analysis and observation of human behavior