MinneBar Notes - Natural Language Processesing
Saturday, April 21st, 2007http://en.wikipedia.org/wiki/Natural_language_processing
Computer language
specific
well-defined
Natural language
ambiguous
context sensitive
John saw the man in the park with the telescope.
Who has the telescope?
Processing natural language
stemming
breaking apart sentences
tokenizer
Named Entity Detection
recognizing chunks in text
person
date
money
Mr Jones went shopping on 31st May. He spent $3.15.
You will probably never be able to expect 100% accuracy in NLP.
Problem areas include companies named after people
Robert Half, Inc.
Thompson West