| |||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||
|
|||||||||||||||||||||||||||||||
|
[ Next page | Previous Page ] What is Natural Language?Natural Languages are languages used in human culture such as chinese, english or bulgarian. They can be either spoken or written.
What is Natural Language Processing? In particular, NLP research also deals with speech recognition. Currently, programs that convert spoken speech into text have been widely used and are fairly dependable. Another field of study in NLP are story understanders, as well as chatterboxes. During the 1960s Joseph Weizenbaum created ELIZA. ELIZA created a storm of public interest in AI, as it helped thousands overcome their personal problems. ELIZA was a psychiatrist, particularly one that posed analytical questions for every answer the user gave it. Though sometimes they may have seemed ambiguous, people actually felt ELIZA could take care of their needs just as well as any other therapist. They became emotionally involved with ELIZA, even Weizenbaum's secretary demanded to be left alone with the program. The following is a transcript of ELIZA chatting with another program PARRY:
Parry:I don't understand your motives. When people had started calling ELIZA intelligent, Joseph Weizenbaum went into an uproar. Technically, ELIZA was actually unable to understand people's personal problems to the depth of any other human being. ELIZA could only manipulate syntax (grammar), and check for some key words. Certainly, if someone had no knowledge of ELIZA being a program, one could easily conclude that it behaved like a human conversing, although it never really neccessary understood everything to the detail that humans do. Coincidentally, ELIZA creates questions to help people's personal problems, while IQATS (Intelligent Question and Answer Test Summarizer), a program written by Sam Hsiung (Generation 5 staff member), creates questions for test-making purposes. Unlike ELIZA, IQATS is able to learn how to ask new questions, if it is given a sample question and answer. Yet, like ELIZA, it knows and will learn only how to manipulate syntax. It will be able to ask a question about what the capital or Saudi Arabia is, however if it were given something a bit more complex, such as Martin Luther King's I have a dream speech, it would not be able to come up with questions that force people to draw inferences (Ex.: Under what context was this speech given in?); neither does it really understand what it is asking. Many researchers realized this limitation, and as a result conceptual dependency (CD) (pioneered by Roger Schank) theory was created. CR systems such as SAM (Script Applier Mechanism) are story understanders. When SAM is given a story, and later asked questions about it, it will answer many of those questions accurately. (Thus showing that it "understands") It can even infer. It accomplishes this through use of scripts. The scripts designate a sequence of actions that are to be performed in chronological fashion for a certain situation. A restaurant script would say that you would need to sit down by a table before you are served dinner. The following is a small example of SAM (Script Applier Mechanism) paraphrasing a story (notice the inferences):
Input: John went to a restaurant. He sat down. He got mad. He left.
Paraphrase: JOHN WAS HUNGRY. HE DECIDED TO GO TO A RESTAURANT. HE WENT TO ONE. HE SAT DOWN IN A CHAIR. A WAITER DID NOT GO TO THE TABLE. JOHN BECAME UPSET. HE DECIDED HE WAS GOING TO LEAVE THE RESTAURANT. HE LEFT IT.
Scripts allow CD systems to draw links and inferences between things. They are also able to classify and distinguish
primitive actions. Kicking someone, for example could be a physical action that institutes 'hurt', while loving
could be an emotional expressiong that implies 'affection'.
Machine Translation
Translators must first take into consideration the context of the things being spoken, before actual translation. Machines translation programs likewise, must also do the same, or else they are very likely to fail. While this problem has been partially dealt with by Conceptual Dependency/Representation Theory (Schank's scripts), many programs have difficulty translating slang, or idioms, all of which are usually unique to each individual language & culture. Grammar and punctuation mistakes that users may make also presents another consideration. SYSTRAN is a machine translation program available on the world wide web. This program was used to translated Generation 5's openning page from english to french, spanish and german. This program provides an excellent look at where machine translation is today. You can test out the program's accuracy by converting text from one language to another, and then converting it back. Here are some interesting translations done by SYSTRAN.
Note that with each conversion, the quality of the translated text degrades. With just one conversion there are twice as less errors than the examples above. As you can see, the quality of texts differ from language to language and paragraph to paragraph. Probably the most impressive translation was the English to Portuguese to English translation. The German translation fared much better than its Russian counterpart. Although these translations are far from perfect, they are clearly very useful (and far better than nothing).
Submitted: 10/12/1999 |
|
||||||||||||||||||||||||||||||
All content copyright © 1998-2007, Generation5 unless otherwise noted.
- Privacy Policy - Legal - Terms of Use -