JTAGGER
Part-of-speech tagging, also called grammatical tagging, is the process of assigning the words in a text with their corresponding parts of speech like noun, verb, pronoun, or other lexical class markers to each word in a sentence. Part-of-speech tagging is an important step in natural language pr...
| Main Author: | YAACOB, NORHANA |
|---|---|
| Format: | Final Year Project |
| Language: | English |
| Institution: | Universiti Teknologi Petronas |
| Record Id / ISBN-0: | utp-utpedia.7056 / |
| Published: |
Universiti Teknologi Petronas
2006
|
| Subjects: | |
| Online Access: |
http://utpedia.utp.edu.my/7056/1/2006%20-%20JTAGGER.pdf http://utpedia.utp.edu.my/7056/ |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| id |
utp-utpedia.7056 |
|---|---|
| recordtype |
eprints |
| spelling |
utp-utpedia.70562021-07-27T15:46:00Z http://utpedia.utp.edu.my/7056/ JTAGGER YAACOB, NORHANA ZA Information resources Part-of-speech tagging, also called grammatical tagging, is the process of assigning the words in a text with their corresponding parts of speech like noun, verb, pronoun, or other lexical class markers to each word in a sentence. Part-of-speech tagging is an important step in natural language processing. Part-of-speech tagging is an ambiguous process because a word can represent morethan one part of speech at different times. Most difficult task is because it deals with ambiguities of the word. A word, phrase, or sentence is ambiguous if it has more than one meaning. The word 'light', for example, can mean not very heavy or not very dark. There are two types of ambiguity which are lexical and structural. When a word has more than one meaning, it is said to be lexically ambiguous. When a phrase or sentence can have more than one structure it is said to be structurally ambiguous. The part-of-speech tagging algorithms fall into three classes which are rule-based taggers, stochastic taggers, and transformation-based taggers. In this project, rule-based tagging algorithm is used as the mechanism to develop the system which named JTagger. The tagger initially tags by assigning each word its most likely tag, estimated by examining a corpus that consists of Penn Treebank Tagsets. JTagger is automatically performed the tagging process giving reasonable accuracy thus eliminate the difficulties of hand tagging task for the reader to manually tag a sentence. Part-of-speech tagging is important since it could help people to understand English better. The programming language used in this system is Java because it is an independent source that can run in any platform including Microsoft or UNIX. Universiti Teknologi Petronas 2006-01 Final Year Project NonPeerReviewed application/pdf en http://utpedia.utp.edu.my/7056/1/2006%20-%20JTAGGER.pdf YAACOB, NORHANA (2006) JTAGGER. Universiti Teknologi Petronas. (Unpublished) |
| institution |
Universiti Teknologi Petronas |
| collection |
UTPedia |
| language |
English |
| topic |
ZA Information resources |
| spellingShingle |
ZA Information resources YAACOB, NORHANA JTAGGER |
| description |
Part-of-speech tagging, also called grammatical tagging, is the process of assigning the
words in a text with their corresponding parts of speech like noun, verb, pronoun, or
other lexical class markers to each word in a sentence. Part-of-speech tagging is an
important step in natural language processing.
Part-of-speech tagging is an ambiguous process because a word can represent morethan
one part of speech at different times. Most difficult task is because it deals with
ambiguities of the word. A word, phrase, or sentence is ambiguous if it has more than
one meaning. The word 'light', for example, can mean not very heavy or not very dark.
There are two types of ambiguity which are lexical and structural. When a word has
more than one meaning, it is said to be lexically ambiguous. When a phrase or sentence
can have more than one structure it is said to be structurally ambiguous.
The part-of-speech tagging algorithms fall into three classes which are rule-based
taggers, stochastic taggers, and transformation-based taggers. In this project, rule-based
tagging algorithm is used as the mechanism to develop the system which named
JTagger. The tagger initially tags by assigning each word its most likely tag, estimated
by examining a corpus that consists of Penn Treebank Tagsets.
JTagger is automatically performed the tagging process giving reasonable accuracy thus
eliminate the difficulties of hand tagging task for the reader to manually tag a sentence.
Part-of-speech tagging is important since it could help people to understand English
better.
The programming language used in this system is Java because it is an independent
source that can run in any platform including Microsoft or UNIX. |
| format |
Final Year Project |
| author |
YAACOB, NORHANA |
| author_sort |
YAACOB, NORHANA |
| title |
JTAGGER |
| title_short |
JTAGGER |
| title_full |
JTAGGER |
| title_fullStr |
JTAGGER |
| title_full_unstemmed |
JTAGGER |
| title_sort |
jtagger |
| publisher |
Universiti Teknologi Petronas |
| publishDate |
2006 |
| url |
http://utpedia.utp.edu.my/7056/1/2006%20-%20JTAGGER.pdf http://utpedia.utp.edu.my/7056/ |
| _version_ |
1741194932216922112 |
| score |
11.62408 |