Comparative Study of Vietnamese Part-of-Speech Tagging Tools

Vietnamese part-of-speech tagging is one of the most fundamental practices in Vietnamese language processing. Unfortunately, no attempt has been made to empirically compare different Vietnamese part-of-speech tagging software. Therefore, in this paper, the authors experiment upon several Vietnamese...

Full description

Main Authors: Quach, L.-D., Do Thanh, D., Tran, D.C., Hassan, M.F.
Format: Conference or Workshop Item
Institution: Universiti Teknologi Petronas
Record Id / ISBN-0: utp-eprints.30115 /
Published: Institute of Electrical and Electronics Engineers Inc. 2020
Online Access: https://www.scopus.com/inward/record.uri?eid=2-s2.0-85096933078&doi=10.1109%2fICSGRC49013.2020.9232564&partnerID=40&md5=7b4f98653c96712391c87ba08ced7cc7
http://eprints.utp.edu.my/30115/
Tags: Add Tag
No Tags, Be the first to tag this record!
id utp-eprints.30115
recordtype eprints
spelling utp-eprints.301152022-03-25T06:34:33Z Comparative Study of Vietnamese Part-of-Speech Tagging Tools Quach, L.-D. Do Thanh, D. Tran, D.C. Hassan, M.F. Vietnamese part-of-speech tagging is one of the most fundamental practices in Vietnamese language processing. Unfortunately, no attempt has been made to empirically compare different Vietnamese part-of-speech tagging software. Therefore, in this paper, the authors experiment upon several Vietnamese part-of-speech tagging software such as VnTagger, RDRPOSTagger (Java Version), JvnTextPro, VNCoreNLP in terms of accuracy, consistency and computational time. In addition, the brief descriptions of the models are discussed in detail. The results help researchers comprehend the models' strengths and weaknesses. The tools are tested on 4 different data sets of number of sentences and different word types such as date, number, special characters, connected characters, double words, compound words, proper names, etc� The results show that the accuracy of the JvnTextPro tool is high and stable with an accuracy of 80.08 to 97.84, and the RDPRPOSTagger tool has faster processing time and relatively good accuracy from 88.41 to 96.84. © 2020 IEEE. Institute of Electrical and Electronics Engineers Inc. 2020 Conference or Workshop Item NonPeerReviewed https://www.scopus.com/inward/record.uri?eid=2-s2.0-85096933078&doi=10.1109%2fICSGRC49013.2020.9232564&partnerID=40&md5=7b4f98653c96712391c87ba08ced7cc7 Quach, L.-D. and Do Thanh, D. and Tran, D.C. and Hassan, M.F. (2020) Comparative Study of Vietnamese Part-of-Speech Tagging Tools. In: UNSPECIFIED. http://eprints.utp.edu.my/30115/
institution Universiti Teknologi Petronas
collection UTP Institutional Repository
description Vietnamese part-of-speech tagging is one of the most fundamental practices in Vietnamese language processing. Unfortunately, no attempt has been made to empirically compare different Vietnamese part-of-speech tagging software. Therefore, in this paper, the authors experiment upon several Vietnamese part-of-speech tagging software such as VnTagger, RDRPOSTagger (Java Version), JvnTextPro, VNCoreNLP in terms of accuracy, consistency and computational time. In addition, the brief descriptions of the models are discussed in detail. The results help researchers comprehend the models' strengths and weaknesses. The tools are tested on 4 different data sets of number of sentences and different word types such as date, number, special characters, connected characters, double words, compound words, proper names, etc� The results show that the accuracy of the JvnTextPro tool is high and stable with an accuracy of 80.08 to 97.84, and the RDPRPOSTagger tool has faster processing time and relatively good accuracy from 88.41 to 96.84. © 2020 IEEE.
format Conference or Workshop Item
author Quach, L.-D.
Do Thanh, D.
Tran, D.C.
Hassan, M.F.
spellingShingle Quach, L.-D.
Do Thanh, D.
Tran, D.C.
Hassan, M.F.
Comparative Study of Vietnamese Part-of-Speech Tagging Tools
author_sort Quach, L.-D.
title Comparative Study of Vietnamese Part-of-Speech Tagging Tools
title_short Comparative Study of Vietnamese Part-of-Speech Tagging Tools
title_full Comparative Study of Vietnamese Part-of-Speech Tagging Tools
title_fullStr Comparative Study of Vietnamese Part-of-Speech Tagging Tools
title_full_unstemmed Comparative Study of Vietnamese Part-of-Speech Tagging Tools
title_sort comparative study of vietnamese part-of-speech tagging tools
publisher Institute of Electrical and Electronics Engineers Inc.
publishDate 2020
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-85096933078&doi=10.1109%2fICSGRC49013.2020.9232564&partnerID=40&md5=7b4f98653c96712391c87ba08ced7cc7
http://eprints.utp.edu.my/30115/
_version_ 1741197352299921408
score 11.62408