http://scholars.ntou.edu.tw/handle/123456789/17906
DC 欄位 | 值 | 語言 |
---|---|---|
dc.contributor.author | Chuan-Jie Lin | en_US |
dc.contributor.author | Jia-Cheng Zhan | en_US |
dc.contributor.author | Yen-Heng Chen | en_US |
dc.contributor.author | Chien-Wei Pao | en_US |
dc.date.accessioned | 2021-10-21T06:44:37Z | - |
dc.date.available | 2021-10-21T06:44:37Z | - |
dc.date.issued | 2012-09 | - |
dc.identifier.uri | http://scholars.ntou.edu.tw/handle/123456789/17906 | - |
dc.description.abstract | This paper proposes an approach to identify word candidates that are not Traditional Chinese, including Japanese names (written in Japanese Kanji or Traditional Chinese characters) and word variants, when doing word segmentation on Traditional Chinese text. When handling personal names, a probability model concerning formats of names is introduced. We also propose a method to map Japanese Kanji into the corresponding Traditional Chinese characters. The same method can also be used to detect words written in character variants. After integrating generation rules for various types of special words, as well as their probability models, the F-measure of our word segmentation system rises from 94.16% to 96.06%. Another experiment shows that 83.18% of the 862 Japanese names in a set of 109 human-annotated documents can be successfully detected. | en_US |
dc.language.iso | en | en_US |
dc.publisher | Computational Linguistics | en_US |
dc.relation.ispartof | Computational Linguistics and Chinese Language Processing | en_US |
dc.subject | Semantic Chinese Word Segmentation | en_US |
dc.subject | Japanese Name Identification | en_US |
dc.subject | Character Variants. | en_US |
dc.title | Strategies of Processing Japanese Names and Character Variants in Traditional Chinese Text | en_US |
dc.type | journal article | en_US |
dc.relation.journalvolume | 17 | en_US |
dc.relation.journalissue | 3 | en_US |
dc.relation.pages | 87-108 | en_US |
item.cerifentitytype | Publications | - |
item.openairetype | journal article | - |
item.openairecristype | http://purl.org/coar/resource_type/c_6501 | - |
item.fulltext | no fulltext | - |
item.grantfulltext | none | - |
item.languageiso639-1 | en | - |
crisitem.author.dept | College of Electrical Engineering and Computer Science | - |
crisitem.author.dept | Department of Computer Science and Engineering | - |
crisitem.author.dept | National Taiwan Ocean University,NTOU | - |
crisitem.author.parentorg | National Taiwan Ocean University,NTOU | - |
crisitem.author.parentorg | College of Electrical Engineering and Computer Science | - |
顯示於: | 資訊工程學系 |
在 IR 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。