Skip navigation
  • 中文
  • English

DSpace CRIS

  • DSpace logo
  • 首頁
  • 研究成果檢索
  • 研究人員
  • 單位
  • 計畫
  • 分類瀏覽
    • 研究成果檢索
    • 研究人員
    • 單位
    • 計畫
  • 機構典藏
  • SDGs
  • 登入
  • 中文
  • English
  1. National Taiwan Ocean University Research Hub
  2. 電機資訊學院
  3. 資訊工程學系
請用此 Handle URI 來引用此文件: http://scholars.ntou.edu.tw/handle/123456789/17906
標題: Strategies of Processing Japanese Names and Character Variants in Traditional Chinese Text
作者: Chuan-Jie Lin 
Jia-Cheng Zhan
Yen-Heng Chen
Chien-Wei Pao
關鍵字: Semantic Chinese Word Segmentation;Japanese Name Identification;Character Variants.
公開日期: 九月-2012
出版社: Computational Linguistics
卷: 17
期: 3
起(迄)頁: 87-108
來源出版物: Computational Linguistics and Chinese Language Processing
摘要: 
This paper proposes an approach to identify word candidates that are not
Traditional Chinese, including Japanese names (written in Japanese Kanji or
Traditional Chinese characters) and word variants, when doing word segmentation
on Traditional Chinese text. When handling personal names, a probability model
concerning formats of names is introduced. We also propose a method to map
Japanese Kanji into the corresponding Traditional Chinese characters. The same
method can also be used to detect words written in character variants. After
integrating generation rules for various types of special words, as well as their
probability models, the F-measure of our word segmentation system rises from
94.16% to 96.06%. Another experiment shows that 83.18% of the 862 Japanese
names in a set of 109 human-annotated documents can be successfully detected.
URI: http://scholars.ntou.edu.tw/handle/123456789/17906
顯示於:資訊工程學系

顯示文件完整紀錄

Page view(s)

118
上周
0
上個月
0
checked on 2025/6/30

Google ScholarTM

檢查

TAIR相關文章


在 IR 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。

瀏覽
  • 機構典藏
  • 研究成果檢索
  • 研究人員
  • 單位
  • 計畫
DSpace-CRIS Software Copyright © 2002-  Duraspace   4science - Extension maintained and optimized by NTU Library Logo 4SCIENCE 回饋