http://scholars.ntou.edu.tw/handle/123456789/17887
DC 欄位 | 值 | 語言 |
---|---|---|
dc.contributor.author | Chuan-Jie Lin | en_US |
dc.contributor.author | Wei-Cheng Chu | en_US |
dc.date.accessioned | 2021-10-21T05:38:38Z | - |
dc.date.available | 2021-10-21T05:38:38Z | - |
dc.date.issued | 2015-06-01 | - |
dc.identifier.uri | http://scholars.ntou.edu.tw/handle/123456789/17887 | - |
dc.description.abstract | This paper proposes an automatic method to build a Chinese spelling check system. Confusion sets were expanded by using two language resources, Shuowen Jiezi and the Four-Corner codes, which improved the coverages of the confusion sets. Nine scoring functions which utilize the frequency data in the Google Ngram Datasets were proposed, where the idea of smoothing was also adopted. Thresholds were also decided in an automatic way. The final system achieved far better than our baseline system in CSC 2013 Evaluation Task. | en_US |
dc.language.iso | en | en_US |
dc.subject | Chinese Spelling Check | en_US |
dc.subject | Confusion Set Expansion | en_US |
dc.subject | Google Ngram Scoring Function. | en_US |
dc.title | A Study on Chinese Spelling Check Using Confusion Sets and N-gram Statistics | en_US |
dc.type | journal article | en_US |
dc.relation.journalvolume | 20 | en_US |
dc.relation.journalissue | 1 | en_US |
dc.relation.pages | 23-48 | en_US |
item.cerifentitytype | Publications | - |
item.openairetype | journal article | - |
item.openairecristype | http://purl.org/coar/resource_type/c_6501 | - |
item.fulltext | no fulltext | - |
item.grantfulltext | none | - |
item.languageiso639-1 | en | - |
crisitem.author.dept | College of Electrical Engineering and Computer Science | - |
crisitem.author.dept | Department of Computer Science and Engineering | - |
crisitem.author.dept | National Taiwan Ocean University,NTOU | - |
crisitem.author.parentorg | National Taiwan Ocean University,NTOU | - |
crisitem.author.parentorg | College of Electrical Engineering and Computer Science | - |
顯示於: | 資訊工程學系 |
在 IR 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。