http://scholars.ntou.edu.tw/handle/123456789/17907
標題: | Evaluating and enhancing cross-domain rank predictability of textual entailment datasets | 作者: | Cheng-Wei Lee Chuan-Jie Lin Hideki Shima Wen-Lian Hsu |
關鍵字: | Guidelines;Humans;Correlation;Text recognition;Accuracy;Educational institutions;Standards | 公開日期: | 8-八月-2012 | 出版社: | IEEE | 摘要: | Textual Entailment (TE) is the task of recognizing entailment, paraphrase, and contradiction relations between a given text pair. The goal of textual entailment research is to develop a core inference component that can be applied to various domains, such as IR or NLP. Since the domain that a TE system applies to may be different from its source domain, it is crucial to develop proper datasets for measuring the cross-domain ability of a TE system. We propose using Kendall's tau to measure a dataset's cross-domain rank predictability. Our analysis shows that incorporating “artificial pairs” into a dataset helps enhance its rank predictability. We also find that the completeness of guidelines has no obvious effect on the rank predictability of a dataset. To validate these findings, more investigation is needed; however these findings suggest some new directions for the creation of TE datasets in the future. |
URI: | http://scholars.ntou.edu.tw/handle/123456789/17907 | ISBN: | 978-1-4673-2284-3 | DOI: | 10.1109/IRI.2012.6302990 |
顯示於: | 資訊工程學系 |
在 IR 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。