http://scholars.ntou.edu.tw/handle/123456789/17907
Title: | Evaluating and enhancing cross-domain rank predictability of textual entailment datasets | Authors: | Cheng-Wei Lee Chuan-Jie Lin Hideki Shima Wen-Lian Hsu |
Keywords: | Guidelines;Humans;Correlation;Text recognition;Accuracy;Educational institutions;Standards | Issue Date: | 8-Aug-2012 | Publisher: | IEEE | Abstract: | Textual Entailment (TE) is the task of recognizing entailment, paraphrase, and contradiction relations between a given text pair. The goal of textual entailment research is to develop a core inference component that can be applied to various domains, such as IR or NLP. Since the domain that a TE system applies to may be different from its source domain, it is crucial to develop proper datasets for measuring the cross-domain ability of a TE system. We propose using Kendall's tau to measure a dataset's cross-domain rank predictability. Our analysis shows that incorporating “artificial pairs” into a dataset helps enhance its rank predictability. We also find that the completeness of guidelines has no obvious effect on the rank predictability of a dataset. To validate these findings, more investigation is needed; however these findings suggest some new directions for the creation of TE datasets in the future. |
URI: | http://scholars.ntou.edu.tw/handle/123456789/17907 | ISBN: | 978-1-4673-2284-3 | DOI: | 10.1109/IRI.2012.6302990 |
Appears in Collections: | 資訊工程學系 |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.