http://scholars.ntou.edu.tw/handle/123456789/23604
DC 欄位 | 值 | 語言 |
---|---|---|
dc.contributor.author | Warikoo, Neha | en_US |
dc.contributor.author | Chang, Yung-Chun | en_US |
dc.contributor.author | Ma, Shang-Pin | en_US |
dc.date.accessioned | 2023-02-15T01:17:32Z | - |
dc.date.available | 2023-02-15T01:17:32Z | - |
dc.date.issued | 2022-10-01 | - |
dc.identifier.uri | http://scholars.ntou.edu.tw/handle/123456789/23604 | - |
dc.description.abstract | Protein-based studies contribute significantly to gathering functional information about biological systems; therefore, the protein-protein interaction detection task is one of the most researched topics in the biomedical literature. To this end, many state-of-the-art systems using syntactic tree kernels (TK) and deep learning have been developed. However, these models are computationally complex and have limited learning interpretability. In this paper, we introduce a linguistic-pattern-representation-based Gradient-Tree Boosting model, i.e., LpGBoost. It uses linguistic patterns to optimize and generate semantically relevant representation vectors for learning over the gradient-tree boosting. The patterns are learned via unsupervised modeling by clustering invariant semantic features. These linguistic representations are semi-interpretable with rich semantic knowledge, and owing to their shallow representation, they are also computationally less expensive. Our experiments with six protein-protein interaction (PPI) corpora demonstrate that LpGBoost outperforms the SOTA tree-kernel models, as well as the CNN-based interaction detection studies for BioInfer and AIMed corpora. | en_US |
dc.language.iso | English | en_US |
dc.publisher | MDPI | en_US |
dc.relation.ispartof | APPLIED SCIENCES-BASEL | en_US |
dc.subject | protein-protein interaction | en_US |
dc.subject | natural language processing | en_US |
dc.subject | gradient-tree boosting | en_US |
dc.subject | linguistic patterns | en_US |
dc.subject | bioinformatics | en_US |
dc.title | Gradient Boosting over Linguistic-Pattern-Structured Trees for Learning Protein-Protein Interaction in the Biomedical Literature | en_US |
dc.type | journal article | en_US |
dc.identifier.doi | 10.3390/app122010199 | - |
dc.identifier.isi | WOS:000872240100001 | - |
dc.relation.journalvolume | 12 | en_US |
dc.relation.journalissue | 20 | en_US |
dc.identifier.eissn | 2076-3417 | - |
item.languageiso639-1 | English | - |
item.fulltext | no fulltext | - |
item.openairetype | journal article | - |
item.openairecristype | http://purl.org/coar/resource_type/c_6501 | - |
item.grantfulltext | none | - |
item.cerifentitytype | Publications | - |
crisitem.author.dept | College of Electrical Engineering and Computer Science | - |
crisitem.author.dept | Department of Computer Science and Engineering | - |
crisitem.author.dept | National Taiwan Ocean University,NTOU | - |
crisitem.author.orcid | 0000-0002-3317-5750 | - |
crisitem.author.parentorg | National Taiwan Ocean University,NTOU | - |
crisitem.author.parentorg | College of Electrical Engineering and Computer Science | - |
顯示於: | 資訊工程學系 |
在 IR 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。