- Title
- Knowledge-based intelligent text simplification for biological relation extraction
- Creator
- Gill, Jaskaran; Chetty, Madhu; Lim, Suryani; Hallinan, Jennifer
- Date
- 2023
- Type
- Text; Journal article
- Identifier
- http://researchonline.federation.edu.au/vital/access/HandleResolver/1959.17/199119
- Identifier
- vital:19138
- Identifier
-
https://doi.org/10.3390/informatics10040089
- Identifier
- ISSN:2227-9709 (ISSN)
- Abstract
- Relation extraction from biological publications plays a pivotal role in accelerating scientific discovery and advancing medical research. While vast amounts of this knowledge is stored within the published literature, extracting it manually from this continually growing volume of documents is becoming increasingly arduous. Recently, attention has been focused towards automatically extracting such knowledge using pre-trained Large Language Models (LLM) and deep-learning algorithms for automated relation extraction. However, the complex syntactic structure of biological sentences, with nested entities and domain-specific terminology, and insufficient annotated training corpora, poses major challenges in accurately capturing entity relationships from the unstructured data. To address these issues, in this paper, we propose a Knowledge-based Intelligent Text Simplification (KITS) approach focused on the accurate extraction of biological relations. KITS is able to precisely and accurately capture the relational context among various binary relations within the sentence, alongside preventing any potential changes in meaning for those sentences being simplified by KITS. The experiments show that the proposed technique, using well-known performance metrics, resulted in a 21% increase in precision, with only 25% of sentences simplified in the Learning Language in Logic (LLL) dataset. Combining the proposed method with BioBERT, the popular pre-trained LLM was able to outperform other state-of-the-art methods. © 2023 by the authors.
- Publisher
- Multidisciplinary Digital Publishing Institute (MDPI)
- Relation
- Informatics Vol. 10, no. 4 (2023), p.
- Rights
- All metadata describing materials held in, or linked to, the repository is freely available under a CC0 licence
- Rights
- https://creativecommons.org/licenses/by/4.0/
- Rights
- Copyright © 2023 by the authors.
- Rights
- Open Access
- Subject
- 4609 Information systems; BERN2; BioBERT; Named entity recognition; Relation extraction; Sentence simplification
- Full Text
- Reviewed
- Funder
- The first author acknowledges the support for the tuition fee waiver scholarship from Federation University and the stipend scholarship from Health Innovative and Transformation Centre (HITC), Federation University Australia.
- Hits: 561
- Visitors: 547
- Downloads: 11
Thumbnail | File | Description | Size | Format | |||
---|---|---|---|---|---|---|---|
View Details Download | SOURCE1 | Published version | 3 MB | Adobe Acrobat PDF | View Details Download |