Literature-Based Discovery (LBD): Towards Hypothesis Generation and Knowledge Discovery in Biomedical Text Mining
DOI:
https://doi.org/10.47852/bonviewMEDIN52025348Keywords:
biomedical text mining, literature-based discovery, ABC principle, open and closed discovery, novel associations, concept profilesAbstract
Biomedical knowledge is growing at an astounding pace with a majority of this knowledge represented as scientific publications. Text mining tools and methods represent automatic approaches for extracting hidden patterns and trends from this semi-structured and unstructured data. In biomedical text mining, literature-based discovery (LBD) is the process of automatically discovering novel associations between medical terms otherwise mentioned in disjoint literature sets. LBD approaches have proven to successfully reduce the discovery time of potential associations that are hidden in the vast amount of scientific literature. The process focuses on creating concept profiles for medical terms such as a disease or symptom and connecting them with a drug and treatment based on the statistical significance of the shared profiles. This knowledge discovery approach introduced in 1989 remains a core task in text mining. Currently, the ABC principle-based two approaches namely open discovery and closed discovery are mostly explored in the LBD process. This review starts with a general introduction about text mining followed by biomedical text mining followed by a brief introduction of the core ABC principle and its associated two approaches open discovery and closed discovery in the LBD process. This review discusses the deep learning applications in LBD by reviewing the role of transformer models and neural networks-based LBD models and their future aspects. Additionally, the potential of Large Language Models in enriching the LBD process is discussed with challenges and solutions and finally reviews the key biomedical discoveries generated through LBD approaches in biomedicine and concludes with the current limitations and future directions of LBD.
Received: 1 February 2025 | Revised: 12 May 2025 | Accepted: 30 May 2025
Conflicts of Interest
The authors declare that they have no conflicts of interest to this work.
Data Availability Statement
Data sharing is not applicable to this article as no new data were created or analyzed in this study.
Author Contribution Statement
Balu Bhasuran: Conceptualization, Methodology, Validation, Formal analysis, Investigation, Resources, Data curation, Writing – original draft, Writing – review & editing, Visualization. Gurusamy Murugesan: Methodology, Validation, Formal analysis, Investigation, Resources, Data curation, Writing – original draft, Writing – review & editing, Visualization. Jeyakumar Natarajan: Conceptualization, Methodology, Validation, Formal analysis, Investigation, Resources, Data curation, Writing – original draft, Writing – review & editing, Visualization, Supervision, Project administration.
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Authors

This work is licensed under a Creative Commons Attribution 4.0 International License.