Literature-Based Discovery (LBD): Towards Hypothesis Generation and Knowledge Discovery in Biomedical Text Mining

Authors

  • Balu Bhasuran DRDO-BU Center for Life Sciences, Bharathiar University Campus, India and School of Information, Florida State University, United States https://orcid.org/0000-0002-9890-4627
  • Gurusamy Murugesan Department of Bioinformatics, Bharathiar University, India
  • Jeyakumar Natarajan DRDO-BU Center for Life Sciences and Department of Bioinformatics, Bharathiar University, India

DOI:

https://doi.org/10.47852/bonviewMEDIN52025348

Keywords:

biomedical text mining, literature-based discovery, ABC principle, open and closed discovery, novel associations, concept profiles

Abstract

Biomedical knowledge is growing at an astounding pace with a majority of this knowledge represented as scientific publications. Text mining tools and methods represent automatic approaches for extracting hidden patterns and trends from this semi-structured and unstructured data. In biomedical text mining, literature-based discovery (LBD) is the process of automatically discovering novel associations between medical terms otherwise mentioned in disjoint literature sets. LBD approaches have proven to successfully reduce the discovery time of potential associations that are hidden in the vast amount of scientific literature. The process focuses on creating concept profiles for medical terms such as a disease or symptom and connecting them with a drug and treatment based on the statistical significance of the shared profiles. This knowledge discovery approach introduced in 1989 remains a core task in text mining. Currently, the ABC principle-based two approaches namely open discovery and closed discovery are mostly explored in the LBD process. This review starts with a general introduction about text mining followed by biomedical text mining followed by a brief introduction of the core ABC principle and its associated two approaches open discovery and closed discovery in the LBD process. This review discusses the deep learning applications in LBD by reviewing the role of transformer models and neural networks-based LBD models and their future aspects. Additionally, the potential of Large Language Models in enriching the LBD process is discussed with challenges and solutions and finally reviews the key biomedical discoveries generated through LBD approaches in biomedicine and concludes with the current limitations and future directions of LBD.

 

Received: 1 February 2025 | Revised: 12 May 2025 | Accepted: 30 May 2025

 

Conflicts of Interest

The authors declare that they have no conflicts of interest to this work.

 

Data Availability Statement

Data sharing is not applicable to this article as no new data were created or analyzed in this study.

 

Author Contribution Statement

Balu Bhasuran: Conceptualization, Methodology, Validation, Formal analysis, Investigation, Resources, Data curation, Writing – original draft, Writing – review & editing, Visualization. Gurusamy Murugesan: Methodology, Validation, Formal analysis, Investigation, Resources, Data curation, Writing – original draft, Writing – review & editing, Visualization. Jeyakumar Natarajan: Conceptualization, Methodology, Validation, Formal analysis, Investigation, Resources, Data curation, Writing – original draft, Writing – review & editing, Visualization, Supervision, Project administration.


Downloads

Published

2025-06-19

Issue

Section

Review

How to Cite

Bhasuran, B., Murugesan, G., & Natarajan, J. (2025). Literature-Based Discovery (LBD): Towards Hypothesis Generation and Knowledge Discovery in Biomedical Text Mining. Medinformatics. https://doi.org/10.47852/bonviewMEDIN52025348