Past and future uses of text mining in ecology and evolution

Farrell, M. J. , Brierley, L., Willoughby, A., Yates, A. and Mideo, N. (2022) Past and future uses of text mining in ecology and evolution. Proceedings of the Royal Society of London Series B: Biological Sciences, 289(1975), 20212721. (doi: 10.1098/rspb.2021.2721) (PMID:35582795) (PMCID:PMC9114983)

[img] Text
293808.pdf - Published Version
Available under License Creative Commons Attribution.

655kB

Abstract

Ecology and evolutionary biology, like other scientific fields, are experiencing an exponential growth of academic manuscripts. As domain knowledge accumulates, scientists will need new computational approaches for identifying relevant literature to read and include in formal literature reviews and meta-analyses. Importantly, these approaches can also facilitate automated, large-scale data synthesis tasks and build structured databases from the information in the texts of primary journal articles, books, grey literature, and websites. The increasing availability of digital text, computational resources, and machine-learning based language models have led to a revolution in text analysis and natural language processing (NLP) in recent years. NLP has been widely adopted across the biomedical sciences but is rarely used in ecology and evolutionary biology. Applying computational tools from text mining and NLP will increase the efficiency of data synthesis, improve the reproducibility of literature reviews, formalize analyses of research biases and knowledge gaps, and promote data-driven discovery of patterns across ecology and evolutionary biology. Here we present recent use cases from ecology and evolution, and discuss future applications, limitations and ethical issues.

Item Type:Articles
Status:Published
Refereed:Yes
Glasgow Author(s) Enlighten ID:Farrell, Dr Maxwell
Creator Roles:
Farrell, M. J.Conceptualization, Visualization, Writing – original draft, Writing – review and editing
Authors: Farrell, M. J., Brierley, L., Willoughby, A., Yates, A., and Mideo, N.
College/School:College of Medical Veterinary and Life Sciences > School of Biodiversity, One Health & Veterinary Medicine
Journal Name:Proceedings of the Royal Society of London Series B: Biological Sciences
Publisher:The Royal Society
ISSN:0962-8452
ISSN (Online):1471-2954
Published Online:18 May 2022
Copyright Holders:Copyright © 2022 The Authors
First Published:First published in Proceedings of the Royal Society of London Series B: Biological Sciences 289(1975): 20212721
Publisher Policy:Reproduced under a Creative Commons License

University Staff: Request a correction | Enlighten Editors: Update this record