Extracting scientific articles from a large digital archive: BioStor and the biodiversity heritage library

Page, R.D.M. (2011) Extracting scientific articles from a large digital archive: BioStor and the biodiversity heritage library. BMC Bioinformatics, 12, p. 187. (doi:10.1186/1471-2105-12-187)

Full text not currently available from Enlighten.

Abstract

Background: The Biodiversity Heritage Library (BHL) is a large digital archive of legacy biological literature, comprising over 31 million pages scanned from books, monographs, and journals. During the digitisation process basic metadata about the scanned items is recorded, but not article-level metadata. Given that the article is the standard unit of citation, this makes it difficult to locate cited literature in BHL. Adding the ability to easily find articles in BHL would greatly enhance the value of the archive. Description: A service was developed to locate articles in BHL based on matching article metadata to BHL metadata using approximate string matching, regular expressions, and string alignment. This article locating service is exposed as a standard OpenURL resolver on the BioStor web site http://biostor.org/openurl/. This resolver can be used on the web, or called by bibliographic tools that support OpenURL. Conclusions: BioStor provides tools for extracting, annotating, and visualising articles from the Biodiversity Heritage Library. BioStor is available from http://biostor.org/

Item Type:Articles
Status:Published
Refereed:Yes
Glasgow Author(s) Enlighten ID:Page, Professor Roderic
Authors: Page, R.D.M.
College/School:College of Medical Veterinary and Life Sciences > Institute of Biodiversity Animal Health and Comparative Medicine
Journal Name:BMC Bioinformatics
ISSN:1471-2105
ISSN (Online):1471-2105
Published Online:23 May 2011

University Staff: Request a correction | Enlighten Editors: Update this record