Investigations into a Global Digitisation Dataset

Lewis, S., Gooding, P. and Furlough, M. (2021) Investigations into a Global Digitisation Dataset. Digital Archive and Kanon, 10 Mar 2021.

[img] Text
290385.pdf - Presentation



During 2019 a network of libraries from the UK and US were funded by the UK’s Arts and Humanities Research Council under their Collaborations in Digital Scholarship programme to investigate the creation of a global dataset of all digitised texts via prototyping and community engagement. This work was led by the University of Glasgow, with HathiTrust, the National Library of Scotland, the National Library of Wales, the British Library, and Research Libraries UK. The main use cases for the dataset are: 1. Digital scholars seeking corpora of texts could search and compile links to items across many sources 2. Readers wishing to find a digitised text would be able to search efficiently across many sources 3. Libraries undertaking digitisation programmes would be able to avoid duplication in their own digitisation efforts The project investigated these use cases and experimental work to understand the challenges of aggregating data from libraries with differing cataloguing standards and approaches. A trial dataset of over 17 million records was created and published openly along with a final report . During the Covid-19 lockdown this concept was taken further with the development of a prototype search service ‘OpenTexts.World’ that contains over 8 million records. This presentation will explore the findings of the project and discuss possible next steps in making the vision of a global dataset of all digitised texts a reality.

Item Type:Conference or Workshop Item
Glasgow Author(s) Enlighten ID:Gooding, Professor Paul
Authors: Lewis, S., Gooding, P., and Furlough, M.
Subjects:Z Bibliography. Library Science. Information Resources > ZA Information resources
Z Bibliography. Library Science. Information Resources > ZA Information resources > ZA4050 Electronic information resources
College/School:College of Arts & Humanities > School of Humanities > Information Studies
Copyright Holders:Copyright © 2021 The Authors
Publisher Policy:Reproduced with the permission of the Author
Related URLs:

University Staff: Request a correction | Enlighten Editors: Update this record

Project CodeAward NoProject NamePrincipal InvestigatorFunder's NameFunder RefLead Dept
306265Central Register of Digitisation - creating a collaborationPaul GoodingArts and Humanities Research Council (AHRC)AH/S012397/1Arts - Information Studies