Towards a grapho-phonologically parsed corpus of medieval Scots: Database design and technical solutions

Kopaczyk, J. , Molineaux Ress, B., Karaiskos, V., Alcorn, R., Los, B. and Maguire, W. (2018) Towards a grapho-phonologically parsed corpus of medieval Scots: Database design and technical solutions. Corpora, 13(2), pp. 255-269. (doi:10.3366/cor.2018.0146)

[img]
Preview
Text
146431.pdf - Accepted Version

806kB

Abstract

This paper presents a newly constructed corpus of sound-to-spelling mappings in medieval Scots, which stems from the work of the From Inglis to Scots (FITS) project. We have developed a systematic approach to the relationships between individual spellings and proposed sound values, and recorded these mutual links in a relational database. In this paper, we introduce the theoretical underpinnings of sound-to-spelling and spelling-to-sound mappings, and show how a Scots root morpheme undergoes grapho-phonological parsing, the analytical procedure that is employed to break down spelling sequences into sound units. We explain the data collection and annotation for the FITS Corpus (Alcorn et al., forthcoming), drawing attention to the extensive meta-data which accompany each analysed unit of spelling and sound. The database records grammatical and lexical information about the root, the positional arrangement of segments within the root, labels for the nuclei, vowels and consonants, the morphological context, and extra-linguistic detail of the text a given root was taken from (date, place and text type). With this wealth of information, the FITS corpus is capable of answering complex queries about the sound and spelling systems of medieval Scots. We also suggest how our methodology can be transferred to other non-standardised spelling systems.

Item Type:Articles
Status:Published
Refereed:Yes
Glasgow Author(s) Enlighten ID:Kopaczyk, Dr Joanna
Authors: Kopaczyk, J., Molineaux Ress, B., Karaiskos, V., Alcorn, R., Los, B., and Maguire, W.
College/School:College of Arts > School of Critical Studies > English Language and Linguistics
Journal Name:Corpora
Publisher:Edinburgh University Press
ISSN:1749-5032
ISSN (Online):1755-1676
Copyright Holders:Copyright © Edinburgh University Press
First Published:First published in Corpora 13(2):255-269
Publisher Policy:Reproduced in accordance with the copyright policy of the publisher

University Staff: Request a correction | Enlighten Editors: Update this record