Spaeth, D.A. (2004) Representing text as data: the analysis of historical sources in XML. Historical Methods: A Journal of Quantitative and Interdisciplinary History, 37(2), pp. 73-85. (doi: 10.3200/HMTS.37.2.73-86)
Full text not currently available from Enlighten.
Publisher's URL: http://dx.doi.org/10.3200/HMTS.37.2.73-86
Abstract
In conventional approaches to computer analysis of historical sources, one must represent the data in structured formats in which content and context are discarded. Extensible markup language (XML), developed for data transfer on the Internet, permits preservation of the full text of irregular historical sources without sacrificing the ability to conduct systematic analysis. Related querying tools offer most functions of a relational database management system, including data transformation facilities for coding, standardizing, and aggregating nominal data. An XML database permits multiple interpretations of the data because the unit of analysis and the coding schemes are not defined at entry. In a case study of probate inventories, the author demonstrates how structured analysis of domestic interiors can be performed and introduces approaches to studying semistructured data.
Item Type: | Articles |
---|---|
Status: | Published |
Refereed: | Yes |
Glasgow Author(s) Enlighten ID: | Spaeth, Dr Donald |
Authors: | Spaeth, D.A. |
Subjects: | D History General and Old World > D History (General) Q Science > QA Mathematics > QA76 Computer software |
College/School: | College of Arts & Humanities > School of Humanities > History |
Journal Name: | Historical Methods: A Journal of Quantitative and Interdisciplinary History |
ISSN: | 1940-1906 |
University Staff: Request a correction | Enlighten Editors: Update this record