Index-driven XML data integration to support functional genomics

Hunt, E., Pafilis, E., Tulloch, I. and Wilson, J. (2004) Index-driven XML data integration to support functional genomics. Lecture Notes in Computer Science, 2994, pp. 95-109. (doi: 10.1007/b96666)

[img]
Preview
Text
index_driven_XML.pdf

341kB

Publisher's URL: http://dx.doi.org/10.1007/b96666

Abstract

We identify a new type of data integration problem that arises in functional genomics research in the context of large-scale experiments involving arrays, 2-dimensional protein gels and mass-spectrometry. We explore the current practice of data analysis that involves repeated web queries iterating over long lists of gene or protein names. We postulate a new approach to solve this problem, applicable to data sets stored in XML format. We propose to discover data redundancies using an XML index we construct and to remove them from the results returned by the query. We combine XML indexing with queries carried out on top of relational tables. We believe our approach could support semi-automated data integration such as that required in the interpretation of large-scale biological experiments.

Item Type:Articles
Status:Published
Refereed:Yes
Glasgow Author(s) Enlighten ID:UNSPECIFIED
Authors: Hunt, E., Pafilis, E., Tulloch, I., and Wilson, J.
Subjects:Q Science > QA Mathematics > QA75 Electronic computers. Computer science
College/School:College of Science and Engineering > School of Computing Science
Journal Name:Lecture Notes in Computer Science
Publisher:Springer
ISSN:0302-9743
Copyright Holders:Copyright © 2004 Springer
First Published:First published in Lecture Notes in Computer Science 2994:95-109
Publisher Policy:Reproduced in accordance with the copyright policy of the publisher

University Staff: Request a correction | Enlighten Editors: Update this record