Hunt, E., Pafilis, E., Tulloch, I. and Wilson, J. (2004) Index-driven XML data integration to support functional genomics. Lecture Notes in Computer Science, 2994, pp. 95-109. (doi: 10.1007/b96666)
|
Text
index_driven_XML.pdf 341kB |
Publisher's URL: http://dx.doi.org/10.1007/b96666
Abstract
We identify a new type of data integration problem that arises in functional genomics research in the context of large-scale experiments involving arrays, 2-dimensional protein gels and mass-spectrometry. We explore the current practice of data analysis that involves repeated web queries iterating over long lists of gene or protein names. We postulate a new approach to solve this problem, applicable to data sets stored in XML format. We propose to discover data redundancies using an XML index we construct and to remove them from the results returned by the query. We combine XML indexing with queries carried out on top of relational tables. We believe our approach could support semi-automated data integration such as that required in the interpretation of large-scale biological experiments.
Item Type: | Articles |
---|---|
Status: | Published |
Refereed: | Yes |
Glasgow Author(s) Enlighten ID: | UNSPECIFIED |
Authors: | Hunt, E., Pafilis, E., Tulloch, I., and Wilson, J. |
Subjects: | Q Science > QA Mathematics > QA75 Electronic computers. Computer science |
College/School: | College of Science and Engineering > School of Computing Science |
Journal Name: | Lecture Notes in Computer Science |
Publisher: | Springer |
ISSN: | 0302-9743 |
Copyright Holders: | Copyright © 2004 Springer |
First Published: | First published in Lecture Notes in Computer Science 2994:95-109 |
Publisher Policy: | Reproduced in accordance with the copyright policy of the publisher |
University Staff: Request a correction | Enlighten Editors: Update this record