Crypt4GH: a file format standard enabling native access to encrypted data

Senf, A., Davies, R., Haziza, F., Marshall, J. , Troncoso-Pastoriza, J., Hofmann, O. and Keane, T. M. (2021) Crypt4GH: a file format standard enabling native access to encrypted data. Bioinformatics, 37(17), pp. 2753-2754. (doi: 10.1093/bioinformatics/btab087) (PMID:33543751) (PMCID:PMC8522443)

[img] Text
234245.pdf - Published Version
Available under License Creative Commons Attribution.

156kB

Abstract

Motivation: The majority of genome analysis tools and pipelines require data to be decrypted for access. This potentially leaves sensitive genetic data exposed, either because the unencrypted data is not removed after analysis, or because the data leaves traces on the permanent storage medium. Results: We defined a file container specification enabling direct byte-level compatible random access to encrypted genetic data stored in community standards such as SAM/BAM/CRAM/VCF/BCF. By standardizing this format, we show how it can be added as a native file format to genomic libraries, enabling direct analysis of encrypted data without the need to create a decrypted copy. Availability and implementation: The Crypt4GH specification can be found at: http://samtools.github.io/hts-specs/crypt4gh.pdf.

Item Type:Articles
Additional Information:This work was supported by the following grants: Wellcome (100956/Z/13/Z, 201535/Z/16/Z, 206194) [TMK, AS, RD], Strategic Focal Area “Personalized Health and Related Technologies (PHRT)” of the ETH Domain #2017-201 [JTP], European Joint Programme on Rare Diseases (EJP-RD) #825575 [FH], and NHMRC grant #1113531 and the Medical Research Future Fund [OH].
Status:Published
Refereed:Yes
Glasgow Author(s) Enlighten ID:Marshall, Mr John
Authors: Senf, A., Davies, R., Haziza, F., Marshall, J., Troncoso-Pastoriza, J., Hofmann, O., and Keane, T. M.
College/School:College of Medical Veterinary and Life Sciences > School of Cancer Sciences
Journal Name:Bioinformatics
Publisher:Oxford University Press
ISSN:1367-4803
ISSN (Online):1460-2059
Published Online:05 February 2021
Copyright Holders:Copyright © 2021 The Authors
First Published:First published in Bioinformatics 37(17): 2753-2754
Publisher Policy:Reproduced under a Creative Commons License

University Staff: Request a correction | Enlighten Editors: Update this record