Population genomic SNPs from epigenetic RADs: gaining genetic and epigenetic data from a single established next-generation sequencing approach

Crotti, M., Adams, C. E. and Elmer, K. (2020) Population genomic SNPs from epigenetic RADs: gaining genetic and epigenetic data from a single established next-generation sequencing approach. Methods in Ecology and Evolution, 11(7), pp. 839-849. (doi: 10.1111/2041-210X.13395)

[img]
Preview
Text
213237.pdf - Published Version
Available under License Creative Commons Attribution.

1MB

Abstract

Epigenetics is increasingly recognised as an important molecular mechanism underlying phenotypic variation. To study DNA methylation in ecological and evolutionary contexts, epiRADseq is a cost‐effective next‐generation sequencing technique based on reduced representation sequencing of genomic regions surrounding non‐/methylated sites. EpiRADseq for genome‐wide methylation abundance and ddRADseq for genome‐wide SNP genotyping follow very similar library and sequencing protocols, but to date these two types of dataset have been handled separately. Here we test the performance of using epiRADseq data to generate SNPs for population genomic analyses. We tested the robustness of using epiRADseq data for population genomics with two independent datasets: a newly generated single‐end dataset for the European whitefish Coregonus lavaretus, and a re‐analysis of publicly available, previously published paired‐end data on corals. Using standard bioinformatic pipelines with a reference genome and without (i.e. de novo catalogue loci), we compared the number of SNPs retained, population genetic summary statistics, and population genetic structure between data drawn from ddRADseq and epiRADseq library preparations. We find that SNPs drawn from epiRADseq are similar in number to those drawn from ddRADseq, with 55‐83% of SNPs being identified by both methods. Genotyping error rate was <5% in both approaches. EpiRADseq‐specific allele dropout was low (~1%). For summary statistics such as heterozygosity and nucleotide diversity, there is a strong correlation between methods (Spearman’s rho > 0.88). Furthermore, identical patterns of population genetic structure were recovered using SNPs from epiRADseq and ddRADseq approaches. We show that SNPs obtained from epiRADseq are highly similar to those from ddRADseq and are equivalent for estimating genetic diversity and population structure. This finding is particularly relevant to researchers interested in genetics and epigenetics on the same individuals because using a single epigenomic approach to generate two datasets greatly reduces the time and financial costs compared to using these techniques separately. It also efficiently enables correction of epigenetic estimates with population genetic data. Many studies will benefit from a combinatorial approach with genetic and epigenetic markers and this demonstrates a single, efficient method to do so.

Item Type:Articles
Status:Published
Refereed:Yes
Glasgow Author(s) Enlighten ID:Crotti, Mr Marco and Elmer, Professor Kathryn and Adams, Professor Colin
Authors: Crotti, M., Adams, C. E., and Elmer, K.
College/School:College of Medical Veterinary and Life Sciences > School of Biodiversity, One Health & Veterinary Medicine
Journal Name:Methods in Ecology and Evolution
Publisher:Wiley
ISSN:2041-210X
ISSN (Online):2041-210X
Published Online:09 April 2020
Copyright Holders:Copyright © 2020 The Authors
First Published:First published in Methods in Ecology and Evolution 11(7):839-849
Publisher Policy:Reproduced under a Creative Commons License

University Staff: Request a correction | Enlighten Editors: Update this record