Deep enzymology data related to Adam et al.: Flanking sequences influence the activity of TET1 and TET2 methylcytosine dioxygenases and affect genomic 5hmC patterns

Dataset

DOI

Experimental procedures for deep enzymology reactions with randomized substrates: For analysis of flanking sequence preferences of the TET enzymes, a similar approach as described for DNMTs (Emperle et al., 2019; Gao et al., 2020; Adam et al., 2020; Dukatz et al., 2020) was used. Briefly, the following single-stranded oligonucleotides containing a methylated or hydroxymethylated CpG or CpH site flanked by 10 randomized nucleotides on either side were obtained from IDT and primer extension was performed to obtain the double stranded DNA substrates. A CpN substrate was prepared as a mixture of CpG and CpH in a 1:3 ratio. For the randomized hydroxymethylated substrate, the single-stranded oligo was purchased coupled to Desthiobiotin-TEG. Primer extension was conducted and the substrate was purified via Streptavidin beads (Dynabeads M-280, ThermoFisher Scientific) and eluted with a biotin solution.

HM rand. GAGTGTGACTAGGCTCTCACTGCCNNNNNNNNNN mC GNNNNNNNNNNGAGAGGAGACCTAGTGAGAAG OH rand. GAGTGTGACTAGGCTCTCACTGCCNNNNNNNNNN hmC GNNNNNNNNNNGAGAGGAGACCTAGTGAGAAG CH rand. GAGTGTGACTAGGCTCTCACTGCCNNNNNNNNNN mC HNNNNNNNNNNGAGAGGAGACCTAGTGAGAAG

The randomized double stranded substrates were incubated with the TET enzyme at 37 °C for 45 min (CN context) or 1 h (CG context) using mixtures containing 1x reaction buffer (50 mM HEPES pH 6.8, 100 mM NaCl, 1 mM DTT, 1 mM alpha-ketoglutarate and 2 mM ascorbic acid), 100 µM ammonium iron(II) sulfate, using different enzyme concentrations and variable amounts of dialysis buffer to keep a fixed salt and glycerol concentration. Reactions were stopped by freezing in liquid nitrogen. Afterwards, Proteinase K (NEB) treatment was used for enzyme inactivation for 1 h at 50 °C, followed by purification with a PCR clean-up kit (MACHEREY-NAGEL). Hairpin ligation and bisulfite conversion was performed using EZ DNA Methylation-Lightning kit (ZYMO).

Library preparation for Illumina Next Generation Sequencing was conducted using a two-step PCR approach as described in (Gao et al., 2020). Unique combinations of barcode and index sequences were introduced to distinguish different samples and experiments. For bioinformatic analysis of the NGS datasets, a local instance of a Galaxy server (Afgan et al., 2018) was used. Sequence reads were trimmed with Trim Galore! (Galaxy Version 0.4.3.1, https://www.bioinformatics.babraham.ac.uk/projects/trim_galore/) keeping only the sequences with a quality score above 20 for further analysis, and filtered according to the expected DNA size using the Filter FASTQ tool (Blankenberg et al., 2010).

The data in this entry contain the Fastq sequence files and extracted DNA sequences obtained with the hemimethylated CpG substrate (HM CG), hemimethylated CpN substrate mixture (HM CN) and hemihydroxymethylated CpG substrate (OH CG). Enzyme kinetics were conducted with TET1 and two versions of TET2 (V1 and V2) as described in the accompanying paper. Individual repeats of experiments are indicated with R1-R5 as appropriate. Control reaction refer to samples treated identically but without enzyme.

The cited references are listed in the accompanying publication to this dataset.

PMID: 35075236

Identifier
DOI	https://doi.org/10.18419/darus-2114
Related Identifier	IsCitedBy https://doi.org/10.1038/s42003-022-03033-4
Metadata Access	https://darus.uni-stuttgart.de/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=doi:10.18419/darus-2114

Provenance
Creator	Jeltsch, Albert ; Bashtrykov, Pavel ; Adam, Sabrina
Publisher	DaRUS
Contributor	Jeltsch, Albert
Publication Year	2021
Funding Reference	DFG JE 252/36 - 403074082 ; DFG RA 1840/2 ; DFG EXC 2075 - 390740016
Rights	CC BY 4.0; info:eu-repo/semantics/openAccess; http://creativecommons.org/licenses/by/4.0
OpenAccess	true
Contact	Jeltsch, Albert (Universität Stuttgart)

Representation
Resource Type	Raw DNA sequences extracted from Fastq NGS files; Bisulfite-seq of 5mC and 5hmC oxidation analysis; Dataset
Format	application/octet-stream; text/plain; application/pdf
Size	44685538; 19577740; 11873736; 4677690; 67449441; 26484975; 228036; 228866; 228051; 86736212; 8658424; 37925462; 9785708; 9774576; 3891228; 4442952; 1877964; 6421435; 3347752; 108377381; 13773052; 96350382; 14610672; 107232650; 12304520; 106041864; 13837656; 106353075; 13958804; 13949460; 5550498; 3860953; 1953117; 3899001; 2041554; 13282284; 6652600; 14976809; 7570076; 48006826; 12669080; 28998342; 7228456; 17172522; 14014480; 16706934; 13173884; 18966315; 8968548; 18595203; 8636228; 12498884; 5915668; 29727672; 6396416; 47104903; 10153740
Version	2.1
Discipline	Basic Biological and Medical Research; Biochemistry; Biology; Life Sciences; Medicine