be4311c07e14feb728abc6425ee606ffaa611a58 markd Fri Jan 22 06:46:58 2021 -0800 merge with master diff --git src/hg/makeDb/trackDb/human/exomeProbesets.html src/hg/makeDb/trackDb/human/exomeProbesets.html new file mode 100755 index 0000000..3331019 --- /dev/null +++ src/hg/makeDb/trackDb/human/exomeProbesets.html @@ -0,0 +1,275 @@ +

Description

+

+This set of tracks shows the genomic positions of probes and targets from a full +suite of in-solution-capture target enrichment exome kits for Next Generation Sequencing (NGS) +applications. Also known as exome sequencing or whole exome sequencing (WES), +this technique allows high-throughput parallel sequencing of all exons (e.g. coding region of genes +which affect protein function), constituting about 1% of the human genome, or approximately 30 +million base pairs. +

+

+The tracks are intended to show the major differences in target genomic regions between the +different exome capture kits from the major players in the NGS sequencing market: +Illumina Inc., +Roche NimbleGen Inc., +Agilent Technologies Inc., +MGI Tech, +Twist Bioscience, and +Integrated DNA Technologies Inc.. +

+ +

Display Conventions and Configuration

+ +

+Items are shaded according to manufacturing company: +

+

+ +

+Tracks labeled as Probes (P) indicate the footprint of the oligonucleotide probes +mapped to the human genome. This is the technically relevant targeted region by the assay. However, +the sequenced region will be bigger than this since flanking sequences are sequenced as well. +Tracks labeled as Target Regions (T) indicate the genomic regions targeted by the +assay. This is the biologically relevant target region. It's not granted that all targeted regions +will be sequenced perfectly, it might be some capture bias on certain locations. The Target +Regions are those normally used for coverage analysis. +

+ +

Methods

+ +

+The capture of the genomic regions of interest using in-solution capture, is achieved +through the hybridization of a set of probes (oligonucleotides) with a sample of fragmented genomic +DNA in a solution environment. The probes hybridize selectively to the genomic regions of interest +which, after a process of exclusion of the non-selective DNA material, can be pulled down and +sequenced enabling selective DNA sequencing of the genomic regions (e.g. exons) of interest. +In-solution capture sequencing is a sensitive method to detect single nucleotide variants, +insertions and deletions, and copy number variations. +

+ + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +
KitTargeted RegionDatabases Used for DesignYear of Release
IDT - xGen Exome Research Panel V1.039 MbCoding sequences from RefSeq (19,396 genes)2015
IDT - xGen Exome Research Panel V2.034 MbCoding sequences from RefSeq 109 (19,433 genes)2020
Twist - RefSeq Exome Panel3.6 MbCurated subset of protein coding genes from CCDSN/A
Twist - Core Exome Panel33 MbProtein coding genes from CCDSN/A
Twist - Comprehensive Exome Panel36.8 MbProtein coding genes from RefSeq, CCDS, and GENCODE 2020
MGI - Easy Exome Capture V459 MbCCDS, GENCODE, RefSeq, and miRBaseN/A
MGI - Easy Exome Capture V569 MbCCDS, GENCODE, RefSeq, miRBase, and MGI Clinical DatabaseN/A
Agilent - SureSelect Clinical Research Exome54 MbDisease-associated regions from OMIM, HGMD, and ClinVar2014
Agilent - SureSelect Clinical Research Exome V263.7 MbDisease-associated regions from OMIM, HGMD, ClinVar, and ACMG2017
Agilent - SureSelect Focused Exome12 MbDisease-associated regions from HGMD, OMIM and ClinVar2016
Agilent - SureSelect All Exon V451 MbCoding regions from CCDS, RefSeq, and GENCODE v6, miRBase v17, TCGA v6, and UCSC known genes2011
Agilent - SureSelect All Exon V4 + UTRs71 MbCoding regions and 5' and 3' UTR sequences from CCDS, RefSeq, and GENCODE v6, regions from miRBase v17, TCGA v6, and UCSC known genes2011
Agilent - SureSelect All Exon V5 50 MbCoding regions from Refseq, GENCODE, UCSC, TCGA, CCDS, and miRBase (21.522 genes)2012
Agilent - SureSelect All Exon V5 + UTRs74 MbCoding regions and 5' and 3' UTR sequences from Refseq, GENCODE, UCSC, TCGA, CCDS, and miRBase (21.522 genes)2012
Agilent - SureSelect All Exon V6 r260 MbCoding regions from RefSeq, CCDS, GENCODE, HGMD, and OMIM2016
Agilent - SureSelect All Exon V6 + COSMIC r266 MbCoding regions from RefSeq, CCDS, GENCODE, HGMD, and OMIM, and targets from both TCGA and COSMIC2016
Agilent - SureSelect All Exon V6 + UTR r275 MbCoding regions and 5' and 3' UTR sequences from RefSeq, GENCODE, CCDS, and UCSC known genes,and miRNAs and lncRNA sequences2016
Agilent - SureSelect All Exon V735.7 MbCoding regions from RefSeq, CCDS, GENCODE, and UCSC known genes2018
Roche - KAPA HyperExome43Mb Coding regions from CCDS, RefSeq, Ensembl, GENCODE,and variants from ClinVar2020
Roche - SeqCap EZ Exome V364 MbCoding regions from RefSeq RefGene CDS, CCDS, and miRBase v14 databases, plus coverage of 97% Vega, 97% Gencode, and 99% Ensembl2018
Roche - SeqCap EZ Exome V3 + UTR92 MbCoding sequences from RefSeq RefGene, CCDS, and miRBase v14, plus coverage of 97% Vega, 97% Gencode, and 99% Ensembl and UTRs from RefSeq RefGene table from UCSC GRCh37/hg19 March 2012 and Ensembl (GRCh37 v64)2018
Roche - SeqCap EZ MedExome47 MbCoding sequences from CCDS 17, RefSeq, Ensembl 76, VEGA 56, GENCODE 20, miRBase 21, and disease-associated regions from GeneTests, ClinVar, and based on customer input2014
Roche - SeqCap EZ MedExome + Mito47 MbCoding sequences and mitochondrial genes from CCDS 17, RefSeq, Ensembl 76, VEGA 56, GENCODE 20 and miRBase 21, disease-associated regions from GeneTests, ClinVar, and based on customer input2014
Illumina - Nextera DNA Exome V1.245 MbCoding regions from RefSeq, CCDS, Ensembl, and GENCODE v192015
Illumina - Nextera Rapid Capture Exome37 Mb212,158 targeted exonic regions with start and stop chromosome locations in GRCh37/hg192013
Illumina - Nextera Rapid Capture Exome V1.237 MbCoding regions from RefSeq, CCDS, Ensembl, and GENCODE v122014
Illumina - Nextera Rapid Capture Expanded Exome66 MbCoding regions from RefSeq, CCDS, Ensembl, and GENCODE v122013
Illumina - TruSeq DNA Exome V1.245 MbCoding regions from RefSeq, CCDS, and Ensembl2017
Illumina - TruSeq Rapid Exome V1.245 MbCoding regions from RefSeq, CCDS, Ensembl, and GENECODE v192015
Illumina - TruSight ONE V1.112 MbCoding regions of 6700 genes from HGMD, OMIM, and GeneTest2017
Illumina - TruSight Exome7 MbDisease-causing mutations as curated by HGMD2017
Illumina - AmpliSeq Exome PanelN/ACCDS coding regions2019
+ +

Credits

+ +

+Thanks to Illumina (U.S), Roche NimbleGen, Inc. (U.S.), Agilent Technologies (U.S.), MGI Tech +(Beijing Genomics Institute, China), Twist Bioscience (U.S.), and Integrated DNA Technologies (IDT), +Inc. (U.S.). for making this data available and to Tiana Pereira, Pranav Muthuraman, Began Nguy and Anna Benet-Pages for enginering this tracks. +

+ + +