7f892f8fa5f41b18d489cfdbb0255c05b1b3e3f3 jnavarr5 Mon Oct 7 10:55:15 2019 -0700 Updating http to https for hg19, uiLinks cronjob. diff --git src/hg/makeDb/trackDb/human/hiSeqDepth.html src/hg/makeDb/trackDb/human/hiSeqDepth.html index 68ff08f..c9f1f2b 100644 --- src/hg/makeDb/trackDb/human/hiSeqDepth.html +++ src/hg/makeDb/trackDb/human/hiSeqDepth.html @@ -1,20 +1,20 @@

Description

This track displays regions of the reference genome that have exceptionally high sequence depth, inferred from alignments of short-read sequences from the -1000 Genomes Project. +1000 Genomes Project. These regions may be caused by collapsed repetitive sequences in the reference genome assembly; they also have high read depth in assays such as ChIP-seq, and may trigger false positive calls from peak-calling algorithms. Excluding these regions from analysis of short-read alignments should reduce such false positive calls.

Methods

Pickrell et al. downloaded sequencing reads for 57 Yoruba individuals from the 1000 Genomes Project's low-coverage pilot data, mapped them to the Mar. 2006 human genome assembly (NCBI36/hg18), computed the read depth for every base in the genome, and compiled a distribution of read depths. They then identified contiguous regions where read depth exceeded thresholds corresponding to the top 0.001, 0.005, 0.01, 0.05 and 0.1 of the per-base