src/hg/makeDb/trackDb/human/caddSuper.html 64868846b5a38ce13f3c5bce12bc97b5d85d5424

64868846b5a38ce13f3c5bce12bc97b5d85d5424
kuhn
  Fri May 21 11:26:37 2021 -0700
made track description for revel and some changes to description for the cadd tracks

diff --git src/hg/makeDb/trackDb/human/caddSuper.html src/hg/makeDb/trackDb/human/caddSuper.html
index a3be62a..19eee75 100644
--- src/hg/makeDb/trackDb/human/caddSuper.html
+++ src/hg/makeDb/trackDb/human/caddSuper.html
@@ -16,84 +16,86 @@
 </p>
 
 <p>
 CADD scores strongly correlate with allelic diversity, pathogenicity of both
 coding and non-coding variants, experimentally measured regulatory effects,
 and also rank causal variants within individual genome sequences with a higher
 value than non-causal variants. 
 Finally, CADD scores of complex trait-associated variants from genome-wide
 association studies (GWAS) are significantly higher than matched controls and
 correlate with study sample size, likely reflecting the increased accuracy of
 larger GWAS.
 </p>
 
 <h2>Display Conventions and Configuration</h2>
 <p>
-There are six subtracks of this track: four for every possible single nucleotide mutation, 
+There are six subtracks of this track: four for single-nucleotide mutations,
+one for each base, showing all possible substitutions, 
 one for insertions and one for deletions. All subtracks show the CADD Phred
 score on mouseover.</p>
 <p>
 PHRED-scaled scores are normalized to all potential &#126;9 billion SNVs, and
 thereby provide an externally comparable unit for analysis. For example, a
 scaled score of 10 or greater indicates a raw score in the top 10% of all
 possible reference genome SNVs, and a score of 20 or greater indicates a raw
 score in the top 1%, regardless of the details of the annotation set, model
 parameters, etc.
 </p>
 
 <p>
 <b>Single nucleotide variants (SNV):</b> For SNVs, at every
 genome position, there are three values per position, one for every possible
-nucleotide mutation. The fourth value, &quot;no mutation&quot;, e.g. A to A, is always
-set to zero.<br>
-When using this track, please zoom in until you can see every basepair at the
-top of the display. Otherwise, there are several nucleotides under your mouse
-cursor per pixel and instead of an actual score, the tooltip text can only show
-the average score of all nucleotides under the cursor, which is indicated by
-the prefix &quot;~&quot; in the mouseover and averages of scores are not useful for any
+nucleotide mutation. The fourth value, &quot;no mutation&quot;, representing 
+the reerence allele, e.g. A to A, is always set to zero.
+<br>
+When using this track, zoom in until you can see every basepair at the
+top of the display. Otherwise, there are several nucleotides per pixel under 
+your mouse cursor and instead of an actual score, the tooltip text will show
+the average score of all nucleotides under the cursor. This is indicated by
+the prefix &quot;~&quot; in the mouseover. Averages of scores are not useful for any
 application of CADD.
 </p>
 
 <p><b>Insertions and deletions:</b> Scores are also shown on mouseover for a
 set of insertions and deletions. On hg38, the set has been obtained from
 Gnomad3. On hg19, the set of indels has been obtained from various sources
 (gnomAD2, ExAC, 1000 Genomes, ESP). If your insertion or deleletion of interest
 is not in the track, you will need to use CADD's
 <a target="_blank" href="https://cadd.gs.washington.edu/score">online scoring tool</a>
 to obtain them.</p>
 
 <h2>Data access</h2>
 <p>
 CADD scores are freely available for all non-commercial applications from
 <a target="_blank" href="https://cadd.gs.washington.edu/download">the CADD website</a>.
 For commercial applications, see
 <a target="_blank" href="https://cadd.gs.washington.edu/contact">the license instructions</a> there.
 </p>
 
 <p>
 The CADD data on the UCSC Genome Browser can be explored interactively with the
 <a href="../cgi-bin/hgTables">Table Browser</a> or the
 <a href="../cgi-bin/hgIntegrator">Data Integrator</a>.
 For automated download and analysis, the genome annotation is stored at UCSC in bigWig and bigBed
 files that can be downloaded from
 <a href="http://hgdownload.soe.ucsc.edu/gbdb/$db/cadd/" target="_blank">our download server</a>.
 The files for this track are called <tt>a.bw, c.bw, g.bw, t.bw, ins.bb and del.bb</tt>. Individual
-regions or the whole genome annotation can be obtained using our tool <tt>bigWigToWig</tt>
+regions or the whole genome annotation can be obtained using our tools <tt>bigWigToWig</tt>
 or <tt>bigBedToBed</tt> which can be compiled from the source code or downloaded as a precompiled
 binary for your system. Instructions for downloading source code and binaries can be found
 <a href="http://hgdownload.soe.ucsc.edu/downloads.html#utilities_downloads">here</a>.
-The tools can also be used to obtain only features within a given range, e.g.
+The tools can also be used to obtain features confined to a given range, e.g.
 <br>
 <tt>bigWigToBedGraph -chrom=chr1 -start=100000 -end=100500 http://hgdownload.soe.ucsc.edu/gbdb/$db/cadd/a.bw stdout</tt>
 <br>
 or
 <br>
 <tt>bigBedToBed -chrom=chr1 -start=100000 -end=100500 http://hgdownload.soe.ucsc.edu/gbdb/$db/cadd/ins.bb stdout</tt></p>
 
 <h2>Methods</h2>
 
 <p>
 Data were converted from the files provided on
 <a href="https://cadd.gs.washington.edu/download" target="_blank">the CADD Downloads website</a>,
 provided by the Kircher lab, using
 <a href="https://github.com/ucscGenomeBrowser/kent/tree/master/src/hg/makeDb/cadd" target="_blank">
 custom Python scripts</a>,