src/hg/makeDb/trackDb/human/geneHancer.html 60e2f1cf879afe3838a24e9bc0448b8c6ebf842e

60e2f1cf879afe3838a24e9bc0448b8c6ebf842e
mspeir
  Fri Sep 24 14:30:08 2021 -0700
Fixing formatting for quote char encoding, no redmine

diff --git src/hg/makeDb/trackDb/human/geneHancer.html src/hg/makeDb/trackDb/human/geneHancer.html
index ad30598..b4db480 100644
--- src/hg/makeDb/trackDb/human/geneHancer.html
+++ src/hg/makeDb/trackDb/human/geneHancer.html
@@ -1,163 +1,163 @@
 <h2>Description</h2>
 <p>
 GeneHancer is a database of human regulatory elements (enhancers and promoters) 
 and their inferred target genes, which is embedded 
 in <a href="https://www.genecards.org/" target="_blank">GeneCards</a>, a human gene 
 compendium.
 The GeneHancer database was created by integrating &gt;1 million regulatory elements 
 from multiple genome-wide databases. 
 Associations between the regulatory elements and target genes
 were based on multiple sources of linking molecular data, along with distance,
 as described in Methods below.
 </p>
 <p>
 The GeneHancer track set contains tracks representing:
 <ul>
 <li>Regulatory elements (GeneHancers)</li>
 <li>Gene transcription start sites</li>
 <li>Interactions (associations) between regulatory elements and genes</li>
 <li>Clustered interactions, by gene target or GeneHancer</li>
 </ul>
 The full set of elements and interactions is included, along with a highly filtered 
-&quot;double elite&quot' subset.
+&quot;double elite&quot; subset.
 
 <h2>Display Conventions</h2>
 <p>
 Each GeneHancer regulatory element is identified by a GeneHancer id. 
 For example: GH0XJ101383 is located on chromosome X, with starting position of 101,383 kb
 (GRCh38/hg38 reference).
 Based on the id, one can obtain full GeneHancer information, as displayed in the Genomics 
 section within the gene-centric web pages of GeneCards. Links to the GeneCards information pages
 are provided on the track details pages.</p>
 
 <h3>Regulatory elements</h3>
 <p>
 Colors are used to distinguish promoters and enhancers and to indicate the GeneHancer element confidence score:</p>
 <p>
 Promoters:&nbsp;
 <span style="border: 2px inset #000000; background-color:#cc0000;">&emsp;&emsp;</span>&nbsp;High
 <span style="border: 2px inset #000000; background-color:#ff3333;">&emsp;&emsp;</span>&nbsp;Medium
 <span style="border: 2px inset #000000; background-color:#ff6666;">&emsp;&emsp;</span>&nbsp;Low
 </p>
 <p>
 Enhancers:&nbsp;
 <span style="border: 2px inset #000000; background-color:#5F5F5F;">&emsp;&emsp;</span>&nbsp;High
 <span style="border: 2px inset #000000; background-color:#A0A0A0;">&emsp;&emsp;</span>&nbsp;Medium
 <span style="border: 2px inset #000000; background-color:#C0C0C0;">&emsp;&emsp;</span>&nbsp;Low
 </p>
 
 <h3>Gene TSS</h3>
 <p>
 Colors are used to improve gene and interactions visibility. 
 Successive genes are colored in different colors, and interactions of a gene have the same color.</p>
 
 <h3>Interactions</h3>
 <p>
 The Interactions view in Full mode shows GeneHancers and target genes connected by curves or 
 half-rectangles (when one of the connected regions is off-screen).  
 Configuration options are available to change the drawing style, and to limit the view to
 interactions with one or both connected items in the region.
 Interactions are identified on mouseover or clicked on for details at the end regions, or at 
 the curve peak, which is marked with a gray ring shape. Interactions in the reverse direction
 (Gene TSS precedes GeneHancer on the genome) are drawn with a dashed line.
 
 <h3>Clusters</h3>
 <p>
 The Clusters view groups interactions by target gene; the target gene and all GeneHancers 
 associated with it are displayed in a single browser item. The gene TSS and associated GeneHancers 
 are shown as blocks linked together, with the TSS drawn as a &quot;tall&quot; item, and the 
 GeneHancers drawn &quot;short&quot;. 
 A user configuration option is provided to change the view to group by GeneHancer 
 (with tall GeneHancer and short TSS's).  
 Clusters composed of interactions with a single gene are colored to correspond to the gene, 
 and those composed of interactions with multiple genes are colored dark gray.</p>
 
 <h2>Methods</h2> 
 <p>
 GeneHancer identifications were created from &gt;1 million regulatory elements 
 obtained from seven genome-wide databases:
 <ol>
 <li><a href="https://www.encodeproject.org/" target="_blank">ENCODE project</a>
 <a href="http://zlab-annotations.umassmed.edu/enhancers/" target="_blank">Z-Lab Enhancer-like regions</a> (version v3)</li>
 <li><a href="http://ensembl.org/info/genome/funcgen/index.html" target="_blank">Ensembl regulatory build</a> (version 92)</li>
 <li><a href="http://fantom.gsc.riken.jp/" target="_blank">FANTOM5</a><a href="http://pressto.binf.ku.dk/"
 
                                                                                  target="_blank"> atlas of active enhancers</a></li>
 <li><a href="https://enhancer.lbl.gov/" target="_blank">VISTA Enhancer Browser</a></li>
 <li><a href="http://asntech.org/dbsuper/" target="_blank">dbSUPER super-enhancers</a></li>
 <li><a href="https://epd.vital-it.ch/EPDnew_database.php" target="_blank">EPDnew promoters</a></li>
 <li><a href="https://ccg.vital-it.ch/UCNEbase/" target="_blank">UCNEbase ultra-conserved noncoding elements</a></li>
 </ol>
 <p>
 Employing an integration algorithm that removes redundancy, the GeneHancer pipeline
 identified &tilde;250k integrated candidate regulatory elements (GeneHancers).
 Each GeneHancer is assigned an annotation-derived confidence score. 
 The GeneHancers that are derived from more than one information source are defined 
 as &quot;elite&quot; GeneHancers.</p>
 <p>
 Gene-GeneHancer associations, and their likelihood-based scores, were generated 
 using information that helps link regulatory elements to genes:
 <ol>
 <li>eQTLs (expression quantitative trait loci) from <a href="https://www.gtexportal.org/home/" 
 target="_blank">GTEx</a> (version v6p)</li>
 <li>Capture Hi-C promoter-enhancer long range interactions</li>
 <li>FANTOM5 eRNA-gene expression correlations</li>
 <li>Cross-tissue expression correlations between a transcription factor interacting 
 with a GeneHancer and a candidate target gene</li>
 <li>Distance-based associations, including several approaches: 
   <ol>
   <li>Nearest neighbors, where each GeneHancer is associated with its two proximal genes</li>
   <li>Overlaps with the gene territory (intragenic)</li>
   <li>Proximity to the gene TSS (&lt;2kb)</li>
   </ol>
 </ol></p>
 <p>
 Associations that are derived from more than one information source are defined 
 as &quot;elite&quot; associations, which leads to the definition of the &quot;double elite&quot;
 dataset - elite gene associations of elite GeneHancers.</p>
 <p>
 More details are provided at the GeneCards
 <a href="https://www.genecards.org/Guide/GeneCard#enhancers" target="_blank">
 information page</a>.
 For a full description of the methods used, refer to the GeneHancer manuscript<sup>1</sup>.</p>
 <p>
 Source data for the GeneHancer version 4.8 was downloaded during May 2018.</p>
 
 <H2>Data Access</H2>
 <p>
 Due to our agreement with the Weizmann Institute, we cannot allow full genome 
 queries from the Table Browser or share download files. You can still access 
 data for individual chromosomes or positional data from the 
 <a href="../cgi-bin/hgTables">Table Browser</a>.</p>
 
 <p>
 GeneHancer is the property of the Weizmann Institute of Science and 
 is not available for download or mirroring by any third party 
 without permission. Please contact the <a target="_blank"
 href="https://www.weizmann.ac.il/pages/contact">Weizmann Institute</a> directly for 
 data inquiries.</p>
 
 <h2>Credits</h2>
 <p>
 Thanks to Simon Fishilevich, Marilyn Safran, Naomi Rosen, and Tsippi Iny Stein of the GeneCards 
 group and Shifra Ben-Dor of the Bioinformatics Core group at the Weizmann Institute, 
 for providing this data and documentation, creating track hub versions of these tracks 
 as prototypes, and overall responsiveness during development of these tracks.</p>
 <p>
 Contact: <A HREF="mailto:&#115;&#105;&#109;&#111;&#110;.
 &#102;&#105;&#115;&#104;&#105;&#108;ev&#105;&#99;h&#64;&#119;&#101;&#105;&#122;m&#97;&#110;&#110;.
 a&#99;.
 &#105;l">
 &#115;&#105;&#109;&#111;&#110;.
 &#102;&#105;&#115;&#104;&#105;&#108;ev&#105;&#99;h&#64;&#119;&#101;&#105;&#122;m&#97;&#110;&#110;.
 a&#99;.
 &#105;l</A>
 <!-- above address is simon.fishilevich at weizmann.ac.il -->
 <p>
 Supported in part by a grant from LifeMap Sciences Inc.</p>
 
 <h2>References</h2>
 <p>
 Fishilevich S., Nudel R., Rappaport N., Hadar R., Plaschkes I., Iny Stein T., Rosen N., Kohn A., Twik M., Safran M., Lancet D. and Cohen D. <i>GeneHancer: genome-wide integration of enhancers and target genes in GeneCards</i>, Database (Oxford) (2017), doi:10.1093/database/bax028. <a href="https://www.genecards.org/papers/Database-2017-Fishilevich-GH.pdf" target="_blank">[PDF]</a> PMID <a href="https://www.ncbi.nlm.nih.gov/pubmed?cmd=search&term=28605766&dopt=b" target="_blank">28605766</a></p>
 <p>
 Stelzer G, Rosen R, Plaschkes I, Zimmerman S, Twik M, Fishilevich S, Iny Stein T, Nudel R, Lieder I, Mazor Y, Kaplan S, Dahary, D, Warshawsky D, Guan- Golan Y, Kohn A, Rappaport N, Safran M, and Lancet D. <i>The GeneCards Suite: From Gene Data Mining to Disease Genome Sequence Analysis</i>, Current Protocols in Bioinformatics (2016), 54:1.30.1-1.30.33. doi: 10.1002/cpbi.5. PMID <a href="https://www.ncbi.nlm.nih.gov/pubmed?cmd=search&term=27322403&dopt=b" target="_blank">27322403</a></p>