src/hg/makeDb/trackDb/wgEncodeNhgriBip.html 1.2

1.2 2009/06/24 19:55:10 tdreszer
Touch ups getting ready for the pushQ
Index: src/hg/makeDb/trackDb/wgEncodeNhgriBip.html
===================================================================
RCS file: /projects/compbio/cvsroot/kent/src/hg/makeDb/trackDb/wgEncodeNhgriBip.html,v
retrieving revision 1.1
retrieving revision 1.2
diff -b -B -U 1000000 -r1.1 -r1.2
--- src/hg/makeDb/trackDb/wgEncodeNhgriBip.html	1 Jun 2009 21:50:30 -0000	1.1
+++ src/hg/makeDb/trackDb/wgEncodeNhgriBip.html	24 Jun 2009 19:55:10 -0000	1.2
@@ -1,92 +1,102 @@
 <h2> Description </h2>
 <p>Bidirectional promoters are the regulatory regions that fall between
 pairs of genes, where the 5' ends of the genes within a pair are positioned
 in close proximity to one another. This spacing facilitates the initiation
 of transcription of both genes, creating two transcription forks that advance
 in opposite directions. The formal definition of a bidirectional promoter
 requires that the transcription initiation sites are separated by no more than
 1,000 bp from one another. Using these criteria we have comprehensively
 annotated the human and mouse genomes for the presence of bidirectional
 promoters, using in silico approaches  . The identification of these promoters
 is contingent upon the presence of adjacent, oppositely oriented pairs of
 genes, because few distinguishing features are available to uniquely identify
 bidirectional promoters de novo. Genomic annotations used for our
 identification phase include (1) curated protein-coding gene annotations,
 (2) spliced ESTs and (3) 5' "end-capped" transcript data, e.g., Cap-Analysis
 of Gene Expression Database (i.e., CAGE). The annotations for protein coding
 genes are strongly supported and therefore provide a high quality dataset for
 mapping bidirectional promoters. In contrast, bidirectional promoters supported
 by RNA evidence alone (as in (2)) have varying levels of evidence, ranging
 from one characterized transcript to hundreds of them. For this reason, dataset
 (3) - the CAGE data - provide a stringent level of validation for the start
 sites of the EST transcripts. As a large class of regulatory sequences,
 bidirectional promoters exemplify a rich source of unexplored biological
 information in the human genome. When compared to the mouse genome, these
 promoters are identifiable as truly orthologous locations, being maintained
 in regions of conserved synteny (including both genes and the intervening
 promoter region) that have undergone no rearrangements since the last common
 ancestor of mammals, and in some cases fish. We use this approach to annotate
 orthologous bidirectional promoters in nonhuman species as genomic annotations
 become available.
 </p>
 
 <h2> Methods </h2>
 
 <h3> Assigning Orthologous Regions </h3>
 <p>A multi-stage approach to mapping orthology at bidirectional promoters was
 developed. Orthology assignments are strongest in coding regions. Therefore we
 began by mapping single human genes regulated by bidirectional promoters from
 the Known Genes annotations onto the mouse genome. Orthology assignments were
 determined using the "chains and nets" data from the UCSC Human Genome Browser
 mysql tables. Chains in the Genome Browser represent sequences of gapless
 aligned blocks. Nets provide a hierarchical ordering of those chains. Level 1
 chains contain the longest, best-scoring sequence chains that span any
 selected region. Subsequent levels in the net represent the results of
 rearrangements, duplications, insertions and deletions that may have disrupted
 the presence of conserved synteny derived from an ancestral sequence.
 </p>
 
 <h3> Confirming Orthologous Genes </h3>
 <p>After determining the orthology assignments using the UCSC chains and nets
 data, we used the Known Gene annotations or spliced ESTs to search the identity
 of genes within the corresponding region. Known Genes represent protein-coding
 genes and therefore can be verified by chains and nets alignments, followed by
 confirmation of protein identity in both species. Spliced ESTs carry less
 descriptive information than protein coding genes and therefore were validated
 in the second species by their presence in an orthologous region, showing
 conserved synteny of the two genes within a pair, and meeting the criteria of
 less than 1,000 bp of intergenic distance between those transcripts. Our method
 for mapping bidirectional promoters in spliced EST datasets is described in
 more detail in a previous publication. If the program verified evidence for
 orthology and conserved-syntenic gene arrangement, then the orthologous
 bidirectional promoter was confirmed. After orthologous assignments were
 confirmed for pairs of human genes, the reciprocal assignments were analyzed
 from mouse to human.
 
 Currently orthologous bi-directional promoter regions have been mapped in
 human, chimp, macaque, mouse, rat, dog and cow genomes.
 </p>
 
 <h2> Credits </h2>
 
 <p>These data were produced by Mary Yang in the
-<a href="http://www.genome.gov/12514761" title="http://www.genome.gov/12514761" rel="nofollow">Elnitski lab</a> at NHGRI, NIH.
-(contact: <A HREF="mailto:&#101;&#108;&#110;&#105;&#116;&#115;ki&#64;&#109;&#97;&#105;l.n&#105;&#104;.g&#111;&#118;">
+<a href="http://www.genome.gov/12514761" title="http://www.genome.gov/12514761"
+rel="nofollow">Elnitski lab</a> at NHGRI, NIH. (contact:
+<A HREF="mailto:&#101;&#108;&#110;&#105;&#116;&#115;ki&#64;&#109;&#97;&#105;l.n&#105;&#104;.g&#111;&#118;">
 &#101;&#108;&#110;&#105;&#116;&#115;ki&#64;&#109;&#97;&#105;l.n&#105;&#104;.g&#111;&#118;</A>)
 <!-- above address is elnitski at mail.nih.gov -->
 </p>
 
 <h2> References </h2>
 <p>Yang MQ, Elnitski L:
- <a href="http://www.springerlink.com/content/q86486k52kr84j06/" title="http://www.springerlink.com/content/q86486k52kr84j06/" rel="nofollow">
- A computational study of bidirectional promoters in the human genome</a>. <i>Springer Lecture Series: Notes in Bioinformatics</i> 2007.
+ <a href="http://www.springerlink.com/content/q86486k52kr84j06/"
+ title="http://www.springerlink.com/content/q86486k52kr84j06/" rel="nofollow">
+ A computational study of bidirectional promoters in the human genome</a>.
+<i>Springer Lecture Series: Notes in Bioinformatics</i> 2007.
 
-</p><p>Yang MQ, Elnitski L: Orthology of Bidirectional Promoters Enables Use of a Multiple Class Predictor for Discriminating Functional Elements in the Human Genome. <i>
- BMC Genomics</i>.
+</p><p>Yang MQ, Elnitski L: Orthology of Bidirectional Promoters Enables Use of
+ a Multiple Class Predictor for Discriminating Functional Elements in the
+ Human Genome. <i>Proceedings of the 2007 International Conference on
+ Bioinformatics & Computational Biology.</i> pp. 218-228. 2007.
 </p><p>Yang MQ, Koehly L, Elnitski L:
- <a href="http://www.ploscompbiol.org/article/info:doi/10.1371/journal.pcbi.0030072" title="http://www.ploscompbiol.org/article/info:doi/10.1371/journal.pcbi.0030072" rel="nofollow">
- Comprehensive annotation of human bidirectional promoters identifies co-regulatory relationships among somatic breast and ovarian cancer genes</a>. <i>PLOS Computational Biolog</i>y 2007.
+ <a href="http://www.ploscompbiol.org/article/info:doi/10.1371/journal.pcbi.0030072"
+ title="http://www.ploscompbiol.org/article/info:doi/10.1371/journal.pcbi.0030072"
+ rel="nofollow"> Comprehensive annotation of human bidirectional promoters
+ identifies co-regulatory relationships among somatic breast and ovarian cancer
+ genes</a>. <i>PLOS Computational Biolog</i>y 2007.
 </p><p>Yang MQ, Taylor J, Elnitski L.
- <a href="http://www.biomedcentral.com/1471-2105/9/S6/S9" title="http://www.biomedcentral.com/1471-2105/9/S6/S9" rel="nofollow">
- Comparative analyses of bidirectional promoters in vertebrates</a>. <i>BMC Bioinformatics</i> May 2008.
+ <a href="http://www.biomedcentral.com/1471-2105/9/S6/S9"
+ title="http://www.biomedcentral.com/1471-2105/9/S6/S9" rel="nofollow">
+ Comparative analyses of bidirectional promoters in vertebrates</a>.
+ <i>BMC Bioinformatics</i> May 2008.
 </p>