src/hg/makeDb/trackDb/ccdsGene.html 1.9

1.9 2009/06/17 16:16:25 hartera
Added CCDS paper reference.
Index: src/hg/makeDb/trackDb/ccdsGene.html
===================================================================
RCS file: /projects/compbio/cvsroot/kent/src/hg/makeDb/trackDb/ccdsGene.html,v
retrieving revision 1.8
retrieving revision 1.9
diff -b -B -U 1000000 -r1.8 -r1.9
--- src/hg/makeDb/trackDb/ccdsGene.html	19 Feb 2009 18:33:00 -0000	1.8
+++ src/hg/makeDb/trackDb/ccdsGene.html	17 Jun 2009 16:16:25 -0000	1.9
@@ -1,69 +1,75 @@
 <H2>Description</H2>
 <P>
 This track shows $organism genome high-confidence gene annotations from the
 <A HREF="http://www.ncbi.nlm.nih.gov/CCDS/" TARGET=_blank>Consensus 
 Coding Sequence (CCDS) project</A>. This project is a collaborative effort 
 to identify a core set of 
 $organism protein-coding regions that are consistently annotated and of high 
 quality. The long-term goal is to support convergence towards a standard set 
 of gene annotations on the $organism genome.
 </P>
 <P>Collaborators include:
 <UL>
 <LI><A HREF="http://www.ebi.ac.uk/" TARGET=_blank>European Bioinformatics 
 Institute</A> (EBI)
 <LI><A HREF="http://www.ncbi.nlm.nih.gov" TARGET=_blank>National Center for 
 Biotechnology Information</A> (NCBI)
 <LI><A HREF="http://www.cbse.ucsc.edu/" TARGET=_blank>University of 
 California, Santa Cruz</A> (UCSC)
 <LI><A HREF="http://www.sanger.ac.uk/" TARGET=_blank>Wellcome Trust Sanger 
 Institute</A> (WTSI)
 </UL>
 
 <H2>Methods</H2>
 <P>
 CDS annotations of the $organism genome were obtained from two sources:
 <A HREF="http://www.ncbi.nlm.nih.gov/RefSeq/index.html" TARGET=_blank>NCBI 
 RefSeq</A> and a union of the gene annotations from 
 <A HREF="http://www.ensembl.org/" TARGET=_blank>Ensembl</A> and 
 <A HREF="http://vega.sanger.ac.uk/" TARGET=_blank>Vega</A>, collectively known 
 as <EM>Hinxton</EM>.</P>
 <P>
 Genes with identical CDS genomic coordinates in both sets become CCDS 
 candidates. The genes undergo a quality evaluation, which must be approved by 
 all collaborators. The following criteria are currently used to assess each
 gene: 
 <UL>
 <LI> an initiating ATG, a valid stop codon, and no in-frame stop codons
 <LI> ability to be translated from the genome reference sequence without frameshifts
 <LI> recognizable splicing sites
 <LI> no intersection with putative pseudogene predictions
 <LI> supporting transcripts and protein homology
 <LI> conservation evidence with other species
 </UL></P>
 <P>
 A unique CCDS ID is assigned to the CCDS, which links together all gene 
 annotations with the same CDS.  CCDS gene annotations are under continuous
 review, with periodic updates to this track.
 </P>
 
 <H2>Credits</H2>
 <P>
 This track was produced at UCSC from data downloaded from the
 <A HREF="http://www.ncbi.nlm.nih.gov/CCDS/" TARGET=_blank>CCDS project</A> 
 web site.
 </P>
 
 <H2>References</H2>
 <P>
+Pruitt KD, Harrow J, Harte RA, Wallin C, Diekhans M, Maglott DR, Searle S, 
+Farrell CM, Loveland JE, Ruef BJ <EM>et al</EM>. 
+<A HREF="http://genome.cshlp.org/content/early/2009/06/04/gr.080531.108.long"
+TARGET=_blank>The consensus coding sequence (CCDS) project: Identifying a common protein-coding gene set for the human and mouse genomes</A>. 
+<I>Genome Res.</I> 2009 Jun 4. [Epub ahead of print]
+<P>
 Hubbard T, Barker D, Birney E, Cameron G, Chen Y, Clark L, Cox T, Cuff J, Curwen V, Down T, <EM>et al</EM>.
 <A HREF="http://nar.oupjournals.org/cgi/content/abstract/30/1/38" 
 TARGET=_blank>The Ensembl genome database project</A>. 
 <I>Nucl. Acids Res.</I> 2002 Jan 1;30(1):38-41.</P>
 <P>
 Pruitt KD, Tatusova T, Maglott DR.
 <A HREF="http://nar.oupjournals.org/cgi/content/full/33/suppl_1/D501" 
 TARGET=_blank>NCBI Reference Sequence (RefSeq): a curated non-redundant 
 sequence database of genomes, transcripts and proteins</A>. 
 <I>Nucl. Acids Res.</I> 2005 Jan 1;33(Database Issue):D501-D504. 
 </P>