396c69214aa8d8b2267caa178377f24c796f579d jnavarr5 Tue Jul 30 14:15:14 2024 -0700 Staging the knownGene announcement for hg19, refs #32302 diff --git src/hg/htdocs/goldenPath/newsarch.html src/hg/htdocs/goldenPath/newsarch.html index be2ead8..576833b 100755 --- src/hg/htdocs/goldenPath/newsarch.html +++ src/hg/htdocs/goldenPath/newsarch.html @@ -54,30 +54,96 @@

You can sign-up to get these announcements via our Genome-announce email list. We send around one short announcement email every two weeks.

Smaller software changes are not announced here. A summary of the three-weekly release changes can be found here. For the full list of our daily code changes head to our GitHub page. Lastly, see our credits page for acknowledgments of the data we host.

+ +

Jul. 31, 2024    GENCODE "KnownGene" v45lift37 release for human (hg19)

+

+We are excited to announce the release of the +GENCODE +"KnownGene" v45lift37 gene track for hg19. With this release, the previous 2013 UCSC +Genes track will be frozen and made available in the +GENCODE/UCSC Genes Archive superTrack for reproducibility. +As new GENCODE tracks are made available, previous versions will also be available in the archive. +Beginning with this update, the "KnownGene" track will use GENCODE v45 gene models +lifted to hg19, which replaces the old UCSC transcript IDs with the official GENCODE IDs. +

+

+The following is an example of some GENCODE IDs that will replace the UCSC IDs in the update: +

+
+oldId      newId
+uc003qfo.3 ENST00000341911.10_8
+uc003jsk.2 ENST00000462279.5_3
+uc003umk.1 ENST00000318238.9_6
+uc003gzi.3 ENST00000682860.1_2
+uc011dpu.2 ENST00000375023.3_6
+uc021raj.2 ENST00000258149.11_6
+uc002fxp.3 ENST00000341657.9_12
+uc010xhp.1 ENST00000429344.7_6
+uc003zze.3 ENST00000242285.11_9
+

+For each transcript ID, the _# portion is part of the +official hg19 backmap ID, so they are not confused with the gene/transcript they +are derived from in hg38. Between hg38 and hg19, the two IDs are not always in the same sequence and +may not be a one-to-one mapping. +

+

+The GENCODE +"KnownGene" V45lift37 gene track is built using a UCSC pipeline (KnownGene) and the +GENCODE comprehensive gene set to generate high-quality manual annotations merged with +evidence-based automated annotations. The GENCODE "KnownGene" tracks are our default gene +tracks, which have extensive associations to external sources. This allows for additional metadata +on every item as well as external links. The track description pages contain options for configuring +the display, such as showing non-coding genes, splice variants, and pseudogenes. +

+

+Below is a summary of the contents found in the GENCODE v45 release. +For more details visit the GENCODE site. +

+ + + + + + + + + +
GENCODE v45 Release Stats
GenesObservedTranscriptsObserved
Protein-coding genes19,395Protein-coding transcripts89,110
Long non-coding RNA genes20,424- full length protein-coding64,028
Small non-coding RNA genes7,565- partial length protein-coding25,082
Pseudogenes14,719Nonsense mediated decay transcripts21,427
Immunoglobulin/T-cell receptor gene segments648Long non-coding RNA loci transcripts59,719
Total No of distinct translations65,357Genes that have more than one distinct translations13,600
+

+

+We would like to thank the GENCODE project for providing these +annotations. We would also like to thank Brian Raney, Mark Diekhans, and Jairo Navarro for the +development and release of these tracks. +

+

Jul. 25, 2024    EVA SNP release 6 for 37 assemblies

We are pleased to announce the release of the EVA SNP release 6 track for 37 assemblies. These tracks contain mappings of single nucleotide variants and small insertions and deletions (indels) — collectively Simple Nucleotide Variants (SNVs) — from the European Variation Archive (EVA) Release 6. The full list of assemblies that contain the EVA SNP release 6 track is below: