c15a59c6ddc7428025519ec671af0a7d4649d7be gperez2 Thu Oct 30 16:50:26 2025 -0700 Releasing the new GENCODE Known Gene tracks V49, v49lift37, and VM38, refs #36169 #36167 #36165 diff --git src/hg/htdocs/goldenPath/newsarch.html src/hg/htdocs/goldenPath/newsarch.html index 7fc030f2870..031ac37915a 100755 --- src/hg/htdocs/goldenPath/newsarch.html +++ src/hg/htdocs/goldenPath/newsarch.html @@ -52,30 +52,94 @@

You can sign-up to get these announcements via our Genome-announce email list. We send around one short announcement email every two weeks.

Smaller software changes are not announced here. A summary of the three-weekly release changes can be found here. For the full list of our daily code changes head to our GitHub page. Lastly, see our credits page for acknowledgments of the data we host.

+ +

Oct. 31, 2025    New GENCODE "knownGene" V49 for human (hg38/hg19) and VM38 +for mouse (mm39)

+ +

+We are happy to announce the new GENCODE gene annotation tracks, corresponding to +Ensembl release 115, along with GENCODE knownGene V49 for human +(hg38/GRCh38 +and +hg19/GRCh37) +and GENCODE knownGene VM38 for mouse +(mm39/GRCm39). +The GENCODE "knownGene" V49 and VM38 tracks were built using the UCSC knownGene pipeline and the +GENCODE comprehensive gene set to generate high-quality manual annotations merged with +evidence-based automated annotations. The GENCODE "knownGene" tracks are our default +gene tracks, which have extensive associations to external sources. This allows for additional +metadata on every item as well as external links. The track description pages contain options for +configuring the display, such as showing non-coding genes, splice variants, and pseudogenes.

+

+Below is a summary of the contents found in each release. For more details, visit the +GENCODE site.

+

+ + + + + + + + + + + + + + + +
GENCODE v49 Release Stats
GenesObservedTranscriptsObserved
Protein-coding genes19,433Protein-coding transcripts211,446
Long non-coding RNA genes35,899- full length protein-coding186,646
Small non-coding RNA genes7,563- partial length protein-coding24,800
Pseudogenes14,701Nonsense mediated decay transcripts21,949
Immunoglobulin/T-cell receptor gene segments649Long non-coding RNA loci transcripts191,079
Total No of distinct translations129,801Genes that have more than one distinct translations15,498

+

+

+ + + + + + + + + + + + + + + +
GENCODE VM38 Release Stats
GenesObservedTranscriptsObserved
Protein-coding genes21,530Protein-coding transcripts58,647
Long non-coding RNA genes36,108- full length protein-coding45,050
Small non-coding RNA genes6,105- partial length protein-coding13,597
Pseudogenes13,809Nonsense mediated decay transcripts7,250
Immunoglobulin/T-cell receptor gene segments701Long non-coding RNA loci transcripts155,914
Total No of distinct translations44,974Genes that have more than one distinct translations10,853

+

+

+We would like to thank the GENCODE project for providing these +annotations. We would also like to thank Jonathan Casper, Mark Diekhans, and Gerardo Perez for the +development and release of these tracks.

+

Oct. 16, 2025    SpliceAI Wildtype tracks for hg38

We are pleased to announce the release of the SpliceAI Wildtype tracks for hg38, available in the Splicing Impact superTrack. These tracks show the scores for the genome sequence itself, without variants, from predicted splice donor (5' intron boundaries) and splice acceptor (3' intron boundaries) sites. Predictions are strand-specific, with separate subtracks for the plus and minus strands.