41c485c8e4bda02a3de334444479d9ae92140c7c lrnassar Fri Mar 27 16:13:20 2026 -0700 Restore original alt text for images that already had alt attributes. The initial alt text commit incorrectly replaced 44 existing human-written descriptions with AI-generated generic text across 12 files. Feedback from CR. refs #37289 diff --git src/hg/htdocs/goldenPath/newsarch.html src/hg/htdocs/goldenPath/newsarch.html index 8932840aa08..0f1dfa447fc 100755 --- src/hg/htdocs/goldenPath/newsarch.html +++ src/hg/htdocs/goldenPath/newsarch.html @@ -896,56 +896,56 @@ We are pleased to announce the release of the SpliceAI Wildtype tracks for hg38, available in the Splicing Impact superTrack. These tracks show the scores for the genome sequence itself, without variants, from predicted splice donor (5' intron boundaries) and splice acceptor (3' intron boundaries) sites. Predictions are strand-specific, with separate subtracks for the plus and minus strands.

SpliceAI Acceptor Plus – Splice acceptor sites, plus strand
SpliceAI Acceptor Minus – Splice acceptor sites, minus strand
SpliceAI Donor Plus – Splice donor sites, plus strand
SpliceAI Donor Minus – Splice donor sites, minus strand

- Browser display of SpliceAI variant pathogenicity prediction scores

SpliceAI Wildtype track display for the CFTR region

These tracks are useful in combination with the variants track for evaluating new transcript models. They can be used to assess potential exon boundaries or possible splice acceptor sites.

We would like to thank Illumina for making SpliceAI available, both the model and the precomputed data files. Thanks to Francois Lecoquierre from the University of Oxford, Jean-Madeleine de Sainte Agathe from Institut Pasteur Paris, and Michael Hiller from the Senckenberg Museum Frankfurt for suggesting and then creating the SpliceAI Wildtype annotations. We would also like to thank Max Haeussler and Gerardo Perez for their efforts on this release.

Sep. 25, 2025 Panmask Easy 151b Regions track for hg38

We are happy to announce the release of the Panmask Easy 151b Regions track for hg38. This new track is available in the Problematic Regions superTrack. The track contains a set of sample-agnostic easy regions where short-read variant calling reaches high accuracy. Easy regions are derived for variant filtration agnostic to individual samples. They are genomic intervals where general variant callers achieve high accuracy without sophisticated filtering.

- Browser display showing pangenome masking tracks for repetitive regions

Panmask Easy 151b Regions track for the BRCA1 exon 19

The pm151 regions are used to filter spurious variant calls in centromeres, long repeats, and other genomic regions where short-read mapping is often problematic. They cover 88.2% of hg38, 92.2% of coding regions, and 96.3% of ClinVar pathogenic variants. The track can be used to filter variant calls for clinical or research human samples. It shows regions that are easy to sequence, rather than those that are problematic. The data was derived from the HPRC assemblies, and this track presents the 151b-easy panmask set.

We would like to thank Heng Li's group at Harvard Medical School for making this data available. We would also like to thank Max Haeussler and Gerardo Perez for their efforts on this release.

Sep. 24, 2025 CoLoRSdb small and structural variants for hg38 and hs1

@@ -958,31 +958,31 @@ discovered through long-read whole genome sequencing, contributed by the international Consortium of Long Read Sequencing (CoLoRS). The small variant tracks (DeepVariant + GLnexus) contain single nucleotide polymorphisms (SNPs) and short indels, while the structural variant tracks (pbsv + Jasmine) display larger events including insertions, deletions, and inversions. Long-read sequencing technology improves sensitivity in repetitive regions and provides more precise breakpoint resolution than short-read approaches, enabling accurate visualization of complex loci in the Genome Browser.

Each track includes allele frequency and sample count annotations, with additional filtering options for variant size and type. Users can click on individual variants to view detailed metadata, such as allele counts, homozygous/heterozygous call distributions, and Hardy-Weinberg equilibrium values.

Browser display of CoLoRSdb structural variant tracks from long-read sequencing

CoLoRSdb track with the mouseover tooltip on the DEL-159bp item.

Genome Browser screenshot of the CoLoRSdb tracks with the mouseover tooltip on the DEL-159bp item.

We would like to thank Mike Schatz, Evan Eichler, and all CoLoRSdb investigators for generating and making the data publicly available. We would also like to thank Karen Wang and Jairo Navarro Gonzalez for the creation and release of these tracks.

Sep. 19, 2025 Developmental Disorders Gene2Phenotype (DDG2P) for hg38 and hg19

@@ -1009,59 +1009,59 @@

G2P ID: Unique identifier assigned by the Gene2Phenotype (G2P) database.
Variant Consequence: Predicted effect each allele of a variant has on a transcript.
Disease Name: Name of the disease associated with the variant.
PubMed IDs: Publications associated with the variant.
Molecular Mechanism: Description of the molecular processes and interactions causing pathogenic effects.
Allelic Requirements: Number of alleles required at a locus to produce a pathogenic phenotype (e.g., monoallelic, biallelic).
Date of Last Review: Most recent date the entry was manually reviewed.

Browser display of developmental disorder gene-to-phenotype tracks

Genome Browser screenshot of the DDG2P track with the mouseover tooltip on the BRCA1 gene.

We would like to thank the G2P project for making this data publicly available. We would also like to thank Jaidan Jenkins-Kiefer and Jairo Navarro Gonzalez for the creation and release of the Genome Browser tracks.

Aug. 21, 2025 MaveDB Experiment Heatmaps and Alignment track for hg38

We are excited to announce the release of the MaveDB Experiment Heatmaps and Alignment track for hg38. This release provides heatmaps of multiplexed assays of variant effects (MAVE) from MaveDB. Each heatmap presents the results of an experiment where many small substitutions were tested within a gene to examine their functional consequences. Accompanying tracks display alignments of each experiment sequence to the genome.

Please note that only a subset of MaveDB experiments could be displayed as heatmaps; the sequence alignments in this track only cover those experiments.

Browser display of MaveDB BRCA1 variant effect map scores

MaveDB heatmap and alignment tracks for the BRCA1 exon 19

Genome Browser screenshot of BRCA1 exon 19 showing the alignment and heatmap tracks.

Hover over each item in the heatmap to see the consequence of substituting individual amino acids within the genome with alternatives. Score ranges vary among experiments, but each is presented with the highest scores in red, the lowest scores in blue, and scores at the midpoint between the two in silver. Higher scores correspond to a higher enrichment level for that variant compared to others in the experiment set.

We would like to thank Jeremy Arbesfeld and the MaveDB team for making this data publicly available. We would also like to thank Melissa Cline, Jonathan Casper, and Jairo Navarro for the @@ -1234,31 +1234,31 @@

July 15, 2025 ENCODE4 Long-read RNA-seq Transcripts

We are pleased to announce the release of the ENCODE4 long-read RNA-seq transcripts track for hg38 and mm10. This track annotates transcripts using numerical triplets representing the identity of the start site, exon junction chain, and transcript end site of each transcript. This is presented alongside sample enrichment information to show how promoter selection, splice pattern, and 3’ processing are deployed across human tissues.

Browser display of ENCODE 4 long-read RNA transcript annotations

Screenshot showing ENCODE4 long-read transcripts track

Transcripts are labeled with triplets, e.g. [1,1,1] or [1,1,3] or [2,1,3]. If transcripts share a number in any of the positions that means they share that feature, e.g. sharing a 8 in the second position but different numbers in the others means those two transcripts share the same set of exons, but different start and end sites.

This track is part of a "Long-read Transcripts" supertrack that will consist of other datasets derived from third-generation sequencing technology, such as PacBio and Oxford Nanopore.

@@ -1713,31 +1713,31 @@ expression levels across 50 tissues from the Genotype Tissue Expression (GTEx) v10 dataset, showing a comprehensive view of the expression of exons across a gene using the proportion expression across transcripts, or pext metric, a transcript-level annotation metric which quantifies isoform expression for variants.

This is especially useful for those interested in alternative splicing and clinical assessment of variants. For more information, see the track description page and the associated publication.

Browser display of hg38 patch extension tracks with sequence alignments

Image of TRDN gene showing variants primarily expressed in heart tissue.

We would like to thank the gnomAD team and the UCSC Genome Browser team members Jeltje van Baren, Max Haeussler, Lou Nassar, and Anna Benet-Pages for developing and releasing this track, as well as making the Exon Relevance RTS.

May 5, 2025 VISTA Enhancers track update for Human and Mouse

We are happy to announce an update to the VISTA Enhancers tracks for human (GRCh38/hg38 and

 highlightText.name NM*

The example above uses the highlightText setting, which will apply a highlight on the field name. Using this setting, any items that begin with NM are highlighted.

 highlightColor #ff0000

In this final example, the highlightColor is used to set the default highlight color. With this setting, all highlight stripes will use the color red, #ff0000.

Screenshot showing the track highlight feature for emphasizing genomic regions

Items in the NCBI RefSeq Historical track that begin with NM are highlighted red.

Items in the NCBI RefSeq Historical track for hg38 have all items that begin with "NM" highlighted in red.

We would like to thank Chris Lee and Jairo Navarro for their efforts in creating and testing the highlight feature for track hubs.

Feb. 24, 2025 enGenome VarChat track for human (hg38 and hg19)

We are happy to announce the release of the enGenome VarChat track for the hg38/GRCh38 and hg19/GRCh37 human assemblies, available in the Variants in Papers superTrack. @@ -4941,31 +4941,31 @@ href="https://www.gencodegenes.org/pages/gencode.html">GENCODE project for providing these annotations. We would also like to thank Mark Diekhans and Lou Nassar for the development and release of these tracks.

May 5, 2022 Merged Cell Expression on hg38

The Genome Browser already provided single-cell RNA-seq datasets for the human GRCh38/hg38 assembly, but those data have so far been split among a collection of tracks depending on the organ and publication source. We are happy to announce that data from 12 of those papers (and 14 organs) are now available in a combined Merged Cells track that provides normalized RNA-seq values for every cell type in those sets. All components were normalized to show expression in parts per million.

Browser display of merged single-cell RNA-seq expression across cell types

Example of the Merged Cells track display for the ACE2 gene

The following tracks were incorporated into this Merged Cells track:

Blood (PBMC) Hao
Colon Wang
Cortex Velmeshev @@ -7487,31 +7487,31 @@
Dec. 23, 2020 New ClinVar Interpretations track for human (hg19/hg38)

We are pleased to release a new track, ClinVar Interpretations, for the hg19/GRCh37 and hg38/GRCh38 human assemblies. This track can be found as part of the ClinVar Composite. It is the first track to use our bead graph display, which is a variation of our existing lollipop display.

The ClinVar Interpretations track displays the genomic positions of individual variant submissions and interpretations of the clinical significance, as well as their relationship to disease in the ClinVar database. As seen on the image below, the variants are classified into six categories each on a separate horizontal line:

- +
- P - Pathogenic
- LP - Likely Pathogenic
- VUS - Variant of Unknown Significance
- LB - Likely Benign
- B - Benign
- OTH - Others
The size of the bead on the line represents the number of submissions at that genomic position. The color of the beads aids to distinguish the categories further. Hovering on the track items shows the genomic variations which start at that position and the number of individual submissions with that classification. Additional information on the variants @@ -8362,59 +8362,59 @@ Lastly, multiple feature options have been added to both tracks independently:
- Filtering by variant length is available on both tracks.
- Filter by variation (INS, DEL, etc.) now available on both tracks.
- Filter by clinical significance (benign, conflicting, etc.) now available on both tracks.
- Filter on allele origin (somatic, germ line, de novo, etc.) now available on both tracks.
- Filter by molecular consequence (stop lost, nonsense, intron variant, etc.) now available on short variants track.
Below is an example of the filter options available for the ClinVar SNVs track. For additional details on the updated display, see the track description page.

- +

Changes to ClinGen and new tracks

We have created a new composite track, ClinGen, and deprecated the previous ClinGen CNVs track. The ClinGen CNVs track will continue to be available, however, the data will no longer be updated. This was done by request of ClinGen, as all the data, as well as further updates, can be found in the ClinVar Copy Number Variants (ClinVar CNVs) track.

The new ClinGen composite track includes three new tracks described below:
- ClinGen Dosage Sensitivity Map - Haploinsufficiency - Shows evidence supporting or refuting haploinsufficiency (loss) as mechanisms for disease at gene-level and larger genomic regions.
- ClinGen Dosage Sensitivity Map - Triplosensitivity - Shows evidence supporting or refuting triplosensitivity (gain) as mechanisms for disease at gene-level and larger genomic regions.
- ClinGen Gene-Disease Validity Classification (ClinGen Validity) - Provides a semi-qualitative measurement for the strength of evidence of a gene-disease relationship.
- +

For more information on these tracks, including display conventions, scores, and classifications, see the track description page.

We would like to thank Erin Riggs and May Flowers as well as the rest of the ClinGen team. We would also like to thank ClinVar for making these data available. Track development and release was made possible by Anna Benet-Pages, Christopher Lee, Max Haeussler, and Lou Nassar.

Sept. 25, 2020 New data and visualization types: Covid GWAS (Lollypop) and Family Trios (VCF Trios)

Covid GWAS meta-analysis

@@ -9664,31 +9664,31 @@ Control Only SV's - gnomAD Structural Variants Controls Only
Non-neuro SV's - gnomAD Structural Variants Non-neuro Only

These data can be found as part of the gnomAD super-track. More information on this track can be found in the track description pages, as well as the gnomAD site.

- Browser display of gnomAD structural variant constraint scores + Example of Constraint Metrics and Structural Variants tracks

We would like to thank the Genome Aggregation Database Consortium for making these data available. We would also like to thank Christopher Lee, Maximilian Haeussler, Lou Nassar, Jairo Navarro, Robert Kuhn and Anna Benet-Pages for their effort in the creation of these tracks.

Apr. 20, 2020 New video on the Browser's YouTube channel

We have released a new video to the Browser's hg38.

We would like to thank NCBI and the RefSeq Annotation database for collecting and curating these data. We would also like to thank Hiram Clawson and Daniel Schmelter for their role creating, documenting, and reviewing these tracks.

Feb. 7, 2020 New and updated Variants in Papers tracks: Avada variants (hg19) & Mastermind variants (hg19, hg38)

We are pleased to announce a new track, Avada Variants, now available on hg19. Additionally, we have updated the Mastermind Variants track and expanded it to hg38.

- Browser display of literature-cited variant annotations from published papers + Example of AVADA and Mastermind tracks

Avada Variants

The Avada Variants track shows the genomic positions of variants in the AVADA database. AVADA is a database of variants built by machine learning software that analyzes full text research articles in PDF format to find genes and variants that look most relevant for genetic diagnosis.

Additional information can be found on the AVADA publication.

Mastermind Variants

The Mastermind Variants track is now available for the hg38 assembly @@ -17391,31 +17391,31 @@ modifications suggestive of enhancer and promoter activity, DNAse clusters indicating open chromatin, regions of transcription factor binding, and transcription levels. When viewed in combination, the complementary nature of the data within these tracks has the potential to greatly facilitate our understanding of regulatory DNA.

The data comprising these tracks were generated from hundreds of experiments on multiple cell lines conducted by labs participating in the Encyclopedia of DNA Elements (ENCODE) project, and were submitted to the UCSC ENCODE Data Coordination Center for display on the Genome Browser.

Faced with the problem of how to display such a large amount of data in a manner facilitating analysis, UCSC has developed new visualization methods that cluster and overlay the data, and then display the resulting tracks on a single screen. Each of the cell lines in a track is associated with a particular color. Light, saturated colors are used to produce the best transparent overlay.

The data in the ENCODE Regulation super-track, as with all data from the production phase of the ENCODE project, have genome-wide coverage. In general, Genome Browser tracks that show ENCODE-generated data can be identified by the double-helix icon preceding the name in the track list. Currently, the ENCODE Regulation data are available only on the Mar. 2006 (NCBI Build 36, UCSC version hg18) assembly of the human genome.

For a detailed description of the datasets contained in this super-track and a discussion of how the tracks can be used synergistically to examine regions of regulatory functionality within the genome, see the track description page.

Aug. 18, 2010 Cat Genome Browser available

We have released a Genome Browser for the latest assembly of Cat (Felis catus). The GTB @@ -17478,31 +17478,31 @@ We are pleased to announce the release of a new Conservation track based on the zebrafish (danRer6) assembly. This track shows multiple alignments of 6 vertebrate species and measurements of evolutionary conservation using phastCons from the PHAST package. The multiple alignments were generated using multiz and other tools in the UCSC/Penn State Bioinformatics comparative genomics alignment pipeline. Conserved elements identified by phastCons are displayed in the companion "Most Conserved" track.

For more details, please visit the track description page.

Jul. 7, 2010 Happy 10th birthday, Human Genome

-

Graph showing Genome Browser growth in tracks and assemblies over 10 years + 10-year usage graph

Top graph: total traffic on the UCSC domain during June-July, 2000. Bottom graph: page hit statistics on genome.ucsc.edu in the ensuing years since the Genome Browser was released.

UCSC is pleased to celebrate the 10-year anniversary of the debut of the first assembled human genome sequence and its then-fledgling visualization tool, the UCSC Genome Browser. Released on July 7, 2000, the genome sequence instantly created unprecedented web traffic on the ucsc.edu domain as researchers around the world scrambled to download the data: 0.5 terabytes per day, a record that stood for many years.

David Haussler recounts that day: "Seeing the waterfall of As, Gs, Cs, and Ts pouring off our

+
Top graph: total traffic on the UCSC domain during June-July, 2000. Bottom graph: page hit statistics on genome.ucsc.edu in the ensuing years since the Genome Browser was released.