58e83e1a302e901ad4b97a529f15599f25f33e43 jeltje.van.baren Tue Jan 21 11:58:49 2025 -0800 alpha missense description diff --git src/hg/makeDb/trackDb/human/alphaMissense.html src/hg/makeDb/trackDb/human/alphaMissense.html index 462d7df5305..cafe631544c 100644 --- src/hg/makeDb/trackDb/human/alphaMissense.html +++ src/hg/makeDb/trackDb/human/alphaMissense.html @@ -1,143 +1,76 @@

Description

- -

This track collection shows Rare Exome Variant Ensemble Learner (alphaMissense) scores for predicting -the deleteriousness of each nucleotide change in the genome. +

+This track shows AlphaMissense predictions for all possible single amino acid substitutions in +the human proteome.

-

-alphaMissense is an ensemble method for predicting the pathogenicity of missense variants -based on a combination of scores from 13 individual tools: MutPred, FATHMM v2.3, -VEST 3.0, PolyPhen-2, SIFT, PROVEAN, MutationAssessor, MutationTaster, LRT, GERP++, -SiPhy, phyloP, and phastCons. alphaMissense was trained using recently discovered pathogenic -and rare neutral missense variants, excluding those previously used to train its -constituent tools. The alphaMissense score for an individual missense variant can range -from 0 to 1, with higher scores reflecting greater likelihood that the variant is -disease-causing. +AlphaMissense is a deep learning method for predicting the pathogenicity of missense variants +in human proteins. It classifies 32% of all missense variants as likely pathogenic and 57% +as likely benign using a cutoff yielding 90% precision on the ClinVar dataset

-

Most authors of deleteriousness scores argue against using fixed cutoffs in -diagnostics. But to give an idea of the meaning of the score value, the alphaMissense -authors note: "For example, 75.4% of disease mutations but only 10.9% of -neutral variants (and 12.4% of all ESVs) have a alphaMissense score above 0.5, -corresponding to a sensitivity of 0.754 and specificity of 0.891. Selecting a -more stringent alphaMissense score threshold of 0.75 would result in higher specificity -but lower sensitivity, with 52.1% of disease mutations, 3.3% of neutral -variants, and 4.1% of all ESVs being classified as pathogenic". (Figure S1 of -the reference below) -

Display Conventions and Configuration

-

-There are five subtracks for this track: -

When using this track, zoom in until you can see every basepair at the top of the display. Otherwise, there are several nucleotides per pixel under your mouse cursor and no score will be shown on the mouseover tooltip.

Track colors

-This track is colored according to Table 2 in Vikas et al. The colors represent the recommended ACMG/AMP score cutoffs. +This track is colored according to the am_class column in the AlphaMissense_$db.tsv file. - - + + - - + + - - + +
Range Classification
≥ .773Pathogenic≥ .564Likely Pathogenic
.772 - .184Neutral.565 - .340Likely Neutral
≤ .183Benign≤ .340Likely Benign
-

For hg38, note that the data was converted from the hg19 data using the UCSC -liftOver program, by the alphaMissense authors. This can lead to missing values or -duplicated values. When a hg38 position is annotated with two scores due to the -lifting, the authors removed all the scores for this position. They did the same when -the reference allele has changed from hg19 to hg38. Also, on hg38, the track has -the "lifted" icon to indicate -this. You can double-check if a nucleotide -position is possibly affected by the lifting procedure by activating the track -"Hg19 Mapping" under "Mapping and Sequencing". -

Data access

alphaMissense scores are available at the -alphaMissense website. +alphaMissense cloud storage site. The site provides precomputed alphaMissense scores for all possible human missense variants to facilitate the identification of pathogenic variants among the large number of rare variants discovered in sequencing studies.

The alphaMissense data on the UCSC Genome Browser can be explored interactively with the Table Browser or the Data Integrator. For automated download and analysis, the genome annotation is stored at UCSC in bigWig files that can be downloaded from our download server. The files for this track are called a.bw, c.bw, g.bw, t.bw. Individual regions or the whole genome annotation can be obtained using our tool bigWigToWig @@ -151,22 +84,29 @@

Methods

Data were converted from the files provided on the alphaMissense Downloads website. As with all other tracks, a full log of all commands used for the conversion is available in our source repository, for hg19 and hg38. The release used for each assembly is shown on the track description page.

Credits

-Thanks to the alphaMissense development team for providing precomputed data and fixing duplicated values in the hg38 files. +Thanks to

References

+Cheng J, Novati G, Pan J, Bycroft C, Žemgulytė A, Applebaum T, Pritzel A, Wong LH, +Zielinski M, Sargeant T et al. + +Accurate proteome-wide missense variant effect prediction with AlphaMissense. +Science. 2023 Sep 22;381(6664):eadg7492. +PMID: 37733863