9418b6ff8004c4aea32c66cfb58cb821a072f94d lrnassar Thu Jun 27 17:30:26 2024 -0700 Adding dataVersion and consistent Broad wording as per feedback from Max refs #27141 diff --git src/hg/makeDb/trackDb/human/spliceAI.html src/hg/makeDb/trackDb/human/spliceAI.html index c45f756..7859a07 100644 --- src/hg/makeDb/trackDb/human/spliceAI.html +++ src/hg/makeDb/trackDb/human/spliceAI.html @@ -1,92 +1,92 @@

Description

SpliceAI is an open-source deep learning splicing prediction algorithm that can predict splicing alterations caused by DNA variations. Such variants may activate nearby cryptic splice sites, leading to abnormal transcript isoforms. SpliceAI was developed at Illumina; a lookup tool is provided by the Broad institute.

Important: The SpliceAI data on the UCSC Genome Browser is directly from Illumina (See Data Access below). However, since SpliceAI refers to the algorithm, and not the computed dataset, -the data on the BROAD server or other sources may have some differences between them. +the data on the Broad server or other sources may have some differences between them.

Display Conventions and Interpretation

Variants are colored by their predicted effects:

Mouseover on items shows the variant, gene name, type of change (donor gain/loss, acceptor gain/loss), location of affected cryptic splice, and spliceAI score. Clicking on any item brings up a table with this information.

The scores range from 0 to 1 and can be interpreted as the probability of the variant being splice-altering. In the paper, a detailed characterization is provided for 0.2 (high recall), 0.5 (recommended), and 0.8 (high precision) cutoffs.

Data Access

These data are not available for download from the Genome Browser. The raw data can be found directly on Illumina. See below for a copy of the license restrictions pertaining to these data.

License

FOR ACADEMIC AND NOT-FOR-PROFIT RESEARCH USE ONLY. The SpliceAI scores are made available by Illumina only for academic or not-for-profit research only. By accessing the SpliceAI data, you acknowledge and agree that you may only use this data for your own personal academic or not-for-profit research only, and not for any other purposes. You may not use this data for any for-profit, clinical, or other commercial purpose without obtaining a commercial license from Illumina, Inc.

Methods

The data were downloaded from Illumina. The spliceAI scores are represented in the VCF INFO field as SpliceAI=G|OR4F5|0.01|0.00|0.00|0.00|-32|49|-40|-31

Here, the pipe-separated fields contain

Since most of the values are 0 or almost 0, we selected only those variants with a score equal to or greater than 0.02.

The complete processing of this track can be found in the makedoc.

References

Jaganathan K, Kyriazopoulou Panagiotopoulou S, McRae JF, Darbandi SF, Knowles D, Li YI, Kosmicki JA, Arbelaez J, Cui W, Schwartz GB et al. Predicting Splicing from Primary Sequence with Deep Learning. Cell. 2019 Jan 24;176(3):535-548.e24. PMID: 30661751