0151d00a4a1d73a78c35f6158c6c936ff338faeb max Fri Apr 24 10:37:34 2026 -0700 NMD Escape: MANE subtrack, Rule 1 bug fix, transcript filter. refs #33737 - Add nmdEscMane subtrack (MANE Select Plus Clinical 1.5), built from /gbdb/hg38/mane/mane.bb. Reuses nmdEscTranscripts.html. - Fix Rule 1: measure 50 bp upstream of the transcript's last splice junction (including 3'UTR introns) rather than stripping 3'UTR from the exon list first. The old logic painted the entire last CDS exon as NMD-escape whenever the transcript had only one CDS exon, even when a 3'UTR intron sat far past the stop codon (e.g. NBDY: 207 bp of CDS over-painted for a junction 2.6 kb past the stop). - Add --rule1-mode {cds,mrna} (default cds): cds counts only CDS bp on the walk-back (paints up to 50 bp of CDS matching the rule label literally); mrna counts mRNA bp and clips to CDS (tracks the 55 bp rule literature). Documented in makeDoc. - Rule 4: when a 3'UTR intron exists, the last CDS-containing exon has a downstream EJC and is now eligible for the long-exon rule. - Mouseover lists contributing transcript accessions when 1-3 items collapse into a region; falls back to a count above that. - Add filterText/filterType/filterLabel on all three escape subtracks so a user can narrow the display to one transcript. - genePredNmdEsc: --gene-sym-field (default 17 for Gencode; pass 18 for MANE, whose HGNC symbol lives in bigGenePred geneName2). - Add findShortTxLongUtrIntron.py helper for finding MANE transcripts with long UTR introns (used to pick NMD edge-case test cases). Post-fix collapsed-region counts (--rule1-mode=cds): MANE 1.5: 67,752 Gencode V49: 233,375 RefSeq Curated: 112,356 diff --git src/hg/makeDb/trackDb/human/hg38/nmdEscTranscripts.html src/hg/makeDb/trackDb/human/hg38/nmdEscTranscripts.html index b9207fc0d6f..cc74d2318c6 100644 --- src/hg/makeDb/trackDb/human/hg38/nmdEscTranscripts.html +++ src/hg/makeDb/trackDb/human/hg38/nmdEscTranscripts.html @@ -1,173 +1,180 @@
The NMD escape ruleset tracks show predicted regions where a premature termination codon (PTC) or frameshift variant is likely to cause the transcript to escape nonsense-mediated decay (NMD), leading to the production of an aberrant truncated protein rather than degradation of the mRNA.
The following rules were applied to transcript annotations to define predicted NMD escape regions (Nagy et al, Trends Biochem Sci 1998 and Lindeboom et al, Nat Genet 2016):
Non-coding transcripts (where CDS start equals CDS end) are excluded. Overlapping regions from multiple transcripts with identical coordinates and the same rule are collapsed into a single item, with the contributing transcript IDs stored as a comma-separated list.
Two versions of this track are available, based on different transcript annotation sets:
NMD escape regions were predicted based on the Exon Junction Complex (EJC)-dependent model of NMD. During normal translation, EJCs are deposited at exon-exon junctions after splicing. As the ribosome translates the mRNA, it displaces each EJC it encounters. When a PTC causes the ribosome to stall prematurely, any remaining downstream EJCs recruit surveillance factors (notably UPF1) that trigger mRNA degradation via NMD.
However, PTCs located in the last coding exon or within approximately 50 bp upstream of the last exon-exon junction are too close to the final EJC (or have no downstream EJC at all) for NMD to be triggered—the transcript escapes degradation. Conversely, PTCs located more than 50–55 bp upstream of the last exon-exon junction are predicted to elicit NMD.
Additional escape mechanisms, supported by Lindeboom et al. 2016 and other studies, are captured by three further rules:
Regions from overlapping transcripts with the same coordinates are collapsed into a single item. The gene symbol is shown as the item name. Mouseover displays the NMD escape rule and the number of transcripts. The details page lists all contributing transcript IDs.
Items are colored by the NMD escape rule that applies:
The data underlying this track can be explored interactively with the Table Browser or the Data Integrator. For automated analysis, the data may be queried from our REST API. Please refer to our mailing list archives for questions, or our Data Access FAQ for more information.
Thanks to Guido Neidhardt for suggesting this track at HUGO VEPTC 2025 and Andreas Lahner for feedback. Thanks to the Decipher Genome Browser team for introducing the idea of a track.
Kurosaki T, Popp MW, Maquat LE. Quality and quantity control of gene expression by nonsense-mediated mRNA decay. Nat Rev Mol Cell Biol. 2019 Jul;20(7):406-420. PMID: 30992545; PMC: PMC6855384
Lindeboom RGH, Supek F, Lehner B. The rules and impact of nonsense-mediated mRNA decay in human cancers. Nat Genet. 2016 Oct;48(10):1112-8. PMID: 27618451; PMC: PMC5045715
Nagy E, Maquat LE. A rule for termination-codon position within intron-containing genes: when nonsense affects RNA abundance. Trends Biochem Sci. 1998 Jun;23(6):198-9. PMID: 9644970