f93b8662afd701763c24634879d05dc08b3178de
max
Fri Jun 5 02:24:16 2026 -0700
Add exon search: jump to GENE exon N from position box
I'm comitting this thinking that the way that we implement searches
leads to duplication of code that doesn't look great to me. While this
feature looks good, the code duplication across C/JS should probably
get reduced with a different approach to the "quick jump" way of the
page. We have currently three ways to quick jump, I think:
- chr:start-end
- rsxxxxx
- gene symbol + autosuggest pick
- HGVS?
They are recognized by both the javascript and the C code with regexes.
I think all of these should be probably be only implemented in the C
code. The JS only sends the current string to the C code and then
gets back if this can be autocompleted and to which position and what
to show in the autosuggest area. For example if you type
"SOD1
To specify a genome position:
A query may have multiple results. If this is the case, a results page will appear listing
each result along with the track it is associated with. Once selected, the result will be displayed
in the Browser with a highlighted label, making it easier to identify. If you have further
questions, you can search the Genome Browser FAQ
page and find links to further resources. Also, developers of track hubs can create
searchable track hubs using the
-To quickly jump to a codon or exon of a gene transcript:SYMBOL exon N or SYMBOL:e.N[+/-offset]
+ (e.g. TP53 exon 5, BRCA2:e.10, NM_000546:e.5+2)
searchTrix setting.
-
-
SYMBOL exon N — e.g. TP53 exon 5 or NM_000546 exon 5SYMBOL:e.N — compact notation from the
+ VICC Gene Fusion Specification,
+ e.g. TP53:e.5 or NM_000546:e.5. Optionally add an intronic offset:
+ SYMBOL:e.N+offset navigates offset bases past the 3’ end of the exon
+ (into the downstream intron), and SYMBOL:e.N-offset navigates offset bases
+ before the 5’ start (into the upstream intron). Useful for splice site inspection:
+ BRCA2:e.10+2 lands 2 bp into the intron after exon 10.+Exon numbering is 1-based and follows transcript order (exon 1 is the 5′ exon). +Genes are looked up in order: MANE, GENCODE/UCSC (knownGene), +all RefSeq, then historical RefSeq. To jump to a codon instead, right-click +any transcript in the browser and select “Zoom to codon”.
Below is a list of examples that might be used to query the Genome Browser. Note that not every query listed here will produce a result in every assembly. The list serves only to illustrate the different types of queries that can be performed.
| Query | Genome Browser Response |
|---|---|
| chr7 | Displays all of chromosome 7 |
| chr3:1-1000000 | Displays the first million bases of chromosome 3, counting from the p-arm telomere |
| PRNP | Displays the region containing HUGO Gene Nomenclature Committee identifier PRNP |
| Q99697 | Displays the region containing the alignment of the UniProt/SwissProt protein sequence with accession Q99697 (PITX2) |
| RH18061;RH80175 15q11;15q13 NM_012090.5;NM_012421.4 |
Displays the region between genome landmarks, such as the STS markers RH18061 and
RH80175, or chromosome bands 15q11 to 15q13, or SNPs NM_000310.4 and NM_012090.5. This syntax may also be used for other range queries, such as between uniquely determined ESTs, mRNAs, refSeqs, SNPS, etc. |
| NR_026861.1:1-1000 | Works with any other type of accession from this page: Displays the first 1000bp of NR_026861.1 |
| TP53 exon 5 NM_000546 exon 5 |
+ Jumps to exon 5 of TP53 using the verbose notation. + A transcript ID (NM_, NR_, XM_, XR_, ENST) may be used in place of the gene symbol. |
| TP53:e.5 NM_000546.6:e.5 BRCA2:e.10+2 BRCA2:e.10-3 |
+ Compact exon notation from the
+ VICC Gene Fusion Specification.
+ Jumps to exon 5 of TP53, or to 2 bp past the end / 3 bp before the start of BRCA2 exon 10
+ (useful for splice site inspection). The +N/-N offset is optional. |
| NM_000310.4(PPT1):c.271_287del17insTT NM_007262.5(PARK7):c.-24+75_-24+92dup NM_006172.4(NPPA):c.456_*1delAA MYH11:c.503-14_503-12del NM_198576.4(AGRN):c.1057C>T NM_198056.3:c.1654G>T NP_002993.1:p.Asp92Glu NP_002993.1:p.D92E BRCA1 Ala744Cys BRCA1 A744C LRG_100t1:c.4G>A LRG_100t1:n.1 LRG_456p1:p.Ser190Leu LRG_321:g.16409_16461del ENST00000002596.6:c.-108-6848A>G ENSP00000005178.5:p.Val20Gly chrX:g.31500000_31600000del NR_111987:n.-1 NM_015102.5:n.3038-2 NM_001372044:c.1528_1530del |
Displays the region that matches the HGVS
expression, usually in the format <transcript or protein>:<position> <amino acid or nucleotide change> If a gene symbol is used, HGVS search will try all RefSeq transcripts to find the nucleotide or amino acid at the position indicated in the expression. If there are multiple matches, a disambiguation page will be shown. If the RefSeq sequence differs from the genome sequence, then currently the search will use the genome, not the transcript, for codon counting and amino acid / nucleotide comparison. Please contact us if this is inconvenient. |
| NM_198056.2:c.1A>C | An example of an HGVS search on a previous NM version that is now outdated. Support for previous NM accessions is only available on hg38. |