99db113e49bd1dd42a3040eaed1e61188a117ecf max Wed May 13 05:44:18 2026 -0700 mucins: new locus-specific mucin VNTR/non-VNTR exon track on hs1 (alpha) Adds a new 'Locus-specific' container in the genes group with two bigGenePred subtracks (mucinsVntr, mucinsNonVntr) derived from Plender et al. 2024 supplementary table 2. The .ra and HTML files live under trackDb/human/ so future locus-specific tracks can share them; hs1 picks them up via include ../mucins.ra alpha. refs #37540 diff --git src/hg/makeDb/trackDb/human/mucinsNonVntr.html src/hg/makeDb/trackDb/human/mucinsNonVntr.html new file mode 100644 index 00000000000..7255be14d5e --- /dev/null +++ src/hg/makeDb/trackDb/human/mucinsNonVntr.html @@ -0,0 +1,96 @@ +<h2>Description</h2> + +<p> +Mucins are large secreted or membrane-tethered glycoproteins that coat +the epithelial surfaces of the airway, gut, and reproductive tract and +form the protective mucus layer. Each mucin transcript is typically +made up of one long VNTR exon, which encodes the heavily glycosylated +tandem-repeat domain, flanked by many shorter, non-repetitive exons +that encode the cysteine-rich and other functional domains of the +protein. This track shows the non-VNTR (non-repetitive) exons of each +mucin transcript on the T2T-CHM13 (hs1) primary assembly. +</p> + +<h2>Display Conventions and Configuration</h2> + +<p> +Each item is a single exon, drawn in blue +(<span style="display:inline-block; background-color:#0072B2; width:18px; height:12px; vertical-align:middle;"></span>), +labeled with the gene symbol and the exon number from the source table +(e.g. <i>MUC1_exon7</i>). The transcript accession and gene symbol are +shown in the details page. The companion track +<a href="hgTrackUi?g=mucinsVntr">Mucin VNTR exons</a> shows the +tandem-repeat exons of the same transcripts. +</p> + +<h2>Methods</h2> + +<p> +Mucin gene structures with their corresponding VNTR and non-VNTR exons +were catalogued in a long-read sequencing study of the human mucin +gene family. The annotations on the T2T-CHM13 primary assembly were +curated manually from the gene models used in the supplementary tables +of that work; for each transcript, exons were labeled as VNTR or +non-VNTR based on whether they encode the tandem-repeat PTS domain. +</p> + +<p> +The starting file <tt>supp2_exons.txt</tt> was downloaded from the +publication's supplementary material and converted to bigGenePred by +the script described in the +<a href="https://github.com/ucscGenomeBrowser/kent/blob/master/src/hg/makeDb/doc/hs1/mucins.txt" +target="_blank">makeDoc</a>, which uses the converter in +<a href="https://github.com/ucscGenomeBrowser/kent/tree/master/src/hg/makeDb/scripts/mucins" +target="_blank">src/hg/makeDb/scripts/mucins</a>. Each exon row in the +source becomes a single-block bigGenePred entry; coordinates in the +source file are 0-based half-open and were passed through unchanged. +This subtrack contains the 335 exons labeled as <tt>non_VNTR_exon</tt> +across 14 mucin transcripts. +</p> + +<h2>Data Access</h2> + +<p> +The data can be explored interactively in table format with the +<a href="../cgi-bin/hgTables">Table Browser</a> or the +<a href="../cgi-bin/hgIntegrator">Data Integrator</a> and exported from +there to spreadsheet or tab-sep tables. From scripts, the data can be +accessed through our <a href="https://api.genome.ucsc.edu">API</a>, +track=<i>mucinsNonVntr</i>. +</p> + +<p> +For automated download and analysis, the annotation is stored in a +bigBed file that can be downloaded from +<a href="http://hgdownload.soe.ucsc.edu/gbdb/hs1/locusSpec/mucins/" +target="_blank">our download server</a>. The file for this track is +called <tt>mucinsNonVntr.bb</tt>. Individual regions or the whole +annotation can be obtained using the tool <tt>bigBedToBed</tt>, which +can be compiled from the source code or downloaded as a precompiled +binary for your system. Instructions for downloading source code and +binaries can be found +<a href="http://hgdownload.soe.ucsc.edu/downloads.html#utilities_downloads">here</a>. +The tool can also be used to obtain features within a given range, e.g. +<tt>bigBedToBed http://hgdownload.soe.ucsc.edu/gbdb/hs1/locusSpec/mucins/mucinsNonVntr.bb -chrom=chr1 -start=154000000 -end=154500000 stdout</tt>. +</p> + +<h2>Credits</h2> + +<p> +Thanks to the Eichler lab and the authors of the source publication +for making the curated mucin annotation available. +</p> + +<h2>References</h2> + + +<p> +Plender EG, Prodanov T, Hsieh P, Nizamis E, Harvey WT, Sulovari A, Munson KM, Kaufman EJ, +O'Neal WK, Valdmanis PN <em>et al</em>. +<a href="https://linkinghub.elsevier.com/retrieve/pii/S0002-9297(24)00213-1" target="_blank"> +Structural and genetic diversity in the secreted mucins MUC5AC and MUC5B</a>. +<em>Am J Hum Genet</em>. 2024 Aug 8;111(8):1700-1716. +PMID: <a href="https://www.ncbi.nlm.nih.gov/pubmed/38991590" target="_blank">38991590</a>; PMC: <a +href="https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11344006/" target="_blank">PMC11344006</a> +</p> +