c81011d4a8f57db347e15aa1248c501b2c8a6fea lrnassar Mon Jun 1 13:16:15 2026 -0700 QA fixes for the lrSv long-read SV supertrack: labels and description cleanups. refs #36258 Trim six subtrack longLabels to the 85-char limit (ga4kSv, hprc2Sv, hgsvc2Sv, chirmade101Sv, cpc1Sv, and lrSvAll; the lrSvAll change is also made in the lrSvMergeAll.py generator so a re-run reproduces it). Standardize the APR dataset name to "Arab Pangenome Reference (APR)" across lrSv.ra, lrSv.html, aprSv.html, and the makeDoc comment (was a mix of "Arabic" and "UAE UPR"). lrSv1kgOnt.html: state per-assembly SV counts (hg38 lifted 148,375 vs hs1 native 161,332, each with its own type breakdown) and encode non-ASCII author names as numeric entities. hgsvc3Sv.html: correct the hg38 counts to match the served bigBed (176,231 DEL+INS, 176,531 total). colorsDbSv.html: use $db in the hgdownload path so it resolves on hs1 as well as hg38. cpc1Sv.html: encode a Unicode minus sign as a numeric entity. diff --git src/hg/makeDb/trackDb/human/lrSv1kgOnt.html src/hg/makeDb/trackDb/human/lrSv1kgOnt.html index 9a379e674a6..fa8be24c6ae 100644 --- src/hg/makeDb/trackDb/human/lrSv1kgOnt.html +++ src/hg/makeDb/trackDb/human/lrSv1kgOnt.html @@ -1,30 +1,31 @@ <h2>Description</h2> <p> This track shows structural variants (SVs) identified by Oxford Nanopore long-read sequencing of 1,019 individuals from the 1000 Genomes Project, representing 26 populations across 5 continental regions: Africa (275 samples), East Asia (192), South Asia (199), Europe (189), and Americas (164). Median sequencing coverage was 16.9x per sample with a median N50 read length of 20.3 kb. </p> <p> SVs were discovered using the SAGA framework (SV Analysis by Graph Augmentation) and annotated with SVAN, which classifies insertions and deletions by their -mechanism of origin. The dataset contains 161,332 annotated SVs, -including 75,324 insertions, 66,192 deletions, and 19,816 complex rearrangements. -The original coordinates are on the T2T-CHM13 assembly (hs1); for GRCh38 (hg38), -coordinates were converted using liftOver (148,375 records mapped successfully). +mechanism of origin. The full release is native to the T2T-CHM13 assembly +(hs1) and contains 161,332 annotated SVs (75,324 insertions, 66,192 deletions, +and 19,816 complex rearrangements). For GRCh38 (hg38), coordinates were converted +using liftOver and 148,375 records mapped successfully (73,298 insertions, +58,637 deletions, and 16,440 complex rearrangements). </p> <p> The 1,019 samples sequenced here are distinct from those in the <a href="hgTrackUi?g=gustafsonSv">1KG ONT 100</a> track (Gustafson et al. 2024); the two releases were produced by separate consortia (Vienna and the 1000 Genomes ONT Sequencing Consortium, respectively) and there is no sample overlap between the two. </p> <h2>Display Conventions and Configuration</h2> <p> Items are colored by SV class: <ul> <li><span style="color: rgb(200,0,0);">Deletions (DEL)</span> - red</li> <li><span style="color: rgb(0,0,200);">Insertions (INS)</span> - blue</li> @@ -94,24 +95,24 @@ <p> Source data is available from the <a href="https://ftp.1000genomes.ebi.ac.uk/vol1/ftp/data_collections/1KG_ONT_VIENNA/" target="_blank">1000 Genomes ONT Vienna</a> data collection at IGSR. </p> <h2>Credits</h2> <p> Thanks to the 1000 Genomes ONT Vienna consortium for making their structural variant calls and SVAN annotations publicly available. </p> <h2>References</h2> <p> -Schloissnig S, Pani S, Ebler J, Hain C, Tsapalou V, Söylev A, Hüther P, Ashraf H, Prodanov T, +Schloissnig S, Pani S, Ebler J, Hain C, Tsapalou V, Söylev A, Hüther P, Ashraf H, Prodanov T, Asparuhova M <em>et al</em>. <a href="https://doi.org/10.1038/s41586-025-09290-7" target="_blank"> Structural variation in 1,019 diverse humans based on long-read sequencing</a>. <em>Nature</em>. 2025 Aug;644(8076):442-452. PMID: <a href="https://www.ncbi.nlm.nih.gov/pubmed/40702182" target="_blank">40702182</a>; PMC: <a href="https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12350158/" target="_blank">PMC12350158</a> </p>