7594507ca126d5242346787e42e13c52ea7709b1 max Fri Apr 17 08:40:31 2026 -0700 Add lrSv supertrack: long-read structural variants from 9 studies (hg38). #Preview2 week - bugs introduced now will need a build patch to fix Sub-tracks (all bigBed 9+): han945Sv - 945 Han Chinese, ONT (Gong 2025, PMID 39929826) lrSv1kgOnt - 1019 1000 Genomes, ONT, SVAN-annotated (Schloissnig 2025, PMID 40702182; lifted from hs1) tommoJpSv - 333 Japanese (111 trios), ONT (Otsuki 2022, PMID 36127505) aou1kSv - 1027 All of Us, PacBio HiFi (Garimella 2025, PMID 41256123) ga4kSv - 502 GA4K pediatric rare disease, PacBio HiFi (Cohen 2022, PMID 35305867) decodeSv - 3622 Icelanders, ONT (Beyter 2021, PMID 33972781) hgsvc3Sv - 65 HGSVC3 diverse haplotype-resolved assemblies, HiFi+ONT (Logsdon 2025, PMID 40702183; merges insdel+inv tables) kwanhoSv - 100 post-mortem brains (PD/ILBD/HC), PacBio HiFi (Kim 2026, PMID 41929179) chirmade101Sv - 101 long-read WGS GWAS SVatalog cohort (Chirmade 2026, PMID 41203876) Includes per-track conversion scripts and autoSql under scripts/lrSv/, the supertrack summary table in lrSv.html, and a consolidated makeDoc at doc/hg38/lrSv.txt. refs #36258 Co-Authored-By: Claude Opus 4.7 (1M context) diff --git src/hg/makeDb/trackDb/human/kwanhoSv.html src/hg/makeDb/trackDb/human/kwanhoSv.html new file mode 100644 index 00000000000..d888c3a6f52 --- /dev/null +++ src/hg/makeDb/trackDb/human/kwanhoSv.html @@ -0,0 +1,106 @@ +

Description

+

+This track shows structural variants (SVs) identified by PacBio HiFi long-read +whole-genome sequencing of 100 post-mortem human brain samples, split across +three diagnostic groups: Parkinson's disease (PD), incidental Lewy body +disease (ILBD) and healthy controls (HC). The high-confidence catalog +contains 74,552 SVs: 34,056 insertions, 29,545 deletions, 9,707 duplications +and 1,244 inversions. +

+

+The dataset accompanies Kim et al. (2026), which combines the long-read SV +catalog with single-nucleus RNA-seq from the same donors to identify SVs +associated with cell-type-specific gene expression, including variants +near genes nominated as causal targets of PD GWAS loci. +

+ +

Display Conventions and Configuration

+

+Items are colored by SV type: +

+

+

+Insertions are placed at the insertion site with a width of 1 bp; deletions, +duplications and inversions span the affected reference interval. Filters are +available for SV type, SV length, variant quality and allele frequencies in +each of the three cohorts (PD, HC, ILBD), as well as the case-minus-control +carrier-rate differential. +

+

+The detail page shows, for each variant: +

+

+ +

Methods

+

+Long-read whole-genome sequencing was performed on 100 post-mortem brain +samples (35 PD, 31 ILBD, 34 HC) with PacBio HiFi chemistry. Per-sample SV +calls from multiple callers were merged into a joint callset; the +high-confidence filtered catalog released in Supplementary Table 13 +(media-13.txt) of the Kim et al. 2026 preprint is used directly +here. Per-cohort allele frequencies, Hardy-Weinberg statistics and case / +control carrier rates are reported in the source table; the track exposes +the allele counts and the case-control differential as filterable fields. +The paper also integrates single-nucleus RNA-seq from two brain regions +of the same donors to test SV-expression associations in specific cell +types, but that layer is not shown in this track. +

+ +

Data Access

+

+The data can be explored interactively in table format with the +Table Browser or the +Data Integrator, and accessed +programmatically through our API, +track=kwanhoSv. +

+

+The bigBed is available from +our +download server as kwanho.bb. Example: +bigBedToBed http://hgdownload.soe.ucsc.edu/gbdb/hg38/lrSv/kwanho.bb -chrom=chr21 -start=0 -end=100000000 stdout. +

+

+The full supplementary data for the paper (including media-13.txt) +is available from the Kim et al. 2026 preprint. +

+ +

Credits

+

+Thanks to Kim, Levin and colleagues at the Aligning Science Across +Parkinson's (ASAP) Collaborative Research Network, the Broad Institute, +Yale School of Medicine, Banner Sun Health Research Institute and their +collaborators for releasing this dataset. +

+ +

References

+ + +

+Kim K, Lin Z, Simmons SK, Parker J, Kearney M, Liao Z, Haywood N, Zhang J, Cline MP, Tuncali I +et al. + +Integrating Long-Read Structural Variant Analysis with single-nucleus RNA-seq to Elucidate Gene +Expression Effects in Disease. +bioRxiv. 2026 Mar 23;. +PMID: 41929179; PMC: PMC13041997 +

+