7594507ca126d5242346787e42e13c52ea7709b1 max Fri Apr 17 08:40:31 2026 -0700 Add lrSv supertrack: long-read structural variants from 9 studies (hg38). #Preview2 week - bugs introduced now will need a build patch to fix Sub-tracks (all bigBed 9+): han945Sv - 945 Han Chinese, ONT (Gong 2025, PMID 39929826) lrSv1kgOnt - 1019 1000 Genomes, ONT, SVAN-annotated (Schloissnig 2025, PMID 40702182; lifted from hs1) tommoJpSv - 333 Japanese (111 trios), ONT (Otsuki 2022, PMID 36127505) aou1kSv - 1027 All of Us, PacBio HiFi (Garimella 2025, PMID 41256123) ga4kSv - 502 GA4K pediatric rare disease, PacBio HiFi (Cohen 2022, PMID 35305867) decodeSv - 3622 Icelanders, ONT (Beyter 2021, PMID 33972781) hgsvc3Sv - 65 HGSVC3 diverse haplotype-resolved assemblies, HiFi+ONT (Logsdon 2025, PMID 40702183; merges insdel+inv tables) kwanhoSv - 100 post-mortem brains (PD/ILBD/HC), PacBio HiFi (Kim 2026, PMID 41929179) chirmade101Sv - 101 long-read WGS GWAS SVatalog cohort (Chirmade 2026, PMID 41203876) Includes per-track conversion scripts and autoSql under scripts/lrSv/, the supertrack summary table in lrSv.html, and a consolidated makeDoc at doc/hg38/lrSv.txt. refs #36258 Co-Authored-By: Claude Opus 4.7 (1M context) diff --git src/hg/makeDb/trackDb/human/han945Sv.html src/hg/makeDb/trackDb/human/han945Sv.html new file mode 100644 index 00000000000..2cd2b28f29d --- /dev/null +++ src/hg/makeDb/trackDb/human/han945Sv.html @@ -0,0 +1,64 @@ +

Description

+

+This track shows structural variants (SVs) identified by long-read sequencing +of 945 Han Chinese individuals. The dataset contains 111,288 SVs merged across +samples using SURVIVOR, including 49,518 deletions, 42,300 insertions, +13,503 duplications, 5,595 inversions, and 372 translocations. +

+ +

Display Conventions and Configuration

+

+Items are colored by SV type: +

+

+

+Filters are available for SV type, SV length, allele frequency, and number of +supporting samples. For insertions, the item is placed at the insertion site +with a width of 1 bp. For translocations, only the first breakpoint is shown; +the second breakpoint chromosome and position are listed in the item details. +

+ +

Methods

+

+Long-read sequencing was performed on 945 Han Chinese individuals. +Structural variants were called per sample and then merged across all samples using +SURVIVOR +(v1.0.6). The merged VCF was converted to bigBed format for display. +Allele frequencies and per-sample support information were extracted from the +INFO fields of the merged VCF. The study identified two notable variants: +an ancestral deletion in GSDMD associated with bone density and kidney injury +risk, and a modern human-specific variant in WWP2 influencing height, body +composition, and facial features. +

+ +

Data Access

+

+The raw VCF data was obtained from the +OMIX repository +(accession OED00945268) at the National Genomics Data Center (NGDC), +China National Center for Bioinformation. +

+ +

Credits

+

+Thanks to Gong et al. for making their structural variant calls publicly available. +

+ +

References

+ +

+Gong J, Sun H, Wang K, Zhao Y, Huang Y, Chen Q, Qiao H, Gao Y, Zhao J, Ling Y et al. + +Long-read sequencing of 945 Han individuals identifies structural variants associated with +phenotypic diversity and disease susceptibility. +Nat Commun. 2025 Feb 10;16(1):1494. +PMID: 39929826; PMC: PMC11811171 +

+