151410cc48b9b1f8b1cb9bee89b7004eca871c61
max
Wed Apr 22 09:03:35 2026 -0700
lrSv: harmonize long-read shortLabels, add aprSv/cpc1Sv/abelSv to overview
Normalize the shortLabel text of every long-read subtrack to the pattern
"
SV length statistics (min / median / max) are computed from the svLen field of each track, in base pairs. Some tracks include sites with svLen=0 (complex events where the reference and alternate alleles differ in sequence but not in length).
-All subtracks below are long-read callsets, except the last row (1KG 3202, -Illumina short-read), which is included as a short-read comparator. +All subtracks below are long-read callsets, except the last two rows +(CCDG 17,795 and 1KG 3202, both Illumina short-read), which are +included as short-read comparators.
| Dataset | N samples | Cohort / disease | Sequencing | SVs | Min | Median | Max |
|---|---|---|---|---|---|---|---|
| CoLoRSdb | 1,427 | @@ -134,50 +135,80 @@111,746 | 50 | 168 | 57,207,414 | ||
| HGSVC3 | 65 | HGSVC3 diverse reference assemblies | PacBio HiFi + ONT | 176,531 | 50 | 154 | 30,176,500 |
| Arab APR | +53 | +UAE-resident Arabs from 8 countries (Arab Pangenome Reference) | +PacBio HiFi + ONT + Hi-C (pangenome graph) | +72,656 | +1 | +21 | +99,885 | +
| CPC | +58 | +Chinese Pangenome Consortium, 36 minority ethnic groups (HPRC-specific SVs removed) | +PacBio HiFi (pangenome graph) | +36,030 | +1 | +53 | +8,998,096 | +
| Kim PD Brain | 100 | Parkinson's disease, ILBD, controls (post-mortem brain) | PacBio HiFi | 74,552 | 50 | 160 | 190,088,222 |
| SVatalog 101 | 101 | Long-read WGS cohort for GWAS LD fine-mapping (SickKids) | long-read | 87,183 | 4 | 160 | 1,321,484 |
| CCDG 17,795 (short-read) | +17,795 | +NHGRI CCDG + PAGE + SGDP (short-read comparator) | +Illumina short-read | +737,998 | +-1 | +-1 | +217,985,413 | +
| 1KG 3202 (short-read) | 3,202 | 1000 Genomes expanded cohort (short-read comparator) | Illumina short-read | 173,366 | 1 | 314 | 154,807,729 |
Structural variants from the Consortium of Long-Read Sequencing database