151410cc48b9b1f8b1cb9bee89b7004eca871c61
max
  Wed Apr 22 09:03:35 2026 -0700
lrSv: harmonize long-read shortLabels, add aprSv/cpc1Sv/abelSv to overview

Normalize the shortLabel text of every long-read subtrack to the pattern
"<Cohort> <N> SVs" (no commas in N): CoLoRSdb 1427, AoU 1027, ToMMo 333,
GA4K 502, deCODE 3622, HPRC v2 233, Kim PD 100 prelim. Short-read
comparators (abelSv, onekg3202Sr, tommoJpCnv) are left alone per user
instruction.

Also add three rows that were missing from lrSv.html's overview table:
aprSv (Arab APR 53), cpc1Sv (CPC 58, HPRC-specific SVs removed) and
abelSv (CCDG 17,795 Illumina short-read comparator). Updates the
comparator footnote to mention both short-read rows.

refs #36258

diff --git src/hg/makeDb/trackDb/human/lrSv.html src/hg/makeDb/trackDb/human/lrSv.html
index ebaced9d96e..11b1e7ea0b0 100644
--- src/hg/makeDb/trackDb/human/lrSv.html
+++ src/hg/makeDb/trackDb/human/lrSv.html
@@ -3,32 +3,33 @@
 This track collection contains structural variant (SV) calls derived from long-read sequencing
 studies. Structural variants are genomic rearrangements larger than ~50 bp, including
 deletions, insertions, duplications, inversions, and translocations. Long-read sequencing
 technologies can span repetitive regions and resolve complex rearrangements
 that are difficult to detect with short-read methods.
 </p>
 
 <h3>Available Datasets</h3>
 <p>
 SV length statistics (min / median / max) are computed from the <tt>svLen</tt>
 field of each track, in base pairs. Some tracks include sites with
 <tt>svLen=0</tt> (complex events where the reference and alternate alleles
 differ in sequence but not in length).
 </p>
 <p>
-All subtracks below are long-read callsets, except the last row (1KG 3202,
-Illumina short-read), which is included as a short-read comparator.
+All subtracks below are long-read callsets, except the last two rows
+(CCDG 17,795 and 1KG 3202, both Illumina short-read), which are
+included as short-read comparators.
 </p>
 <table class="stdTbl">
 <tr>
   <th>Dataset</th>
   <th>N samples</th>
   <th>Cohort / disease</th>
   <th>Sequencing</th>
   <th>SVs</th>
   <th>Min</th>
   <th>Median</th>
   <th>Max</th>
 </tr>
 <tr>
   <td><a href="hgTrackUi?g=colorsDbSv">CoLoRSdb</a></td>
   <td>1,427</td>
@@ -134,50 +135,80 @@
   <td>111,746</td>
   <td>50</td>
   <td>168</td>
   <td>57,207,414</td>
 </tr>
 <tr>
   <td><a href="hgTrackUi?g=hgsvc3Sv">HGSVC3</a></td>
   <td>65</td>
   <td>HGSVC3 diverse reference assemblies</td>
   <td>PacBio HiFi + ONT</td>
   <td>176,531</td>
   <td>50</td>
   <td>154</td>
   <td>30,176,500</td>
 </tr>
+<tr>
+  <td><a href="hgTrackUi?g=aprSv">Arab APR</a></td>
+  <td>53</td>
+  <td>UAE-resident Arabs from 8 countries (Arab Pangenome Reference)</td>
+  <td>PacBio HiFi + ONT + Hi-C (pangenome graph)</td>
+  <td>72,656</td>
+  <td>1</td>
+  <td>21</td>
+  <td>99,885</td>
+</tr>
+<tr>
+  <td><a href="hgTrackUi?g=cpc1Sv">CPC</a></td>
+  <td>58</td>
+  <td>Chinese Pangenome Consortium, 36 minority ethnic groups (HPRC-specific SVs removed)</td>
+  <td>PacBio HiFi (pangenome graph)</td>
+  <td>36,030</td>
+  <td>1</td>
+  <td>53</td>
+  <td>8,998,096</td>
+</tr>
 <tr>
   <td><a href="hgTrackUi?g=kwanhoSv">Kim PD Brain</a></td>
   <td>100</td>
   <td>Parkinson's disease, ILBD, controls (post-mortem brain)</td>
   <td>PacBio HiFi</td>
   <td>74,552</td>
   <td>50</td>
   <td>160</td>
   <td>190,088,222</td>
 </tr>
 <tr>
   <td><a href="hgTrackUi?g=chirmade101Sv">SVatalog 101</a></td>
   <td>101</td>
   <td>Long-read WGS cohort for GWAS LD fine-mapping (SickKids)</td>
   <td>long-read</td>
   <td>87,183</td>
   <td>4</td>
   <td>160</td>
   <td>1,321,484</td>
 </tr>
+<tr>
+  <td><a href="hgTrackUi?g=abelSv">CCDG 17,795 (short-read)</a></td>
+  <td>17,795</td>
+  <td>NHGRI CCDG + PAGE + SGDP (<b>short-read comparator</b>)</td>
+  <td><b>Illumina short-read</b></td>
+  <td>737,998</td>
+  <td>-1</td>
+  <td>-1</td>
+  <td>217,985,413</td>
+</tr>
 <tr>
   <td><a href="hgTrackUi?g=onekg3202Sr">1KG 3202 (short-read)</a></td>
   <td>3,202</td>
   <td>1000 Genomes expanded cohort (<b>short-read comparator</b>)</td>
   <td><b>Illumina short-read</b></td>
   <td>173,366</td>
   <td>1</td>
   <td>314</td>
   <td>154,807,729</td>
 </tr>
 </table>
 
 <h3>CoLoRSdb SVs (<a href="hgTrackUi?g=colorsDbSv">colorsDbSv</a>)</h3>
 <p>
 Structural variants from the Consortium of Long-Read Sequencing database