383da828477aad2b3c6053880a64fdbfc2a00cd9 max Thu Mar 19 02:30:41 2026 -0700 Fix varFreqs HTML issues and trexplorer citation, from AI code review 2026-03-19, refs #36642 Fix broken $db download URLs to hg38 in 14 HTML files, correct "Japanese" to "Korean" in kova.html, fix "area" typo in schema.html, fix "Finnland" to "Finland" in varFreqs.ra, normalize GREGoR capitalization, fix grammar, quote all target=_blank attributes, capitalize GitHub consistently, and fix bioRxiv citation formatting in trexplorer.html. Co-Authored-By: Claude Opus 4.6 diff --git src/hg/makeDb/trackDb/human/saudi.html src/hg/makeDb/trackDb/human/saudi.html index b35cd9993ba..fdb41d9141d 100644 --- src/hg/makeDb/trackDb/human/saudi.html +++ src/hg/makeDb/trackDb/human/saudi.html @@ -2,50 +2,50 @@

Variant frequencies from 302 whole genomes at 30x coverage from the Saudi Genome Program. The genotyping data and imputations from 3,352 individuals do not seem to be available publicly.

Data Access

The data can be explored interactively with the Table Browser or the Data Integrator. For programmatic access, our REST API can be used; the track name is saudi. For bulk download, the VCF file can be obtained from -our download server. +our download server.

The original data were downloaded from Figshare and converted to VCF.

Methods

Whole-genome sequencing of 302 Saudi Arabian individuals was performed on the Illumina HiSeq X Ten platform using TruSeq Nano DNA library preparation at 30x target coverage. Sequencing and initial bioinformatics processing were carried out by deCODE Genetics (Reykjavík, Iceland). Reads were aligned to the GRCh38 reference genome using BWA 0.7.10. Per-sample variant calling was performed with GATK HaplotypeCaller, followed by joint genotyping using CombineGVCFs and GenotypeGVCFs. Variant quality score recalibration (VQSR) was applied for both SNPs and indels. The final autosomal callset contains 25.5 million variants across the 302 individuals.

The variant data were downloaded from Figshare and converted to VCF format using a custom script. -We provide documentation that indicates how all source files of the varFreqs track were converted in the makeDoc file of the track. -For some tracks, python scripts were necessary and are also available from Github. +We provide documentation that indicates how all source files of the varFreqs track were converted in the makeDoc file of the track. +For some tracks, python scripts were necessary and are also available from GitHub.

References

Malomane DK, Williams MP, Huber CD, Mangul S, Abedalthagafi M, Chiang CWK. Patterns of population structure and genetic variation within the Saudi Arabian population. bioRxiv. 2025 Jan 13;. PMID: 39868174; PMC: PMC11761371