383da828477aad2b3c6053880a64fdbfc2a00cd9
max
Thu Mar 19 02:30:41 2026 -0700
Fix varFreqs HTML issues and trexplorer citation, from AI code review 2026-03-19, refs #36642
Fix broken $db download URLs to hg38 in 14 HTML files, correct "Japanese"
to "Korean" in kova.html, fix "area" typo in schema.html, fix "Finnland"
to "Finland" in varFreqs.ra, normalize GREGoR capitalization, fix grammar,
quote all target=_blank attributes, capitalize GitHub consistently, and
fix bioRxiv citation formatting in trexplorer.html.
Co-Authored-By: Claude Opus 4.6
The data can be explored interactively with the Table Browser or the Data Integrator. For programmatic access, our REST API can be used; the track name is gasp. For bulk download, the VCF file can be obtained from -our download server. +our download server.
The original VCFs are also available from the GenomeAsia 100K website. No license nor login is required.
Samples were sequenced on Illumina HiSeq 2500, HiSeq 4000, and HiSeq X Ten instruments with 2×100 bp or 2×150 bp paired-end reads at an average depth of 36x. Reads were aligned to GRCh37 using BWA-MEM. Duplicate reads were marked with SAMBLASTER and sorted with Sambamba. Per-sample variant calling was performed with GATK HaplotypeCaller in GVCF mode, followed by joint genotyping with GenotypeGVCFs. Variant quality score recalibration (VQSR) was applied at a 99% sensitivity tranche for both SNPs and indels. Sample-level QC included contamination checks with verifyBamID and sex concordance verification. The final callset contains ∼65 million variants across 1,739 individuals from 219 populations.
-We provide documentation that indicates how all source files of the varFreqs track were converted in the makeDoc file of the track. -For some tracks, python scripts were necessary and are also available from Github. +We provide documentation that indicates how all source files of the varFreqs track were converted in the makeDoc file of the track. +For some tracks, python scripts were necessary and are also available from GitHub.
GenomeAsia100K Consortium. The GenomeAsia 100K Project enables genetic discoveries across Asia. Nature. 2019 Dec;576(7785):106-111. PMID: 31802016; PMC: PMC7054211