383da828477aad2b3c6053880a64fdbfc2a00cd9
max
Thu Mar 19 02:30:41 2026 -0700
Fix varFreqs HTML issues and trexplorer citation, from AI code review 2026-03-19, refs #36642
Fix broken $db download URLs to hg38 in 14 HTML files, correct "Japanese"
to "Korean" in kova.html, fix "area" typo in schema.html, fix "Finnland"
to "Finland" in varFreqs.ra, normalize GREGoR capitalization, fix grammar,
quote all target=_blank attributes, capitalize GitHub consistently, and
fix bioRxiv citation formatting in trexplorer.html.
Co-Authored-By: Claude Opus 4.6
The data can be explored interactively with the Table Browser or the Data Integrator. For programmatic access, our REST API can be used; the track name is allofus. For bulk download, the VCF file can be obtained from -our download server. +our download server.
Variant data and individual-level data are accessible through the All of Us Researcher Workbench, which requires registration and completion of a training program. Aggregate allele frequency data is freely available.
Whole-genome sequencing was performed on the Illumina NovaSeq 6000 platform with PCR-free library preparation targeting 30x coverage. Reads were aligned to GRCh38 and variants were called using the Illumina DRAGEN (Dynamic Read Analysis for GENomics) pipeline, which performs mapping, alignment, sorting, duplicate marking, and variant calling (SNVs and indels) in a single hardware-accelerated workflow. Joint genotyping was performed across all samples. Quality control included sample-level filtering for contamination, sex discordance, and relatedness, and variant-level filtering using VQSR. Population-specific allele frequencies were determined using local ancestry inference at UCSC by the Ioannidis group. The ancestry breakdown into European, East Asian, African, Indigenous American, Oceanian, and South Asian components is part of a pending publication.
-At UCSC, we provide documentation that indicates how all source files of the varFreqs track were converted in the makeDoc file of the track. -For some tracks, python scripts were necessary and are also available from Github. +At UCSC, we provide documentation that indicates how all source files of the varFreqs track were converted in the makeDoc file of the track. +For some tracks, python scripts were necessary and are also available from GitHub.
The All of Us Research Program is supported by the National Institutes of Health. We thank the participants and the program for making frequency data available. The local ancestry inference was performed by Qudsi Aljabiri and Cole Shanks under Prof. Alexander Ioannidis, UC Santa Cruz.