3d9187d264d00ee8e681521bc2c942ee2527d4f1 max Wed May 13 07:33:38 2026 -0700 varFreqs: add WBBC (Westlake BioBank for Chinese) subtrack from the Phase I v20210103 release: 4,480 WGS samples, 78.6M variants, per-region frequencies for the 4 Han Chinese geographic subgroups (North/Central/South/Lingnan). databases.tsv + populations.tsv updated for the next varFreqsAll rebuild. refs #36642 Co-Authored-By: Claude Opus 4.7 (1M context) diff --git src/hg/makeDb/scripts/varFreqs/populations.tsv src/hg/makeDb/scripts/varFreqs/populations.tsv index b4e2d61e7db..d1c9dedb64b 100644 --- src/hg/makeDb/scripts/varFreqs/populations.tsv +++ src/hg/makeDb/scripts/varFreqs/populations.tsv @@ -1,36 +1,41 @@ # Population breakdown configuration for varFreqsAll combined track # db_key pop_key pop_name ac_field af_field # AllOfUs local ancestry populations AllOfUs AFR African AC_AFR AF_AFR AllOfUs AMR Indigenous American AC_AMR AF_AMR AllOfUs EAS East Asian AC_EAS AF_EAS AllOfUs EUR European AC_EUR AF_EUR AllOfUs OCE Oceanian AC_OCE AF_OCE AllOfUs SAS South Asian AC_SAS AF_SAS # GenomeAsia populations (7 groups in source VCF) GenomeAsia NEA Northeast Asian AC_NEA AF_NEA GenomeAsia SEA Southeast Asian AC_SEA AF_SEA GenomeAsia SAS South Asian AC_SAS AF_SAS GenomeAsia OCE Oceanian AC_OCE AF_OCE GenomeAsia AMR American AC_AMR AF_AMR GenomeAsia AFR African AC_AFR AF_AFR GenomeAsia WER Western European Ref AC_WER AF_WER # gnomAD HGDP+1kG continental groups HGDP1kG afr African gnomad_AC_afr gnomad_AF_afr HGDP1kG ami Amish gnomad_AC_ami gnomad_AF_ami HGDP1kG amr Latino gnomad_AC_amr gnomad_AF_amr HGDP1kG asj Ashkenazi Jewish gnomad_AC_asj gnomad_AF_asj HGDP1kG eas East Asian gnomad_AC_eas gnomad_AF_eas HGDP1kG fin Finnish gnomad_AC_fin gnomad_AF_fin HGDP1kG mid Middle Eastern gnomad_AC_mid gnomad_AF_mid HGDP1kG nfe Non-Finnish European gnomad_AC_nfe gnomad_AF_nfe HGDP1kG oth Other gnomad_AC_oth gnomad_AF_oth HGDP1kG sas South Asian gnomad_AC_sas gnomad_AF_sas # GREGoR affected/unaffected breakdown GREGoR AFF Affected AC_AFFECTED . GREGoR UNA Unaffected AC_UNAFFECTED . GREGoR UNK Unknown AC_UNKNOWN . # NPM Singapore (SG10K_Health) ancestry groups NPM Chinese Singapore Chinese AC_SgChinese AF_SgChinese NPM Malay Singapore Malay AC_SgMalay AF_SgMalay NPM Indian Singapore Indian AC_SgIndian AF_SgIndian +# WBBC Westlake BioBank for Chinese regional Han groups (AC not present, will be synthesized from AF*AN at build time) +WBBC North North Han . North_AF +WBBC Central Central Han . Central_AF +WBBC South South Han . South_AF +WBBC Lingnan Lingnan Han . Lingnan_AF