7fd4df639618f431e81244f1c0ad911e4dcb0bd5 max Wed Jan 28 04:56:31 2026 -0800 adding Australia Biobank and fixing up NCBI Alfa docs, refs #36642 diff --git src/hg/makeDb/trackDb/human/varFreqs.html src/hg/makeDb/trackDb/human/varFreqs.html index 39d2482b47a..8a267679c99 100644 --- src/hg/makeDb/trackDb/human/varFreqs.html +++ src/hg/makeDb/trackDb/human/varFreqs.html @@ -55,41 +55,53 @@ were sequenced, a total of 142,357 individuals with whole-exome (WES) and 12,519 with whole-genome sequencing (WGS). The data contains 32,559 trios and 8,895 quads (one sibling without autism), and 824 twins. The same frequencies shown here are also available publicly on the SFARI Genome Browser. See (SPARK et al, Neuron 2018) for details or the methods below on this page.
Most tracks only show the variant and allele frequencies on mouseover or clicks. When zoomed in, tracks display alleles with base-specific coloring. Homozygote data are shown as one letter, while heterozygotes will be displayed with both -letters. -
- --For NCBI ALFA: This track has no single VCF with INFO fields, but uses multiple subtracks -instead, one per ancestry. +letters. All VCF files are normalized, with one single allele per annotation (no multi-allele lines).
Most of the data in these tracks are not available for download from UCSC. Data can be browsed on our website. But the data can be downloaded for free from the original projects. Accessing the -data usually requires a click-through license or access request on the respective websites, links are either -provided above in the project description or with more details here: +data usually requires a click-through license or filling out an access request form on the respective websites, links are either provided above in the project description or with more details here:
MXB: Allele frequencies by geographical state and ancestry are available via the MexVar platform. Raw genotype data are available under controlled access at the EGA (Study: EGAS00001005797; Dataset: EGAD00010002361). For the VCFs, email andres.moreno@cinvestav.mx.
MCPS: VCFs with summarized allele frequencies are available from the MCPS website.
Regeneron one million exomes: VCFs with summarized allele frequencies are available from the RGC ME website.
TOPMED: VCFs with summarized allele frequencies are available from the TOPMED BRAVO website. They require a login.
SFARI SPARK: Allele frequencies can be displayed on the SFARI Genome Browser. Full CRAMs and VCFs with genotypes are available from SFARI Base. They require a data access request, which is usually reviewed quickly. More information is available in the SPARK Welcome Packet.
+ ++Australia MGRB: VCF access can be requested via a form from +Sydney Genomics. +
+GenomeAsia Pilot: VCFs are available from UCSC and also from the GenomeAsia 100K website. No license nor login.
KOVA: TSV data can be requested on the KOVA Downloads website.
Finngen: TSV data can be requested via the form at https://finngen.gitbook.io/documentation/data-download which triggers an email with the download link.
SweGen: We are not allowed to redistribute the VCF, you can request it at SweGen, alongside the VCF file.