616cfce6b21924d958892448c83b7701f715416b jnavarr5 Tue Nov 11 14:17:47 2025 -0800 Fixing other special characters. refs #36642 diff --git src/hg/makeDb/trackDb/human/varFreqs.html src/hg/makeDb/trackDb/human/varFreqs.html index 8442fcdd6a1..6bbcbbc81c1 100644 --- src/hg/makeDb/trackDb/human/varFreqs.html +++ src/hg/makeDb/trackDb/human/varFreqs.html @@ -168,124 +168,132 @@ TSV data can be requested on the <a href="https://www.kobic.re.kr/kova/downloads" target="_blank">KOVA Downloads</a> website. Coverage 100x for WES, 30x for WGS. For details see (Lee et al, Exp Mol Med 2022). </li> </ul> <h2>Display Conventions</h2> <p>Most tracks only show the variant and allele frequencies on mouseover or clicks. When zoomed in, tracks display alleles with base-specific coloring. Homozygote data are shown as one letter, while heterozygotes will be displayed with both letters. </p> <p> -Full haplotype display - only for the MXB and HGDP tracks: In "pack" mode, this track sorts the haplotypes. This can be +Full haplotype display - only for the MXB and HGDP tracks: In "pack" mode, this track +sorts the haplotypes. This can be useful for determining the similarity between the samples and inferring inheritance at a particular locus. For a full description of how the display works, please see our <a href="../goldenpath/help/hgVcfTrackHelp.html">Haplotype Display help page</a>. Briefly, each sample's phased and/or homozygous genotypes are split into haplotypes, clustered by similarity around a central variant (in pink), and sorted for display by their position in the clustering tree. Click a variant to center on it. The tree (as space allows) is drawn in the label area next to the track image. Leaf clusters, in which all haplotypes are identical (at least for the variants used in clustering), are colored purple. </p> <p> -For NCBI ALFA: This track has no single VCF with INFO fields, but uses multiple subtracks instead, one per ancestry. +For NCBI ALFA: This track has no single VCF with INFO fields, but uses multiple subtracks instead, +one per ancestry. </p> <h2>Data Access</h2> <p>Most of the data in these tracks are not available for download from UCSC. Only individual variants can be browsed on our website. But the data can be downloaded for free from the original projects. Accessing the data usually requires a click-through license on the respectice websites: </p> <p> MXB: Allele frequencies by geographical state and ancestry are available via -the <a target=_blank href="https://morenolab.shinyapps.io/mexvar/">MexVar platform</a>. +the <a target="_blank" href="https://morenolab.shinyapps.io/mexvar/">MexVar platform</a>. Raw genotype data are available under controlled access at the -EGA (Study: EGAS00001005797; Dataset: EGAD00010002361). For the VCFs, email andres.moreno@cinvestav.mx. +EGA (Study: EGAS00001005797; Dataset: EGAD00010002361). For the VCFs, email +andres.moreno@cinvestav.mx. </p> <p> MCPS: VCFs with summarized allele frequencies are available from -the <a target=_blank href="https://rgc-mcps.regeneron.com/">MCPS website</a>. +the <a target="_blank" href="https://rgc-mcps.regeneron.com/">MCPS website</a>. </p> <p> Regeneron one million exomes: VCFs with summarized allele frequencies are available from -the <a target=_blank href="https://rgc-research.regeneron.com/me/resources">RGC ME website</a>. +the <a target="_blank" href="https://rgc-research.regeneron.com/me/resources">RGC ME website</a>. </p> <p> TOPMED: VCFs with summarized allele frequencies are available from -the <a target=_blank href="https://bravo.sph.umich.edu/">TOPMED BRAVO website</a>. They require a login. +the <a target="_blank" href="https://bravo.sph.umich.edu/">TOPMED BRAVO website</a>. They require a +login. </p> <p> GenomeAsia Pilot: VCFs are available from UCSC and also from the -the <a target=_blank href="https://browser.genomeasia100k.org/#tid=download">GenomeAsia 100K website</a>. No license nor login. +the <a target="_blank" +href="https://browser.genomeasia100k.org/#tid=download">GenomeAsia 100K website</a>. +No license nor login. </p> <h2>Methods</h2> <p> MXB: Genotyping was performed with the Illumina Multi-Ethnic Global Array (MEGA, ~1.8M SNPs), optimized for admixed populations and enriched for ancestry-informative and medically relevant variants. Only autosomal, biallelic SNPs passing quality control are included. Samples were selected from 898 recruitment sites, with prioritization of indigenous language speakers. Data processing included GenomeStudio → PLINK conversion, strand alignment, removal of duplicates, update of map positions using dbSNP Build 151 and low-quality variants/individuals, and relatedness filtering. </p> <p> -SGDP: The version used was https://sharehost.hms.harvard.edu/genetics/reich_lab/sgdp/vcf_variants/, +SGDP: The version used was +<a target="_blank" href="https://sharehost.hms.harvard.edu/genetics/reich_lab/sgdp/vcf_variants/" +>https://sharehost.hms.harvard.edu/genetics/reich_lab/sgdp/vcf_variants/</a>, merged with bcftools and lifted to hg38 with CrossMap. </p> <p> KOVA: V7 of the TSV.gz was obtained from the KOVA staff and converted to VCF. If it not available for download from our site but can be requested from the KOVA website. </p> <h2>Credits</h2> <p> MXB: We thank the Center for Research and Advanced Studies (Cinvestav) of Mexico for generating and providing the frequency data, the National Institute of Medical Sciences and Nutrition (INCMNSZ) for DNA extraction, and the Ministry of Health together with the National Institute of Public Health (INSP) for the design and implementation of the National Health Survey 2000 (ENSA 2000). We also thank the ENSA-Genomics Consortium for their contributions to sample collection and data processing that made possible the construction of the MXB genomic resource. </p> <p> MCPS: Data produced by Regeneron RGC and collaborators, which are the -University of Oxford, Universidad Nacional Autónoma de México (UNAM) and +University of Oxford, Universidad Nacional Autónoma de México (UNAM) and National Institute of Genomic Medicine in Mexico. The Regeneron Genetics Center, University of Oxford, Universidad Nacional -Autónoma de México (UNAM), National Institute of Genomic Medicine in Mexico, -Abbvie Inc. and AstraZeneca UK Limited (collectively, the “Collaborators”) bear +Autónoma de México (UNAM), National Institute of Genomic Medicine in Mexico, +Abbvie Inc. and AstraZeneca UK Limited (collectively, the "Collaborators") bear no responsibility for the analyses or interpretations of the data presented here. Any opinions, insights, or conclusions presented herein are those of the authors and not of the Collaborators. </p> </p> <p> Regeneron Million Exomes: The Regeneron Genetics Center, and its collaborators -(collectively, the “Collaborators”) bear no responsibility for the analyses or +(collectively, the "Collaborators") bear no responsibility for the analyses or interpretations of the data presented here. Any opinions, insights, or conclusions presented herein are those of the authors and not of the Collaborators. This research has been conducted using the UK Biobank Resource under application number 26041. </p> <p> SGDP: This project was funded by the Simons Foundation. Thanks to David Reich and Swapan Mallick for help with importing the data. </p> <p> KOVA: Thanks to Insu Jang and the KOVA director for providing variant frequencies in TSV format. </p> <p>Thanks to Alex Ioannidis, UCSC, and Andreas Lahner, MGZ, for feedback on this track.</p> <h2>References</h2> @@ -293,41 +301,41 @@ Barberena-Jonas, C. et al. (2025). MexVar database: Clinical genetic variation beyond the Hispanic label in the Mexican Biobank. <em>Nature Medicine (in press)</em>. </p> <p> Sohail M, Moreno-Estrada A. <a href="https://journals.biologists.com/dmm/article-lookup/doi/10.1242/dmm.050522" target="_blank"> The Mexican Biobank Project promotes genetic discovery, inclusive science and local capacity building</a>. <em>Dis Model Mech</em>. 2024 Jan 1;17(1). PMID: <a href="https://www.ncbi.nlm.nih.gov/pubmed/38299665" target="_blank">38299665</a>; PMC: <a href="https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10855211/" target="_blank">PMC10855211</a> </p> <p> -Sohail M, Palma-Martínez MJ, Chong AY, Quinto-Cortés CD, Barberena-Jonas C, Medina-Muñoz SG, -Ragsdale A, Delgado-Sánchez G, Cruz-Hervert LP, Ferreyra-Reyes L <em>et al</em>. +Sohail M, Palma-Martínez MJ, Chong AY, Quinto-Corés CD, Barberena-Jonas C, Medina-Muñoz SG, +Ragsdale A, Delgado-Sánchez G, Cruz-Hervert LP, Ferreyra-Reyes L <em>et al</em>. <a href="https://doi.org/10.1038/s41586-023-06560-0" target="_blank"> Mexican Biobank advances population and medical genomics of diverse ancestries</a>. <em>Nature</em>. 2023 Oct;622(7984):775-783. PMID: <a href="https://www.ncbi.nlm.nih.gov/pubmed/37821706" target="_blank">37821706</a>; PMC: <a href="https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10600006/" target="_blank">PMC10600006</a> </p> <p> -Ziyatdinov A, Torres J, Alegre-Díaz J, Backman J, Mbatchou J, Turner M, Gaynor SM, Joseph T, Zou Y, +Ziyatdinov A, Torres J, Alegre-Díaz J, Backman J, Mbatchou J, Turner M, Gaynor SM, Joseph T, Zou Y, Liu D <em>et al</em>. <a href="https://doi.org/10.1038/s41586-023-06595-3" target="_blank"> Genotyping, sequencing and analysis of 140,000 adults from Mexico City</a>. <em>Nature</em>. 2023 Oct;622(7984):784-793. PMID: <a href="https://www.ncbi.nlm.nih.gov/pubmed/37821707" target="_blank">37821707</a>; PMC: <a href="https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10600010/" target="_blank">PMC10600010</a> </p> <p> GenomeAsia100K Consortium. <a href="https://doi.org/10.1038/s41586-019-1793-z" target="_blank"> The GenomeAsia 100K Project enables genetic discoveries across Asia</a>. <em>Nature</em>. 2019 Dec;576(7785):106-111. PMID: <a href="https://www.ncbi.nlm.nih.gov/pubmed/31802016" target="_blank">31802016</a>; PMC: <a href="https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7054211/" target="_blank">PMC7054211</a> @@ -345,33 +353,33 @@ <p> Tadaka S, Kawashima J, Hishinuma E, Saito S, Okamura Y, Otsuki A, Kojima K, Komaki S, Aoki Y, Kanno T <em>et al</em>. <a href="https://academic.oup.com/nar/article-lookup/doi/10.1093/nar/gkad978" target="_blank"> jMorp: Japanese Multi-Omics Reference Panel update report 2023</a>. <em>Nucleic Acids Res</em>. 2024 Jan 5;52(D1):D622-D632. PMID: <a href="https://www.ncbi.nlm.nih.gov/pubmed/37930845" target="_blank">37930845</a>; PMC: <a href="https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10767895/" target="_blank">PMC10767895</a> </p> <p> Naslavsky MS, Scliar MO, Yamamoto GL, Wang JYT, Zverinova S, Karp T, Nunes K, Ceroni JRM, de -Carvalho DL, da Silva Simões CE <em>et al</em>. +Carvalho DL, da Silva Simões CE <em>et al</em>. <a href="https://doi.org/10.1038/s41467-022-28648-3" target="_blank"> -Whole-genome sequencing of 1,171 elderly admixed individuals from São Paulo, Brazil</a>. +Whole-genome sequencing of 1,171 elderly admixed individuals from São Paulo, Brazil</a>. <em>Nat Commun</em>. 2022 Mar 4;13(1):1004. PMID: <a href="https://www.ncbi.nlm.nih.gov/pubmed/35246524" target="_blank">35246524</a>; PMC: <a href="https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8897431/" target="_blank">PMC8897431</a> </p> <p> Jain A, Bhoyar RC, Pandhare K, Mishra A, Sharma D, Imran M, Senthivel V, Divakar MK, Rophina M, Jolly B <em>et al</em>. <a href="https://academic.oup.com/nar/article-lookup/doi/10.1093/nar/gkaa923" target="_blank"> IndiGenomes: a comprehensive resource of genetic variants from over 1000 Indian genomes</a>. <em>Nucleic Acids Res</em>. 2021 Jan 8;49(D1):D1225-D1232. PMID: <a href="https://www.ncbi.nlm.nih.gov/pubmed/33095885" target="_blank">33095885</a>; PMC: <a href="https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7778947/" target="_blank">PMC7778947</a>