7072384d5e6f4228bec4186d3d677527be0c9bc5 mspeir Fri Jun 26 09:37:20 2026 -0700 Adding data access to hs1 pages on the RR, refs # diff --git src/hg/makeDb/trackDb/human/hs1/html/gwasSNPs2022-03-08.html src/hg/makeDb/trackDb/human/hs1/html/gwasSNPs2022-03-08.html index 95f64d14eb8..0a4548c6f54 100644 --- src/hg/makeDb/trackDb/human/hs1/html/gwasSNPs2022-03-08.html +++ src/hg/makeDb/trackDb/human/hs1/html/gwasSNPs2022-03-08.html @@ -1,14 +1,32 @@

Description

A GWAS Catalog VCF was generated by intersecting RefSeq IDs in the GWAS Catalog associations v1.0 file (ebi.ac.uk/gwas/api/search/downloads/full, accessed 2022-03-08) with all dbSNP build 155 variants on the primary contigs for Chromosomes 1-22, Chromsome X and Chromosome Y. These variants were lifted over from GRCh38 to the T2T-CHM13 assembly. This track contains GWAS Catalog variants that lifted over from GRCh38 to the T2T-CHM13v2.0 assembly. This includes variants that lifted over perfectly, as well as variants that failed initial liftover due to reference/alternative allele swaps but were recovered on subsequent liftover, with reference and alternative alleles swapped appropriately.

These two sets of variants are included together in this track. If you are interested in downloading these sets separately (i.e., variants that lifted over perfectly vs. recovered variants with ref/alt allele swaps) they can be accessed here: https://s3-us-west-2.amazonaws.com/human-pangenomics/index.html?prefix=T2T/CHM13/assemblies/annotation/liftover/.

Methods

We performed liftover using the GATK release 4.1.9 LiftoverVcf (Picard Version 2.23.3) tool with the default parameters. This successfully lifts over variants that map exactly from GRCh38 to T2T-CHM13v2.0 but does not recover variants with swapped reference and alternative alleles. To recover variants with swapped reference/alternative alleles, we ran LiftoverVCF again, with the RECOVER_SWAPPED_REF_ALT flag. Notably, this feature does not recover multiallelic variants, so to recover these variants, we first separated them into multiple biallelic variants, performed liftover using the RECOVER_SWAPPED_REF_ALT tag, and converted them back to their multiallelic representations.

+

Data Access

+

+The variant data can be explored interactively with the +Table Browser or the +Data Integrator. The data can also be +accessed from scripts through our REST +API, where the track name is gwasSNPs2022-03-08.

+

+The underlying VCF file is available from our +download server as +chm13v2.0_GWASv1.0rsids_e100_r2022-03-08.vcf.gz. +

+Please refer to our +mailing +list archives for questions, or our +Data Access FAQ for more +information.

+

Contacts

References

Van der Auwera GA & O'Connor BD. (2020). Genomics in the Cloud: Using Docker, GATK, and WDL in Terra (1st Edition). O'Reilly Media.