fb1f5e7813084d80c19ff0e394457b5ff1b59386
max
  Fri Jul 4 03:36:19 2025 -0700
adding mgap track, refs #36032

diff --git src/hg/makeDb/trackDb/rhesus/rheMac10/mgap.html src/hg/makeDb/trackDb/rhesus/rheMac10/mgap.html
new file mode 100644
index 00000000000..ce83b44ef75
--- /dev/null
+++ src/hg/makeDb/trackDb/rhesus/rheMac10/mgap.html
@@ -0,0 +1,39 @@
+<h2>Description</h2>
+
+The Macaque Genotype and Phenotype Resource (mGAP) provides reference population-level short variant data for rhesus macaques. The current dataset was generated from ~4,000 rhesus macaques, primarily derived from the NIH National Primate Research Center (NPRC) colonies, although other colonies are represented. These are outbred, genetically diverse populations.
+
+<p>
+This track contains the ~130 million high-quality short variants (SNVs and INDELs), generated from whole genome and whole exome data. Samples were jointly called to ensure each sample was genotyped for all loci. This track omits sample-level genotypes; however, the complete dataset with genotypes is available to browse or download from  <a href="http://www.mgap.ohsu.edu/" target="_blank">https://mgap.ohsu.edu</a>. Allele frequency and other annotations are provided.
+</p>
+
+<p>
+One of the unique features of the mGAP dataset is that most samples are derived from captive macaques housed within the NIH NPRC system. As such, these animals have lifelong health records and usually come from a genetically validated pedigree. If a variant of interest is identified in the mGAP dataset, it is often possible to identify specific macaques harboring that variant and interrogate other data that exists on these animals. <a href="http://www.mgap.ohsu.edu/">The mGAP website contains additional search features for these data</a>.
+</p>
+
+<p>
+Email <a href="mailto:mgap@ohsu.edu">mgap@ohsu.edu</a> with any questions.
+</p>
+
+
+<h2>Methods</h2>
+
+Sequence reads are aligned to the Mmul10 (rheMac10) genomic assembly. Variant calling is performed using a modified version of the Broad Institute <a href="https://gatk.broadinstitute.org/hc/en-us/articles/360035535932-Germline-short-variant-discovery-SNPs-Indels" target="_blank">GATK SNP and Indel Discovery Best Practices</a>, adapted for macaques. Raw variant calls are subjected to fairly strict quality filtering approach, validated using Mendelian violations and other analyses to identify regions of the macaque genome with systematic errors in short read alignment or genotype calling.
+
+<p />
+
+Annotation takes place in two phases. First, variants are annotated according to their predicted consequence on protein coding. Second, a pipeline is run that maps macaque variants to the human genome and applies a panel of annotations. <a href="https://mgap.ohsu.edu/mGAP/mgap-annotation.view">Please see the mGAP website for more detail</a> (requires user registration).
+
+
+<h2>Credits</h2>
+
+mGAP is supported by NIH R24OD021324. Please remember to cite this funding source in all publications that make use of mGAP data.
+
+<h2>References</h2>
+<p>
+Benjamin N Bimber, Melissa Y Yan, Samuel M Peterson, Betsy Ferguson.
+<a href="https://pubmed.ncbi.nlm.nih.gov/30841849/"
+target="_blank">
+mGAP: the macaque genotype and phenotype resource, a framework for accessing and interpreting macaque variant data, and identifying new models of human disease</a>.
+<em>BMC Genomics.</em>. 2019 Mar 6.
+PMID: <a href="https://pubmed.ncbi.nlm.nih.gov/30841849/" target="_blank">30841849</a>
+</p>