bbabbd5d2566d47d923d51dbe350634783455999 mspeir Sun Oct 26 12:14:52 2025 -0700 change soe to gi, refs #35031 diff --git src/hg/htdocs/goldenPath/help/bigMaf.html src/hg/htdocs/goldenPath/help/bigMaf.html index 8775d0a625f..6f62e5c5b9c 100755 --- src/hg/htdocs/goldenPath/help/bigMaf.html +++ src/hg/htdocs/goldenPath/help/bigMaf.html @@ -115,52 +115,52 @@ chr22_KI270731v1_random.gp genePred file.

Step 3. Download the autoSql file bigMaf.as needed by bedToBigBed. If you have opted to include the optional frame summary and information with your bigMaf file, you must also download the autoSql files mafSummary.as and mafFrames.as files.

Here are wget commands to obtain the above files and the hg38.chrom.sizes file mentioned below:

wget https://genome.ucsc.edu/goldenPath/help/examples/chr22_KI270731v1_random.maf
 wget https://genome.ucsc.edu/goldenPath/help/examples/chr22_KI270731v1_random.gp
 wget https://genome.ucsc.edu/goldenPath/help/examples/bigMaf.as
 wget https://genome.ucsc.edu/goldenPath/help/examples/mafSummary.as
 wget https://genome.ucsc.edu/goldenPath/help/examples/mafFrames.as
-wget http://hgdownload.soe.ucsc.edu/goldenPath/hg38/bigZips/hg38.chrom.sizes
+wget http://hgdownload.gi.ucsc.edu/goldenPath/hg38/bigZips/hg38.chrom.sizes
 

Step 4. Download the bedToBigBed and mafToBigMaf programs from the UCSC -binary utilities directory. If you have +binary utilities directory. If you have opted to generate the optional frame and summary files for your multiple alignment, you must also download the hgLoadMafSummary, genePredSingleCover, and genePredToMafFrames programs from the same -directory.

+directory.

Step 5. Use the fetchChromSizes script from the -same directory to create a +same directory to create a chrom.sizes file for the UCSC database with which you are working (e.g., hg38). Alternatively, you can download the chrom.sizes file for any assembly hosted at UCSC from our -downloads page (click on "Full +downloads page (click on "Full data set" for any assembly). For example, the hg38.chrom.sizes file for the hg38 database is located at -http://hgdownload.soe.ucsc.edu/goldenPath/hg38/bigZips/hg38.chrom.sizes.

+http://hgdownload.gi.ucsc.edu/goldenPath/hg38/bigZips/hg38.chrom.sizes.

mafToBigMaf hg38 chr22_KI270731v1_random.maf stdout | sort -k1,1 -k2,2n > bigMaf.txt
 bedToBigBed -type=bed3+1 -as=bigMaf.as -tab bigMaf.txt hg38.chrom.sizes bigMaf.bb 

Note that the hg38 in the mafToBigMaf hg38 command indicates the referenceDb and matches the expected prefix of the primary species' sequence name, for instance hg38 for the hg38.chr22_KI270731v1_random found in the input example chr22_KI270731v1_random.maf file.

Step 6. Follow the below steps to create the binary indexed mafFrames and mafSummary files to accompany your bigMaf file:

genePredSingleCover chr22_KI270731v1_random.gp single.gp
 genePredToMafFrames hg38 chr22_KI270731v1_random.maf bigMafFrames.txt hg38 single.gp
 bedToBigBed -type=bed3+8 -as=mafFrames.as -tab bigMafFrames.txt hg38.chrom.sizes bigMafFrames.bb
 
 hgLoadMafSummary -minSeqSize=1 -test hg38 bigMafSummary chr22_KI270731v1_random.maf
 cut -f2- bigMafSummary.tab | sort -k1,1 -k2,2n > bigMafSummary.bed
@@ -222,66 +222,66 @@
 similar to that which is available for a standard MAF track.

Example #2

In this example, you will create a bigMaf file from an existing bigMaf input file, bigMaf.txt, located on the UCSC Genome Browser http server.

  1. Save the bed3+1 example file, bigMaf.txt, to your computer (Step 6, above).
  2. Save the autoSql file bigMaf.as to your computer (Step 3, above).
  3. Download the - bedToBigBed utility + bedToBigBed utility (Step 4, above).
  4. Save the hg38.chrom.sizes text file to your computer. This file contains the chrom.sizes for the human (hg38) assembly (Step 5, above).
  5. Run the bedToBigBed utility to create a binary indexed MAF file (Step 6, above):
    bedToBigBed -type=bed3+1 -tab -as=bigMaf.as bigMaf.txt hg38.chrom.sizes bigMaf.bb
  6. Move the newly created bigMaf file (bigMaf.bb) to a web-accessible location (Step 7, above).
  7. Construct a track line that points to the bigMaf file (Step 8, above).
  8. Create the custom track on the human assembly hg38 (Dec. 2013), and view it in the Genome Browser (step 9, above).

Sharing your data with others

If you would like to share your bigMaf data track with a colleague, learn how to create a URL by looking at Example 6 on this page.

Extracting data from the bigMaf format

Because bigMaf files are an extension of bigBed files, which are indexed binary files, it can be difficult to extract data from them. UCSC has developed the following programs to assist in working with bigBed formats, available from the -binary utilities directory.

+binary utilities directory.

As with all UCSC Genome Browser programs, simply type the program name (with no parameters) at the command line to view the usage statement.

Troubleshooting

If you encounter an error when you run the bedToBigBed program, check your input file for data coordinates that extend past the the end of the chromosome. If these are present, run the bedClip program -(available here) to remove the problematic +(available here) to remove the problematic row(s) in your input file before running the bedToBigBed program.