bbabbd5d2566d47d923d51dbe350634783455999 mspeir Sun Oct 26 12:14:52 2025 -0700 change soe to gi, refs #35031 diff --git src/hg/htdocs/goldenPath/help/bigGenePred.html src/hg/htdocs/goldenPath/help/bigGenePred.html index 98a6290522f..603afa3a5f2 100755 --- src/hg/htdocs/goldenPath/help/bigGenePred.html +++ src/hg/htdocs/goldenPath/help/bigGenePred.html @@ -85,41 +85,41 @@ <h2>Creating a bigGenePred track from a bed12+8 file</h2> <p> <strong>Step 1.</strong> Format your pre-bigGenePred file. The first 12 fields of pre-bigGenePred files are described by the <a href="../../FAQ/FAQformat.html#format1">BED file format</a>. Your file must also contain the 8 extra fields described in the autoSql file definition shown above: <code>name2, cdsStartStat, cdsEndStat, exonFrames, type, geneName, geneName2, geneType</code>. For example, you can use this bed12+8 input file, <a href="examples/bigGenePred.txt">bigGenePred.txt</a>. Your pre-bigGenePred file must be sorted first on the <code>chrom</code> field, and secondarily on the <code>chromStart</code> field. You can use the UNIX <code>sort</code> command to do this:</p> <pre><code>sort -k1,1 -k2,2n unsorted.bed > input.bed</code></pre> <p> <strong>Step 2.</strong> Download the <code>bedToBigBed</code> program from the -<a href="http://hgdownload.soe.ucsc.edu/admin/exe/">binary utilities directory</a>.</p> +<a href="http://hgdownload.gi.ucsc.edu/admin/exe/">binary utilities directory</a>.</p> <p> <strong>Step 3.</strong> Download the <em>chrom.sizes</em> file for your assembly from -our <a href="http://hgdownload.soe.ucsc.edu/downloads.html">downloads</a> page (click on "Full +our <a href="http://hgdownload.gi.ucsc.edu/downloads.html">downloads</a> page (click on "Full data set" for your organism). For example, the <em>hg38.chrom.sizes</em> file for the hg38 database is located at -<a href="http://hgdownload.soe.ucsc.edu/goldenPath/hg38/bigZips/hg38.chrom.sizes" -target="_blank">http://hgdownload.soe.ucsc.edu/goldenPath/hg38/bigZips/hg38.chrom.sizes</a>. +<a href="http://hgdownload.gi.ucsc.edu/goldenPath/hg38/bigZips/hg38.chrom.sizes" +target="_blank">http://hgdownload.gi.ucsc.edu/goldenPath/hg38/bigZips/hg38.chrom.sizes</a>. Alternatively, you can use the <code>fetchChromSizes</code> script from the -<a href="http://hgdownload.soe.ucsc.edu/admin/exe/">utilities directory</a>. +<a href="http://hgdownload.gi.ucsc.edu/admin/exe/">utilities directory</a>. <p> <strong>Step 4.</strong> Create the bigGenePred file from your pre-bigGenePred file using the <code>bedToBigBed</code> utility command:</p> <pre><code><strong>bedToBigBed</strong> -as=bigGenePred.as -type=bed12+8 bigGenePred.txt chrom.sizes myBigGenePred.bb</code></pre> <p> <strong>Step 5.</strong> Move the newly created bigGenePred file (<em>myBigGenePred.bb</em>) to a web-accessible http, https, or ftp location. See <a href="hgTrackHubHelp.html#Hosting">hosting section</a> if necessary.</p> <p> <strong>Step 6.</strong> Construct a <a href="hgTracksHelp.html#CustomTracks">custom track</a> using a single <a href="hgTracksHelp.html#TRACK">track line</a>. Any of the track attributes will be available for use on bigBed tracks. The basic version of the track line will look something like this:</p> @@ -196,80 +196,80 @@ page</a> to view an example of bigGenePred amino acid display at the beginning of the SIRT1 gene on chromosome 10.</p> <p class='text-center'> <img class='text-center' src="../../images/codonColoring.png" alt="An image of a track with codons colored" width="999"> </p> <a name="Example3"></a> <h3>Example #3: Bed12+8 to BigGenePred</h3> <p> In this example, you will create your own bigGenePred file from an existing pre-bigGenePred input file, a bed12+8 file.</p> <ol> <li> Save the example bed12+8 input file to your computer, <em><a href="examples/bigGenePred.txt">bigGenePred.txt</a></em>.</li> <li> - Download the <a href="http://hgdownload.soe.ucsc.edu/admin/exe/"><code>bedToBigBed</code></a> + Download the <a href="http://hgdownload.gi.ucsc.edu/admin/exe/"><code>bedToBigBed</code></a> utility (<em>Step 2</em>, in the <em>Creating a bigGenePred</em> section above).</li> <li> Save the <a href="hg38.chrom.sizes"><em>hg38.chrom.sizes</em> text file</a> to your computer. This file contains the chrom.sizes for the human hg38 assembly (<em>Step 3</em>, above).</li> <li> Save the autoSql file <a href="examples/bigGenePred.as"><em>bigGenePred.as</em></a> to your computer.</li> <li> Run the <code>bedToBigBed</code> utility to create the bigGenePred output file (<em>step 4</em>, above): <pre><code><B>bedToBigBed</B> -type=bed12+8 -tab -as=bigGenePred.as bigGenePred.txt hg38.chrom.sizes bigGenePred.bb</code></pre></li> <li> Place the newly created bigGenePred file (<em>bigGenePred.bb</em>) on a web-accessible server (<em>Step 5</em>, above).</li> <li> Construct a track line that points to the bigGenePred file (<em>Step 6</em>, above).</li> <li> Create the custom track on the human assembly hg38 (Dec. 2013), and view it in the Genome Browser (<em>step 7</em>, above).</li> </ol> <a name="Example4"></a> <h3>Example #4: GTF (or GFF) to BigGenePred </h3> <p>In this example, you will convert a GTF file to bigGenePred using command line utilities. You will need <code>gtfToGenePred</code>, <code>genePredTobigGenePred</code>, and <code>bedToBigBed</code>. If you would like to convert a GFF file to bigGenePred, you will use <code>gff3ToGenePred</code> in place of the <code>gtfToGenePred</code>. You can download utilities from the -<a href="http://hgdownload.soe.ucsc.edu/admin/exe/">utilities directory</a>.</p> +<a href="http://hgdownload.gi.ucsc.edu/admin/exe/">utilities directory</a>.</p> <ol> <li> Obtain a GTF file using the <code>wget</code> command. Skip this step if you already have a GTF or GFF file. <pre><code>wget http://genome.ucsc.edu/goldenPath/help/examples/bigGenePredExample4.gtf</code></pre></li> <li> Convert the GTF file to genePred extended format using the <code>gtfToGenePred</code> command. <pre><code>gtfToGenePred -genePredExt bigGenePredExample4.gtf example4.genePred</code></pre> If you are converting a GFF file, use the <code>gff3ToGenePred</code> command. <pre><code>gff3ToGenePred yourFile.gff example4.genePred </pre></code></li> <li> Convert the genePred extended file to a pre-bigGenePred text file. <pre><code>genePredToBigGenePred example4.genePred bigGenePredEx4.txt</code></pre></li> <li> Download a helper file that specifies column names. <pre><code>wget https://genome.ucsc.edu/goldenPath/help/examples/bigGenePred.as</code></pre> </li> <li> Convert your text bigGenePred to a binary indexed format. If you are not using hg38, you will need to replace the hg38.chrom.sizes file path with your organism's file path from the - <a href="http://hgdownload.soe.ucsc.edu">downloads directory</a> under "Genome Sequence Files". - <pre><code>bedToBigBed -type=bed12+8 -tab -as=bigGenePred.as bigGenePredEx4.txt http://hgdownload.soe.ucsc.edu/goldenPath/hg38/bigZips/hg38.chrom.sizes bigGenePredEx4.bb</code></pre></li> + <a href="http://hgdownload.gi.ucsc.edu">downloads directory</a> under "Genome Sequence Files". + <pre><code>bedToBigBed -type=bed12+8 -tab -as=bigGenePred.as bigGenePredEx4.txt http://hgdownload.gi.ucsc.edu/goldenPath/hg38/bigZips/hg38.chrom.sizes bigGenePredEx4.bb</code></pre></li> <li> Put your binary indexed file, bigGenePredEx4.bb, in a web-accessible location. See the <a href="https://genome.ucsc.edu/goldenpath/help/hgTrackHubHelp.html#Hosting">hosting section</a> for more information.</li> <li> To view this example, you can click this into this Browser link. To view your own data, paste the link into your web browser and replace the URL after "bigDataUrl=" with a link to your own hosted data. <pre> <a href="http://genome.ucsc.edu/cgi-bin/hgTracks?db=hg38&position=chr19:44905790-44909388&hgct_customText=track%20type=bigGenePred%20bigDataUrl=http://genome.ucsc.edu/goldenPath/help/examples/bigGenePredEx4.bb">http://genome.ucsc.edu/cgi-bin/hgTracks?db=hg38&position=chr19:44905790-44909388&hgct_customText=track%20type=bigGenePred%20bigDataUrl=http://genome.ucsc.edu/goldenPath/help/examples/bigGenePredEx4.bb</a></pre> You can also add your data in the <a href="../../cgi-bin/hgCustom?db=hg38">custom track management page</a>. This allows you to set position, configuration options, and write a more complete desciption. If you want to see codons, you can right click, then click configure codon view or set this options using <code>baseColorDefault=genomicCodons</code> as is done below. <pre><code>browser position chr19:44905790-44909388 track type=bigGenePred baseColorDefault=genomicCodons name="bigGenePred Example Four" description="Ex4:BigGenePred Made from GTF" visibility=pack bigDataUrl=http://genome.ucsc.edu/goldenPath/help/examples/bigGenePredEx4.bb</code></pre> Once you are done, you should have a track on the Genome Browser like the one below.</li> @@ -320,41 +320,41 @@ <p class='text-center'> <img class='text-center' src="../../images/bigGenePredHub.png" alt="An image of a bigGenePred track hub on the Browser" width="999"> </p> </ol> <h2>Sharing your data with others</h2> <p> If you would like to share your bigGenePred data track with a colleague, learn how to create a URL link to your data by looking at <a href="customTrack.html#EXAMPLE6">Example #6</a> on the custom track help page.</p> <h2>Extracting data from bigBed format</h2> <p> Because the bigGenePred files are an extension of bigBed files, which are indexed binary files, it can be difficult to extract data from them. UCSC has developed the following programs to assist in working with bigBed formats, available from the -<a href="http://hgdownload.soe.ucsc.edu/admin/exe/">binary utilities directory</a>.</p> +<a href="http://hgdownload.gi.ucsc.edu/admin/exe/">binary utilities directory</a>.</p> <ul> <li> <code>bigBedToBed</code> — converts a bigBed file to ASCII BED format.</li> <li> <code>bigBedSummary</code> — extracts summary information from a bigBed file.</li> <li> <code>bigBedInfo</code> — prints out information about a bigBed file.</li> </ul> <p> As with all UCSC Genome Browser programs, simply type the program name (with no parameters) at the command line to view the usage statement.</p> <h2>Troubleshooting</h2> <p> If you encounter an error when you run the <code>bedToBigBed</code> program, check your input file for data coordinates that extend past the end of the chromosome. If these are present, run the <code>bedClip</code> program -(<a href="http://hgdownload.soe.ucsc.edu/admin/exe/">available here</a>) to remove the problematic +(<a href="http://hgdownload.gi.ucsc.edu/admin/exe/">available here</a>) to remove the problematic row(s) before running the <code>bedToBigBed</code> program. </p> <!--#include virtual="$ROOT/inc/gbPageEnd.html" --> <!DOCTYPE html> <!--#set var="TITLE" value="Genome Browser bigGenePred Track Format" --> <!--#set var="ROOT" value="../.." -->