b866b80e5a3d77797edd9c29ecf7eae04e6c6daf
markd
  Fri Mar 1 19:23:36 2024 -0800
added a bit of details to CRAM refUrl to make it a bit clearer

diff --git src/hg/htdocs/goldenPath/help/cram.html src/hg/htdocs/goldenPath/help/cram.html
index b3de4c3..23cd8d3 100755
--- src/hg/htdocs/goldenPath/help/cram.html
+++ src/hg/htdocs/goldenPath/help/cram.html
@@ -3,43 +3,43 @@
 <!--#set var="ROOT" value="../.." -->
 
 <!-- Relative paths to support mirror sites with non-standard GB docs install -->
 <!--#include virtual="$ROOT/inc/gbPageStart.html" -->
 
 <h1>CRAM Track Format</h1>
 <p>
 The UCSC Genome Browser is capable of displaying both the BAM  and CRAM file formats. While BAM 
 files contain all sequence data within a file, CRAM files are smaller by taking advantage of an 
 additional external &quot;reference sequence&quot; file. This file is needed to both compress and 
 decompress the read information.</p>
 <p>
 Since CRAM files are more dense than <a href="/goldenPath/help/bam.html">BAM</a> files, many groups
 are switching to the CRAM format to save disk space. For CRAM tracks to load there is an expectation
 that the checksum of the reference sequence used to create the CRAM will be in the CRAM header. A
-file with a matching checksum is also expected to be accessible from the EBI CRAM reference registry
+file with a matching checksum is also expected to be accessible from the EBI RefGet CRAM reference registry
 (see <a href="#refs">References</a> for CRAM resources). Otherwise, users must specify a
 <code>refUrl</code> setting that will point to a server that is offering up the reference sequences
 (see <a href="#example4">Example Four</a>).</p>
 <p>
 Since the loading of CRAM data requires the specific reference sequence used to create the CRAM 
 file, it is very important that the exact same reference sequence is used for compression and 
 decompression. When a CRAM file is first loaded on a given chromosome, a check for the preexistence 
 in a special browser &quot;cramCache&quot; directory of the specified reference checksum will take 
 place. If the reference sequence information specific for that CRAM for the currently viewed 
 chromosome region does not exist, a message will display about the file not being found along with 
 a note about downloading the reference from the EBI CRAM reference registry if it is available (or 
-from the specified <code>refUrl</code>). A refresh of the page once the download is complete will 
+from another Refget server using <code>refUrl</code>). A refresh of the page once the download is complete will 
 display the CRAM data as if it were a BAM file.</p>
 <p>
 The track lines to describe CRAM tracks are identical to track lines for BAM tracks. This includes 
 the <code>type</code> parameter, which is still <code>bam</code> even for CRAM tracks. The only 
 difference is that instead of providing the URL to a BAM file, the URL instead points to a CRAM 
 file.</p>
 <p>
 Please also note that just as a BAM file requires an associated BAM.bai index file, a CRAM file will
 require an associated CRAM.crai index file in the same location to load.</p>
 
 <h2>Example #1</h2>
 <p>
 Here is an example CRAM track that displays around the gene SOD1 on hg19 that can be cut and pasted 
 as text into the <a href="/cgi-bin/hgCustom?db=hg19" target="_blank">Custom Tracks</a> page:</p>
 <pre><code>track type=bam db=hg19 name=exampleCRAM bigDataUrl=http://genome.ucsc.edu/goldenPath/help/examples/cramExample.cram </code></pre>
@@ -82,58 +82,66 @@
 The CRAM format is also supported in track hubs. Below is an example trackDb.txt stanza that would 
 display a CRAM files from the 1000 Genomes Project. To learn more about using Track Hubs see the 
 <a href="hgTrackHubHelp.html" target="_blank">User Guide</a> and associated Quick Start Guides to 
 building hubs. Note that <code>type bam</code> is used to display CRAM files in hubs, just as 
 <code>type bam</code>  is used in custom CRAM tracks.</p>
 <pre><code>track cram61
 type bam
 shortLabel HG00361
 longLabel This CRAM file is from the 1000 Genomes Project HG00361
 visibility pack
 bigDataUrl ftp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/phase3/data/HG00361/exome_alignment/HG00361.mapped.ILLUMINA.bwa.FIN.exome.20120522.bam.cram</code></pre>
 
 <a name="example4"></a>
 <h2>Example #4</h2>
 <p>
-If users do not register their reference sequence with the EBI CRAM reference registry then there is
-the option to use a <code>refUrl</code> setting to point the browser to the appropriate place to 
-find the reference sequence. The below example is a hub track stanza using the  <code>refUrl</code> 
+For genomes that are not registered in the EBI CRAM Reference Registry,
+the <code>refUrl</code> setting is used to point the browser to
+the appropriate place to find the reference sequence. The <code>refUrl</code>
+takes an argument of the URL of the reference server with a <tt>%s</tt> being 
+replaced the RefGet MD5 checksum that identifies the reference sequence.
+<p>
+</p> 
+The below example is a hub track stanza using the <code>refUrl</code> 
 option:</p> 
 <pre><code>track cramExample
 type bam
 visibility full
 shortLabel cramExRefUrl
 longLabel This CRAM file points to a reference sequence specified by refUrl
 refUrl http://university.edu/URL/cramRef/%s
 bigDataUrl ftp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/phase3/data/HG00096/alignment/HG00096.mapped.ILLUMINA.bwa.GBR.low_coverage.20120522.bam.cram</code></pre>
 <p>
 The use of <code>refUrl</code> can also be employed on a custom track line:</p>
 <pre><code>track type=bam db=hg19 name=cramExRefUrl
 refUrl=http://university.edu/URL/cramRef/%s
 bigDataUrl=ftp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/phase3/data/HG00096/alignment/HG00096.mapped.ILLUMINA.bwa.GBR.low_coverage.20120522.bam.cram </code></pre>
 
 <a name="refs"></a>
 <h2>References</h2>
 <p>
 Below is a collection of helpful CRAM resources:</p>
 <ul>
   <li>
   <a href="http://www.ebi.ac.uk/ena/software/cram-toolkit" target="_blank">CRAM toolkit</a></li>
   <li>
   <a href="http://www.ebi.ac.uk/ena/software/cram-reference-registry" target="_blank">CRAM reference
   registry</a></li>
   <li>
+  <a href="https://doi.org/10.1093/bioinformatics/btab524" target="_blank">
+    Refget: standardized access to reference sequences</a>
+  <li>
   <a href="http://samtools.github.io/hts-specs/CRAMv3.pdf" target="_blank">CRAM format 
   specification (version 3.0)</a></li>
   <li>
   <a href="http://www.htslib.org/workflow/cram.html" target="_blank">Using CRAM within
   Samtools</a></li>
   <li>
   <a href="ftp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/README_using_1000genomes_cram.md" 
   target="_blank"> International Genome Sample Resource (IGSR) CRAM Tutorial</a></li>
   <li>
   <a href="http://davetang.org/muse/2014/09/26/bam-to-cram/" target="_blank">Dave Tang blog post 
   about  &quot;BAM to CRAM&quot</a></li>
 </ul>
 
 <h2>Sharing your data with others</h2>
 <p>