src/hg/htdocs/FAQ/FAQdownloads.html 01e223245a85e1ae6d1f2e914c64367cfd006ea9

01e223245a85e1ae6d1f2e914c64367cfd006ea9
ccpowell
  Tue Jul 9 16:00:29 2019 -0700
Changing MySQL to MariaDB in documentation, refs #23597

diff --git src/hg/htdocs/FAQ/FAQdownloads.html src/hg/htdocs/FAQ/FAQdownloads.html
index d22d10d..621592b 100755
--- src/hg/htdocs/FAQ/FAQdownloads.html
+++ src/hg/htdocs/FAQ/FAQdownloads.html
@@ -30,31 +30,31 @@
 <li><a href="#download12">Chromosome M</a></li>
 <li><a href="#download13">N characters at beginning of human chr22</a></li>
 <li><a href="#download30">Erroneous duplicated chrY_random region on Mouse Build 34 (mm6)</a></li>
 <li><a href="#download25">Mapping chimp chromosome numbers to human chromosomes numbers</a></li>
 <li><a href="#download28">Converting genome coordinates between assemblies</a></li>
 <li><a href="#download33">Linking gene name with accession number</a></li>
 <li><a href="#download31">Obtaining a list of Known Genes</a></li>
 <li><a href="#download16">Repeat-masking data</a></li>
 <li><a href="#download17">Availability of repeat-masked data</a></li>
 <li><a href="#download24">RepeatMasker version differences - UCSC vs. Repeatmasker website</a></li> 
 <li><a href="#download18">Obtaining promoter sequence</a></li>
 <li><a href="#download19">Data from Evolutionary Conservation Score tracks</a></li>
 <li><a href="#download20">Minus strand coordinates - axtNet files</a></li>
 <li><a href="#download21">Mapping UCSC STS marker IDS to those of other groups</a></li>
 <li><a href="#download22">deCODE map data</a></li>
-<li><a href="#download29">Direct MySQL access to data</a></li>
+<li><a href="#download29">Direct MariaDB (MySQL) access to data</a></li>
 <li><a href="#download34">Name of fourth column in BED output</a></li>
 <li><a href="#download36">Track data access</a></li>
 <li><a href="#download37">Known issues with Table Browser GTF output</a></li>
 <li><a href="#download38">Table Browser output file not ordered</a></li>
 <li><a href="#download39">'Permisssion denied' error when trying to use command-line utilities</a></li>
 <li><a href="#download40">Restricted Track Data</a></li>
 </ul>
 <hr>
 <p>
 <a href="index.html">Return to FAQ Table of Contents</a></p>
 
 <a name="download1"></a>
 <h2>Downloading sequence and annotation data</h2>
 <h6>How do I obtain the sequence and/or annotation data for a release?</h6>
 <p> 
@@ -141,31 +141,31 @@
     <li>sex</li>
     <li>source</li>
     <li>tissue</li>
   </ul> 
   </div>
   <div class="col-md-3">
   </div>
 </div>
 <p>
 These tables are also accessible from: </p>
 <ul>
   <li> 
   The <a href="../cgi-bin/hgTables" >Table Browser</a>, as connected tables and joined fields 
   described when clicking the &quot;describe table schema &quot; button</li>
   <li>
-  One of our two <a href="../goldenPath/help/mysql.html">public access MySQL servers</a>
+  One of our two <a href="../goldenPath/help/mysql.html">public access MariaDB servers</a>
   in the US and Europe</li>
 </ul>
 
 <a name="download32"></a>
 <h2>Extracting sequence in batch from an assembly</h2>
 <h6>I have a lot of coordinates for an assembly and want to extract the corresponding sequences.
 What is the best way to proceed? </h6>
 <p> 
 There are two ways to extract genomic sequence in batch from an assembly:</p>
 <p>
 A. Download the appropriate fasta files from our 
 <a href="ftp://hgdownload.soe.ucsc.edu/goldenPath/">ftp server</a> and extract sequence data using 
 your own tools or the tools from our source tree. This is the recommended method when you have very 
 large sequence datasets or will be extracting data frequently. Sequence data for most assemblies is 
 located in the assembly's &quot;chromosomes&quot; subdirectory on the downloads server. For example,
@@ -247,34 +247,34 @@
 <p> 
 Microsoft Word or any program that can handle large text files will do. Some of the chromosomes 
 begin with long blocks of <em>N</em>s. You may want to search for an <em>A</em> to get past
 them.</p>
 <p>
 Unless you have a particular need to view or use the raw data files, you might find it more 
 interesting to look at the data using the Genome Browser. Type the name of a gene in which you're 
 interested into the position box (or use the default position), then click the submit button. In 
 the resulting Genome Browser display, click the DNA link on the menu bar at the top of the page. 
 Select the Extended case/color options button at the bottom of the next page. Now you can color the 
 DNA sequence to display which portions are repeats, known genes, genetic markers, etc.</p>
 
 <a name="download4"></a>
 <h2>Data differences between downloaded data and browser display</h2>
 <p>
-<h6>I downloaded the genome annotations from your MySQL database tables, but the mRNA locations 
+<h6>I downloaded the genome annotations from your MariaDB database tables, but the mRNA locations 
 didn't match what was showing in the Genome Browser. Shouldn't they be in synch?</h6>
 <p> 
-Yes. The Genome Browser and Table Browser are both driven by the same underlying MySQL database. 
+Yes. The Genome Browser and Table Browser are both driven by the same underlying MariaDB database. 
 Check that your downloaded tables are from the same assembly version as the one you are viewing in 
 the Genome Browser. If the assembly dates don't match, the coordinates of the data within the 
 tables may differ. In a very rare instance, you could also be affected by the brief lag time between
 the update of the live databases underlying the Genome Browser and the time it takes for text dumps 
 of these databases to become available in the downloads directory.</p> 
 
 <a name="download5"></a>
 <h2>Strange characters in FASTA file</h2>
 <h6>I noticed several characters other than <em>A</em>, <em>C</em>, <em>G</em>, <em>T</em>, and 
 <em>N</em> in my fasta file, for example <em>y</em>, <em>k</em>, <em>s</em>, etc. Is the file 
 corrupted or are these characters valid?</h6>
 <p>
 The characters most commonly seen in sequence are <em>A</em>, <em>C</em>, <em>G</em>, <em>T</em>, 
 and <em>N</em>, but there are several other valid characters that are used in clones to indicate 
 ambiguity about the identity of certain bases in the sequence. It's not uncommon to see these 
@@ -774,40 +774,40 @@
 this ID to look it up in the stsMap table where the marker is located. For example, D10S249 has 
 UCSC ID 2880 and is located at chr10:240791-241019.</p> 
 
 <a name="download22"></a>
 <h2>deCODE map data</h2>
 <h6>Where can I get more information about the deCODE map?</h6>
 <p> 
 You can obtain this information from the combination of a couple of tables. The stsMap table 
 contains the physical position of all STS markers, including those on the deCODE map. This file 
 also contains information about the position on the genome-wide maps, including the deCODE map. A 
 second file, stsInfo2, contains additional information about each marker, including aliases, primer 
 sequence information, etc. This table is related to the first table by an ID (the identNo field in 
 both files).</p>
 
 <a name="download29"></a>
-<h2>Direct MySQL access to data</h2>
+<h2>Direct MariaDB (MySQL) access to data</h2>
 <h6>Is it possible to run SQL queries directly on the database rather than using the Table 
 Browser interface?</h6>
 <p> 
 Yes. See our documentation on <a href="../goldenPath/help/mysql.html">Downloading Data using 
-MySQL</a>.</p> 
+MariaDB</a>.</p> 
 <p>
-Connect to the US MySQL server using the command:</p>
+Connect to the US MariaDB server using the command:</p>
 <pre><code>mysql --user=genome --host=genome-mysql.soe.ucsc.edu -A </code></pre>
-<p>Or to the European MySQL server using the command:</p>
+<p>Or to the European MariaDB server using the command:</p>
 <pre><code>mysql --user=genome --host=genome-euro-mysql.soe.ucsc.edu -A </code></pre>
 
 <a name="download34"></a>
 <h2>Name of fourth column in BED output</h2>
 <h6>When using the Table Browser to extract exons from a Gene track, what does the &quot;Name&quot; 
 column (fourth BED column) refer to?</h6>
 <p> 
 The fourth column of the BED output contains a lot of information separated by underscores. For 
 example:</p>
 <pre><code>uc009vjk.2_cds_1_0_chr1_324343_f </code></pre>
 <p>
 This information is represented as follows:</p>
 <pre><code>ucscId_sequenceType_sequenceTypeNumber_basesAdded_chromosome_positionOfFirstBaseOfItem_strand</code></pre>
 <ul>
   <li>
@@ -834,53 +834,53 @@
   listed in this section of the 4th column is actually 1 based. It will be the exact coordinate the 
   feature starts on as displayed in the browser.</li>
   <li>
   Strand: forward(f) or reverse(-) strand.</li>
 </ul>
 
 <a name="download36"></a>
 <h2>Track Data Access</h2>
 <h6>How do I access the data underlying a track?</h6> 
 <p>
 The raw data underlying a track can be explored interactively with the 
 <a href="../cgi-bin/hgTables">Table Browser</a>, <a href="../cgi-bin/hgIntegrator">Data 
 Integrator</a>, or <a href="../cgi-bin/hgVai">Variant Annotation Integrator</a>. For automated 
 analysis, the genome annotation can be downloaded from the 
 <a href="http://hgdownload.soe.ucsc.edu/">downloads server</a>, one of our two
-<a href="http://genome.ucsc.edu/goldenPath/help/mysql.html">public MySQL servers</a>, or 
+<a href="http://genome.ucsc.edu/goldenPath/help/mysql.html">public MariaDB servers</a>, or 
 using our <a href='../goldenPath/help/api.html' target=_blank>JSON API</a>.</p>
 <p> 
 <strong>bigBed data:</strong> For <a href="FAQformat.html#format1.5">bigBed</a> files, individual 
 regions or the whole genome annotation can be obtained using our tool bigBedToBed which can be 
 compiled from the source code or downloaded as a precompiled binary for your system. Instructions 
 for downloading source code and binaries can be found 
 <a href="http://hgdownload.soe.ucsc.edu/downloads.html#utilities_downloads">here</a>. The tool can 
 also be used to obtain only features within a given range using one of the hgdownload servers,
 example:</p> 
 <ul>
   <li>
     North American server:
     <pre><code>bigBedToBed http://hgdownload.soe.ucsc.edu/gbdb/path/to/file/bigBedfile.bb -chrom=chr21 -start=0 -end=1000000 stdout </code></pre> 
   </li>
   <li>
     European server:
     <pre><code>bigBedToBed http://hgdownload-euro.soe.ucsc.edu/gbdb/path/to/file/bigBedfile.bb -chrom=chr21 -start=0 -end=1000000 stdout </code></pre> 
   </li>
 </ul>
 <p> 
-<strong>SNP data:</strong> If queries against the SNP table on one of our public MySQL servers or on your
-own MySQL installation are slow, then they can be sped up by using the &quot;bin&quot; field; you 
+<strong>SNP data:</strong> If queries against the SNP table on one of our public MariaDB servers or on your
+own MariaDB installation are slow, then they can be sped up by using the &quot;bin&quot; field; you 
 can <a href="../contacts.html">contact us</a> for more information.</p>
 
 <p>
 Read more in <a href="http://genome.ucsc.edu/blog/"> our blog</a> about
 <a href="http://genome.ucsc.edu/blog/?s=programmatic"> Accessing the Genome Browser Programmatically</a>
 to acquire data.
 </p>
 
 <a name="download37"></a>
 <h2>Obtaining GTF (Gene Transfer Format)</h2>
 <h6>What is the best method for obtaining GTF output?</h6>
 <p>
 Currently, the <a href="../cgi-bin/hgTables">Table Browser</a> does not have an option return data as
 <a href="../FAQ/FAQformat.html#format4">GTF</a> files. Currently, the best method to obtain 
 GTF files is to use the command-line format conversion utility, <code>genePredToGtf</code>. This can be set up 
@@ -897,31 +897,31 @@
 includes proper start and stop codons.</li>
   <li>Some tables in older genome assemblies are not supported.</li>
 </ul>
             <a href="../FAQ/FAQformat#format9">GenePred</a> (short for Gene Predictions) is a table
 format commonly used for gene tracks in the UCSC Genome Browser where each transcript has a single
 row. Tables are not stored in GTF as it would require many rows to describe a single transcript
 since each gene feature (i.e., exon) requires a separate line. The <code>genePredToGtf</code> command-line
 utility can be used to convert genePred to GTF. Download the <code>genePredToGtf</code> operating 
 system-specific command-line utility from the
 <a href="http://hgdownload.soe.ucsc.edu/admin/exe/">utilities directory</a>.</p>
 <p>
 Please see the <a href="http://genomewiki.ucsc.edu/index.php/Genes_in_gtf_or_gff_format"> Genes in GTF
 or GFF Format wiki page</a> for examples and various methods for conversion. The <code>genePredToGtf</code>
 utility can convert files from several sources, such as Table Browser output from a genePred table,
 a local downloaded gene set table like refGene.txt, or from querying
-<a href="../goldenpath/help/mysql.html">public MySQL tables.</a></p>
+<a href="../goldenpath/help/mysql.html">public MariaDB tables.</a></p>
 
 <a name="download38"></a>
 <h2>Table Browser output file order</h2>
 <h6>My table browser output file is not ordered by position, how is it ordered?</h6>
 <p>
 Most of our tables have a special first column called "bin" that helps with quickly displaying data on 
 the Genome Browser. This (chrom,bin) index causes query results to be ordered first by bin, then by 
 chromStart. This allows us to query and return results more quickly than if they were sorted by chromStart.
 </p>
 <p>
 A quick way to sort an output BED file by position is to use the following UNIX command on our
 <a href="../cgi-bin/hgTables">Table Browser</a> output BED file:
 <pre><code>sort -k1,1 -k2n,2n example.bed > example.sorted.bed</code></pre>
 </p>