96c23d22e15631da16fcf1069f0f30249910b533 max Thu Oct 21 05:45:43 2021 -0700 Updating mirrorManual and mirror.html / README.txt for Mysql support retirement, refs #28373 diff --git src/product/README.txt src/product/README.txt index afadfc6..fe00bc2 100644 --- src/product/README.txt +++ src/product/README.txt @@ -41,133 +41,134 @@ server yourself on a Unix server. * If your hub includes a high number of annotation files or HAL (multiple alignment) files and is located far from Santa Cruz, the display performance of assembly hubs may be slower than a local mirror. This issue can be resolved by using a UCSC mirror closer to the assembly hub (e.g. use genome-euro.ucsc.edu for assembly hubs located on servers in Europe, or genome-asia.ucsc.edu for those in Asia). Alternatively, you can improve performance by moving your hub data to a webspace provider closer to Santa Cruz or by using a content distribution network, where all content is mirrored and the closest location is chosen by your provider. 3. **Use Genome Browser in a Box** [Genome Browser in a Box](https://genome.ucsc.edu/goldenPath/help/gbib.html) (GBiB): is a fully configured virtual machine that includes - Apache and MySQL, and has behavior identical to the UCSC website. + Apache and MariaDB, and has behavior identical to the UCSC website. GBiB loads genome data from the UCSC download servers on the fly. Website and data updates are applied automatically every two weeks. By default, GBiB uses the VirtualBox virtualization software, so it can be run on any operating system, Windows, OSX and Linux. It does not require VirtualBox, the virtual machine image can easily be converted to e.g., VMWare or HyperV. For increased privacy, you can download the genomes and annotation tracks you need and use your GBiB off-line even on a laptop. * Pros: * Relatively simple to install: download and double-click. * By default, software and data updates are managed remotely by UCSC, via an update script run every week. * Best performance when rendering local BAM/bigWig/bigBed files. * No Unix webserver needed, runs on any OS. * For commercial users: easier click-through licensing compared to a full multi-seat, manual license. * Cons: * Requires the free VirtualBox software or a commercial Virtualization system. - * By default requires opening at least three outgoing ports to UCSC for MySQL, Rsync downloads + * By default requires opening at least three outgoing ports to UCSC for MariaDB, Rsync downloads and BLAT in your firewall. Once all data is downloaded, no open ports are needed. * For maximum browsing speed, can require up to 2-6TB to store all genome annotation tracks, like a manual local installation. 4. **Install locally with the Genome Browser installation script (GBIC)** We recommend this only if none of the above options fulfill your needs. Our [GBIC installation](https://genome.ucsc.edu/goldenpath/help/gbic.html) script will install a full local mirror of the UCSC website, for the assemblies you select. We support mirror site installations as time allows, and have many functional mirrors of the Genome Browser worldwide. For details, see the [section below](#installing-a-genome-browser-locally-with-the-gbic-installer). * Pros: * Relatively simple to install on a virtual machine or cloud instance: just run the script. * Best performance when rendering local BAM/bigWig/bigBed files. * For commercial users: easier click-through licensing compared to a full multi-seat, manual license. * Cons: * To keep up with changes in the Genome Browser, you will have to install linux packages and update the linux distribution yourself in the future and run the installation script with the 'update' command if you want to take advantage of new features in the Genome Browser. * Preferably run on a linux server that is not used otherwise. * By default requires opening at least three outgoing ports to UCSC for - MySQL and BLAT in your firewall. Once all data is downloaded and BLAT + MariaDB and BLAT in your firewall. Once all data is downloaded and BLAT setup locally, no open ports are needed anymore. * For maximum browsing speed, can require up to 2-6TB to store all genome annotation tracks. 5. **Install manually yourself, by following installation instructions** We do provide [step-by-step instructions](https://genome.ucsc.edu/goldenpath/help/mirrorManual.html) for local installation, but do not recommend this, if any other system works for you. The internet also has various pages with instructions, but they are often out of date and may cause problems later. For details on manual instructions, see the [section below](#manual-installation-instructions). * Pros: * You will understand the complete setup of the Genome Browser. * For commercial users: license agreements can be customized to your needs. * Cons: * Not easy to setup, even for experienced Unix administrators. * Will probably require some support via the [genome-mirror](mailto:genome-mirror@soe.ucsc.edu) mailing list. * To keep up with changes in the Genome Browser, you will have to install linux packages and update the linux distribution yourself in - the future and apply UCSC data updates yourself using rsync or MySQL table loads + the future and apply UCSC data updates yourself using rsync or MariaDB table loads * Configuration changes on our side may break your setup. * For maximum browsing speed, can require up to 2-6TB to store all genome annotation tracks. * For commercial users: license agreements take longer to negotiate, no click-through license. A license is required for commercial download and/or installation of the Genome Browser binaries and source code. No license is needed for academic, nonprofit, and personal use. To purchase a license, see our [license Instructions](https://genome.ucsc.edu/license/index.html) or visit the [Genome Browser store](https://genome-store.ucsc.edu/). # Installing a Genome Browser locally with the GBiC installer If you do not want to use our prepared virtual machine Genome-Browser-in-a-Box, we provide a Genome Browser in the Cloud (GBiC) [installation program](https://genome.ucsc.edu/goldenPath/help/gbic.html) that sets up a fully functional mirror on all major Linux distributions. It has been tested on Ubuntu 14/16, RedHat/CentOS 6 and 7, and Fedora 20. Preferably, the installation should be performed on a fresh Linux installation, as it deactivates the default site config file in Apache -and fills the MySQL directory with numerous databases. The easiest way to accomplish this is to +and fills the MariaDB directory with numerous databases. The easiest way to accomplish this is to run the Genome Browser in the Cloud program in a new virtual machine. The program also works on Docker and cloud computing virtual machines, and has been tested on those sold by Amazon, Microsoft and Google. Like GBiB, the mirror installed by the GBiC can load the annotation data either from UCSC or a local database copy. If you load data from UCSC and use a cloud computing provider, it is highly advisable to run your instances in the US West Coast / San Francisco Bay Area or San Jose data centers; otherwise data-loading may be very slow. To run the installation program, please see the [GBiC user guide](https://genome.ucsc.edu/goldenPath/help/gbic.html). # Manual installation instructions -If the installation program does not work on your linux distribution or you prefer to -make adaptations to your mirror, we also provide [step-by-step installation +See [mirrorManual.html](mirrorManual.html): If the installation program does +not work on your linux distribution or you prefer to make adaptations to your +mirror, we provide these [step-by-step installation instructions](mirrorManual.html) that cover the configuration of Apache, -MySQL, the Genome Browser CGIs, temporary file removal and other topics, like +MariaDB, the Genome Browser CGIs, temporary file removal and other topics, like data loading through proxies. The following external websites were not created by UCSC staff and are of varying quality, but may be helpful when installing on unusual platforms. * [Installation on Ubuntu](http://enotacoes.wordpress.com/2009/09/03/installing-a-minimal-ucsc-mirror-in-ubuntu-jaunty-64-bits/) * [Installation into a FreeBSD jail](http://www.bioinformatics.uni-muenster.de/download/ucsc), by Norbert Grundmann, Universitaet Muenster, Germany * [Compilation of the Kent source tree on OSX with fink](http://bergmanlab.ls.manchester.ac.uk/?p=32) by Casey Bergman, Manchester, UK * [Browser installation and new genome setup](http://www.oliverelliott.org/article/bioinformatics/tut_genomebrowser/), by Oliver Elliott, Columbia University, USA * [Installation notes in our wiki](http://genomewiki.ucsc.edu/index.php/Browser_Installation) # Using UDR to speed up downloads [UDR] (UDT Enabled Rsync) is a download protocol that is very efficent @@ -197,31 +198,31 @@ If you need help building the UDR binaries or have questions about how UDR functions, please read the documentation on the GitHub page and if necessary, contact the UDR authors via the GitHub page. We recommend reading the documentation on the UDR GitHub page to better understand how UDR works. UDR is written in C++. It is Open Source and is released under the Apache 2.0 License. In order for it to work, you must have rsync installed on your system. For your convenience, we offer a binary distribution of UDR for Red Hat Enterprise Linux 6.x (or variants such as CentOS 6 or Scientific Linux 6). You'll find both a 64-bit and 32-bit rpm [here]. Once you have a working UDR binary, either by building from source or by installing the rpm, you can download files from either of our our download servers in a fashion similar to rsync. For example, using -rsync, all of the MySQL tables for the hg19 +rsync, all of the MariaDB tables for the hg19 database can be downloaded using either one of the following two commands: rsync -avP rsync://hgdownload.soe.ucsc.edu/mysql/hg19/ /my/local/hg19/ rsync -avP hgdownload.soe.ucsc.edu::mysql/hg19/ /my/local/hg19/ Using UDR is very similar. The UDR syntax for downloading the same data would be: udr rsync -avP hgdownload.soe.ucsc.edu::mysql/hg19/ /my/local/hg19/ [UDR]: https://github.com/LabAdvComp/UDR [Laboratory for Advanced Computing]: http://www.labcomputing.org/ [here]: http://hgdownload.soe.ucsc.edu/admin/udr # The genome-mirror mailing list