bbabbd5d2566d47d923d51dbe350634783455999 mspeir Sun Oct 26 12:14:52 2025 -0700 change soe to gi, refs #35031 diff --git src/hg/htdocs/ENCODE/FAQ/index.html src/hg/htdocs/ENCODE/FAQ/index.html index 18e42122d9d..e9078d7946a 100755 --- src/hg/htdocs/ENCODE/FAQ/index.html +++ src/hg/htdocs/ENCODE/FAQ/index.html @@ -291,53 +291,53 @@ How do I extract information about an ENCODE experiment from the filename? <p> <b><font color="#006666">Response:</font></b><br> This is not recommended. While ENCODE filenames have some metadata embedded, the information there is not complete nor easily extracted. Rather, use the file's metadata, for example in "files.txt", or access metadata in the following places: <ol>By opening "files.txt" - the metadata file located in each track's corresponding download directory. <br/>By clicking the blue down-arrow next to each subtrack listed on a track's Track Settings page. <br/>By using <a href="/ENCODE/search.html" target="_blank">Track Search or File Search</a> to filter files by metadata. <br/>By using the <a href="../../cgi-bin/hgTables" target="_blank">Table Browser tool</a> and setting "Group" to "All Tables" and selecting the "metaDb" table. Click the "data format description" button to learn more about the metaDb table. <br/>By using the <a href="../../goldenPath/help/mysql.html" target="_blank">public MariaDB database</a> to query the metaDb table for each database. </ol> The metadata uses controlled vocabulary (cv.ra), which can be downloaded as a text file - <a href="http://hgdownload.soe.ucsc.edu/goldenPath/encodeDCC/cv.ra" target="_blank">here</a>. + <a href="http://hgdownload.gi.ucsc.edu/goldenPath/encodeDCC/cv.ra" target="_blank">here</a>. </div><!--end content--> </div><!--end wrapper--> <!--Content Tables-------------------------------------------------------> <a name="release7"></a> <!--outer table is for border purposes--> <div class="wrapper"> <td align="right"><a href="#FAQ"><img src="../../images/top.gif" alt="" align="right" border="0"></a> <div class="bar"><h4 class="title">ENCODE FILE FORMATS</h4></div> <div class="content"> <p> <b><font color="#006666">Question: </font></b><br> How do I learn more about different ENCODE file formats? For example what is the difference between a file.bed and a file.bed9 in the ENCODE methylation data? <p> <b><font color="#006666">Response:</font></b><br> By clicking the <a href="../../ENCODE/fileFormats.html" target="_blank">File Formats</a> link from the ENCODE portal page you can reach a list of various file formats used in ENCODE. Every ENCODE file has metadata included under a "files.txt" file in the related downloads page. - For example, from the <a href="http://hgdownload.soe.ucsc.edu/goldenPath/hg19/encodeDCC/wgEncodeHaibMethylRrbs/">HudsonAlpha DNA methylation download page</a>, - in the <a href="http://hgdownload.soe.ucsc.edu/goldenPath/hg19/encodeDCC/wgEncodeHaibMethylRrbs/files.txt">files.txt</a> + For example, from the <a href="http://hgdownload.gi.ucsc.edu/goldenPath/hg19/encodeDCC/wgEncodeHaibMethylRrbs/">HudsonAlpha DNA methylation download page</a>, + in the <a href="http://hgdownload.gi.ucsc.edu/goldenPath/hg19/encodeDCC/wgEncodeHaibMethylRrbs/files.txt">files.txt</a> file, a line after the specific bed9 file in question, wgEncodeHaibMethylRrbsAg04449UwstamgrowprotSitesRep1.bed9, reads 'objstatus=replaced'. This metadata indicates this bed9 file was preliminary data that has since been replaced. A similar note in the automatically displayed README file states: "WARNING - Revoked and replaced data files may be present in this directory." </div><!--end content--> </div><!--end wrapper--> <!--Content Tables-------------------------------------------------------> <a name="release8"></a> <!--outer table is for border purposes--> <div class="wrapper"> <td align="right"><a href="#FAQ"><img src="../../images/top.gif" alt="" align="right" border="0"></a> <div class="bar"><h4 class="title">ENCODE SCORE DEFINITION</h4></div> <div class="content"> <p> @@ -447,37 +447,37 @@ <div class="bar"><h4 class="title">DOWNLOAD ALL ENCODE DATA</h4></div> <div class="content"> <p> <b><font color="#006666">Question: </font></b><br> Is there a service providing ENCODE data on a hard drive? What is the total data volume? We have been trying FTP, but it takes too much bandwidth and time. <p> <b><font color="#006666">Response:</font></b><br> The total volume of ENCODE data are greater than 31 TB. Unfortunately, it is not possible for you to obtain a disk copy, however, there is a new protocol to try called UDR (UDT Enabled Rsync). UDR provides users much faster download rates.</p> <p> Here is an example using UDR, once installed, to download all the mouse mm9 ENCODE information:</p> - <pre class="code">$ udr rsync -avP hgdownload.soe.ucsc.edu::goldenPath/mm9/encodeDCC/ /my/local/mm9/</pre> + <pre class="code">$ udr rsync -avP hgdownload.gi.ucsc.edu::goldenPath/mm9/encodeDCC/ /my/local/mm9/</pre> Please read more about the new UDR method <a href="../../ENCODE/newsarch.html#091213" target="_blank">here</a>.</p> <p> For those not downloading high amounts of data, we highly recommend using rsync. For example:</p> - <pre class="code">$ rsync -a -P rsync://hgdownload.soe.ucsc.edu/goldenPath/hg19/encodeDCC/wgEncodeDir/wgEncodeFile ./</pre> + <pre class="code">$ rsync -a -P rsync://hgdownload.gi.ucsc.edu/goldenPath/hg19/encodeDCC/wgEncodeDir/wgEncodeFile ./</pre> <p> Using rsync has the advantage of starting up where it left off after a failure, when run again. </p> </div><!--end content--> </div><!--end wrapper--> <!--Content Tables-------------------------------------------------------> <a name="release13"></a> <!--outer table is for border purposes--> <div class="wrapper"> <td align="right"><a href="#FAQ"><img src="../../images/top.gif" alt="" align="right" border="0"></a> <div class="bar"><h4 class="title">ENCODE PAPERS</h4></div> <div class="content"> <p> <b><font color="#006666">Question: </font></b><br> @@ -539,31 +539,31 @@ For example downloadable files in the wgEncodeCaltechRnaSeq/ directory have a gene_id format like gene_id "GM12878-rep1.1045777" where the first part is the cell type. Would you know what does the last number 1045777 means? <p> <b><font color="#006666">Response:</font></b><br> At the top of the page for each of the download directories you are visiting there is a README.txt file that is automatically displayed. A link is provided that will bring you to a user interface enabling filtering of files by cell type and other parameters, as well as including additional information such as release status, restriction dates, track description, methods, and metadata that can answer such questions. </p> <p> For example in the README.txt file displayed at the top of the page in the - <a href="http://hgdownload.soe.ucsc.edu/goldenPath/hg19/encodeDCC/wgEncodeCaltechRnaSeq/">Caltech RNA-seq directory</a> + <a href="http://hgdownload.gi.ucsc.edu/goldenPath/hg19/encodeDCC/wgEncodeCaltechRnaSeq/">Caltech RNA-seq directory</a> you can find the following link: "http://genome.ucsc.edu/cgi-bin/hgFileUi?db=hg19&g=wgEncodeCaltechRnaSeq" </p> <p> By navigating to the page above, <a href="../../cgi-bin/hgFileUi?db=hg19&g=wgEncodeCaltechRnaSeq">Caltech RNA-seq Downloadable Files</a>, you can scroll to the bottom (or click the "Description" link in the top right corner) and read the track description's "Methods" section. In the "Data Processing and Analysis" section there is information explaining how the numbers in gene_id, "GM12878-rep1.####" represent de novo identifiers output by Cufflinks software. At the very bottom of the page is a "Credits" section where contacts are listed. You should send remaining process-specific questions about the data you are investigating to the appropriate contact listed. </p> @@ -680,31 +680,31 @@ <div class="content"> <p> <b><font color="#006666">Question: </font></b><br> What program reads ".bb" TFBS files from ENCODE? I am interested in looking at the AWG TFBS data. I downloaded the files and one is called: spp.optimal.wgEncodeBroadHistoneGm12878CtcfStdAlnRep0_VS_wgEncodeBroadHistoneGm12878ControlStdAlnRep0.bb <p> However, I do not have a program that can open this file. What is the program for this file and where can I find it? <p> <b><font color="#006666">Response:</font></b><br> Files ending in ".bb" are <a href="../../FAQ/FAQformat.html#format1.5">bigBed</a> files. Click <a href="../../goldenPath/help/bigBed.html">here</a> for extensive information on the bigBed format and how to extract data with different binary utilities located in this - <a href="http://hgdownload.soe.ucsc.edu/admin/exe/">directory</a>. + <a href="http://hgdownload.gi.ucsc.edu/admin/exe/">directory</a>. </div><!--end content--> </div><!--end wrapper--> <!--Content Tables-------------------------------------------------------> <a name="release21"></a> <!--outer table is for border purposes--> <div class="wrapper"> <td align="right"><a href="#FAQ"><img src="../../images/top.gif" alt="" align="right" border="0"></a> <div class="bar"><h4 class="title">HUB EXAMPLES</h4></div> <div class="content"> <p> <b><font color="#006666">Question: </font></b><br> I am making a public hub for my paper, is there an example html file to use for my data description? <p> <b><font color="#006666">Response:</font></b><br> @@ -727,29 +727,29 @@ <p> <b>Other Examples:</b><br> <p> Here are a few good examples of hub structure and configuration from the ENCODE Analysis hub: <p> <a href="http://ftp.ebi.ac.uk/pub/databases/ensembl/encode/integration_data_jan2011/hub.txt"> http://ftp.ebi.ac.uk/pub/databases/ensembl/encode/integration_data_jan2011/hub.txt</a><br> <a href="http://ftp.ebi.ac.uk/pub/databases/ensembl/encode/integration_data_jan2011/genomes.txt"> http://ftp.ebi.ac.uk/pub/databases/ensembl/encode/integration_data_jan2011/genomes.txt</a><br> <a href="http://ftp.ebi.ac.uk/pub/databases/ensembl/encode/integration_data_jan2011/hg19/trackDb.txt"> http://ftp.ebi.ac.uk/pub/databases/ensembl/encode/integration_data_jan2011/hg19/trackDb.txt</a><br> <p> Note: We recommend a minimal number of default visible tracks in your trackDb.txt to quicken hub loading time and to avoid overwhelming users. For more suggestions on hub structure, please see our - <a href="http://genomewiki.soe.ucsc.edu/index.php/Public_Hub_Guidelines".>Public Hub Guidelines</a> + <a href="http://genomewiki.gi.ucsc.edu/index.php/Public_Hub_Guidelines".>Public Hub Guidelines</a> wikipage. Also, for help defining unfamiliar terms, you may want to see the Hub Track Database Definition's <a href="http://genome.ucsc.edu/goldenPath/help/trackDb/trackDbHub.html#toc" target="_blank">table of contents</a>. </p> <p> </div><!--end content--> </div><!--end wrapper--> <!--END FAQ Content Tables-------------------------------------------------------> <p class="date">Updated 15 August 2014</p> </body> </html>