8185d253cdfd94afeef826febda5116c29fb80d8 brianlee Mon May 13 15:13:04 2019 -0700 Adding a short new FAQ for restricted downloads and cleaning up some tag and link problems found by htmlCheck validate/checkLinks refs #23493 diff --git src/hg/htdocs/FAQ/FAQdownloads.html src/hg/htdocs/FAQ/FAQdownloads.html index 84298db..89f83a5 100755 --- src/hg/htdocs/FAQ/FAQdownloads.html +++ src/hg/htdocs/FAQ/FAQdownloads.html @@ -369,31 +369,31 @@ To find transcription direction, we use a method that relies on finding gt-ag canonical pairs in one direction more often than in the opposite direction. The calculation is:

gt/ag introns minus ct/ac introns = intronOrientation

The sign of this calculated intronOrientation field (stored in the estOrientInfo table) shows the orientation of the transcript relative to the EST. Therefore, if intronOrientation is positive, then the EST appears in the display with the arrows pointing in the same direction as the EST.

Missing RefSeq ID

Why isn't my refseq ID in your database?

It may have been added after we last downloaded data from GenBank, or it may have been replaced or removed. You can check the submission date and status of an accession on the -NCBI Entrez Nucleotide +NCBI Entrez Nucleotide site.

Finished vs. draft segments

Do chrN.fa tables contain both finished and draft segments? If so, how do you determine which segments are finished?

Yes, these tables contain both finished and draft segments. Use the corresponding chrN_gold table to look them up. The quality of the draft varies. In general, the larger the contig it is in, the better the quality. The quality of the last 500 bases on either end of a contig tends to be lower than that of the rest of the contig.

How do you determine the accuracy? The base-calling program Phred analyzes the traces from the sequencing machines and assigns a quality score to these. These quality scores are used by the 2 (part)132b 323 434 545 656 - 767 878 9119 10810 11911 @@ -611,31 +611,31 @@ YYY

Converting genome coordinates between assemblies

I've been researching a specific area of the human genome on the current assembly, and now you've just released a new version. Is there an easy way to locate my area of interest on the new assembly?

You can migrate data from one assembly to another by using the blat -alignment tool or by converting assembly coordinates. There are two conversion tools available +alignment tool or by converting assembly coordinates. There are two conversion tools available on the Genome Browser web site: the Convert utility and the LiftOver tool. The Convert utility, which is accessed from the menu on the Genome Browser annotation tracks page, supports forward, reverse, and cross-species conversions, but does not accept batch input. The LiftOver tool, accessed via the Utilities link on the Genome Browser home page, also supports forward, reverse, and cross-species conversions, as well as batch conversions.

If you wish to update a large number of coordinates to a different assembly and have access to a Linux platform, you may find it useful to try the command-line version of the LiftOver tool. The executable file for this utility can be downloaded here. LiftOver requires a UCSC-generated over.chain file as input. Pre-generated files are available for selected assemblies from the Downloads page. If the desired file is not available, send a request to the genome mailing list and @@ -857,31 +857,31 @@

SNP data: If queries against the SNP table on one of our public MySQL servers or on your own MySQL installation are slow, then they can be sped up by using the "bin" field; you can contact us for more information.

Read more in our blog about Accessing the Genome Browser Programmatically to acquire data.

Obtaining GTF (Gene Transfer Format)

-
What is the best method for obtaining GTF output?
+
What is the best method for obtaining GTF output?

Currently, the Table Browser does not have an option return data as GTF files. Currently, the best method to obtain GTF files is to use the command-line format conversion utility, genePredToGtf. This can be set up to automatically connect to the UCSC public SQL database and return GTF files in a few minutes using this short guide.

Summary of Table Browser limitations: