bbabbd5d2566d47d923d51dbe350634783455999 mspeir Sun Oct 26 12:14:52 2025 -0700 change soe to gi, refs #35031 diff --git src/hg/htdocs/FAQ/FAQreleases.html src/hg/htdocs/FAQ/FAQreleases.html index 7ebf837ec31..51f1f9223ec 100755 --- src/hg/htdocs/FAQ/FAQreleases.html +++ src/hg/htdocs/FAQ/FAQreleases.html @@ -376,31 +376,31 @@ The annotations accompanying an assembly are obtained from a variety of sources. The UCSC Genome Bioinformatics Group generates several of the tracks; the remainder are contributed by collaborators at other sites. Each track has an associated description page that credits the authors of the annotation.

For detailed information about the individuals and organizations who contributed to a specific assembly, see the Credits page.

Which UCSC assemblies are equivalent to Ensembl or NCBI assemblies?

The asmEquivalent table on the hgFixed database is available on the public MySQL server to show which assemblies versions are identical (or almost identical) to each other between UCSC, Ensembl, Genbank, and RefSeq assemblies.

-mysql --user=genome --host=genome-mysql.soe.ucsc.edu -A -e 'desc asmEquivalent;' hgFixed
+mysql --user=genome --host=genome-mysql.gi.ucsc.edu -A -e 'desc asmEquivalent;' hgFixed
 +----------------------+-------------------------------------------+
 | Field                | Type                                      |
 +----------------------+-------------------------------------------+
 | source               | varchar(255)                              |
 | destination          | varchar(255)                              |
 | sourceAuthority      | enum('ensembl','ucsc','genbank','refseq') |
 | destinationAuthority | enum('ensembl','ucsc','genbank','refseq') |
 | matchCount           | bigint(20)                                |
 | sourceCount          | bigint(20)                                |
 | destinationCount     | bigint(20)                                |
 +----------------------+-------------------------------------------+

The "Count" indications are the count of individual sequences in the assembly. When all three counts are identical, matchCount == sourceCount == destinationCount, then the match between genome assemblies is perfectly identical.

@@ -461,31 +461,31 @@

Differences between UCSC and NCBI mouse assemblies

Is the mouse genome assembly displayed in the UCSC Genome Browser the same as the one on the NCBI website?

The mouse genome assemblies featured in the UCSC Genome Browser are the same as those on the NCBI web site with one difference: the UCSC versions contain only the reference strain data (C57BL/6J). NCBI provides data for several additional strains in their builds.

Accessing older assembly versions

I need to access an older version of a genome assembly that's no longer listed in the Genome Browser menu. What should I do?

In addition to the assembly versions currently available in the Genome Browser, you can access the data for older assemblies of the browser through our -Downloads page.

+Downloads page.

Frequency of GenBank data updates

How frequently does UCSC update its databases with new data from GenBank?

GenBank updates for mRNA, RefSeq, and EST data occur on a semi-quarterly basis, following major NCBI releases. These updates are in place for most Genome Browser assemblies. Assemblies that are not on an incremental update schedule are updated whenever we load a new assembly or make a major revision to a table.

Coordinate changes between assemblies

I noticed that the chromosomal coordinates for a particular gene that I'm looking at have changed since the last time I used your browser. What happened?
@@ -575,49 +575,49 @@ being stored as a MariaDb table. For very large queries, this may cause the Table Browser to timeout before the query finishes as dbSNP has grown to include over 700 million variants. If you find that your Table Browser query timesout for your list of rsIDs, you can use the bigBedNamedItems command-line tool to extract the rsID coordinates directly from the bigBed file instead of using the Table Browser.

More information and examples using the bigBedNamedItems utility can be found on the following FAQ entry. As a reminder, you can run any Kent command-line tool without arguments to get the usage statement.

How can I extract a list of rsIDs using chrom:start-end or vice versa?

Several utilities for working with bigBed-formatted binary files can be downloaded -here. +here. Run a utility with no arguments to see a brief description of the utility and its options.

Examples:
  1. Retrieve all variants in the region chr1:200001-200400 -
    bigBedToBed http://hgdownload.soe.ucsc.edu/gbdb/hg38/snp/dbSnp155.bb -chrom=chr1 -start=200000 -end=200400 stdout
    +
    bigBedToBed http://hgdownload.gi.ucsc.edu/gbdb/hg38/snp/dbSnp155.bb -chrom=chr1 -start=200000 -end=200400 stdout
  2. Retrieve variant rs6657048
    bigBedNamedItems dbSnp155.bb rs6657048 stdout
  3. Retrieve all variants with rs# IDs in a file (myIds.txt) and output to another file (dbSnp155.myIds.bed)
    bigBedNamedItems -nameFile dbSnp155.bb myIds.txt dbSnp155.myIds.bed

Missing annotation tracks

Why is my favorite annotation track missing from your latest release?

The initial release of a new genome assembly typically contains a small subset of core annotation