ebf7c2678556d6eae1e6521f317977fd1c9cdc88 mspeir Sun Feb 16 09:56:46 2025 -0800 adding entry about running blat locally, refs #34829 diff --git src/hg/htdocs/FAQ/FAQblat.html src/hg/htdocs/FAQ/FAQblat.html index 02a517e205e..b52bb586743 100755 --- src/hg/htdocs/FAQ/FAQblat.html +++ src/hg/htdocs/FAQ/FAQblat.html @@ -16,31 +16,31 @@ expected
  • BLAT use restrictions
  • Downloading BLAT source and documentation
  • Replicating web-based BLAT parameters in command-line version
  • Using the -ooc flag
  • Replicating web-based BLAT percent identity and score calculations
  • Replicating web-based BLAT "I'm feeling lucky" search results
  • Using BLAT for short sequences with maximum sensitivity
  • BLAT ALL genomes
  • BLAT ALL genomes: No matches found
  • Approximating web-based BLAT results using gfServer/gfClient
  • Standalone or gfServer/gfClient result start positions off by one
  • Protein-translated BLAT having different results
  • Querying BLAT programmatically using URLs
  • - +
  • Running BLAT locally

  • Return to FAQ Table of Contents

    BLAT vs. BLAST

    What are the differences between BLAT and BLAST?

    BLAT is an alignment tool like BLAST, but it is structured differently. On DNA, BLAT works by keeping an index of an entire genome in memory. Thus, the target database of BLAT is not a set of GenBank sequences, but instead an index derived from the assembly of the entire genome. By default, the index consists of all non-overlapping 11-mers except for those heavily involved in repeats, and it uses less than a gigabyte of RAM. This smaller size means that BLAT is far more easily mirrored than BLAST. Blat of DNA is designed to quickly find @@ -486,17 +486,49 @@ The URL requires three variables: the sequence to blat, the type of query and the database, as follows

     https://genome.ucsc.edu/cgi-bin/hgBlat?userSeq=[seq]&type=[type]&db=[database]&output=json

    If the sequence is longer than 8kbp, the request must be sent via HTTP POST.

    Query types include DNA, protein, translated RNA and translated DNA:

     https://genome.ucsc.edu/cgi-bin/hgBlat?userSeq=GACCTCGGCGTGGCCTAGCG&type=DNA&db=hg38&output=json
     https://genome.ucsc.edu/cgi-bin/hgBlat?userSeq=IGCLPAHLLGDMWGRFWTNLYSLTVPFGQKPNIDVTDAMVDQAWDAQRIFKEAEKFFVSVGLPNM&type=protein&db=hg38&output=json
     https://genome.ucsc.edu/cgi-bin/hgBlat?userSeq=UUUCCCUUCCCCACUGUAGUGGGAGAGAAGGGAGUGGCCAUACCAUAUUUUUCUCGUGGGCCGUUGUAGUCAUAAGGCCUUCCUUUGCGGAAAAUUUUCAGGGUGGGAUA&type=translated%20RNA&db=hg38&output=json
     https://genome.ucsc.edu/cgi-bin/hgBlat?userSeq=TTTCCCTTCCCCACTGTAGTGGGAGAGAAGGGAGTGGCCATACCATATTTTTCTCGTGGGCCGTTGTAGTCATAAGGCCTTCCTTTGCGGAAAATTTTCAGGGTGGGATA&type=translated%20DNA&db=hg38&output=json
     
    + +

    Running BLAT Locally

    +
    How do I run BLAT locally if I'm running into limitations with the web-based version?
    +

    +If you have access to a Linux or Mac OSX command-line you can run blat locally. +The blat binary is available under the directory for your system type under the +utilities directory on our download server. +

    +Once you have the downloaded the blat program to your computer, ensure the 'execute' bit is +on for permissions: +

    $ ls -og blat
    +-rwxrwxr-x 1 5528712 Nov  7 04:03 blat
    +
    +

    +If the permissions indicate something like: -rw-rw-r--, then +you will need to enable the ability to execute the program: +

    $ chmod +x blat
    +

    +To confirm that it works, run the program without arguments. This will also +display the help message. +

    $ ./blat
    +

    +You can now use this program locally on your sequences. For example, given a +sample.fa, to find that sequence in the human/hg38 genome, you would run: +

    $ ./blat hg38.2bit sample.fa output.psl
    +

    +You can download the 2bit file for your assembly of interest from the +download server +under the "Genome sequence files and select annotations (2bit, GTF, GC-content, etc)" link. +For example, the hg38.2bit sequence can be found here. +