a6b50639f42d157408a1dc49512b77546b58afac lrnassar Thu Aug 25 14:18:37 2022 -0700 Moving the track description page to a new directory for tidyness, refs #29356 diff --git src/hg/htdocs/goldenPath/trackDescriptions/bigRmskTrackDesc.html src/hg/htdocs/goldenPath/trackDescriptions/bigRmskTrackDesc.html new file mode 100644 index 0000000..88f2e4d --- /dev/null +++ src/hg/htdocs/goldenPath/trackDescriptions/bigRmskTrackDesc.html @@ -0,0 +1,208 @@ +

Description

+

+Repetitive genomic elements including Transposable Element (TE) families, Satellite, Short Tandem Repeats, +and low complexity DNA as annotated by RepeatMasker. + + + +

Display Conventions and Configuration

+ + +

Context Sensitive Zooming

+

+This track employs a technique which chooses the appropriate visual representation for the data based on the +zoom scale, and or the number of annotations currently in view. The track will automatically switch from the +most detailed visualization ('Full' mode) to the denser view ('Pack' mode) when the window size is greater +than 45kb of sequence. It will further switch to the even denser single line view ('Dense' mode) if more than +500 annotations are present in the current view. +

+

Dense Mode Visualization

+

+In dense display mode, a single line is displayed denoting the coverage of repeats using a series +of colored boxes. The boxes are colored based on the classification of the repeat (see below for legend). +
+
+ +

+

Pack Mode Visualization

+

+In pack mode, repeats are represented as sets of joined features. These are color coded as above based on the +class of the repeat, and the further details such as orientation (denoted by chevrons) and a family label are provided. +This family label may be optionally turned off in the track configuration. +
+
+ +
+
+The pack display mode may also be configured to resemble the original UCSC repeat track. In this visualization +repeat features are grouped by classes (see below), and displayed on seperate track lines. The repeat ranges are +denoted as grayscale boxes, reflecting both the size of the repeat and +the amount of base mismatch, base deletion, and base insertion associated with a repeat element. +The higher the combined number of these, the lighter the shading. +
+
+ +

+

Full Mode Visualization

+

+In the most detailed visualization repeats are displayed as chevron boxes, indicating the size and orientation of +the repeat. The interior grayscale shading represents the divergence of the repeat (see above) while the outline color +represents the class of the repeat. Dotted lines above the repeat and extending left or right +indicate the length of unaligned repeat model sequence and provide context for where a repeat fragment originates in its +consensus or pHMM model. If the length of the unaligned sequence +is large, an iterruption line and bp size is indicated instead of drawing the extension to scale. +
+
+ +

+

+For example, the following repeat is a SINE element in the forward orientation with average +divergence. Only the 5' proximal fragment of the consensus sequence is aligned to the genome. +The 3' unaligned length (384bp) is not drawn to scale and is instead displayed using a set of +interruption lines along with the length of the unaligned sequence. +

+ + + +

+Repeats that have been fragmented by insertions or large internal deletions are now represented +by join lines. In the example below, a LINE element is found as two fragments. The solid +connection lines indicate that there are no unaligned consensus bases between the two fragments. +Also note these fragments form the 3' extremity of the repeat, as there is no unaligned consensus +sequence following the last fragment. +

+ + + +

+In cases where there is unaligned consensus sequence between the fragments, the repeat will look like +the following. The dotted line indicates the length of the unaligned sequence between the two +fragments. In this case the unaligned consensus is longer than the actual genomic distance between +these two fragments. +

+ + + +

+If there is consensus overlap between the two fragments, the joining lines will be drawn to indicate +how much of the left fragment is repeated in the right fragment. +

+ + + +

+The following table lists the repeat class colors: +

+ + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +
ColorRepeat Class
SINE - Short Interspersed Nuclear Element
LINE - Long Interspersed Nuclear Element
LTR - Long Terminal Repeat
DNA - DNA Transposon
Simple - Single Nucleotide Stretches and Tandem Repeats
Low_complexity - Low Complexity DNA
Satellite - Satellite Repeats
RNA - RNA Repeats (including RNA, tRNA, rRNA, snRNA, scRNA, srpRNA)
Other - Other Repeats (including class RC - Rolling Circle)
Unknown - Unknown Classification
+ + +

+A "?" at the end of the "Family" or "Class" (for example, DNA?) +signifies that the curator was unsure of the classification. At some point in the future, +either the "?" will be removed or the classification will be changed.

+ +

Methods

+ +

+The RepeatMasker (www.repeatmasker.org) tool was used to generate the datasets found on this track hub. +

+ +

References

+ +

+Smit AFA, Hubley R, Green P. RepeatMasker Open-3.0. + +http://www.repeatmasker.org. 1996-2010. +

+ +

+Dfam is described in: +

+

+Wheeler TJ, Clements J, Eddy SR, Hubley R, Jones TA, Jurka J, Smit AF, Finn RD. + +Dfam: a database of repetitive DNA based on profile hidden Markov models. +Nucleic Acids Res. 2013 Jan;41(Database issue):D70-82. +PMID: 23203985; PMC: PMC3531169 +

+ +

+Repbase Update is described in: +

+

+Jurka J. + +Repbase Update: a database and an electronic journal of repetitive elements. +Trends Genet. 2000 Sep;16(9):418-420. +PMID: 10973072 +

+ +

+For a discussion of repeats in mammalian genomes, see: +

+

+Smit AF. + +Interspersed repeats and other mementos of transposable elements in mammalian genomes. +Curr Opin Genet Dev. 1999 Dec;9(6):657-63. +PMID: 10607616 +

+ +

+Smit AF. + +The origin of interspersed repeats in the human genome. +Curr Opin Genet Dev. 1996 Dec;6(6):743-8. +PMID: 8994846 +