66604eefdbb0971ed43ee81bd6f18cc3a409e304 markd Sat Jan 10 13:18:44 2026 -0800 color-code clsLongReadRna modles (#36908) diff --git src/hg/makeDb/trackDb/mouse/mm10/clsLongReadRna.html src/hg/makeDb/trackDb/mouse/mm10/clsLongReadRna.html index c1ef1be9e84..8f75d75869d 100644 --- src/hg/makeDb/trackDb/mouse/mm10/clsLongReadRna.html +++ src/hg/makeDb/trackDb/mouse/mm10/clsLongReadRna.html @@ -1,48 +1,63 @@
These tracks represent the results of targeted long-read RNA sequencing aimed at identifying lowly expressed lncRNAs in adult and embryonic tissues. The track consists of capture target regions, mappings of pre- and post-capture reads, and transcript models built from the data.
Portions of this dataset were used to develop the lncRNA annotations - introduced in GENCODE v47. The data are a superset of the data incorporated + introduced in GENCODE VM36. The data are a superset of the data incorporated into GENCODE. The transcript models for a given RNA do not necessarily match those in GENCODE and are provided as a guide to exploring the sequencing data. + Note that GENCODE VM36 is released on GRCm39/mm39 while these tracks are on + GRCm38/mm10.
Detailed descriptions of the data are available at the GENCODE CLS Project site.
This is a multi-view composite track containing multiple data types (views). Each view includes subtracks that are displayed individually in the browser. Instructions for configuring multi-view tracks are
here.
Views:
Model Color Coding
+
+Model annotations are color-coded based on their incorporation into GENCODE VM36 +and the assigned GENCODE VM36 BioType. Note that VM36 is not on the mm10 assembly. +
+This project, led by the GENCODE consortium, employed the Capture Long-read Sequencing (CLS) protocol to enrich transcripts from targeted genomic regions. It used a large capture array with orthologous probes in human and mouse genomes, targeting non-GENCODE lncRNA annotations and regions suspected of unannotated transcription. CapTrap-Seq, a cDNA library preparation protocol, was used to enrich for full-length RNA molecules (5′ to 3′).
Matched adult and embryonic tissues from human and mouse were selected to maximize transcriptome complexity. Libraries were sequenced pre- and post-capture using PacBio and Oxford Nanopore Technologies (ONT) long-read platforms, as well as short-read technologies.
Transcript isoform models were built from reads using the LyRic analysis software. These were merged using intron chains, with transcription start and end sites anchored using CAGE and poly(A) data.