--------------------------------------------------------------- hg19.trackDb.html : Differences exist between hgwbeta and hgw2 (RR fields taken from public MySql server, not individual machine) 29081,29246d29080 < knownGeneV45lift37 | html < knownGeneV45lift37 |

Description

< knownGeneV45lift37 |

< knownGeneV45lift37 | The GENCODE Genes track (version 45, January 2024) shows high-quality manual < knownGeneV45lift37 | annotations merged with evidence-based automated annotations across the entire < knownGeneV45lift37 | human genome generated by the < knownGeneV45lift37 | GENCODE project. < knownGeneV45lift37 | By default, only the basic gene set is < knownGeneV45lift37 | displayed, which is a subset of the comprehensive gene set. The basic set represents transcripts < knownGeneV45lift37 | that GENCODE believes will be useful to the majority of users.

< knownGeneV45lift37 | < knownGeneV45lift37 |

< knownGeneV45lift37 | The track includes protein-coding genes, non-coding RNA genes, and pseudo-genes, though pseudo-genes < knownGeneV45lift37 | are not displayed by default. It contains annotations on the reference chromosomes as well as < knownGeneV45lift37 | assembly patches and alternative loci (haplotypes).

< knownGeneV45lift37 | < knownGeneV45lift37 |

< knownGeneV45lift37 | Statistics for the v45 release can be found in the < knownGeneV45lift37 | GENCODE site for this build.

< knownGeneV45lift37 | < knownGeneV45lift37 |

< knownGeneV45lift37 | For more information on the different gene tracks, see our Genes FAQ.

< knownGeneV45lift37 | < knownGeneV45lift37 |

Display Conventions and Configuration

< knownGeneV45lift37 |

< knownGeneV45lift37 | By default, this track displays only the basic GENCODE set, splice variants, and non-coding genes. < knownGeneV45lift37 | It includes options to display the entire GENCODE set and pseudogenes. To customize these < knownGeneV45lift37 | options, the respective boxes can be checked or unchecked at the top of this description page. < knownGeneV45lift37 | < knownGeneV45lift37 |

< knownGeneV45lift37 | This track also includes a variety of labels which identify the transcripts when visibility is set < knownGeneV45lift37 | to "full" or "pack". Gene symbols (e.g. NIPA1) are displayed by default, but < knownGeneV45lift37 | additional options include GENCODE Transcript ID (ENST00000561183.5), UCSC Known Gene ID < knownGeneV45lift37 | (uc001yve.4), UniProt Display ID (Q7RTP0). Additional information about gene < knownGeneV45lift37 | and transcript names can be found in our < knownGeneV45lift37 | FAQ.

< knownGeneV45lift37 | < knownGeneV45lift37 |

< knownGeneV45lift37 | This track, in general, follows the display conventions for gene prediction tracks. The exons for < knownGeneV45lift37 | putative non-coding genes and untranslated regions are represented by relatively thin blocks, while < knownGeneV45lift37 | those for coding open reading frames are thicker. < knownGeneV45lift37 |

Coloring for the gene annotations is based on the annotation type:

< knownGeneV45lift37 |

coding: protein coding transcripts, including polymorphic < knownGeneV45lift37 | pseudogenes < knownGeneV45lift37 |
non-coding: non-protein coding transcripts < knownGeneV45lift37 |
pseudogene: pseudogene transcript annotations < knownGeneV45lift37 |
problem: problem transcripts (Biotypes of < knownGeneV45lift37 | retained_intron, TEC, or disrupted_domain)

< knownGeneV45lift37 | < knownGeneV45lift37 |

< knownGeneV45lift37 | This track contains an optional codon coloring feature that allows users to < knownGeneV45lift37 | quickly validate and compare gene predictions. There is also an option to display the data as < knownGeneV45lift37 | a density graph, which < knownGeneV45lift37 | can be helpful for visualizing the distribution of items over a region.

< knownGeneV45lift37 | < knownGeneV45lift37 | < knownGeneV45lift37 |

Squishy-pack Display

< knownGeneV45lift37 |

< knownGeneV45lift37 | Within a gene using the pack display mode, transcripts below a specified rank will be < knownGeneV45lift37 | condensed into a view similar to squish mode. The transcript ranking approach is < knownGeneV45lift37 | preliminary and will change in future releases. The transcripts rankings are defined by the < knownGeneV45lift37 | following criteria for protein-coding and non-coding genes:

< knownGeneV45lift37 | Protein_coding genes < knownGeneV45lift37 |

MANE or Ensembl canonical < knownGeneV45lift37 |
- 1st: MANE Select / Ensembl canonical
- 2nd: MANE Plus Clinical
< knownGeneV45lift37 |
Coding biotypes < knownGeneV45lift37 |
- 1st: protein_coding and protein_coding_LoF
- 2nd: NMDs and NSDs
- 3rd: retained intron and protein_coding_CDS_not_defined
< knownGeneV45lift37 |
Completeness < knownGeneV45lift37 |
- 1st: full length
- 2nd: CDS start/end not found
< knownGeneV45lift37 |
CARS score (only for coding transcripts)
Transcript genomic span and length (only for non-coding transcripts)

< knownGeneV45lift37 | Non-coding genes < knownGeneV45lift37 |

Transcript biotype < knownGeneV45lift37 |
- 1st: transcript biotype identical to gene biotype
< knownGeneV45lift37 |
Ensembl canonical
GENCODE basic
Transcript genomic span
Transcript length

< knownGeneV45lift37 | < knownGeneV45lift37 | < knownGeneV45lift37 |

Methods

< knownGeneV45lift37 |

< knownGeneV45lift37 | The GENCODE v45 track was built from the GENCODE downloads file < knownGeneV45lift37 | gencode.v45.chr_patch_hapl_scaff.annotation.gff3.gz. Data from other sources < knownGeneV45lift37 | were correlated with the GENCODE data to build association tables.

< knownGeneV45lift37 | < knownGeneV45lift37 |

Related Data

< knownGeneV45lift37 |

< knownGeneV45lift37 | The GENCODE Genes transcripts are annotated in numerous tables, each of which is also available as a < knownGeneV45lift37 | downloadable < knownGeneV45lift37 | file. < knownGeneV45lift37 | < knownGeneV45lift37 |

< knownGeneV45lift37 | One can see a full list of the associated tables in the Table Browser by selecting GENCODE Genes from the track menu; this list < knownGeneV45lift37 | is then available on the table menu. < knownGeneV45lift37 | < knownGeneV45lift37 | < knownGeneV45lift37 |

Data access

< knownGeneV45lift37 |

< knownGeneV45lift37 | GENCODE Genes and its associated tables can be explored interactively using the < knownGeneV45lift37 | REST API, the < knownGeneV45lift37 | Table Browser or the < knownGeneV45lift37 | Data Integrator. < knownGeneV45lift37 | The genePred format files for hg38 are available from our < knownGeneV45lift37 | < knownGeneV45lift37 | downloads directory or in our < knownGeneV45lift37 | < knownGeneV45lift37 | GTF download directory. < knownGeneV45lift37 | All the tables can also be queried directly from our public MySQL < knownGeneV45lift37 | servers, with more information available on our < knownGeneV45lift37 | help page as well as on < knownGeneV45lift37 | our blog.

< knownGeneV45lift37 | < knownGeneV45lift37 |

Credits

< knownGeneV45lift37 |

< knownGeneV45lift37 | The GENCODE Genes track was produced at UCSC from the GENCODE comprehensive gene set using a < knownGeneV45lift37 | computational pipeline developed by Jim Kent and Brian Raney. This version of the track was < knownGeneV45lift37 | generated by Jonathan Casper.

< knownGeneV45lift37 | < knownGeneV45lift37 |

References

< knownGeneV45lift37 | < knownGeneV45lift37 |

< knownGeneV45lift37 | Frankish A, Carbonell-Sala S, Diekhans M, Jungreis I, Loveland JE, Mudge JM, Sisu C, Wright JC, < knownGeneV45lift37 | Arnan C, Barnes I et al. < knownGeneV45lift37 | < knownGeneV45lift37 | GENCODE: reference annotation for the human and mouse genomes in 2023. < knownGeneV45lift37 | Nucleic Acids Res. 2023 Jan 6;51(D1):D942-D949. < knownGeneV45lift37 | PMID: 36420896; PMC: PMC9825462 < knownGeneV45lift37 |

< knownGeneV45lift37 | < knownGeneV45lift37 |

A full list of GENCODE publications is available < knownGeneV45lift37 | at The GENCODE < knownGeneV45lift37 | Project web site. < knownGeneV45lift37 |

< knownGeneV45lift37 | < knownGeneV45lift37 |

Data Release Policy

< knownGeneV45lift37 |

GENCODE data are available for use without restrictions.

< knownGeneV45lift37 | 103435,103436c103269,103272 < wgRna | are from snoRNABase, which is maintained at the Laboratoire de Biologie < wgRna | Moléculaire Eucaryote. --- > wgRna | are from > wgRna | snoRNABase, which is > wgRna | maintained at the wgRna | target="_blank">Laboratoire de Biologie Moléculaire Eucaryote. 103459,103460c103295,103297 < wgRna | The snoRNAs and scaRNAs genomic locations from snoRNABase were aligned to the < wgRna | hg18 human genome assembly using BLAT and were then lifted to the hg19 assembly. --- > wgRna | The snoRNAs and scaRNAs genomic locations from wgRna | target="_blank">snoRNABase were aligned to the hg18 human genome assembly > wgRna | using BLAT and were then lifted to the hg19 assembly.