--------------------------------------------------------------- macFas5.trackDb.html : Differences exist between hgwbeta and hgw2 (RR fields taken from public MySql server, not individual machine) 2636,3278d2635 < evaSnp | html < evaSnp |

Description

< evaSnp |

< evaSnp | This track contains mappings of single nucleotide variants < evaSnp | and small insertions and deletions (indels) < evaSnp | from the European Variation Archive < evaSnp | (EVA) < evaSnp | Release 3 for the crab-eating macaque macFas5 genome. The dbSNP database at NCBI no longer < evaSnp | hosts non-human variants. < evaSnp |

< evaSnp | < evaSnp |

Interpreting and Configuring the Graphical Display

< evaSnp |

< evaSnp | Variants are shown as single tick marks at most zoom levels. < evaSnp | When viewing the track at or near base-level resolution, the displayed < evaSnp | width of the SNP variant corresponds to the width of the variant in the < evaSnp | reference sequence. Insertions are indicated by a single tick mark displayed < evaSnp | between two nucleotides, single nucleotide polymorphisms are displayed as the < evaSnp | width of a single base, and multiple nucleotide variants are represented by a < evaSnp | block that spans two or more bases. The display is set to automatically collapse to < evaSnp | dense visibility when there are more than 100k variants in the window. < evaSnp | When the window size is more than 250k bp, the display is switched to density graph mode. < evaSnp |

< evaSnp | < evaSnp |

Searching, details, and filtering

< evaSnp |

< evaSnp | Navigation to an individual variant can be accomplished by typing or copying < evaSnp | the variant identifier (rsID) or the genomic coordinates into the Position/Search box on the < evaSnp | Browser.

< evaSnp | < evaSnp |

< evaSnp | A click on an item in the graphical display displays a page with data about < evaSnp | that variant. Data fields include the Reference and Alternate Alleles, the < evaSnp | class of the variant as reported by EVA, the source of the data, the amino acid < evaSnp | change, if any, and the functional class as determined by UCSC's Variant Annotation < evaSnp | Integrator. < evaSnp |

< evaSnp | < evaSnp |

Variants can be filtered using the track controls to show subsets of the < evaSnp | data by either EVA Sequence Ontology (SO) term, UCSC-generated functional effect, or < evaSnp | by color, which bins the UCSC functional effects into general classes.

< evaSnp | < evaSnp |

Mouse-over

< evaSnp |

< evaSnp | Mousing over an item shows the ucscClass, which is the consequence according to the < evaSnp | Variant Annotation Integrator, and < evaSnp | the aaChange when one is available, which is the change in amino acid in HGVS.p < evaSnp | terms. Items may have multiple ucscClasses, which will all be shown in the mouse-over < evaSnp | in a comma-separated list. Likewise, multiple HGVS.p terms may be shown for each rsID < evaSnp | separated by spaces describing all possible AA changes.

< evaSnp |

< evaSnp | Multiple items may appear due to different variant predictions on multiple gene transcripts. < evaSnp | For all organisms the gene models used were ncbiRefSeqCurated, except for mm39 which < evaSnp | used ncbiRefSeqSelect.

< evaSnp |

< evaSnp | < evaSnp |

Track colors

< evaSnp | < evaSnp |

< evaSnp | Variants are colored according to the most potentially deleterious functional effect prediction < evaSnp | according to the Variant Annotation Integrator. Specific bins can be seen in the Methods section < evaSnp | below. < evaSnp |

< evaSnp | < evaSnp |

Color	Variant Type
	Protein-altering variants and splice site variants
	Synonymous codon variants
	Non-coding transcript or Untranslated Region (UTR) variants
	Intergenic and intronic variants

< evaSnp | < evaSnp |

Sequence ontology (SO)

< evaSnp | < evaSnp |

< evaSnp | Variants are classified by EVA into one of the following sequence ontology terms: < evaSnp |

< evaSnp | < evaSnp |

substitution — < evaSnp | A single nucleotide in the reference is replaced by another, alternate allele < evaSnp |
deletion — < evaSnp | One or more nucleotides is deleted. The representation in the database is to < evaSnp | display one additional nucleotide in both the Reference field (Ref) and the < evaSnp | Alternate Allele field (Alt). E.g. a variant that is a deletion of an A < evaSnp | maybe be represented as Ref = GA and Alt = G. < evaSnp |
insertion — < evaSnp | One or more nucleotides is inserted. The representation in the database is to < evaSnp | display one additional nucleotide in both the Reference field (Ref) and the < evaSnp | Alternate Allele field (Alt). E.g. a variant that is an insertion of a T maybe < evaSnp | be represented as Ref = G and Alt = GT < evaSnp |
delins — < evaSnp | Similar to tandemRepeat, in that the runs of Ref and Alt Alleles are of < evaSnp | different length, except that there is more than one type of nucleotide, < evaSnp | e.g., Ref = CCAAAAACAAAAACA, Alt = ACAAAAAC. < evaSnp |
multipleNucleotideVariant — < evaSnp | More than one nucleotide is substituted by an equal number of different < evaSnp | nucleotides, e.g., Ref = AA, Alt = GC. < evaSnp |
sequence alteration — < evaSnp | A parent term meant to signify a deviation from another sequence. Can be < evaSnp | assigned to variants that have not been characterized yet. < evaSnp |

< evaSnp |

< evaSnp | < evaSnp |

Methods

< evaSnp |

< evaSnp | Data were downloaded from the European Variation Archive EVA release 3 (2022-02-24) < evaSnp | current_ids.vcf.gz files corresponding to the proper assembly.

< evaSnp |

< evaSnp | Chromosome names were converted to UCSC-style, a few problematic variants were removed, < evaSnp | and the variants passed through the < evaSnp | Variant Annotation Integrator to < evaSnp | predict consequence. For every organism the ncbiRefSeqCurated gene models were used to < evaSnp | predict the consequences, except for mm39 which used the ncbiRefSeqSelect models.

< evaSnp |

< evaSnp | Variants were then colored according to their predicted consequence in the following fashion: < evaSnp |

Protein-altering variants and < evaSnp | splice site variants < evaSnp | - exon_loss_variant, frameshift_variant, < evaSnp | inframe_deletion, inframe_insertion, initiator_codon_variant, missense_variant, < evaSnp | splice_acceptor_variant, splice_donor_variant, splice_region_variant, stop_gained, < evaSnp | stop_lost, coding_sequence_variant, transcript_ablation
Synonymous codon variants < evaSnp | - synonymous_variant, stop_retained_variant
Non-coding transcript or < evaSnp | Untranslated Region (UTR) variants < evaSnp | - 5_prime_UTR_variant, < evaSnp | 3_prime_UTR_variant, complex_transcript_variant, non_coding_transcript_exon_variant
Intergenic and intronic variants - upstream_gene_variant, downstream_gene_variant, < evaSnp | intron_variant, intergenic_variant, NMD_transcript_variant, no_sequence_alteration

< evaSnp |

< evaSnp | < evaSnp |

< evaSnp | Sequence Ontology ("SO:") < evaSnp | terms were converted to the variant classes, then the files were converted to BED, < evaSnp | and then bigBed format. < evaSnp |

< evaSnp |

< evaSnp | No functional annotations were provided by the EVA (e.g., missense, nonsense, etc). < evaSnp | These were computed using UCSC's Variant Annotation Integrator (Hinrichs, et al., 2016). < evaSnp | Amino-acid substitutions for missense variants are based < evaSnp | on RefSeq alignments of mRNA transcripts, which do not always match the amino acids < evaSnp | predicted from translating the genomic sequence. Therefore, in some instances, the < evaSnp | variant and the genomic nucleotide and associated amino acid may be reversed. < evaSnp | E.g., a Pro > Arg change from the perspective of the mRNA would be Arg > Pro from < evaSnp | the persepective the genomic sequence. < evaSnp | For complete documentation of the processing of these tracks, read the < evaSnp | < evaSnp | EVA Release 3 MakeDoc.

< evaSnp | < evaSnp |

Data Access

< evaSnp |

< evaSnp | Note: It is not recommeneded to use LiftOver to convert SNPs between assemblies, < evaSnp | and more information about how to convert SNPs between assemblies can be found on the following < evaSnp | FAQ entry.

< evaSnp |

< evaSnp | The data can be explored interactively with the Table Browser, < evaSnp | or the Data Integrator. For automated analysis, the data may be < evaSnp | queried from our REST API. Please refer to our < evaSnp | mailing list archives < evaSnp | for questions, or our Data Access FAQ for more < evaSnp | information.

< evaSnp | < evaSnp |

< evaSnp | For automated download and analysis, this annotation is stored in a bigBed file that < evaSnp | can be downloaded from our download server. The file for this track is called evaSnp.bb. < evaSnp | Individual regions or the whole genome annotation can be obtained using our tool < evaSnp | bigBedToBed which can be compiled from the source code or downloaded as a precompiled < evaSnp | binary for your system. Instructions for downloading source code and binaries can be found < evaSnp | here. < evaSnp | The tool can also be used to obtain only features within a given range, e.g. < evaSnp |

< evaSnp | bigBedToBed https://hgdownload.soe.ucsc.edu/gbdb/macFas5/bbi/evaSnp.bb -chrom=chr21 -start=0 -end=100000000 stdout < evaSnp |

< evaSnp | < evaSnp |

Credits

< evaSnp |

< evaSnp | This track was produced from the European < evaSnp | Variation Archive release 3 data. Consequences were predicted using UCSC's Variant Annotation < evaSnp | Integrator and NCBI's RefSeq gene models. < evaSnp |

< evaSnp | < evaSnp |

References

< evaSnp |

< evaSnp | Cezard T, Cunningham F, Hunt SE, Koylass B, Kumar N, Saunders G, Shen A, Silva AF, < evaSnp | Tsukanov K, Venkataraman S et al. The European Variation Archive: a FAIR resource of genomic variation for all < evaSnp | species. Nucleic Acids Res. 2021 Oct 28:gkab960. < evaSnp | doi:10.1093/nar/gkab960. < evaSnp | Epub ahead of print. PMID: 34718739. PMID: PMC8728205. < evaSnp |

< evaSnp |

< evaSnp | Hinrichs AS, Raney BJ, Speir ML, Rhead B, Casper J, Karolchik D, Kuhn RM, Rosenbloom KR, Zweig AS, < evaSnp | Haussler D, Kent WJ. < evaSnp | UCSC Data Integrator and Variant Annotation Integrator. < evaSnp | Bioinformatics. 2016 May 1;32(9):1430-2. < evaSnp | PMID: 26740527; PMC: < evaSnp | PMC4848401 < evaSnp |

< evaSnp | < evaSnp4 | html < evaSnp4 |

Description

< evaSnp4 |

< evaSnp4 | This track contains mappings of single nucleotide variants < evaSnp4 | and small insertions and deletions (indels) < evaSnp4 | from the European Variation Archive < evaSnp4 | (EVA) < evaSnp4 | Release 4 for the crab-eating macaque macFas5 genome. The dbSNP database at NCBI no longer < evaSnp4 | hosts non-human variants. < evaSnp4 |

< evaSnp4 | < evaSnp4 |

Interpreting and Configuring the Graphical Display

< evaSnp4 |

< evaSnp4 | Variants are shown as single tick marks at most zoom levels. < evaSnp4 | When viewing the track at or near base-level resolution, the displayed < evaSnp4 | width of the SNP variant corresponds to the width of the variant in the < evaSnp4 | reference sequence. Insertions are indicated by a single tick mark displayed < evaSnp4 | between two nucleotides, single nucleotide polymorphisms are displayed as the < evaSnp4 | width of a single base, and multiple nucleotide variants are represented by a < evaSnp4 | block that spans two or more bases. The display is set to automatically collapse to < evaSnp4 | dense visibility when there are more than 100k variants in the window. < evaSnp4 | When the window size is more than 250k bp, the display is switched to density graph mode. < evaSnp4 |

< evaSnp4 | < evaSnp4 |

Searching, details, and filtering

< evaSnp4 |

< evaSnp4 | Navigation to an individual variant can be accomplished by typing or copying < evaSnp4 | the variant identifier (rsID) or the genomic coordinates into the Position/Search box on the < evaSnp4 | Browser.

< evaSnp4 | < evaSnp4 |

< evaSnp4 | A click on an item in the graphical display displays a page with data about < evaSnp4 | that variant. Data fields include the Reference and Alternate Alleles, the < evaSnp4 | class of the variant as reported by EVA, the source of the data, the amino acid < evaSnp4 | change, if any, and the functional class as determined by UCSC's Variant Annotation < evaSnp4 | Integrator. < evaSnp4 |

< evaSnp4 | < evaSnp4 |

Variants can be filtered using the track controls to show subsets of the < evaSnp4 | data by either EVA Sequence Ontology (SO) term, UCSC-generated functional effect, or < evaSnp4 | by color, which bins the UCSC functional effects into general classes.

< evaSnp4 | < evaSnp4 |

Mouse-over

< evaSnp4 |

< evaSnp4 | Mousing over an item shows the ucscClass, which is the consequence according to the < evaSnp4 | Variant Annotation Integrator, and < evaSnp4 | the aaChange when one is available, which is the change in amino acid in HGVS.p < evaSnp4 | terms. Items may have multiple ucscClasses, which will all be shown in the mouse-over < evaSnp4 | in a comma-separated list. Likewise, multiple HGVS.p terms may be shown for each rsID < evaSnp4 | separated by spaces describing all possible AA changes.

< evaSnp4 |

< evaSnp4 | Multiple items may appear due to different variant predictions on multiple gene transcripts. < evaSnp4 | For all organisms the gene models used were the NCBI RefSeq curated when available, if not then < evaSnp4 | ensembl genes, or finally UCSC mappings of RefSeq if neither of the previous models was possible. < evaSnp4 |

< evaSnp4 | < evaSnp4 |

Track colors

< evaSnp4 | < evaSnp4 |

< evaSnp4 | Variants are colored according to the most potentially deleterious functional effect prediction < evaSnp4 | according to the Variant Annotation Integrator. Specific bins can be seen in the Methods section < evaSnp4 | below. < evaSnp4 |

< evaSnp4 | < evaSnp4 |

Color	Variant Type
	Protein-altering variants and splice site variants
	Synonymous codon variants
	Non-coding transcript or Untranslated Region (UTR) variants
	Intergenic and intronic variants

< evaSnp4 | < evaSnp4 |

Sequence ontology (SO)

< evaSnp4 | < evaSnp4 |

< evaSnp4 | Variants are classified by EVA into one of the following sequence ontology terms: < evaSnp4 |

< evaSnp4 | < evaSnp4 |

substitution — < evaSnp4 | A single nucleotide in the reference is replaced by another, alternate allele < evaSnp4 |
deletion — < evaSnp4 | One or more nucleotides is deleted. The representation in the database is to < evaSnp4 | display one additional nucleotide in both the Reference field (Ref) and the < evaSnp4 | Alternate Allele field (Alt). E.g. a variant that is a deletion of an A < evaSnp4 | maybe be represented as Ref = GA and Alt = G. < evaSnp4 |
insertion — < evaSnp4 | One or more nucleotides is inserted. The representation in the database is to < evaSnp4 | display one additional nucleotide in both the Reference field (Ref) and the < evaSnp4 | Alternate Allele field (Alt). E.g. a variant that is an insertion of a T maybe < evaSnp4 | be represented as Ref = G and Alt = GT < evaSnp4 |
delins — < evaSnp4 | Similar to tandemRepeat, in that the runs of Ref and Alt Alleles are of < evaSnp4 | different length, except that there is more than one type of nucleotide, < evaSnp4 | e.g., Ref = CCAAAAACAAAAACA, Alt = ACAAAAAC. < evaSnp4 |
multipleNucleotideVariant — < evaSnp4 | More than one nucleotide is substituted by an equal number of different < evaSnp4 | nucleotides, e.g., Ref = AA, Alt = GC. < evaSnp4 |
sequence alteration — < evaSnp4 | A parent term meant to signify a deviation from another sequence. Can be < evaSnp4 | assigned to variants that have not been characterized yet. < evaSnp4 |

< evaSnp4 |

< evaSnp4 | < evaSnp4 |

Methods

< evaSnp4 |

< evaSnp4 | Data were downloaded from the European Variation Archive EVA release 4 (2022-11-21) < evaSnp4 | current_ids.vcf.gz files corresponding to the proper assembly.

< evaSnp4 |

< evaSnp4 | Chromosome names were converted to UCSC-style < evaSnp4 | and the variants passed through the < evaSnp4 | Variant Annotation Integrator to < evaSnp4 | predict consequence. For every organism the NCBI RefSeq curated models were used when available, < evaSnp4 | followed by ensembl genes, and finally UCSC mapping of RefSeq when neither of the previous models < evaSnp4 | were possible.

< evaSnp4 |

< evaSnp4 | Variants were then colored according to their predicted consequence in the following fashion: < evaSnp4 |

Protein-altering variants and < evaSnp4 | splice site variants < evaSnp4 | - exon_loss_variant, frameshift_variant, < evaSnp4 | inframe_deletion, inframe_insertion, initiator_codon_variant, missense_variant, < evaSnp4 | splice_acceptor_variant, splice_donor_variant, splice_region_variant, stop_gained, < evaSnp4 | stop_lost, coding_sequence_variant, transcript_ablation
Synonymous codon variants < evaSnp4 | - synonymous_variant, stop_retained_variant
Non-coding transcript or < evaSnp4 | Untranslated Region (UTR) variants < evaSnp4 | - 5_prime_UTR_variant, < evaSnp4 | 3_prime_UTR_variant, complex_transcript_variant, non_coding_transcript_exon_variant
Intergenic and intronic variants - upstream_gene_variant, downstream_gene_variant, < evaSnp4 | intron_variant, intergenic_variant, NMD_transcript_variant, no_sequence_alteration

< evaSnp4 |

< evaSnp4 | < evaSnp4 |

< evaSnp4 | Sequence Ontology ("SO:") < evaSnp4 | terms were converted to the variant classes, then the files were converted to BED, < evaSnp4 | and then bigBed format. < evaSnp4 |

< evaSnp4 |

< evaSnp4 | No functional annotations were provided by the EVA (e.g., missense, nonsense, etc). < evaSnp4 | These were computed using UCSC's Variant Annotation Integrator (Hinrichs, et al., 2016). < evaSnp4 | Amino-acid substitutions for missense variants are based < evaSnp4 | on RefSeq alignments of mRNA transcripts, which do not always match the amino acids < evaSnp4 | predicted from translating the genomic sequence. Therefore, in some instances, the < evaSnp4 | variant and the genomic nucleotide and associated amino acid may be reversed. < evaSnp4 | E.g., a Pro > Arg change from the perspective of the mRNA would be Arg > Pro from < evaSnp4 | the persepective the genomic sequence. Also, in bosTau9, galGal5, rheMac8, < evaSnp4 | danRer10 and danRer11 the mitochondrial sequence was removed or renamed to match UCSC. < evaSnp4 | For complete documentation of the processing of these tracks, read the < evaSnp4 | < evaSnp4 | EVA Release 4 MakeDoc.

< evaSnp4 | < evaSnp4 |

Data Access

< evaSnp4 |

< evaSnp4 | Note: It is not recommeneded to use LiftOver to convert SNPs between assemblies, < evaSnp4 | and more information about how to convert SNPs between assemblies can be found on the following < evaSnp4 | FAQ entry.

< evaSnp4 |

< evaSnp4 | The data can be explored interactively with the Table Browser, < evaSnp4 | or the Data Integrator. For automated analysis, the data may be < evaSnp4 | queried from our REST API. Please refer to our < evaSnp4 | mailing list archives < evaSnp4 | for questions, or our Data Access FAQ for more < evaSnp4 | information.

< evaSnp4 | < evaSnp4 |

< evaSnp4 | For automated download and analysis, this annotation is stored in a bigBed file that < evaSnp4 | can be downloaded from our download server. The file for this track is called evaSnp4.bb. < evaSnp4 | Individual regions or the whole genome annotation can be obtained using our tool < evaSnp4 | bigBedToBed which can be compiled from the source code or downloaded as a precompiled < evaSnp4 | binary for your system. Instructions for downloading source code and binaries can be found < evaSnp4 | here. < evaSnp4 | The tool can also be used to obtain only features within a given range, e.g. < evaSnp4 |

< evaSnp4 | bigBedToBed https://hgdownload.soe.ucsc.edu/gbdb/macFas5/bbi/evaSnp4.bb -chrom=chr21 -start=0 -end=100000000 stdout < evaSnp4 |

< evaSnp4 | < evaSnp4 |

Credits

< evaSnp4 |

< evaSnp4 | This track was produced from the European < evaSnp4 | Variation Archive release 4 data. Consequences were predicted using UCSC's Variant Annotation < evaSnp4 | Integrator and NCBI's RefSeq as well as ensembl gene models. < evaSnp4 |

< evaSnp4 | < evaSnp4 |

References

< evaSnp4 |

< evaSnp4 | Cezard T, Cunningham F, Hunt SE, Koylass B, Kumar N, Saunders G, Shen A, Silva AF, < evaSnp4 | Tsukanov K, Venkataraman S et al. The European Variation Archive: a FAIR resource of genomic variation for all < evaSnp4 | species. Nucleic Acids Res. 2021 Oct 28:gkab960. < evaSnp4 | doi:10.1093/nar/gkab960. < evaSnp4 | Epub ahead of print. PMID: 34718739. PMID: PMC8728205. < evaSnp4 |

< evaSnp4 |

< evaSnp4 | Hinrichs AS, Raney BJ, Speir ML, Rhead B, Casper J, Karolchik D, Kuhn RM, Rosenbloom KR, Zweig AS, < evaSnp4 | Haussler D, Kent WJ. < evaSnp4 | UCSC Data Integrator and Variant Annotation Integrator. < evaSnp4 | Bioinformatics. 2016 May 1;32(9):1430-2. < evaSnp4 | PMID: 26740527; PMC: < evaSnp4 | PMC4848401 < evaSnp4 |

< evaSnp4 | < evaSnp5 | html < evaSnp5 |

Description

< evaSnp5 |

< evaSnp5 | This track contains mappings of single nucleotide variants < evaSnp5 | and small insertions and deletions (indels) < evaSnp5 | from the European Variation Archive < evaSnp5 | (EVA) < evaSnp5 | Release 5 for the crab-eating macaque macFas5 genome. The dbSNP database at NCBI no longer < evaSnp5 | hosts non-human variants. < evaSnp5 |

< evaSnp5 | < evaSnp5 |

Interpreting and Configuring the Graphical Display

< evaSnp5 |

< evaSnp5 | Variants are shown as single tick marks at most zoom levels. < evaSnp5 | When viewing the track at or near base-level resolution, the displayed < evaSnp5 | width of the SNP variant corresponds to the width of the variant in the < evaSnp5 | reference sequence. Insertions are indicated by a single tick mark displayed < evaSnp5 | between two nucleotides, single nucleotide polymorphisms are displayed as the < evaSnp5 | width of a single base, and multiple nucleotide variants are represented by a < evaSnp5 | block that spans two or more bases. The display is set to automatically collapse to < evaSnp5 | dense visibility when there are more than 100k variants in the window. < evaSnp5 | When the window size is more than 250k bp, the display is switched to density graph mode. < evaSnp5 |

< evaSnp5 | < evaSnp5 |

Searching, details, and filtering

< evaSnp5 |

< evaSnp5 | Navigation to an individual variant can be accomplished by typing or copying < evaSnp5 | the variant identifier (rsID) or the genomic coordinates into the Position/Search box on the < evaSnp5 | Browser.

< evaSnp5 | < evaSnp5 |

< evaSnp5 | A click on an item in the graphical display displays a page with data about < evaSnp5 | that variant. Data fields include the Reference and Alternate Alleles, the < evaSnp5 | class of the variant as reported by EVA, the source of the data, the amino acid < evaSnp5 | change, if any, and the functional class as determined by UCSC's Variant Annotation < evaSnp5 | Integrator. < evaSnp5 |

< evaSnp5 | < evaSnp5 |

Variants can be filtered using the track controls to show subsets of the < evaSnp5 | data by either EVA Sequence Ontology (SO) term, UCSC-generated functional effect, or < evaSnp5 | by color, which bins the UCSC functional effects into general classes.

< evaSnp5 | < evaSnp5 |

Mouse-over

< evaSnp5 |

< evaSnp5 | Mousing over an item shows the ucscClass, which is the consequence according to the < evaSnp5 | Variant Annotation Integrator, and < evaSnp5 | the aaChange when one is available, which is the change in amino acid in HGVS.p < evaSnp5 | terms. Items may have multiple ucscClasses, which will all be shown in the mouse-over < evaSnp5 | in a comma-separated list. Likewise, multiple HGVS.p terms may be shown for each rsID < evaSnp5 | separated by spaces describing all possible AA changes.

< evaSnp5 |

< evaSnp5 | Multiple items may appear due to different variant predictions on multiple gene transcripts. < evaSnp5 | For all organisms the gene models used were the NCBI RefSeq curated when available, if not then < evaSnp5 | ensembl genes, or finally UCSC mappings of RefSeq if neither of the previous models was possible. < evaSnp5 |

< evaSnp5 | < evaSnp5 |

Track colors

< evaSnp5 | < evaSnp5 |

< evaSnp5 | Variants are colored according to the most potentially deleterious functional effect prediction < evaSnp5 | according to the Variant Annotation Integrator. Specific bins can be seen in the Methods section < evaSnp5 | below. < evaSnp5 |

< evaSnp5 | < evaSnp5 |

Color	Variant Type
	Protein-altering variants and splice site variants
	Synonymous codon variants
	Non-coding transcript or Untranslated Region (UTR) variants
	Intergenic and intronic variants

< evaSnp5 | < evaSnp5 |

Sequence ontology (SO)

< evaSnp5 | < evaSnp5 |

< evaSnp5 | Variants are classified by EVA into one of the following sequence ontology terms: < evaSnp5 |

< evaSnp5 | < evaSnp5 |

substitution — < evaSnp5 | A single nucleotide in the reference is replaced by another, alternate allele < evaSnp5 |
deletion — < evaSnp5 | One or more nucleotides is deleted. The representation in the database is to < evaSnp5 | display one additional nucleotide in both the Reference field (Ref) and the < evaSnp5 | Alternate Allele field (Alt). E.g. a variant that is a deletion of an A < evaSnp5 | maybe be represented as Ref = GA and Alt = G. < evaSnp5 |
insertion — < evaSnp5 | One or more nucleotides is inserted. The representation in the database is to < evaSnp5 | display one additional nucleotide in both the Reference field (Ref) and the < evaSnp5 | Alternate Allele field (Alt). E.g. a variant that is an insertion of a T maybe < evaSnp5 | be represented as Ref = G and Alt = GT < evaSnp5 |
delins — < evaSnp5 | Similar to tandemRepeat, in that the runs of Ref and Alt Alleles are of < evaSnp5 | different length, except that there is more than one type of nucleotide, < evaSnp5 | e.g., Ref = CCAAAAACAAAAACA, Alt = ACAAAAAC. < evaSnp5 |
multipleNucleotideVariant — < evaSnp5 | More than one nucleotide is substituted by an equal number of different < evaSnp5 | nucleotides, e.g., Ref = AA, Alt = GC. < evaSnp5 |
sequence alteration — < evaSnp5 | A parent term meant to signify a deviation from another sequence. Can be < evaSnp5 | assigned to variants that have not been characterized yet. < evaSnp5 |

< evaSnp5 |

< evaSnp5 | < evaSnp5 |

Methods

< evaSnp5 |

< evaSnp5 | Data were downloaded from the European Variation Archive EVA release 5 (2023-9-7) < evaSnp5 | current_ids.vcf.gz files corresponding to the proper assembly.

< evaSnp5 |

< evaSnp5 | Chromosome names were converted to UCSC-style < evaSnp5 | and the variants passed through the < evaSnp5 | Variant Annotation Integrator to < evaSnp5 | predict consequence. For every organism the NCBI RefSeq curated models were used when available, < evaSnp5 | followed by ensembl genes, and finally UCSC mapping of RefSeq when neither of the previous models < evaSnp5 | were possible.

< evaSnp5 |

< evaSnp5 | Variants were then colored according to their predicted consequence in the following fashion: < evaSnp5 |

Protein-altering variants and < evaSnp5 | splice site variants < evaSnp5 | - exon_loss_variant, frameshift_variant, < evaSnp5 | inframe_deletion, inframe_insertion, initiator_codon_variant, missense_variant, < evaSnp5 | splice_acceptor_variant, splice_donor_variant, splice_region_variant, stop_gained, < evaSnp5 | stop_lost, coding_sequence_variant, transcript_ablation
Synonymous codon variants < evaSnp5 | - synonymous_variant, stop_retained_variant
Non-coding transcript or < evaSnp5 | Untranslated Region (UTR) variants < evaSnp5 | - 5_prime_UTR_variant, < evaSnp5 | 3_prime_UTR_variant, complex_transcript_variant, non_coding_transcript_exon_variant
Intergenic and intronic variants - upstream_gene_variant, downstream_gene_variant, < evaSnp5 | intron_variant, intergenic_variant, NMD_transcript_variant, no_sequence_alteration

< evaSnp5 |

< evaSnp5 | < evaSnp5 |

< evaSnp5 | Sequence Ontology ("SO:") < evaSnp5 | terms were converted to the variant classes, then the files were converted to BED, < evaSnp5 | and then bigBed format. < evaSnp5 |

< evaSnp5 |

< evaSnp5 | No functional annotations were provided by the EVA (e.g., missense, nonsense, etc). < evaSnp5 | These were computed using UCSC's Variant Annotation Integrator (Hinrichs, et al., 2016). < evaSnp5 | Amino-acid substitutions for missense variants are based < evaSnp5 | on RefSeq alignments of mRNA transcripts, which do not always match the amino acids < evaSnp5 | predicted from translating the genomic sequence. Therefore, in some instances, the < evaSnp5 | variant and the genomic nucleotide and associated amino acid may be reversed. < evaSnp5 | E.g., a Pro > Arg change from the perspective of the mRNA would be Arg > Pro from < evaSnp5 | the persepective the genomic sequence. Also, in bosTau9, galGal5, rheMac8, < evaSnp5 | danRer10 and danRer11 the mitochondrial sequence was removed or renamed to match UCSC. < evaSnp5 | For complete documentation of the processing of these tracks, read the < evaSnp5 | < evaSnp5 | EVA Release 5 MakeDoc.

< evaSnp5 | < evaSnp5 |

Data Access

< evaSnp5 |

< evaSnp5 | Note: It is not recommeneded to use LiftOver to convert SNPs between assemblies, < evaSnp5 | and more information about how to convert SNPs between assemblies can be found on the following < evaSnp5 | FAQ entry.

< evaSnp5 |

< evaSnp5 | The data can be explored interactively with the Table Browser, < evaSnp5 | or the Data Integrator. For automated analysis, the data may be < evaSnp5 | queried from our REST API. Please refer to our < evaSnp5 | mailing list archives < evaSnp5 | for questions, or our Data Access FAQ for more < evaSnp5 | information.

< evaSnp5 | < evaSnp5 |

< evaSnp5 | For automated download and analysis, this annotation is stored in a bigBed file that < evaSnp5 | can be downloaded from our download server. The file for this track is called evaSnp5.bb. < evaSnp5 | Individual regions or the whole genome annotation can be obtained using our tool < evaSnp5 | bigBedToBed which can be compiled from the source code or downloaded as a precompiled < evaSnp5 | binary for your system. Instructions for downloading source code and binaries can be found < evaSnp5 | here. < evaSnp5 | The tool can also be used to obtain only features within a given range, e.g. < evaSnp5 |

< evaSnp5 | bigBedToBed https://hgdownload.soe.ucsc.edu/gbdb/macFas5/bbi/evaSnp5.bb -chrom=chr21 -start=0 -end=100000000 stdout < evaSnp5 |

< evaSnp5 | < evaSnp5 |

Credits

< evaSnp5 |

< evaSnp5 | This track was produced from the European < evaSnp5 | Variation Archive release 5 data. Consequences were predicted using UCSC's Variant Annotation < evaSnp5 | Integrator and NCBI's RefSeq as well as ensembl gene models. < evaSnp5 |

< evaSnp5 | < evaSnp5 |

References

< evaSnp5 |

< evaSnp5 | Cezard T, Cunningham F, Hunt SE, Koylass B, Kumar N, Saunders G, Shen A, Silva AF, < evaSnp5 | Tsukanov K, Venkataraman S et al. The European Variation Archive: a FAIR resource of genomic variation for all < evaSnp5 | species. Nucleic Acids Res. 2021 Oct 28:gkab960. < evaSnp5 | doi:10.1093/nar/gkab960. < evaSnp5 | Epub ahead of print. PMID: 34718739. PMID: PMC8728205. < evaSnp5 |

< evaSnp5 |

< evaSnp5 | Hinrichs AS, Raney BJ, Speir ML, Rhead B, Casper J, Karolchik D, Kuhn RM, Rosenbloom KR, Zweig AS, < evaSnp5 | Haussler D, Kent WJ. < evaSnp5 | UCSC Data Integrator and Variant Annotation Integrator. < evaSnp5 | Bioinformatics. 2016 May 1;32(9):1430-2. < evaSnp5 | PMID: 26740527; PMC: < evaSnp5 | PMC4848401 < evaSnp5 |

Description

< transMapEnsemblV5 | < transMapEnsemblV5 |

< transMapEnsemblV5 | This track contains GENCODE or Ensembl alignments produced by < transMapEnsemblV5 | the TransMap cross-species alignment algorithm from other vertebrate < transMapEnsemblV5 | species in the UCSC Genome Browser. GENCODE is Ensembl for human and mouse, < transMapEnsemblV5 | for other Ensembl sources, only ones with full gene builds are used. < transMapEnsemblV5 | Projection Ensembl gene annotations will not be used as sources. < transMapEnsemblV5 | For closer evolutionary distances, the alignments are created using < transMapEnsemblV5 | syntenically filtered BLASTZ alignment chains, resulting in a prediction of the < transMapEnsemblV5 | orthologous genes in crab-eating macaque. < transMapEnsemblV5 |

< transMapEnsemblV5 | < transMapEnsemblV5 | < transMapEnsemblV5 |

Display Conventions and Configuration

< transMapEnsemblV5 | < transMapEnsemblV5 |

< transMapEnsemblV5 | This track follows the display conventions for < transMapEnsemblV5 | PSL alignment tracks.

< transMapEnsemblV5 |

< transMapEnsemblV5 | This track may also be configured to display codon coloring, a feature that < transMapEnsemblV5 | allows the user to quickly compare cDNAs against the genomic sequence. For more < transMapEnsemblV5 | information about this option, click < transMapEnsemblV5 | here. < transMapEnsemblV5 | Several types of alignment gap may also be colored; < transMapEnsemblV5 | for more information, click < transMapEnsemblV5 | here. < transMapEnsemblV5 | < transMapEnsemblV5 |

Methods

< transMapEnsemblV5 | < transMapEnsemblV5 |

< transMapEnsemblV5 |

Source transcript alignments were obtained from vertebrate organisms < transMapEnsemblV5 | in the UCSC Genome Browser Database. BLAT alignments of RefSeq Genes, GenBank < transMapEnsemblV5 | mRNAs, and GenBank Spliced ESTs to the cognate genome, along with UCSC Genes, < transMapEnsemblV5 | were used as available. < transMapEnsemblV5 |
For all vertebrate assemblies that had BLASTZ alignment chains and < transMapEnsemblV5 | nets to the crab-eating macaque (macFas5) genome, a subset of the alignment chains were < transMapEnsemblV5 | selected as follows: < transMapEnsemblV5 |
- For organisms whose branch distance was no more than 0.5 < transMapEnsemblV5 | (as computed by phyloFit, see Conservation track description for details), < transMapEnsemblV5 | syntenic filtering was used. Reciprocal best nets were used if available; < transMapEnsemblV5 | otherwise, nets were selected with the netfilter -syn command. < transMapEnsemblV5 | The chains corresponding to the selected nets were used for mapping. < transMapEnsemblV5 |
- For more distant species, where the determination of synteny is difficult, < transMapEnsemblV5 | the full set of chains was used for mapping. This allows for more genes to < transMapEnsemblV5 | map at the expense of some mapping to paralogous regions. The < transMapEnsemblV5 | post-alignment filtering step removes some of the duplications. < transMapEnsemblV5 |
< transMapEnsemblV5 |
The pslMap program was used to do a base-level projection of < transMapEnsemblV5 | the source transcript alignments via the selected chains < transMapEnsemblV5 | to the crab-eating macaque genome, resulting in pairwise alignments of the source transcripts to < transMapEnsemblV5 | the genome. < transMapEnsemblV5 |
The resulting alignments were filtered with pslCDnaFilter < transMapEnsemblV5 | with a global near-best criteria of 0.5% in finished genomes < transMapEnsemblV5 | (human and mouse) and 1.0% in other genomes. Alignments < transMapEnsemblV5 | where less than 20% of the transcript mapped were discarded. < transMapEnsemblV5 |

< transMapEnsemblV5 |

< transMapEnsemblV5 | < transMapEnsemblV5 |

< transMapEnsemblV5 | To ensure unique identifiers for each alignment, cDNA and gene accessions were < transMapEnsemblV5 | made unique by appending a suffix for each location in the source genome and < transMapEnsemblV5 | again for each mapped location in the destination genome. The format is: < transMapEnsemblV5 |

< transMapEnsemblV5 |    accession.version-srcUniq.destUniq
< transMapEnsemblV5 |

< transMapEnsemblV5 | < transMapEnsemblV5 | Where srcUniq is a number added to make each source alignment unique, and < transMapEnsemblV5 | destUniq is added to give the subsequent TransMap alignments unique < transMapEnsemblV5 | identifiers. < transMapEnsemblV5 |

< transMapEnsemblV5 |

< transMapEnsemblV5 | For example, in the cow genome, there are two alignments of mRNA BC149621.1. < transMapEnsemblV5 | These are assigned the identifiers BC149621.1-1 and BC149621.1-2. < transMapEnsemblV5 | When these are mapped to the human genome, BC149621.1-1 maps to a single < transMapEnsemblV5 | location and is given the identifier BC149621.1-1.1. However, BC149621.1-2 < transMapEnsemblV5 | maps to two locations, resulting in BC149621.1-2.1 and BC149621.1-2.2. Note < transMapEnsemblV5 | that multiple TransMap mappings are usually the result of tandem duplications, where both < transMapEnsemblV5 | chains are identified as syntenic. < transMapEnsemblV5 |

< transMapEnsemblV5 | < transMapEnsemblV5 |

Data Access

< transMapEnsemblV5 | < transMapEnsemblV5 |

< transMapEnsemblV5 | The raw data for these tracks can be accessed interactively through the < transMapEnsemblV5 | Table Browser or the < transMapEnsemblV5 | Data Integrator. < transMapEnsemblV5 | For automated analysis, the annotations are stored in < transMapEnsemblV5 | bigPsl files (containing a < transMapEnsemblV5 | number of extra columns) and can be downloaded from our < transMapEnsemblV5 | download server, < transMapEnsemblV5 | or queried using our API. For more < transMapEnsemblV5 | information on accessing track data see our < transMapEnsemblV5 | Track Data Access FAQ. < transMapEnsemblV5 | The files are associated with these tracks in the following way: < transMapEnsemblV5 |

TransMap Ensembl - macFas5.ensembl.transMapV4.bigPsl
TransMap RefGene - macFas5.refseq.transMapV4.bigPsl
TransMap RNA - macFas5.rna.transMapV4.bigPsl
TransMap ESTs - macFas5.est.transMapV4.bigPsl

< transMapEnsemblV5 | Individual regions or the whole genome annotation can be obtained using our tool < transMapEnsemblV5 | bigBedToBed which can be compiled from the source code or downloaded as < transMapEnsemblV5 | a precompiled binary for your system. Instructions for downloading source code and < transMapEnsemblV5 | binaries can be found < transMapEnsemblV5 | here. < transMapEnsemblV5 | The tool can also be used to obtain only features within a given range, for example: < transMapEnsemblV5 |

< transMapEnsemblV5 | bigBedToBed http://hgdownload.soe.ucsc.edu/gbdb/macFas5/transMap/V4/macFas5.refseq.transMapV4.bigPsl < transMapEnsemblV5 | -chrom=chr6 -start=0 -end=1000000 stdout < transMapEnsemblV5 | < transMapEnsemblV5 | < transMapEnsemblV5 |

Credits

< transMapEnsemblV5 | < transMapEnsemblV5 |

< transMapEnsemblV5 | This track was produced by Mark Diekhans at UCSC from cDNA and EST sequence data < transMapEnsemblV5 | submitted to the international public sequence databases by < transMapEnsemblV5 | scientists worldwide and annotations produced by the RefSeq, < transMapEnsemblV5 | Ensembl, and GENCODE annotations projects.

< transMapEnsemblV5 | < transMapEnsemblV5 |

References

< transMapEnsemblV5 |

< transMapEnsemblV5 | Siepel A, Diekhans M, Brejová B, Langton L, Stevens M, Comstock CL, Davis C, Ewing B, Oommen S, < transMapEnsemblV5 | Lau C et al. < transMapEnsemblV5 | < transMapEnsemblV5 | Targeted discovery of novel human exons by comparative genomics. < transMapEnsemblV5 | Genome Res. 2007 Dec;17(12):1763-73. < transMapEnsemblV5 | PMID: 17989246; PMC: PMC2099585 < transMapEnsemblV5 |

< transMapEnsemblV5 | < transMapEnsemblV5 |

< transMapEnsemblV5 | Stanke M, Diekhans M, Baertsch R, Haussler D. < transMapEnsemblV5 | < transMapEnsemblV5 | Using native and syntenically mapped cDNA alignments to improve de novo gene finding. < transMapEnsemblV5 | Bioinformatics. 2008 Mar 1;24(5):637-44. < transMapEnsemblV5 | PMID: 18218656 < transMapEnsemblV5 |

< transMapEnsemblV5 | < transMapEnsemblV5 |

< transMapEnsemblV5 | Zhu J, Sanborn JZ, Diekhans M, Lowe CB, Pringle TH, Haussler D. < transMapEnsemblV5 | < transMapEnsemblV5 | Comparative genomics search for losses of long-established genes on the human lineage. < transMapEnsemblV5 | PLoS Comput Biol. 2007 Dec;3(12):e247. < transMapEnsemblV5 | PMID: 18085818; PMC: PMC2134963 < transMapEnsemblV5 |

< transMapEnsemblV5 | < transMapEnsemblV5 | < transMapEstV5 | html < transMapEstV5 |

Description

< transMapEstV5 | < transMapEstV5 |

< transMapEstV5 | This track contains GenBank spliced EST alignments produced by < transMapEstV5 | the TransMap cross-species alignment algorithm < transMapEstV5 | from other vertebrate species in the UCSC Genome Browser. < transMapEstV5 | For closer evolutionary distances, the alignments are created using < transMapEstV5 | syntenically filtered BLASTZ alignment chains, resulting in a prediction of the < transMapEstV5 | orthologous genes in crab-eating macaque. < transMapEstV5 |

< transMapEstV5 | < transMapEstV5 | < transMapEstV5 |

Display Conventions and Configuration

< transMapEstV5 | < transMapEstV5 |

< transMapEstV5 | This track follows the display conventions for < transMapEstV5 | PSL alignment tracks.

< transMapEstV5 |

< transMapEstV5 | This track may also be configured to display codon coloring, a feature that < transMapEstV5 | allows the user to quickly compare cDNAs against the genomic sequence. For more < transMapEstV5 | information about this option, click < transMapEstV5 | here. < transMapEstV5 | Several types of alignment gap may also be colored; < transMapEstV5 | for more information, click < transMapEstV5 | here. < transMapEstV5 | < transMapEstV5 |

Methods

< transMapEstV5 | < transMapEstV5 |

< transMapEstV5 |

Source transcript alignments were obtained from vertebrate organisms < transMapEstV5 | in the UCSC Genome Browser Database. BLAT alignments of RefSeq Genes, GenBank < transMapEstV5 | mRNAs, and GenBank Spliced ESTs to the cognate genome, along with UCSC Genes, < transMapEstV5 | were used as available. < transMapEstV5 |
For all vertebrate assemblies that had BLASTZ alignment chains and < transMapEstV5 | nets to the crab-eating macaque (macFas5) genome, a subset of the alignment chains were < transMapEstV5 | selected as follows: < transMapEstV5 |
- For organisms whose branch distance was no more than 0.5 < transMapEstV5 | (as computed by phyloFit, see Conservation track description for details), < transMapEstV5 | syntenic filtering was used. Reciprocal best nets were used if available; < transMapEstV5 | otherwise, nets were selected with the netfilter -syn command. < transMapEstV5 | The chains corresponding to the selected nets were used for mapping. < transMapEstV5 |
- For more distant species, where the determination of synteny is difficult, < transMapEstV5 | the full set of chains was used for mapping. This allows for more genes to < transMapEstV5 | map at the expense of some mapping to paralogous regions. The < transMapEstV5 | post-alignment filtering step removes some of the duplications. < transMapEstV5 |
< transMapEstV5 |
The pslMap program was used to do a base-level projection of < transMapEstV5 | the source transcript alignments via the selected chains < transMapEstV5 | to the crab-eating macaque genome, resulting in pairwise alignments of the source transcripts to < transMapEstV5 | the genome. < transMapEstV5 |
The resulting alignments were filtered with pslCDnaFilter < transMapEstV5 | with a global near-best criteria of 0.5% in finished genomes < transMapEstV5 | (human and mouse) and 1.0% in other genomes. Alignments < transMapEstV5 | where less than 20% of the transcript mapped were discarded. < transMapEstV5 |

< transMapEstV5 |

< transMapEstV5 | < transMapEstV5 |

< transMapEstV5 | To ensure unique identifiers for each alignment, cDNA and gene accessions were < transMapEstV5 | made unique by appending a suffix for each location in the source genome and < transMapEstV5 | again for each mapped location in the destination genome. The format is: < transMapEstV5 |

< transMapEstV5 |    accession.version-srcUniq.destUniq
< transMapEstV5 |

< transMapEstV5 | < transMapEstV5 | Where srcUniq is a number added to make each source alignment unique, and < transMapEstV5 | destUniq is added to give the subsequent TransMap alignments unique < transMapEstV5 | identifiers. < transMapEstV5 |

< transMapEstV5 |

< transMapEstV5 | For example, in the cow genome, there are two alignments of mRNA BC149621.1. < transMapEstV5 | These are assigned the identifiers BC149621.1-1 and BC149621.1-2. < transMapEstV5 | When these are mapped to the human genome, BC149621.1-1 maps to a single < transMapEstV5 | location and is given the identifier BC149621.1-1.1. However, BC149621.1-2 < transMapEstV5 | maps to two locations, resulting in BC149621.1-2.1 and BC149621.1-2.2. Note < transMapEstV5 | that multiple TransMap mappings are usually the result of tandem duplications, where both < transMapEstV5 | chains are identified as syntenic. < transMapEstV5 |

< transMapEstV5 | < transMapEstV5 |

Data Access

< transMapEstV5 | < transMapEstV5 |

< transMapEstV5 | The raw data for these tracks can be accessed interactively through the < transMapEstV5 | Table Browser or the < transMapEstV5 | Data Integrator. < transMapEstV5 | For automated analysis, the annotations are stored in < transMapEstV5 | bigPsl files (containing a < transMapEstV5 | number of extra columns) and can be downloaded from our < transMapEstV5 | download server, < transMapEstV5 | or queried using our API. For more < transMapEstV5 | information on accessing track data see our < transMapEstV5 | Track Data Access FAQ. < transMapEstV5 | The files are associated with these tracks in the following way: < transMapEstV5 |

TransMap Ensembl - macFas5.ensembl.transMapV4.bigPsl
TransMap RefGene - macFas5.refseq.transMapV4.bigPsl
TransMap RNA - macFas5.rna.transMapV4.bigPsl
TransMap ESTs - macFas5.est.transMapV4.bigPsl

< transMapEstV5 | Individual regions or the whole genome annotation can be obtained using our tool < transMapEstV5 | bigBedToBed which can be compiled from the source code or downloaded as < transMapEstV5 | a precompiled binary for your system. Instructions for downloading source code and < transMapEstV5 | binaries can be found < transMapEstV5 | here. < transMapEstV5 | The tool can also be used to obtain only features within a given range, for example: < transMapEstV5 |

< transMapEstV5 | bigBedToBed http://hgdownload.soe.ucsc.edu/gbdb/macFas5/transMap/V4/macFas5.refseq.transMapV4.bigPsl < transMapEstV5 | -chrom=chr6 -start=0 -end=1000000 stdout < transMapEstV5 | < transMapEstV5 | < transMapEstV5 |

Credits

< transMapEstV5 | < transMapEstV5 |

< transMapEstV5 | This track was produced by Mark Diekhans at UCSC from cDNA and EST sequence data < transMapEstV5 | submitted to the international public sequence databases by < transMapEstV5 | scientists worldwide and annotations produced by the RefSeq, < transMapEstV5 | Ensembl, and GENCODE annotations projects.

< transMapEstV5 | < transMapEstV5 |

References

< transMapEstV5 |

< transMapEstV5 | Siepel A, Diekhans M, Brejová B, Langton L, Stevens M, Comstock CL, Davis C, Ewing B, Oommen S, < transMapEstV5 | Lau C et al. < transMapEstV5 | < transMapEstV5 | Targeted discovery of novel human exons by comparative genomics. < transMapEstV5 | Genome Res. 2007 Dec;17(12):1763-73. < transMapEstV5 | PMID: 17989246; PMC: PMC2099585 < transMapEstV5 |

< transMapEstV5 | < transMapEstV5 |

< transMapEstV5 | Stanke M, Diekhans M, Baertsch R, Haussler D. < transMapEstV5 | < transMapEstV5 | Using native and syntenically mapped cDNA alignments to improve de novo gene finding. < transMapEstV5 | Bioinformatics. 2008 Mar 1;24(5):637-44. < transMapEstV5 | PMID: 18218656 < transMapEstV5 |

< transMapEstV5 | < transMapEstV5 |

< transMapEstV5 | Zhu J, Sanborn JZ, Diekhans M, Lowe CB, Pringle TH, Haussler D. < transMapEstV5 | < transMapEstV5 | Comparative genomics search for losses of long-established genes on the human lineage. < transMapEstV5 | PLoS Comput Biol. 2007 Dec;3(12):e247. < transMapEstV5 | PMID: 18085818; PMC: PMC2134963 < transMapEstV5 |

< transMapEstV5 | < transMapEstV5 | < transMapRefSeqV5 | html < transMapRefSeqV5 |

Description

< transMapRefSeqV5 | < transMapRefSeqV5 |

< transMapRefSeqV5 | This track contains RefSeq Gene alignments produced by < transMapRefSeqV5 | the TransMap cross-species alignment algorithm < transMapRefSeqV5 | from other vertebrate species in the UCSC Genome Browser. < transMapRefSeqV5 | For closer evolutionary distances, the alignments are created using < transMapRefSeqV5 | syntenically filtered BLASTZ alignment chains, resulting in a prediction of the < transMapRefSeqV5 | orthologous genes in crab-eating macaque. < transMapRefSeqV5 |

< transMapRefSeqV5 | < transMapRefSeqV5 | < transMapRefSeqV5 |

Display Conventions and Configuration

< transMapRefSeqV5 | < transMapRefSeqV5 |

< transMapRefSeqV5 | This track follows the display conventions for < transMapRefSeqV5 | PSL alignment tracks.

< transMapRefSeqV5 |

< transMapRefSeqV5 | This track may also be configured to display codon coloring, a feature that < transMapRefSeqV5 | allows the user to quickly compare cDNAs against the genomic sequence. For more < transMapRefSeqV5 | information about this option, click < transMapRefSeqV5 | here. < transMapRefSeqV5 | Several types of alignment gap may also be colored; < transMapRefSeqV5 | for more information, click < transMapRefSeqV5 | here. < transMapRefSeqV5 | < transMapRefSeqV5 |

Methods

< transMapRefSeqV5 | < transMapRefSeqV5 |

< transMapRefSeqV5 |

Source transcript alignments were obtained from vertebrate organisms < transMapRefSeqV5 | in the UCSC Genome Browser Database. BLAT alignments of RefSeq Genes, GenBank < transMapRefSeqV5 | mRNAs, and GenBank Spliced ESTs to the cognate genome, along with UCSC Genes, < transMapRefSeqV5 | were used as available. < transMapRefSeqV5 |
For all vertebrate assemblies that had BLASTZ alignment chains and < transMapRefSeqV5 | nets to the crab-eating macaque (macFas5) genome, a subset of the alignment chains were < transMapRefSeqV5 | selected as follows: < transMapRefSeqV5 |
- For organisms whose branch distance was no more than 0.5 < transMapRefSeqV5 | (as computed by phyloFit, see Conservation track description for details), < transMapRefSeqV5 | syntenic filtering was used. Reciprocal best nets were used if available; < transMapRefSeqV5 | otherwise, nets were selected with the netfilter -syn command. < transMapRefSeqV5 | The chains corresponding to the selected nets were used for mapping. < transMapRefSeqV5 |
- For more distant species, where the determination of synteny is difficult, < transMapRefSeqV5 | the full set of chains was used for mapping. This allows for more genes to < transMapRefSeqV5 | map at the expense of some mapping to paralogous regions. The < transMapRefSeqV5 | post-alignment filtering step removes some of the duplications. < transMapRefSeqV5 |
< transMapRefSeqV5 |
The pslMap program was used to do a base-level projection of < transMapRefSeqV5 | the source transcript alignments via the selected chains < transMapRefSeqV5 | to the crab-eating macaque genome, resulting in pairwise alignments of the source transcripts to < transMapRefSeqV5 | the genome. < transMapRefSeqV5 |
The resulting alignments were filtered with pslCDnaFilter < transMapRefSeqV5 | with a global near-best criteria of 0.5% in finished genomes < transMapRefSeqV5 | (human and mouse) and 1.0% in other genomes. Alignments < transMapRefSeqV5 | where less than 20% of the transcript mapped were discarded. < transMapRefSeqV5 |

< transMapRefSeqV5 |

< transMapRefSeqV5 | < transMapRefSeqV5 |

< transMapRefSeqV5 | To ensure unique identifiers for each alignment, cDNA and gene accessions were < transMapRefSeqV5 | made unique by appending a suffix for each location in the source genome and < transMapRefSeqV5 | again for each mapped location in the destination genome. The format is: < transMapRefSeqV5 |

< transMapRefSeqV5 |    accession.version-srcUniq.destUniq
< transMapRefSeqV5 |

< transMapRefSeqV5 | < transMapRefSeqV5 | Where srcUniq is a number added to make each source alignment unique, and < transMapRefSeqV5 | destUniq is added to give the subsequent TransMap alignments unique < transMapRefSeqV5 | identifiers. < transMapRefSeqV5 |

< transMapRefSeqV5 |

< transMapRefSeqV5 | For example, in the cow genome, there are two alignments of mRNA BC149621.1. < transMapRefSeqV5 | These are assigned the identifiers BC149621.1-1 and BC149621.1-2. < transMapRefSeqV5 | When these are mapped to the human genome, BC149621.1-1 maps to a single < transMapRefSeqV5 | location and is given the identifier BC149621.1-1.1. However, BC149621.1-2 < transMapRefSeqV5 | maps to two locations, resulting in BC149621.1-2.1 and BC149621.1-2.2. Note < transMapRefSeqV5 | that multiple TransMap mappings are usually the result of tandem duplications, where both < transMapRefSeqV5 | chains are identified as syntenic. < transMapRefSeqV5 |

< transMapRefSeqV5 | < transMapRefSeqV5 |

Data Access

< transMapRefSeqV5 | < transMapRefSeqV5 |

< transMapRefSeqV5 | The raw data for these tracks can be accessed interactively through the < transMapRefSeqV5 | Table Browser or the < transMapRefSeqV5 | Data Integrator. < transMapRefSeqV5 | For automated analysis, the annotations are stored in < transMapRefSeqV5 | bigPsl files (containing a < transMapRefSeqV5 | number of extra columns) and can be downloaded from our < transMapRefSeqV5 | download server, < transMapRefSeqV5 | or queried using our API. For more < transMapRefSeqV5 | information on accessing track data see our < transMapRefSeqV5 | Track Data Access FAQ. < transMapRefSeqV5 | The files are associated with these tracks in the following way: < transMapRefSeqV5 |

TransMap Ensembl - macFas5.ensembl.transMapV4.bigPsl
TransMap RefGene - macFas5.refseq.transMapV4.bigPsl
TransMap RNA - macFas5.rna.transMapV4.bigPsl
TransMap ESTs - macFas5.est.transMapV4.bigPsl

< transMapRefSeqV5 | Individual regions or the whole genome annotation can be obtained using our tool < transMapRefSeqV5 | bigBedToBed which can be compiled from the source code or downloaded as < transMapRefSeqV5 | a precompiled binary for your system. Instructions for downloading source code and < transMapRefSeqV5 | binaries can be found < transMapRefSeqV5 | here. < transMapRefSeqV5 | The tool can also be used to obtain only features within a given range, for example: < transMapRefSeqV5 |

< transMapRefSeqV5 | bigBedToBed http://hgdownload.soe.ucsc.edu/gbdb/macFas5/transMap/V4/macFas5.refseq.transMapV4.bigPsl < transMapRefSeqV5 | -chrom=chr6 -start=0 -end=1000000 stdout < transMapRefSeqV5 | < transMapRefSeqV5 | < transMapRefSeqV5 |

Credits

< transMapRefSeqV5 | < transMapRefSeqV5 |

< transMapRefSeqV5 | This track was produced by Mark Diekhans at UCSC from cDNA and EST sequence data < transMapRefSeqV5 | submitted to the international public sequence databases by < transMapRefSeqV5 | scientists worldwide and annotations produced by the RefSeq, < transMapRefSeqV5 | Ensembl, and GENCODE annotations projects.

< transMapRefSeqV5 | < transMapRefSeqV5 |

References

< transMapRefSeqV5 |

< transMapRefSeqV5 | Siepel A, Diekhans M, Brejová B, Langton L, Stevens M, Comstock CL, Davis C, Ewing B, Oommen S, < transMapRefSeqV5 | Lau C et al. < transMapRefSeqV5 | < transMapRefSeqV5 | Targeted discovery of novel human exons by comparative genomics. < transMapRefSeqV5 | Genome Res. 2007 Dec;17(12):1763-73. < transMapRefSeqV5 | PMID: 17989246; PMC: PMC2099585 < transMapRefSeqV5 |

< transMapRefSeqV5 | < transMapRefSeqV5 |

< transMapRefSeqV5 | Stanke M, Diekhans M, Baertsch R, Haussler D. < transMapRefSeqV5 | < transMapRefSeqV5 | Using native and syntenically mapped cDNA alignments to improve de novo gene finding. < transMapRefSeqV5 | Bioinformatics. 2008 Mar 1;24(5):637-44. < transMapRefSeqV5 | PMID: 18218656 < transMapRefSeqV5 |

< transMapRefSeqV5 | < transMapRefSeqV5 |

< transMapRefSeqV5 | Zhu J, Sanborn JZ, Diekhans M, Lowe CB, Pringle TH, Haussler D. < transMapRefSeqV5 | < transMapRefSeqV5 | Comparative genomics search for losses of long-established genes on the human lineage. < transMapRefSeqV5 | PLoS Comput Biol. 2007 Dec;3(12):e247. < transMapRefSeqV5 | PMID: 18085818; PMC: PMC2134963 < transMapRefSeqV5 |

< transMapRefSeqV5 | < transMapRefSeqV5 | < transMapRnaV5 | html < transMapRnaV5 |