198c9b8daecc44fbda6a6494c566c723920f030a lrnassar Wed Mar 11 18:25:21 2026 -0700 Fixing a few hundred clear typos with the help of Claude. Some are less important in code comments, but majority of them are in user-facing places. I manually approved 60%+ of the changes and didn't see any that were an incorrect suggestion, at worst it was potentially uncessesary, like a code comment having cant instead of can't. No RM. diff --git src/hg/makeDb/trackDb/human/encodeEgaspUpdate.html src/hg/makeDb/trackDb/human/encodeEgaspUpdate.html index ab0e231d4bb..44879d67f35 100644 --- src/hg/makeDb/trackDb/human/encodeEgaspUpdate.html +++ src/hg/makeDb/trackDb/human/encodeEgaspUpdate.html @@ -84,31 +84,31 @@ reduced, leading to increasingly complex objects. This process enables the production of alternative transcripts from initial HSPs.
FGenesh++ predictions are based on hidden Markov models and protein similarity to the NR database. For more information, see the reference below.
The GeneID program predicts genes in anonymous genomic sequences designed with a hierarchical structure. In the first step, splice sites, start and stop codons are predicted and scored along the sequence using position weight arrays (PWAs). Next, exons are built from the sites. Exons are scored as the sum of the scores -of the defining sites plus the the log-likelihood ratio of a Markov model for +of the defining sites plus the log-likelihood ratio of a Markov model for coding DNA. Finally, the gene structure is assembled from the set of predicted exons, maximizing the sum of the scores of the assembled exons. The modified version of GeneID used to generate the predictions in this track incorporates models for U12-dependent splice signals in addition to U2 splice signals.
The GeneID subtrack shows all GeneID genes. Only U12 introns and their flanking exons are displayed in the GeneID U12 subtrack. Exons flanking predicted U12-dependent introns are assigned a type attribute reflecting their splice sites, displayed on the details page of the GeneID U12 subtrack as the "Alternate Name" of the item composed of the intron plus flanking exons.
All pseudogenes in this track were manually curated. In the browser, the track details page shows the pseudogene type.
Augustus was written by Mario Stanke at the Department of -Bioinformatics of the University of Göttingen in Germany.
+Bioinformatics of the University of G�ttingen in Germany.Exogean was developed by Sarah Djebali and Hugues Roest Crollius from the Dyogen Lab, Ecole -Normale Supérieure (Paris, France) and Franck Delaplace -from the Laboratoire de Méthodes Informatiques +Normale Sup�rieure (Paris, France) and Franck Delaplace +from the Laboratoire de M�thodes Informatiques (LaMI), (Evry, France).
The FGenesh++ gene predictions were provided by Victor Solovyev of Softberry Inc.
The GeneID-U12 and SGP2-U12 programs were developed by the -Grup de Recerca en Informàtica Biomèdica +Grup de Recerca en Inform�tica Biom�dica (GRIB) at -the Institut Municipal d'Investigació Mèdica (IMIM) in Barcelona. +the Institut Municipal d'Investigaci� M�dica (IMIM) in Barcelona. The version of GeneID on which GeneID-U12 is based (geneid_v1.2) was written by -Enrique Blanco and Roderic Guigó. +Enrique Blanco and Roderic Guig�. The parameter files were constructed by Genis Parra and Francisco Camara. Additional contributions were made by Josep F. Abril, Moises Burset and Xavier Messeguer. Modifications to GeneID that allow for the prediction of U12-dependent splice sites and incorporation of U12 introns into gene models were made by Tyler Alioto.
Jigsaw was developed at The Institute for Genomic Research (TIGR) by Jonathan Allen and Steven Salzberg, with computational gene-finder contributions from Mihaela Pertea and William Majoros. Continued maintenance and development of Jigsaw will be provided by the Salzberg group at the Center for Bioinformatics and Computational Biology (CBCB) at the University of Maryland, College Park.
@@ -212,63 +212,63 @@Stanke, M., Steinkamp, R., Waack, S. and Morgenstern, B. AUGUSTUS: a web server for gene finding in eukaryotes. Nucl. Acids Res., 32, W309-W312 (2004).
Solovyev V.V. "Statistical approaches in Eukaryotic gene prediction". In Handbook of Statistical Genetics (eds. Balding D. et al.) (John Wiley & Sons, Inc., 2001). p. 83-127.
-Blanco, E., Parra, G. and Guigó, R. +Blanco, E., Parra, G. and Guig�, R. "Using geneid to identify genes". In Current Protocols in Bioinformatics, Unit 4.3. (ed. Baxevanis, A.D.) (John Wiley & Sons, Inc., 2002).
-Guigó, R. +Guig�, R. Assembling genes from predicted exons in linear time with dynamic programming. J Comput Biol. 5(4), 681-702 (1998).
-Guigó, R., Knudsen, S., Drake, N. and Smith, T. +Guig�, R., Knudsen, S., Drake, N. and Smith, T. Prediction of gene structure. J Mol Biol. 226(1), 141-57 (1992).
-Parra, G., Blanco, E. and Guigó, R. +Parra, G., Blanco, E. and Guig�, R. GeneID in Drosophila. Genome Research 10(4), 511-515 (2000).
Allen, J.E., Pertea, M. and Salzberg, S.L. Computational gene prediction using multiple sources of evidence. Genome Res., 14(1), 142-8 (2004).
Allen, J.E. and Salzberg, S.L. JIGSAW: integration of multiple sources of evidence for gene prediction. Bioinformatics 21(18), 3596-3603 (2005).
-Guigó, R., Dermitzakis, E.T., Agarwal, P., Ponting, C.P., Parra, G., +Guig�, R., Dermitzakis, E.T., Agarwal, P., Ponting, C.P., Parra, G., Reymond, A., Abril, J.F., Keibler, E., Lyle, R., Ucla, C. et al. Comparison of mouse and human genomes followed by experimental verification yields an estimated 1,019 additional genes. Proc Natl Acad Sci U S A 100(3), 1140-5 (2003).
-Parra, G., Agarwal, P., Abril, J.F., Wiehe, T., Fickett, J.W. and Guigó, R. +Parra, G., Agarwal, P., Abril, J.F., Wiehe, T., Fickett, J.W. and Guig�, R. Comparative gene prediction in human and mouse. Genome Res. 13(1), 108-17 (2003).