03b614da15d6d8f07f0b974735e84d8fc42bf39c bnguy Fri Apr 3 14:25:45 2020 -0700 Update on projects and tools page regarding NS forest and pyle diff --git src/hg/cirm/gateway/htdocs/tools.html src/hg/cirm/gateway/htdocs/tools.html index 9d3b395..ded488d 100755 --- src/hg/cirm/gateway/htdocs/tools.html +++ src/hg/cirm/gateway/htdocs/tools.html @@ -154,33 +154,32 @@ SCIMITAR was created Pablo Cordero.
-

Tag Storm: Metadata made - simple

+

Tag Storm: Metadata made simple

  • The tag storm format offers an easy way to describe a hierarchical set of metadata for your data. The tag storm format is made up of a number of key/value pairs spread across stanzas at various indentation levels. Lower indentation levels inherit information from higher levels, so the most general metadata, e.g. lab or dataset names, are generally found at the highest level and more specifc metadata, e.g. sex or file name, are found at the lower level.

    The Tag Storm GitHub page contains more information about the format as well as a suite of tools to work with and manipulate tag storm files.

  • NDEx Website
  • PMID: 26594663; PMC: PMC4649937
+

NS-Forest: - Necessary and Sufficient Forest (NS-Forest) for Cell Type Marker Determination from cell type clusters

+ Necessary and Sufficient Forest (NS-Forest) for Cell Type Marker Determination

  • Necessary and Sufficient Forest is a method that takes cluster results from single cell/nuclei RNAseq experiments and generates lists of minimal markers needed to define each cell type cluster. The method begins by re-encoding the cluster labels into binary classifications, and Random Forest models are generated comparing each cluster versus all. The top fifteen genes are then reranked using a score measuring how binary they are, e.g., a gene with expression in the target cluster but no expression in the other clusters would have a high binary score. Expression cutoffs for the top six genes ranked by binary score are then determined by generating individual decision trees and extracting the decision path information. Then all permutations of the top six most binary genes are evaluated using f-beta score as an objective function (the beta value default set at 0.5, which weights the f-measure score more toward precision as opposed to recall)

  • NSForest GitHub
+
+