The Cytoscape Ecosystem & NDEx Network Cloud
The Ideker lab is involved in development of several bioinformatic resources for network analysis that are widely used by the biological research community. The most visible of these is a collaborative open-source software project called Cytoscape. Cytoscape is one of the principal tools used by researchers to analyze and visualize molecular interaction networks, with approximately 17,000 downloads per month, and it is also used in almost all of the lab’s research activities. Now in its 16th year, the Cytoscape codebase is curated by four main institutions via funding from the P41 NIGMS National Resource for Network Biology and an NHGRI R01, both of which Dr. Ideker directs. In the past few years the platform has been substantially extended to include a cloud-based storage system for networks, much like Google Drive and DropBox provide for online storage and sharing of other types of documents. This network store, which we call the Network Data Exchange (NDEx), is funded through a separate U24 grant from NCI.
The core Cytoscape application has been frequently extended through a straightforward plug-in architecture, giving ready access to over 330 plug-ins (Cytoscape ‘Apps’) which are presently available and approximately half of which have been described in their own peer-reviewed publications. Approximately 160 of these Apps have been newly published or significantly updated in the past three-year review period, most of which are by independent groups and do not include me or my team as authors. Popular Apps extend Cytoscape in areas such as network query and download; network integration and filtering; attribute-directed network layout; Gene Ontology enrichment analysis; as well as network motif, functional module, protein complex, or domain interaction detection. Our plan for Cytoscape development for the remainder of 2018 is to release Cytoscape 3.7, which will involve major upgrades to the core architecture and seamless roundtrip connectivity to NDEx.
The NDEx Project provides an open-source framework where scientists and organizations can find, store, share and publish biological network knowledge. The project maintains a free Public Server and an informational website with technical documentation.
NAGA is designed to use biological networks to analyze GWAS results. NAGA assigns each gene with an association score based on the given GWAS result. To integrate prior biological knowledge, NAGA downloads a molecular network from the NDEx (hyperlink NDEx?) database and performs network propagation, providing a set of new scores for each gene. The high scoring genes form a new subnetwork, which can be compared to a set of gold standard genes in order to evaluate the enrichment for previously discovered biology.
Related paper: Carlin DE, et al. A Fast and Flexible Framework for Network-Assisted Genomic Association. iScience. 2019 [PDF] [PubMed]
The Clique Extracted Ontologies algorithm (CliXO) infers an ontology in the form of a hierarchical, directed acyclic graph (DAG) from pairwise similarity data. Originally developed for inferring gene ontologies from biological gene networks.
The Network Extracted Ontology (NeXO) is a gene ontology inferred directly from large-scale molecular networks. NeXO uses a principled computational approach which integrates evidence from hundreds of thousands of individual gene and protein interactions to construct a complete hierarchy of cellular components and processes.
Cell Circuit Search: Molecular interaction models provide us with a framework for integrating the large-scale data that we are now able to collect at multiple levels of biological information – genes, RNAs, proteins, and small molecules. Cell Circuit Search is a web-based interface for searching for genes that appear in our library of network models.
NetworkBLAST Software: NetworkBlast analyzes protein interaction networks in order to predict previously unknown relationships. It can compare multiple species’ protein interaction networks and infer interactions through homology. The program is best used in conjunction with Cytoscape to easily visualize the returned data.
PathBLAST Website: Pathway alignment and query against protein interaction databases to identify conserved protein interaction networks between species. PathBLAST searches the protein-protein interaction network of the target organism to extract all protein interaction pathways that align with a pathway query.
VERA and SAM: VERA and SAM was developed to address the need for a better statistical test for identifying differentially-expressed genes. VERA estimates the parameters of a statistical model that describes multiplicative and additive errors influencing an array experiment, using the method of maximum likelihood. SAM gives a value, lambda, for each gene on an array, which describes how likely it is that the gene is expressed differently between the two cell populations and was developed to address the need for a better statistical test for identifying differentially-expressed genes.
CrazyQuant: CrazyQuant is a spot-finding tool developed by Trey Ideker while at the University of Washington (cited in O’Reilly’s Beginning Perl for Bioinformatics). This JAVA-based applet allows the loading and quantitation of your cDNA microarray images and membranes. CrazyQuant is a spot-finding tool developed by Trey Ideker while at the University of Washington (cited in O’Reilly’s ). This JAVA-based applet allows the loading and quantitation of your cDNA microarray images and membranes.
Dapple: Dapple is a program for quantitating spots on a two-color DNA microarray image. Given a pair of images from a comparative hybridization, Dapple finds the individual spots on the image, evaluates their qualities, and quantifies their total fluorescent intensities. Dapple is designed to work with microarrays on glass and is a program for quantitating spots on a two-color DNA microarray image.
enoLOGOS: Program enoLOGOS generates LOGOs of transcription factor DNA binding sites from various types of input matrices. It can utilize standard count matrices, probability matrices or matrices of “energy” values (i.e., log-frequencies).