VISABLE MACHINE LEARNING

Visible machine learning for predictive oncology and other genotype-phenotype challenges. We are developing “visible” machine learning approaches to model the flow of genetic information, in which predictive models learn not only to translate genotype to phenotype but to also identify the molecular functions and mechanisms by which these predictions are made. The central concept is to couple the structure of a machine learning model to the structure and function of a target biological system, creating what we have called visible neural networks (VNNs). Whereas a typical predictive model is not readily interpretable due to many hidden variables and states (a “black box”), VNNs can be more directly inspected to reveal the molecular and cellular events responsible for each prediction. They provide a framework for interpretable deep learning, combining the generality, scale and power of neural networks with a biomechanistic understanding.

Ma et al. Nature Methods 2018 (Cover)
Yu et al. Cell Systems 2016 (Cover) 

These concepts led to DCell, a deep neural network modeling approximately 3500 subsystems in a budding yeast cell and which is able to accurately translate genotypes to growth phenotypes [3a,b below]. Using DCell as a foundation, we created deep neural networks of cancer which predict response to therapy based on the tumor genotype (i.e. its profile of genetic markers) and the drug formula [3c, Personal Statement 3,4]. Visible deep learning models open up the possibility to address a host of biomedical questions which have been recalcitrant to machine learning thus far, with significant applications in cancer genomics.