Event sponsored by:
Computational Biology and Bioinformatics (CBB)
Biostatistics and Bioinformatics
Duke Center for Genomic and Computational Biology (GCB)
Precision Genomics Collaboratory
School of Medicine (SOM)
Contact:
Franklin, MonicaSpeaker:
David Page
I admit the title intentionally overstates the case. But many (most?) high-throughput biology datasets are based on aggregates, where aggregation occurs during either the experiment or data post-processing. As a result, any node in a graphical model of the data (e.g., Bayes net, dynamic Bayes net, Markov net, point process, or CRF) really is an aggregate of many idealized single-measurement nodes, so the real model can be viewed as a high-dimensional tree-structured graphical model. We prove that such models correspond to neural networks, and also that every neural network can be viewed as such a model. Based on this theoretical result, we discuss potential applications, including causal neural networks and the potential for a future "foundation model" for health. We also use examples from clinical data (such as EHRs) in addition to biological data.
CBB Monday Seminar Series