Research
For billions of years, nature has been conducting the greatest experiment of all time. Imagine one-day gaining access to the detailed notes from these experiments. Today, with worldwide expeditions to collect samples from all habitats, single-cell sequencing of unculturable microbes and the rapid drop in sequencing costs, we can finally tap into nature and gain access to these notes.
Making use of this data, our lab is interested in:
Developing a unified statistical model of protein evolution that integrates phylogenetics, genomic, structural, and functional constraints.
Recent Publications
Hettiarachchi R, et al. 2023. ๐Github
Differentiable Search of Evolutionary Trees from Leaves.ยHwang Y, et al. 2023. ๐Github
Genomic language model predicts protein co-regulation and function.ยPetti S, et al. 2022. ๐GitHub
End-to-end learning of multiple sequence alignments with differentiable Smith-Waterman.ย
Zhang Z, Wayment-Steele HK, et al. 2024. ๐Github
Protein language models learn evolutionary statistics of interacting sequence motifs.ยWang H, et al. 2022.๐Github
Disentanglement of Entropy and Coevolution using Spectral Regularization.ยBhattacharya N, et al. 2021. ๐Github
Interpreting Potts and Transformer Protein Models Through the Lens of Simplified Attention.ยMarshall D, et al. 2020. ๐Github
The structure-fitness landscape of pairwise relations in generative sequence models.Dauparas J, et al. 2019. ๐Github
Unified framework for modeling multivariate distributions in biological sequences.ย
Explicit modeling of the protein conformational (and/or folding) landscape for protein structure prediction and design.
Recent Publications
Roney JP, et al. 2022. ๐Github
State-of-the-Art estimation of protein model accuracy using AlphaFold.ยOvchinnikov S, et al. 2021.
Structure-based protein design with deep learning.ยNorn C, et al. 2021. ๐Github
Protein sequence design by conformational landscape optimization.
(image credit Basile Wicky)ย
Applying the models to mine metagenomic โdark matterโ sequences to discover new protein families, functions, and protein-protein interactions. Probing evolution of multicellularity through comparison of structures and interactions in the early tree of life.
Recent Publications
Pavlopoulos GA, et al. 2023.๐Database
Unraveling the functional dark matter through global metagenomics.ยTrinquier J, et al. 2022.๐Github
SWAMPNN: End-to-end protein structures alignment.Ovchinnikov S, et al., 2017.๐Github
Protein structure determination using metagenome sequence data.ย
All Publications