Bioinformatics Seminar
The Bioinformatics Seminar is co-sponsored by the Department of Mathematics at the Massachusetts Institute of Technology and the Theory of Computation group at MIT's Computer Science and Artificial Intelligence Laboratory (CSAIL). The seminar series focuses on highlighting areas of research in the field of computational biology. This year, we are hoping to highlight three topics: (1) deep learning approaches for biology/biomedicine, (2) algorithms for genomics, and (3) computational methods for understanding and modeling evolution.
Fall 2025
Lectures are on Wednesdays, 11:30am - 1:00pm ET
Location: 32G-575 (Stata Center at MIT; Gates Tower; 5th Floor)
Zoom link for virtual attendants: https://mit.zoom.us/j/95319499071
Date | Speaker | Title/Abstract |
---|---|---|
Sept. 10 (virtual) |
Brian Hie (Stanford)* |
Genome modeling and design across all domains of life All of life encodes information with DNA. While tools for sequencing, synthesis, and editing of genomic code have transformed biological research, intelligently composing new biological systems would also require a deep understanding of the immense complexity encoded by genomes. We introduce Evo 2, a biological foundation model trained on 9.3 trillion DNA base pairs from a highly curated genomic atlas spanning all domains of life. We train Evo 2 with 7B and 40B parameters to have an unprecedented 1 million token context window with single-nucleotide resolution. Evo 2 learns from DNA sequence alone to accurately predict the functional impacts of genetic variation—from noncoding pathogenic mutations to clinically significant BRCA1 variants—without task-specific finetuning. Applying mechanistic interpretability analyses, we reveal that Evo 2 autonomously learns a breadth of biological features, including exon–intron boundaries, transcription factor binding sites, protein structural elements, and prophage genomic regions. Beyond its predictive capabilities, Evo 2 generates mitochondrial, prokaryotic, and eukaryotic sequences at genome scale with greater naturalness and coherence than previous methods. Guiding Evo 2 via inference-time search enables controllable generation of epigenomic structure, for which we demonstrate the first inference-time scaling results in biology. We make Evo 2 fully open, including model parameters, training code, inference code, and the OpenGenome2 dataset, to accelerate the exploration and design of biological complexity. |
Sept. 17 (virtual) |
Ron Dror (Stanford)* |
Discovering Safe, Effective Drugs via Machine Learning and Simulation of 3D Structure Recent years have seen dramatic advances in both experimental determination and computational prediction of macromolecular structures. These structures hold great promise for the discovery of highly effective drugs with minimal side effects, but structure-based design of such drugs remains challenging. I will describe recent progress toward this goal, using both atomic-level molecular simulations and machine learning on three-dimensional structures. |
Sept. 24 | L. Aravind (NCBI) |
Discovering new biochemistry from biological conflicts Biological replicators are locked in deeply intertwined genetic conflicts with each other. Using comparative genomics, protein sequence and structure analysis and evolutionary investigations, my lab has uncovered a staggering diversity of molecular armaments and mechanisms regulating their deployment, collectively termed biological conflict systems. These include toxins used in interorganismal interactions and a host of mechanisms involved in self/nonself discrimination, especially in the context of host-selfish element conflicts. Our studies have helped identify shared syntactical features in the organizational logic of biological conflict systems. These principles can be exploited to discover new conflict systems through computational analyses. Further, we find that across the range of biological organization, from intragenomic conflicts to interorganismal conflicts, a circumscribed set of effector protein domain families is deployed, targeting genetic information flow through the Central Dogma, certain membranes, and key molecules like NAD+ and NTPs. This has led to significant advances in discovering new biochemistry of these systems and furnished new biotechnological reagents for genome editing, sequencing and beyond. I’ll discuss this using specific examples of toxins in interorganismal conflict and effectors in antiviral immunity. |
Oct. 1 | Benedict Paten (UC Santa Cruz) |
Furthering our understanding of human genetic variation: the human pangenome reference project second release Human genomics has relied on a single reference genome for the last twenty years. This reference genome is a corner stone of much of what we do in genomics but it can not, by definition, represent the variation present in the human population, and as a reference introduces a pervasive bias into genomic analyses. I will survey our recent efforts, through the Human Pangenome Reference Consortium, to build and use a reference pangenome – a collection of extremely high-quality reference genomes related together by a consensus genome alignment that we intend as a replacement for the reference genome. |
Oct. 8 (virtual) |
Vagheesh Narasimhan (UT Austin)* |
|
Oct. 15 | Marina Sirota (UC San Francisco) |
|
Oct. 22 | Kishwar Shafin (Google) |
|
Oct. 29 | Elinor Karlsson (UMass Medical, Broad Institute) |
|
Nov. 5 | Chirag Patel (Harvard Medical School) |
|
Nov. 12 | Victoria Popic (Broad Clinical Labs) |
|
Nov. 19 | TBD | |
Nov. 26 | Mile Šikić (Genome Institute of Singapore, University of Zagreb) |
|
Dec. 3 (virtual) |
Fabian Theis (Helmholtz Munich)* |
|
Dec. 10 | Mona Singh (Princeton) |
*Indicates the speaker will be presenting over Zoom. Otherwise, they will be presenting in person.
Past Terms
A listing of the Bioinformatics Seminar series home pages from prior terms.
- Fall 2025
- Fall 2024
- Fall 2023
- Fall 2022
- Spring 2022
- Fall 2021
- Spring 2021
- Spring 2020
- Spring 2019
- Spring 2018
- Spring 2017
- Spring 2016
- Spring 2015
- Spring 2013
- Spring 2011
- Spring 2010
- Spring 2009
- Fall 2008
- Fall 2007
- Spring 2007
- Fall 2006
- Spring 2006
- Fall 2005
- Spring 2005
- Fall 2004
- Spring 2004
- Fall 2003
- Spring 2003
- Spring 2001
Organizers and Information
The Bioinformatics Seminar is hosted by MIT Simons Professor of Mathematics and head of the Computation and Biology group at CSAIL Bonnie Berger. Professor Berger is also Faculty of Harvard-MIT Health Sciences & Technology, Associate Member of the Broad Institute of MIT and Harvard, Faculty of MIT CSB, and Affiliated Faculty of Harvard Medical School.
The seminar is announced weekly via email to members of the seminar's mailing list and to those on CSAIL's event calendar list. It is also posted in the BioWeek calendar.
Bonnie Berger: bab@mit.edu
Megan Le (TA): meganle@mit.edu
To be added to the seminar's email announcement list or for any questions you have about the seminar, please mail bioinfo@csail.mit.edu and cc TA Megan Le (meganle@mit.edu).
If you plan to enroll in the associated course, 18.418/HST.504: Topics in Computational Molecular Biology, please contact Professor Berger (bab@mit.edu) and cc TA Megan Le (meganle@mit.edu) for more information.