Skip to main content

Online Talk: Scaling Probabilistic AI with Automatic Differentiation of Probabilistic Programs

Special Programming Languages Seminar - Speaker: Alex Lew, MIT

Event Details

Date
Monday, March 25, 2024
Time
1-2 p.m.
Location
Description

ZOOM LINK: https://uwmadison.zoom.us/j/96126547182

Abstract: By automating the error-prone math behind deep learning, systems such as TensorFlow and PyTorch have supercharged machine learning research, empowering hundreds of thousands of practitioners to rapidly explore the design space of neural network architectures and training algorithms. In this talk, I will show how new programming language techniques, especially generalizations of automatic differentiation, make it possible to generalize and extend such systems to support probabilistic models. Our automation is rigorously proven sound using new semantic techniques for reasoning compositionally about expressive probabilistic programs, and static types are employed to ensure important preconditions for soundness, eliminating large classes of implementation bugs. Providing a further boost, our tools can help users correctly implement fast, low-variance, unbiased estimators of gradients and probability densities that are too expensive to compute exactly, enabling orders-of-magnitude speedups in downstream optimization and inference algorithms.

To illustrate the value of these techniques, I’ll show how they have helped us experiment with new architectures that could address key challenges with today’s dominant AI models. In particular, I’ll showcase systems we’ve built for (1) auditable reasoning and learning in relational domains, enabling the detection of thousands of errors across millions of Medicare records, and (2) probabilistic inference over large language models, enabling small open models to outperform GPT-4 on several constrained generation benchmarks.

Bio: Alex Lew is a final-year PhD student at MIT’s Probabilistic Computing Project, co-advised by Vikash Mansinghka and Josh Tenenbaum, and supported by an NSF Graduate Research Fellowship. Before coming to MIT, he taught high-school computer science at Commonwealth School in Boston. And before that, he was a student at Yale, where he received a B.S. in computer science and math in 2015. (http://alexlew.net/)

Cost
Free

Tags