Cosmic Connections: A ML X Astrophysics Symposium at Simons Foundation

Name: Cosmic Connections: A ML X Astrophysics Symposium at Simons Foundation
Start: 2023-05-22T08:50:00-04:00
End: 2023-05-24T17:00:00-04:00
Location: 162 5th Avenue

May 22 – 24, 2023

162 5th Avenue

America/New_York timezone

Session

Invited Talk

May 22, 2023, 9:50 AM

Ingrid Daubechies Auditorium/2-IDA (162 5th Avenue)

Ingrid Daubechies Auditorium/2-IDA

162 5th Avenue

200

Description

Chair: Julia Kempe

Recently, the theory of infinite-width neural networks led to the first technology, muTransfer, for tuning enormous neural networks that are too expensive to train more than once. For example, this allowed us to tune the 6.7 billion parameter version of GPT-3 using only 7% of its pretraining compute budget, and with some asterisks, we get a performance comparable to the original GPT-3 model with twice the parameter count. In this talk, I will explain the core insight behind this theory. In fact, this is an instance of what I call the Optimal Scaling Thesis, which connects infinite-size limits for general notions of “size” to the optimal design of large models in practice, illustrating a way for theory to reliably guide the future of AI. I'll end with several concrete key mathematical research questions whose resolutions will have incredible impact on how practitioners scale up their NNs.

There are no materials yet.

Building timetable...

Choose timezone

Cosmic Connections: A ML X Astrophysics Symposium at Simons Foundation

Description

Presentation materials