MITDeepLearning
diff --git a/‎images/people/chrisbishop.jpg‎
111 KB b/‎images/people/chrisbishop.jpg‎
111 KB
diff --git a/‎images/people/mathiaslechner.jpg‎
9 KB b/‎images/people/mathiaslechner.jpg‎
9 KB
diff --git a/‎index.html‎
Lines changed: 25 additions & 28 deletions b/‎index.html‎
Lines changed: 25 additions & 28 deletions
@@ -654,14 +654,14 @@ <h6 class="card-title">Software Lab 3</h6>
 									<div class="col-md-2 v-center">
 										<div class="card card-lecture">
 											<div class="card-icon">
-												<img src="images/thumb/mystery1.jpg" alt="">
+												<img src="images/thumb/molecule.gif" alt="">
 											</div>
 										</div>
 									</div>
 									<div class="col-md-1"></div>
 									<div class="col-md-8 v-center">
 										<div class="card card-lecture">
-											<h5 class="card-title"><highlight>Guest Lecture</highlight></h5>
+											<h5 class="card-title"><highlight>AI for Science</highlight></h5>
 										</div>
 									</div>
 								</div>
@@ -673,8 +673,8 @@ <h6 class="card-title">Lecture 7</h6>
 										<!-- <p>[<a data-toggle="modal" data-target="#themis_modal">Info</a>] [<a href='slides/6S191_MIT_DeepLearning_L5.pdf'>Slides</a>] [<a href="https://www.youtube.com/watch?v=kIiO4VSrivU&list=PLtBw6njQRU-rwp5__7C0oIVt26ZgjG9NI&index=5">Video</a>]</p> -->
 										<!-- <p>[<a data-toggle="modal" data-target="#themis_modal">Info</a>] [<a href='slides/6S191_MIT_DeepLearning_L5.pdf'>Slides</a>] [<b>Video</b>] <i>coming soon!</i></p> -->
 										<!-- <p>[<a data-toggle="modal" data-target="#google_modal">Info</a>] [<a href="https://www.youtube.com/watch?v=ZNodOsz94cc&list=PLtBw6njQRU-rwp5__7C0oIVt26ZgjG9NI&index=7">Video</a>] -->
-										<!-- <p>[<a data-toggle="modal" data-target="#google_modal">Info</a>] [<b>Slides</b>] [<b>Video</b>] <i>coming soon!</i></p> -->
-										<p>[<b>Slides</b>] [<b>Video</b>] <i>coming soon!</i></p>
+										<p>[<a data-toggle="modal" data-target="#microsoft_modal">Info</a>] [<b>Slides</b>] [<b>Video</b>] <i>coming soon!</i></p>
+										<!-- <p>[<b>Slides</b>] [<b>Video</b>] <i>coming soon!</i></p> -->
 									</div>
 								</div>
 							</div> <!-- end of Lecture 7 -->
@@ -686,14 +686,14 @@ <h6 class="card-title">Lecture 7</h6>
                   <div class="col-md-2 v-center">
                     <div class="card card-lecture">
                       <div class="card-icon">
-                        <img src="images/thumb/mystery2.jpg" alt="">
+                        <img src="images/thumb/rocket.gif" alt="">
                       </div>
                     </div>
                   </div>
                   <div class="col-md-1"></div>
                   <div class="col-md-8 v-center">
                     <div class="card card-lecture">
-                      <h5 class="card-title"><highlight>Guest Lecture</highlight></h5>
+                      <h5 class="card-title"><highlight>Secrets to Massively Parallel Training</highlight></h5>
                     </div>
                   </div>
                 </div>
@@ -704,8 +704,8 @@ <h6 class="card-title">Lecture 8</h6>
                     <!-- <i>Apr. 21, 2025</i> -->
 										<!-- <p>[<a data-toggle="modal" data-target="#liquid_modal">Info</a>] [<a href='slides/6S191_MIT_DeepLearning_L8.pdf'>Slides</a>] [<b>Video</b>] <i>coming soon!</i></p> -->
 										<!-- <p>[<a data-toggle="modal" data-target="#liquid_modal">Info</a>] [<a href='https://www.youtube.com/watch?v=_HfdncCbMOE&list=PLtBw6njQRU-rwp5__7C0oIVt26ZgjG9NI&index=9'>Video</a>] </p> -->
-										<!-- <p>[<a data-toggle="modal" data-target="#liquid_modal">Info</a>] [<b>Slides</b>] [<b>Video</b>] <i>coming soon!</i></p> -->
-										<p>[<b>Slides</b>] [<b>Video</b>] <i>coming soon!</i></p>
+										<p>[<a data-toggle="modal" data-target="#liquid_modal">Info</a>] [<b>Slides</b>] [<b>Video</b>] <i>coming soon!</i></p>
+										<!-- <p>[<b>Slides</b>] [<b>Video</b>] <i>coming soon!</i></p> -->
                   </div>
                 </div>
               </div> <!-- end of Lecture 8 -->
@@ -1433,7 +1433,7 @@ <h4>Social Media</h4>
 
 
     <!-- Modal -->
-    <div class="modal fade" id="google_modal" role="dialog">
+    <div class="modal fade" id="microsoft_modal" role="dialog">
       <div class="modal-dialog">
 
         <!-- Modal content-->
@@ -1447,14 +1447,14 @@ <h4>Social Media</h4>
               <div class="col-md-4 v-center">
                 <div class="card card-lecture card-modal">
                   <div class="card-icon">
-										<img src="images/people/petergrabowski.jpg" alt="">
+										<img src="images/people/chrisbishop.jpg" alt="">
                   </div>
                 </div>
               </div>
               <div class="col-md-8 v-center">
                 <div class="card card-lecture card-modal">
-                  <h4 align="left">Introduction to Language Modeling</h4>
-									<h5 align="left">Peter Grabowski, Lead of Gemini Applied Research, Google</h5>
+                  <h4 align="left">AI for Science</h4>
+									<h5 align="left">Chris Bishop, Technical Fellow, Microsoft</h5>
                 </div>
               </div>
             </div>
@@ -1463,12 +1463,12 @@ <h5 align="left">Peter Grabowski, Lead of Gemini Applied Research, Google</h5>
               <div class="col-md-12 v-center">
                 <h6>Talk Abstract</h6>
                 <p>
-									Want to get started with LLMs? This lecture will cover an introduction to language modeling and prompt engineering, example use cases and applications, and a discussion of common considerations for LLM usage (cost, efficiency, accuracy, bias).
+									Coming soon!
                 </p>
 
                 <h6>Speaker Bio</h6>
                 <p>
-                  Peter leads the Gemini Applied Research group, focused on developing fast, efficient, and scalable models in partnership with DeepMind, Search, Ads, Cloud, and other teams across Google. Prior to that, he led a group focused on Google's Enterprise AI, worked on making the Google Assistant better for Kids, and led the data integration / machine learning team at Nest. Peter loves to teach, and is a member of the faculty at UC Berkeley's School of Information, where he teaches courses focused on Deep Learning and Natural Language Processing.
+                Christopher Bishop is a Microsoft Technical Fellow and a member of Microsoft Research AI for Science. Chris obtained a BA in Physics from Oxford, and a PhD in Theoretical Physics from the University of Edinburgh, with a thesis on quantum field theory. After his PhD he joined the Theoretical Physics Division of Culham Laboratory where he conducted research into the physics of magnetically confined fusion plasmas. During this time, he developed an interest in machine learning and became Head of the Applied Neurocomputing Centre at AEA Technology. He was subsequently elected to a Chair in the Department of Computer Science and Applied Mathematics at Aston University, where he set up and led the Neural Computing Research Group. He joined Microsoft in 1997 and was Lab Director of Microsoft Research Cambridge from 2015 until 2022 when he founded the new AI for Science team. At Microsoft Research, Chris oversees a global portfolio of research, focussed on machine learning for the natural sciences.
 								</p>
               </div>
             </div>
@@ -1493,14 +1493,14 @@ <h6>Speaker Bio</h6>
               <div class="col-md-4 v-center">
                 <div class="card card-lecture card-modal">
                   <div class="card-icon">
-										<img src="images/people/maximelabonne.jpg" alt="">
+										<img src="images/people/mathiaslechner.jpg" alt="">
                   </div>
                 </div>
               </div>
               <div class="col-md-8 v-center">
                 <div class="card card-lecture card-modal">
-                  <h4 align="left">Introduction to LLM Post-Training</h4>
-									<h5 align="left">Maxime Labonne, Head of Post-Training, Liquid AI</h5>
+                  <h4 align="left">Massively Parallel Training</h4>
+									<h5 align="left">Mathias Lechner, Co-Founder and Chief Technology Officer, Liquid AI</h5>
                 </div>
               </div>
             </div>
@@ -1509,12 +1509,12 @@ <h5 align="left">Maxime Labonne, Head of Post-Training, Liquid AI</h5>
               <div class="col-md-12 v-center">
                 <h6>Talk Abstract</h6>
                 <p>
-                  In this talk, we will cover the fundamentals of modern LLM post-training at various scales with concrete examples. High-quality data generation is at the core of this process, focusing on the accuracy, diversity, and complexity of the training samples. We will explore key training techniques, including supervised fine-tuning, preference alignment, and model merging. The lecture will delve into evaluation frameworks with their pros and cons for measuring model performance. We will conclude with an overview of emerging trends in post-training methodologies and their implications for the future of LLM development.
+                This lecture talks about how to scale training of deep neural networks to thousands of GPUs. It begins by motivating why GPUs are essential for training (comparing FLOPs of GPUs vs CPUs) and why scaling to larger models and datasets improves performance, drawing on scaling laws from LLaMA and Kaplan et al. The talk then explores the memory requirements of training and techniques to reduce them, including activation checkpointing and offloading. The bulk of the lecture covers parallelism strategies: data parallelism, tensor parallelism, pipeline parallelism, and sequence/context parallelism, as well as sharding approaches like DeepSpeed ZeRO and FSDP. It also touches on sparsity through Mixture of Experts and expert parallelism. Throughout, network bandwidth is highlighted as a key bottleneck. The lecture concludes with a case study of LFM2 showing how these techniques combine in practice.
                 </p>
 
                 <h6>Speaker Bio</h6>
                 <p>
-                  Maxime Labonne is Head of Post-Training at Liquid AI. He holds a Ph.D. in Machine Learning from the Polytechnic Institute of Paris and is a Google Developer Expert in AI/ML. He has made significant contributions to the open-source community, including the LLM Course, tutorials on fine-tuning, tools such as LLM AutoEval, and several state-of-the-art models like NeuralDaredevil. He is the author of the best-selling books “LLM Engineer’s Handbook” and “Hands-On Graph Neural Networks Using Python”.
+                Mathias Lechner is Co-Founder and Chief Technology Officer (CTO) at Liquid AI,  as well as a Research Affiliate at the Computer Science and Artificial Intelligence Laboratory (CSAIL) at MIT, where he collaborates with Prof. Daniela Rus. He completed his PhD in 2022 at the Institute of Science and Technology Austria (ISTA), under the supervision of Tom Henzinger. Before his PhD, he earned his master’s (2017) and bachelor’s (2016) degrees in Computer Science from the Vienna University of Technology (TU Wien).
                 </p>
               </div>
             </div>
@@ -1525,7 +1525,7 @@ <h6>Speaker Bio</h6>
 
 
     <!-- Modal -->
-    <div class="modal fade" id="microsoft_modal" role="dialog">
+    <div class="modal fade" id="google_modal" role="dialog">
       <div class="modal-dialog">
 
         <!-- Modal content-->
@@ -1539,14 +1539,14 @@ <h6>Speaker Bio</h6>
               <div class="col-md-4 v-center">
                 <div class="card card-lecture card-modal">
                   <div class="card-icon">
-										<img src="images/people/avaamini.jpg" alt="">
+										<img src="images/people/anon.jpg" alt="">
                   </div>
                 </div>
               </div>
               <div class="col-md-8 v-center">
                 <div class="card card-lecture card-modal">
-                  <h4 align="left">AI to Optimize Biology</h4>
-									<h5 align="left">Ava Amini, Senior Research Scientist, Microsoft</h5>
+                  <h4 align="left">Coming soon!</h4>
+									<h5 align="left">Coming soon!</h5>
                 </div>
               </div>
             </div>
@@ -1555,12 +1555,12 @@ <h5 align="left">Ava Amini, Senior Research Scientist, Microsoft</h5>
               <div class="col-md-12 v-center">
                 <h6>Talk Abstract</h6>
                 <p>
-                  The potential of AI in biology is immense, yet its success is contingent on interfacing effectively with wet-lab experimentation and remaining grounded in the system, structure, and physics of biology. I will share how, at Microsoft Research, we are developing new AI systems that help us better understand and design biology via generative design and interactive discovery. I will focus on Generative AI models for the design of novel and useful biomolecules, expanding our ability to engineer new proteins for therapeutic, biological, and industrial applications and beyond.
+                  Coming soon!
                 </p>
 
                 <h6>Speaker Bio</h6>
                 <p>
-                  Ava Amini is a Senior Researcher at Microsoft, where she develops new AI technologies for precision biology and medicine. She completed her PhD in Biophysics at Harvard University and her BS in Computer Science and Molecular Biology at MIT and has been recognized by the National Academy of Engineering, the National Science Foundation, TEDx, Venture Beats, and the Association of MIT Alumnae, among others, for her research. Ava is passionate about AI education and outreach -- she is a lead organizer and instructor for MIT Introduction to Deep Learning, where she has taught AI to 1000s of students in-person and over 100,000 globally registered students online, garnering more than 11 million online lecture views, and served as a co-founder and director of MomentumAI, which taught all-expenses-paid education programs for high schoolers to learn AI.
+                  Coming soon!
 								</p>
               </div>
             </div>
@@ -1584,9 +1584,6 @@ <h6>Speaker Bio</h6>
             <div class="row">
               <div class="col-md-4 v-center">
                 <div class="card card-lecture card-modal">
-                  <!-- <div class="card-icon">
-                    <img src="images/people/nikolaskaris.jpg" alt="">
-                  </div> -->
                   <div class="card-icon">
                     <img src="images/people/douglasblank.jpg" alt="">
                   </div>