ucb_agentic_ai

Lecture 04: Open Training Recipes for Reasoning in Language Models

Link to lecture recording on YouTube

Date: 2025-02-24

Speaker: Hannaneh Hajishirzi

Speaker’s social profile: Website / University Profile / Google Scholar / GitHub / LinkedIn / X (Twitter)

Education:

Work:

Notes

AI is here today due to open scientific practices and fully open models (transparent, reproducible, accessible); need to make a lot of advances to push language models beyond language, and mitigate their biases and risk
Project OLMo: fully open ecosystem to develop, study and advance LMs; open documented and reproducible

Stage Tools
Pre-training OLMo
OLMo2
OLMoE
Dolma
Post-training Tulu
OLMo-Instruct
Test-time Scaling S1
Open Scholar
Self-RAG

Pre-training:

Post-training:

Post-training

Pre-training

Test-time Inference

[Incomplete, work in progress]

References