Yuchen jin @yuchenj_uw on x – Artofit

Image gallery for: Yuchen jin @yuchenj_uw on x

Yuchen Jin (@Yuchenj_UW) on X

This is wild - UC Berkeley shows that a tiny 1.5B model beats o1-preview on math by RL! They applied simple RL to Deepseek-R1-Distilled-Qwen-1.5B on 40K math problems, trained at 8K context, then scaled to 16K & 24K. 3,800 A100 hours ($4,500) to beat o1-preview in math! Best
Advertisement
Gokul Swamy (@g_k_swamy) on X

Reinforcement Learning
ℏεsam (@Hesamation) on X

Understanding LLMs and Gen AI
Valeriy M., PhD, MBA, CQF (@predict_addict) on X

Understanding LLMs and Gen AI
Software Development
Advertisement
Youssef Hosni on LinkedIn: If you want to study LLMs check out this series of articles I wrote…

Understanding LLMs and Gen AI
ℏεsam (@Hesamation) on X

Understanding LLMs and Gen AI
ℏεsam (@Hesamation) on X

Understanding LLMs and Gen AI
ℏεsam (@Hesamation) on X

Understanding LLMs and Gen AI
Matrices notes✅

Engineering Mathematics
Maxime Labonne (@maximelabonne) on X

Understanding LLMs and Gen AI
Lenny Rachitsky (@lennysan) on X

SaaS ideas, AI Agents, AI enabled IDE
Advertisement
Advertisement
Advertisement
The internet is going wild for OpenAI's GPT-4o native image generation…

Understanding LLMs and Gen AI