Sam Altman @ Stanford CS183 (2024) — Scale + ChatGPT 故事 + AI Utility

2024 年 41 分钟访谈：Scale 哲学 + ChatGPT 故事 + AI 作为新 Utility + 3 个未来 Fork

核心论点

AI 时代唯一"反共识但正确"的认知 = "scale 永远比共识认为的更有效"。Sam 用 YC 批次规模、神经网络 scaling、OpenAI 内部组织为例：找到"小规模已经在 working 的东西" → 推到前所未有的规模 → 几乎 always 出 interesting results。

教育是 Sam 自承的最大 prediction error："我以为 ChatGPT 上线 1 年内教育会大改，3.5 年过去了没看到任何系统性变化。"

Scale 哲学（Sam 最重要的洞察）

"I offer no theory that I find satisfying to explain it... but empirically it does seem to be true, which is all of the most interesting things I have observed in my career in watching other things happen. All of the most interesting ones have had something to do with emergent properties that scale or scale continuing to provide returns far beyond what the consensus thinks will work."

3 个 scale 案例

案例	共识认为	实际发生
神经网络 scaling	"已知能 scale，不需要再试"	LLM 能力持续突破
YC batch size	"太多公司，应该缩到 10 个/batch"	批次网络效应 — 50+ 公司/batch 反而更好
OpenAI compute scale	"10K-100K GPUs 不可能"	GPT-4 训练需要 ~25K H100s 集群

为什么大多数人不敢 scale

🔧 Stuff breaks

"Stuff breaks at accelerating rate and in an unpredictable way as you scale"

👥 聪明人劝退

"There are always very smart people who say why you shouldn't do this"

🧠 类比陷阱

突破需要 first principles reasoning，类比推理不 work

💡 Sam 类比

"We did not evolve to be good at thinking about exponentials. People have a hard time imagining that scaling laws are going to continue exponentially."

ChatGPT 创业故事（核心）

2020-2022 时间线

GPT-3 阶段（2020 夏）

API 推出 → "shockingly bad"（vs 现在模型）· 唯一 work 的商业用例：copyrighting

Twitter 走红

一个月后 API 莫名走红 · 人们用 API key chat with it（但 OpenAI 不想做 chatbot）

决策：Build Chatbot（2022 秋）

"We can build a good chatbot. People clearly want that."
本意是 research demo → 目的让其他公司 pay API

5 天 emergency 转向

每天 traffic shot up → 跌 → "hype cycle" → 第二天创新高 · Sam 第 4-5 天说："I know how this works"

2 个月 crazy scaling

"Business model later, just don't run out of compute bills" · ChatGPT 上线 → rest is history

"Under the YC principle of see what your users love and do that."

"When something really starts growing and it's not very good, you have like a guaranteed hit on your hands."

Codex / Code 故事

ChatGPT 前计划

内部原本要 all in on code
"Coding 是模型控制 computer 的方式"
"Robotics 是控制物理世界"
模型够 smart + actuators（写 code + 驱动 robot）= 真正能做事

实际进度

时间	事件
早期	Codex 进展一般
2024 初	Codeex 真正变好（5.5 是 inflection point）
2024+	用户做 "incredible things"

Pipeline 现状

Pre-training

Mid-training

Post-training

RL + Supervised

⚠️ Sam 自评

"这 pipeline 有点怪，不太像 optimal solution。我们肯定会 major rewrite，但不知道何时。"

⚡ AI = 新 Utility（最深刻框架）

"We are in the process of creating a new utility. This doesn't happen very often. You know, electricity is a utility, internet's a utility, water, I guess there's not a lot of these."

Electricity 类比

💡 早期 electricity 公司

没卖 electricity — 没人知道要
卖 light at night（"我们不卖电，是卖夜里的光"）
副业："By the way, 你可以用同个东西洗衣"
用户："太远了我不信"

"Even if we're totally right that intelligence is going to become this new utility... I kind of don't think the right way for us to analogize that is 'we're selling intelligence' because people are just like somehow not resonating. I don't know what our equivalent of 'we're selling you light at night' is going to be."

Jensen vs Sam utility 视角

视角	主张	抽象层级
Jensen	Compute 是 utility	硬件层（chips）
Sam	Intelligence 是 utility	服务层（tokens）
真实答案	消费者看到 tokens，hardware 被抽象掉	—

📱 类比：手机

"Pay for cell phone bill... you think about access to the whole system and the particular hardware at the base station and how it connects to the internet. You don't think about that as much."

OpenAI 关键目标（2026-2028）

时间	目标	意义
2026 年 9 月	用 500K A100-equivalent GPUs 作 AI research intern	Compute scale 极限
2028 年 3 月	Full end-to-end very talented researcher（能发明完整新架构）	AGI 实质突破

⚠️ 注意

这些是 Sam 2024 年在 Stanford 讲的目标，可能已更新。但 framework 仍 valid。

跟 Nate Herk "Stop AI" 视频关联

OpenAI 6/8 文章 "Built to Benefit Everyone" 跟 Sam 在 Stanford 讲的目标完全一致：

Sam 在 Stanford: "2028.3 = full researcher" → OpenAI 文章: "2028.3 = AI 自动化研究员"
Sam 在 Stanford: "democratization utility" → OpenAI 文章: "personal AGI for everyone"
Sam 在 Stanford: "international regulation" → OpenAI 文章: "international coordination"

🔀 3 个未来 Fork（10 年视角）

① 民主化 vs 集中

"How much is this technology going to be very widely democratized versus how much is it going to sit in a few companies."

Sam 概率：80% 民主路径

② Compute 分配

"How specifically how we distribute compute."

H100 / Blackwell 5x 价差 · "Compute shortage forever"

③ 经济模型

UBI vs ownership vs capitalism？

Sam 偏好 citizen wealth fund（不是现金分红）

Fork 1：民主化

Default trajectory: 集中到少数公司 → "成为地球财富重要部分"
这显然 terrible → OpenAI 会努力推民主
Utility 模型 = 防止集中
"集中 → alignment failure + 脆弱世界"

"The risk of keeping this concentrated in a handful of companies even though we would be one of these companies is not something we should tolerate."

Fork 2：Compute 短缺

现状（2024 视角）	Sam 预测
H100 / Blackwell 大幅短缺	"As long as we can continue to make progress... there will be a shortage forever"
Long-term vs spot 价差 5x	Demand 永远 > supply
"几乎全部 gone for this year"	"100 个 personal agents 一直 work for you"

⚠️ Why are people not freaking out?

"People assume we will make big inference gains on the hardware we have... I also think there is a tsunami of hardware coming but maybe the demand tsunami is even bigger."

Fork 3：经济模型

✅ Sam 偏好：Citizen Wealth Fund

Ownership stake > 现金分红
参考 Norwegian Sovereign Wealth Fund（1.5% 全球公开公司）
"As leverage shifts from labor to capital"
美国政府已是最大雇主 = 已有 redistribution

❌ Sam 不偏好：纯 UBI

"Fixed monthly cash dividend"
不利用 human psychology
"I funded a big UBI study a while ago"
但看了 startup 投资后改想法

"I would love to see... that we find a way to have something like a citizens wealth fund in the country or in the world eventually where you basically own a slice of capitalism."

⚔️ 反驳 Yann LeCun（"LLMs 是死胡同"）

Sam 的回应

已超人类

LLMs 已经远超人类在某些方面

仍差人远

长 horizon + 高 judgment 任务

数学突破

昨天模型 disprove 一个数学 conjecture（smart scientists 之前说"不会发生"）

需要 World Model

但 robotics 显然需要 · 赌 LLM scaling 不 work 感觉 misguided

"Field held back by a generation of scientists who just were way too certain on what scaling was not going to produce and then some people just looked at the graphs and said, 'Well, it looks like it's continuing beautifully. Let's keep going.'"

On "I told you so"

Sam 的态度

"The data is quite strong on our side and I don't think it'd be that fun to say I told you so."

"You were like she was nervous. You're still going on about it. Like the data is quite strong on our side."

On critics + identity

"If you make your identity about a particular thing is going to work or not work and then the science disproves you and you're too hung up on your identity, you can't let it go. You can't see the truth. I think this is a form of insanity."

🎓 Education 没变（Sam 最大 prediction error）

Sam 2022 预期

"ChatGPT 上线 → 1 年学生 cheating → 教育系统大改 → 教得更好"

2024 现实

❌ Prediction error

"I struggle to point to any significant systemic change that I've seen in the education system at large in the three and a half years since ChatGPT launched."

Sam 的担心

"If we continue to teach and evaluate students as if we were in a pre-agi world, it's not going to work and it is going to lead to like atrophy of learning how to think."

仍要学的

✍️ Writing

Sam 自述靠 writing 来 think

💻 Programming

同样 meta skill

🧠 Meta Skill of Thinking

"Machines can do better, but useful to teach"

Sam 选课建议

大一 3 个 intro sem 一季
"Shallow understanding of lots of different fields was an incredible thing"
如果只上 CS + physics，会少很多 perspective
对 Patrick 启示：跨学科 exposure 价值巨大

💬 关键金句库

Scale 哲学

"All of the most interesting things I have observed in my career have had something to do with emergent properties that scale or scale continuing to provide returns far beyond what the consensus thinks will work."

民主化

"The risk of keeping this concentrated in a handful of companies even though we would be one of these companies is not something we should tolerate."

"I told you so"

"AI is just going to keep going. And I think this is considered I don't think this is like widely believed yet."

Education

"I struggle to point to any significant systemic change that I've seen in the education system at large in the three and a half years since ChatGPT launched."

Identity

"If you make your identity about a particular thing is going to work or not work and then the science disproves you... I think this is a form of insanity."

给学生

"No matter what you all do, we're going to have incredible models. I promise here like pretty quickly. But I think we have not invested enough in being able to deliver at scale huge amounts of cheap intelligence."

Exponentials

"We did not evolve to be good at thinking about exponentials. People have a hard time imagining that scaling laws are going to continue exponentially."

YC principle

"Under the YC principle of see what your users love and do that."

Guaranteed hit

"When something really starts growing and it's not very good, you have like a guaranteed hit on your hands."

Light at night

"I don't think the right way for us to analogize that is 'we're selling intelligence' because people are just like somehow not resonating. I don't know what our equivalent of 'we're selling you light at night' is going to be."

Compute 短缺

"As long as we can continue to make progress on this there will be a shortage forever and things will be bid among above what the price we think the price should be."

Citizen wealth fund

"I would love to see... that we find a way to have something like a citizens wealth fund in the country or in the world eventually where you basically own a slice of capitalism."