Home Videos

Dario Amodei — The highest-stakes financial model in history (YouTube Video Transcript)

Need transcripts for other videos? Try our YouTube Transcript Generator →
Title: Dario Amodei — The highest-stakes financial model in history
Duration: 02:22:20
Total Correct Answers:
Current Caption
Correct

Learning Modes

YouTube Video Transcript Hide

Ask AI Result

The ask AI result will appear here..
(00:00:00) Your YouTube transcript will appear here (00:00:00) So we talked three years ago. I'm (00:00:02) curious in your view, what has been the (00:00:04) biggest update of the last three years? (00:00:05) What has been the biggest difference (00:00:06) between what it felt like last three (00:00:08) years versus now? (00:00:08) >> Yeah, I would say actually the (00:00:11) underlying technology like the (00:00:13) exponential of the technology has has (00:00:15) gone broadly speaking I would say about (00:00:18) about as I expected it to go. I mean (00:00:20) there's like plus or minus you know a (00:00:22) couple there's plus or minus a year or (00:00:24) two here. There's plus or minus a year (00:00:25) or two there. I don't know that I would (00:00:27) have predicted the specific direction of (00:00:29) code. Um but but actually when I look at (00:00:32) the exponential it it is roughly what I (00:00:36) expected in terms of the march of the (00:00:38) models from like you know smart high (00:00:40) school student to smart college student (00:00:41) to like you know beginning to do PhD and (00:00:44) professional stuff and in the case of (00:00:45) code reaching beyond that. So you know (00:00:48) the frontier is a little bit uneven. (00:00:49) It's roughly what I expected. I will (00:00:52) tell you though what the most surprising (00:00:53) thing has been. The most surprising (00:00:55) thing has been the lack of public (00:00:57) recognition of how close we are to the (00:01:01) end of the exponential. To me, it is (00:01:03) absolutely wild that, you know, you have (00:01:07) people, you know, within the bubble and (00:01:09) outside the bubble, you know, but but (00:01:10) you have people talking about these (00:01:12) these, you know, just the same tired old (00:01:15) hot button political issues and like, (00:01:18) you know, around us. We're like near the (00:01:20) end of the exponential. I I want to (00:01:22) understand what that exponential looks (00:01:25) like right now because the first (00:01:26) question I asked you when we recorded (00:01:28) three years ago was, you know, what's up (00:01:29) with scaling? How why does it work? Um I (00:01:32) have a similar question now but I feel (00:01:34) like it's a more complicated question (00:01:35) because at least from the public's point (00:01:36) of view. (00:01:37) >> Yes. (00:01:37) >> Three years ago there were these you (00:01:39) know well-known public trends where (00:01:42) across many orders of magnitude of (00:01:43) compute you could see how the loss (00:01:44) improves and now we have RL scaling and (00:01:47) there's no publicly known scaling law (00:01:49) for it. It's not even clear what exactly (00:01:51) the story is of is this supposed to be (00:01:53) teaching the model skills is supposed to (00:01:54) be teaching metalarning. Um what is the (00:01:57) scaling hypothesis at this point? (00:01:59) >> Yeah. So, so I have actually the same (00:02:01) hypothesis that I had even all the way (00:02:04) back in 2017. So, in 2017, I think I (00:02:07) talked about it last time, but I wrote a (00:02:08) doc called the the big blob of compute (00:02:11) hypothesis. And and and you know, it it (00:02:13) wasn't about the scaling of language (00:02:15) models in particular. When I when I (00:02:16) wrote it, GPT1 had had just come out, (00:02:19) right? So, that was you know, one among (00:02:21) many things, right? There was back in (00:02:23) those days there was robotics. People (00:02:25) tried to work on reasoning as a separate (00:02:27) thing from language models. there was (00:02:29) scaling of the kind of RL that happened (00:02:31) that you know kind of happened in Alph (00:02:33) Go and uh you know that that happened at (00:02:36) Dota at OpenAI and um you know people (00:02:39) remember Starcraft at Deep Mind you know (00:02:41) the Alpha Star um so uh it was written (00:02:44) as a more general document and and the (00:02:46) specific thing I said was the following (00:02:48) that and you know it's it's very you (00:02:50) know Rich Sutton put out the bitter (00:02:52) lesson a couple years later um uh but (00:02:55) you know the the hypothesis is is (00:02:56) basically the same so so what it says is (00:02:59) all the cleverness, all the techniques, (00:03:02) all all the kind of we need a new method (00:03:04) to to do something like that doesn't (00:03:06) matter very much. There are only a few (00:03:08) things that matter and I think I listed (00:03:09) seven of them. One is like how much raw (00:03:12) compute you have. The other is the (00:03:15) quantity of data that you have. Then the (00:03:17) third is kind of the quality and (00:03:19) distribution of data, right? It needs to (00:03:21) be a broad broad distribution of data. (00:03:24) The fourth is I think how long you train (00:03:26) for. Um the fifth is you need an (00:03:29) objective function that can scale to the (00:03:31) moon. So the pre-training objective (00:03:33) function is one such objective function (00:03:36) right another objective function is you (00:03:39) know the the kind of RL objective (00:03:41) function that says like you have a goal (00:03:43) you're going to go out and reach the (00:03:44) goal within that of course there's (00:03:46) objective rewards like you know like you (00:03:48) see in math and coding and there's more (00:03:51) subjective rewards like you see in RL (00:03:53) from human feedback are kind of higher (00:03:54) order higher order versions of that and (00:03:57) and then the sixth and seventh were (00:03:59) things around kind of like normaliz (00:04:02) ation or conditioning like you know just (00:04:03) getting the numerical stability so that (00:04:05) kind of the big blob of compute flows in (00:04:08) this laminer way instead of instead of (00:04:10) running into problems. So that was the (00:04:12) hypothesis and it's a hypothesis I still (00:04:15) hold. I I don't think I've seen very (00:04:17) much that is not in line with that (00:04:20) hypothesis. And so the pre-trained (00:04:22) scaling laws were one example of what of (00:04:24) of of kind of what we see there. And (00:04:27) indeed those have continued going like (00:04:30) you know uh you know I think I think now (00:04:31) it's been it's been widely reported like (00:04:34) you know we feel good about pre-training (00:04:36) like pre-training is continuing to give (00:04:38) us gains. What has changed is that now (00:04:42) we're also seeing the same thing for RL (00:04:45) right so we're seeing a pre-training (00:04:46) phase and then we're seeing like an RL (00:04:48) phase on top of that. Um and with RL (00:04:53) it's it's actually just the same like (00:04:54) you know even even other companies have (00:04:56) have published um uh um like um you know (00:05:01) in some of their in some of their (00:05:02) releases have published things that say (00:05:04) look you know we train the model on math (00:05:06) contests you know aime or or the kind of (00:05:09) other things and you know how well how (00:05:12) well the model does is log linear and (00:05:15) how long we've trained it and we see (00:05:17) that as well and it's not just math (00:05:19) contest it's a wide variety of RL tasks. (00:05:21) And so we're seeing the same scaling in (00:05:24) RL that we saw for pre-training. Um you (00:05:27) mentioned Richard Sutton and the bitter (00:05:29) lesson. Yeah, (00:05:30) >> I interviewed him last year and he is (00:05:33) actually very non LLM pill. And if I'm (00:05:37) if I I don't know if this is his (00:05:38) perspective, but one way to paraphrase (00:05:40) this objection is something like look (00:05:43) something which possesses the true core (00:05:44) of human learning would not require all (00:05:47) these billions of dollars of data and (00:05:50) compute and these bespoke environments (00:05:51) to learn how to use Excel or how does an (00:05:55) you know how to how to use PowerPoint, (00:05:56) how to navigate a web browser. And the (00:05:58) fact that we have to build in these (00:06:00) skills using these RL environments hints (00:06:03) that we're actually lacking this core (00:06:07) human learning algorithm. Uh and so (00:06:11) we're scaling the wrong thing. And so (00:06:12) yeah, that does raise a question. Why (00:06:13) are we doing all this RL scaling if we (00:06:15) do think there's something that's going (00:06:16) to be humanlike in its ability to learn (00:06:18) on the fly? (00:06:19) >> Yeah. Yeah. So I think I think this kind (00:06:21) of puts together several things that (00:06:24) should be kind of thought of thought of (00:06:25) differently. (00:06:26) >> Yeah. I think there is a genuine puzzle (00:06:28) here, but it it may not matter. Um, in (00:06:32) fact, I would guess it probably it (00:06:33) probably doesn't matter. So, let's take (00:06:36) the RL out of it for a second because I (00:06:37) actually think RL and it's a red herring (00:06:40) to say that RL is any different from (00:06:41) pre-training in this matter. Um, so if (00:06:44) we if we look at pre-training scaling, (00:06:46) um, it it was very interesting back in, (00:06:49) you know, 2017 when Alec Radford was (00:06:52) doing GPT1. If you look at the models (00:06:55) before GPT1, they were trained on these (00:06:59) data sets that didn't represent a wide, (00:07:02) you know, distribution of text, right? (00:07:04) You had like, you know, these very (00:07:06) standard, you know, kind of language (00:07:08) modeling benchmarks and GBT1 itself was (00:07:11) trained on a bunch of, I think it was (00:07:12) fanfiction actually. Um, but you know, (00:07:14) it was it was like literary, you know, (00:07:16) it's like literary text, which is a very (00:07:18) small fraction of the text that you get. (00:07:20) And what we found with that, you know, (00:07:22) and in those days it was like a billion (00:07:23) words or something. So small data sets (00:07:26) and represented a pretty narrow (00:07:29) distribution, right? Like a narrow (00:07:30) distribution of kind of what what you (00:07:32) can see what you can see in the world. (00:07:34) And it didn't generalize well. If you (00:07:36) did better on um you know the the you (00:07:40) know I forget what but some some kind of (00:07:42) fanfiction corpus um it wouldn't (00:07:45) generalize that well to kind of the (00:07:46) other hat. you know, we had all these (00:07:48) measures of like, you know, how well (00:07:50) does how well does a model do at (00:07:52) predicting all of these other kinds of (00:07:53) texts. You really didn't see the (00:07:55) generalization. It was only when you (00:07:57) trained over all the tasks on the you, (00:07:59) you know, the internet when you when you (00:08:00) kind of did a general internet scrape, (00:08:02) right, from something like, you know, (00:08:04) common crawl or scraping links on (00:08:06) Reddit, which is what we did for GPT2. (00:08:08) It's only when you do that that you kind (00:08:10) of started to get generalization. Um, (00:08:13) and I think we're seeing the same thing (00:08:15) on RL that we're starting with first (00:08:18) very simple RL tasks like training on (00:08:20) math competitions. Then we're kind of (00:08:22) moving to, you know, kind of broader (00:08:25) broader training that involves things (00:08:26) like code as a task. And now we're (00:08:29) moving to do kind of many many other (00:08:31) tasks. And then I think we're going to (00:08:33) increasingly get generalization. So that (00:08:36) that kind of takes out the RL versus the (00:08:38) pre-training side of it. But I think (00:08:40) there is a puzzle here either way which (00:08:42) is that on pre-training when we train (00:08:45) the model on pre-training you know we we (00:08:48) use like trillions of tokens right and (00:08:50) and humans don't see trillions of words (00:08:53) so there is an actual sample efficiency (00:08:55) difference here there is actually (00:08:57) something different that's that's (00:08:59) happening here which is that the models (00:09:00) start from scratch and you know they (00:09:03) have to get much more much more training (00:09:06) but we also see that once they're (00:09:08) trained if we give them a long context (00:09:10) length. The only thing blocking a long (00:09:12) context length is like inference. But if (00:09:14) we give them like a context length of a (00:09:16) million, they're very good at learning (00:09:18) and adapting within that context length. (00:09:20) And and so I don't know the full answer (00:09:23) to this, but but I think there's (00:09:25) something going on that pre-training (00:09:27) it's it's not like the process of humans (00:09:30) learning. It's somewhere between the (00:09:32) process of humans learning and the (00:09:33) process of human evolution. It's like (00:09:35) it's somewhere between like we get many (00:09:37) of our priors from evolution. Our brain (00:09:40) isn't just a blank slate, right? Whole (00:09:41) books have been written about. I think (00:09:43) the language models, they're much more (00:09:45) blank slates. They literally start as (00:09:46) like random weights. Whereas the human (00:09:48) brain starts with all these regions. (00:09:50) It's connected to all these inputs and (00:09:52) outputs. Um and and so maybe we should (00:09:55) think of pre-training and for that (00:09:57) matter RL as well as as being something (00:10:00) that exists in the middle space between (00:10:03) human evolution and you know kind of (00:10:06) human on on the spot learning and as the (00:10:09) in context learning that the models do (00:10:13) as as something between long-term human (00:10:15) learning and short-term human learning. (00:10:17) So, you know, there there's this (00:10:19) hierarchy of like there's evolution, (00:10:21) there's long-term learning, there's (00:10:22) short-term learning, and there's just (00:10:24) human reaction. And the LOM phases exist (00:10:28) along this spectrum, but not necessarily (00:10:30) exactly at the same points that there's (00:10:33) no analog to some of the human modes of (00:10:35) learning. The LOMs are kind of falling (00:10:37) between the points. Does that make (00:10:39) sense? (00:10:40) >> Um, yes. Although some things are still (00:10:42) a bit confusing. For example, if the (00:10:44) analogy is that this is like evolution, (00:10:45) so it's fine that it's not that sample (00:10:46) efficient, then like well, if we're (00:10:49) going to get the kind of super sample (00:10:51) efficient agent from in context (00:10:53) learning, why are we bothering to build (00:10:55) in, you know, there's RL environment (00:10:57) companies which are it seems like what (00:10:58) they're doing is they're teaching it how (00:11:00) to use this API, how to use Slack, how (00:11:01) to use whatever. It's confusing to me (00:11:03) why there's so much emphasis on that if (00:11:06) the kind of agent that can just learn on (00:11:07) the fly is emerging or is going to soon (00:11:09) emerge or has already emerged. (00:11:11) >> Yeah. Yeah. So I I I mean I can't speak (00:11:12) for the emphasis of anyone else. I can I (00:11:14) can only talk about how we how we think (00:11:16) about it. I think the way we think about (00:11:19) it is the goal is not to teach the model (00:11:23) every possible skill within RL just as (00:11:26) we don't do that within pre-training. (00:11:27) Right? Within pre-training we're not (00:11:29) trying to expose the model to you know (00:11:32) every every possible uh you know way (00:11:34) that words could be put together. Right? (00:11:36) you know, we're it's it's rather that (00:11:38) the model trains on a lot of things and (00:11:40) then and then it reaches generalization (00:11:42) across pre-training, right? That was (00:11:44) that was the transition from GPT1 to (00:11:46) GPT2 that I saw up close which is like (00:11:49) you know the the model reaches a point (00:11:52) you know I I I I like had these moments (00:11:54) where I was like oh yeah you just give (00:11:56) the model like you just give the model a (00:11:58) list of numbers that's like you know um (00:12:01) you know this is the cost of the house (00:12:02) this is the square feet of the house and (00:12:04) the model completes the pattern and does (00:12:06) linear regression like not great but it (00:12:08) does it but it's never seen that exact (00:12:10) thing before and and so to you know to (00:12:14) to the extent that we are building these (00:12:16) RL environments the the goal is is very (00:12:20) similar to what is be you know to what (00:12:22) was done five or 10 years ago with (00:12:24) pre-training with we're trying to get a (00:12:27) we're trying to get a whole bunch of (00:12:28) data not because we want to cover a (00:12:31) specific document or a specific skill (00:12:33) but because we want to generalize. I (00:12:35) mean I I think the framework you're (00:12:39) laying down obviously makes sense like (00:12:40) we're making progress towards AGI. I (00:12:42) think the crux is something like nobody (00:12:44) at this point disagrees that we're going (00:12:46) to achieve AGI in this century. And the (00:12:48) crux is you say we're hitting the end of (00:12:50) the exponential um and somebody else (00:12:53) looks at this and says, "Oh yeah, we (00:12:55) we're making progress. We've been making (00:12:56) progress since 2012 and then 2035 we'll (00:12:59) have a humanlike agent." And so I want (00:13:01) to understand what it is that you're (00:13:02) seeing which makes you think um yeah (00:13:05) obviously we're seeing the kinds of (00:13:06) things that evolution did or that human (00:13:08) within the human lifetime learning is (00:13:10) like in these models and why think that (00:13:12) it's one year away and not 10 years (00:13:14) away. (00:13:14) >> I I I actually think of it as like two (00:13:18) there's kind of two cases to be made (00:13:20) here or like two two claims you could (00:13:22) make. One of which is like stronger and (00:13:24) the other of which is weaker. So, I (00:13:26) think starting starting with the weaker (00:13:28) claim, you know, when when I first saw (00:13:31) the scaling back in like, you know, (00:13:33) 2019, (00:13:35) um, you know, I wasn't sure. You know, (00:13:36) this was the whole this was kind of a (00:13:38) 50/50 thing, right? I thought I saw (00:13:40) something that was, you know, and and my (00:13:43) claim was this is much more likely than (00:13:45) anyone thinks it is. Like, this is wild. (00:13:47) No one else would even consider this. (00:13:48) Maybe there's a 50% chance this happens. (00:13:51) um on the basic hypothesis of you know (00:13:54) as you put it within 10 years we'll get (00:13:57) to you know you know what I call kind of (00:14:00) country of geniuses in a data center I'm (00:14:02) at like 90% on that um and it's hard to (00:14:05) go much higher than 90% cuz the world is (00:14:08) so unpredictable um maybe the (00:14:10) irreducible uncertainty would be if we (00:14:12) were at 95% where you get to things like (00:14:15) I don't know may maybe multi you know (00:14:17) multiple companies have you know kind of (00:14:20) internal turmoil and nothing happens and (00:14:22) then Taiwan gets invaded and like all (00:14:24) the all the fabs get blown up by (00:14:26) missiles and and you know and then now (00:14:28) you scenario you know just you could (00:14:31) construct a scenario where there's like (00:14:32) a 5% chance that it it or you know you (00:14:35) can construct a 5% world where like (00:14:38) things things get delayed for for for (00:14:40) for for 10 years that's maybe 5%. (00:14:44) There's another 5% which is that I'm (00:14:47) very confident on tasks that can be (00:14:49) verified. So I think I think with coding (00:14:51) I'm just except for that irreducible (00:14:53) uncertainty there's just there's I mean (00:14:54) I think we'll be there in one or two (00:14:56) years there's no way we will not be (00:14:57) there there in 10 years in terms of (00:14:59) being able to do it end to end coding. (00:15:01) My one little bit the one little bit of (00:15:04) of fundamental uncertainty even on long (00:15:07) time scales is this thing about tasks (00:15:09) that aren't verifiable like planning a (00:15:11) mission to Mars like uh you know doing (00:15:15) some fundamental scientific discovery (00:15:17) like like crisper like you know writing (00:15:19) a writing a novel hard to hard to verify (00:15:22) those tasks. I am almost certain that (00:15:27) we have a reliable path to get there. (00:15:29) But like if there was a little bit (00:15:32) uncertainty, it's there. So, so, so, so, (00:15:34) so on the 10 years, I'm like, you know, (00:15:37) 90% which is about as certain as you can (00:15:39) be. Like I think it's I think it's crazy (00:15:43) to say that this won't happen by by by (00:15:45) 2035. Like in some sane world, it would (00:15:48) be outside the mainstream. But but the (00:15:49) emphasis on verification hints to me as (00:15:55) a lack of a lack of uh belief that these (00:15:58) models are generalized. If you think (00:15:59) about humans, (00:16:00) >> we are good at things that both which we (00:16:02) get verifiable reward and things which (00:16:04) we don't. You're like you have no no (00:16:07) this is this is why I'm almost sure we (00:16:08) already see substantial generalization (00:16:10) from things that that verify to things (00:16:12) that don't ver. We're already seeing (00:16:14) that. But but it seems like you were (00:16:15) emphasizing this as a spectrum which (00:16:17) will (00:16:19) uh split apart which domains we see more (00:16:21) progress and I'm like but that's it (00:16:22) doesn't seem like how humans get there. (00:16:23) >> The world in which we don't make it or (00:16:25) or or the world in which we don't get (00:16:26) there is the world in which we do we do (00:16:29) all the things that are that are (00:16:30) verifiable and then they like you know (00:16:34) many of them generalize but but we kind (00:16:36) of don't get fully there. We don't we (00:16:37) don't we don't fully you know we don't (00:16:39) fully color in this side of the box. (00:16:41) It's it's it's not a it's not a binary (00:16:43) thing. But but it also seems to me even (00:16:45) if even if in the world where (00:16:46) generalization is weak when you only say (00:16:48) baref domains it's not clear to me in (00:16:49) such a world you could automate software (00:16:52) engineering because software like in (00:16:54) some sense you are quote unquote a (00:16:56) software engineer but part of being a (00:16:58) software engineer for you involves (00:16:58) writing these like long memos about your (00:17:00) grand vision about different things and (00:17:01) so I don't think that's part of the job (00:17:03) of sui that's part that's part of the (00:17:04) job of the company but I do think (00:17:06) involves like design documents and other (00:17:08) things like that um which by the way the (00:17:11) models are not bad they're already (00:17:12) pretty good at writing comments and (00:17:13) start and so with with again I'm making (00:17:16) like much weaker claims here than I (00:17:18) believe to like you know to to to to (00:17:20) kind of set up a you know to to (00:17:23) distinguish between two things like (00:17:24) we're we're already almost there for (00:17:26) software engineering and we are already (00:17:27) almost there by by what metric there's (00:17:29) one metric which is like how many lines (00:17:30) of code are written by AI and if you use (00:17:33) if you consider other productivity (00:17:34) improvements in the course of the (00:17:35) history of software engineering (00:17:37) compilers write all the lines of (00:17:38) software and but we there's a difference (00:17:40) between how many lines are written and (00:17:41) how big the productivity improvement Oh (00:17:43) yeah. And um and then like we're almost (00:17:46) there, meaning like the how big is the (00:17:48) productivity improvement, not just how (00:17:49) many lines are written. (00:17:50) >> Yeah. Yeah. So so I actually um I (00:17:52) actually I actually agree with you on (00:17:53) this. So I I've made this series of (00:17:55) predictions on um code and software (00:17:58) engineering and and and I think people (00:18:00) have repeatedly kind of misunderstood (00:18:02) them. So So let me let me let me let me (00:18:04) let me lay out the spectrum, right? Like (00:18:06) I think it was like you know like you (00:18:08) know eight eight or nine months ago or (00:18:09) something I said you know the AI model (00:18:11) will be writing 90 90% of the lines of (00:18:14) code in like you know 3 to 6 months (00:18:17) which which happened at least at some (00:18:19) places right happened happened at (00:18:20) entropic happened with many people (00:18:23) downstream using our models but but (00:18:25) that's actually a very weak criterion (00:18:27) right people thought I was saying like (00:18:29) we won't need 90% of the software (00:18:31) engineers those things are worlds apart (00:18:34) right like I would put the spectrum term (00:18:35) as 90% of code is written by the model, (00:18:39) 100% of code is written by the model and (00:18:41) that's a big difference in productivity. (00:18:43) Um 90% of the end toend SWE tasks, (00:18:47) right? Including things like compiling, (00:18:50) including things like setting up (00:18:51) clusters and environments, testing (00:18:54) features, writing memos, 90% of the SU (00:18:56) tasks are written by the models. 100% of (00:18:59) today's suite tasks are are are written (00:19:01) by the models. And and even when when (00:19:03) when that happen doesn't mean software (00:19:04) engineers are out of a job like there's (00:19:06) like new higher level things they can do (00:19:08) where they can they can manage and then (00:19:10) there's a further down the spectrum like (00:19:12) you know there's 90% less demand for (00:19:15) SWES which I think will happen but like (00:19:17) this is this this is a spectrum and you (00:19:19) know I wrote about it in in the (00:19:21) adolescence of technology where I went (00:19:23) through this kind of spectrum with (00:19:24) farming um uh uh and so I I actually (00:19:27) totally agree with you on that. It's (00:19:29) just these are very different benchmarks (00:19:32) from each other but we're proceeding (00:19:33) through them super fast. It seems like (00:19:35) in part of your vision it's like going (00:19:36) from 90 to 100 um first it's going to (00:19:39) happen fast and two that somehow that (00:19:43) leads to huge productivity improvements. (00:19:45) Um whereas when I noticed even in green (00:19:47) field projects that people start with (00:19:48) cloud code or something people report (00:19:51) starting a lot of projects and I'm like (00:19:52) do we see in the world out there a (00:19:55) renaissance of software all these new (00:19:57) features that wouldn't exist otherwise (00:19:58) and at least so far it doesn't seem like (00:20:00) we see that and so that does make me (00:20:02) wonder even if even if like I never had (00:20:04) to intervene on cloud code um there is (00:20:06) this thing of like there's just the (00:20:08) world is complicated jobs are (00:20:10) complicated and (00:20:12) closing the loop on self-contained (00:20:14) systems whether just writing software or (00:20:15) something how much sort of how much (00:20:17) broader gains we would see just from (00:20:19) that. And so maybe that makes us this (00:20:21) should dilute our estimation of the (00:20:24) country of geniuses. (00:20:25) >> I well well I actually I I like I like (00:20:27) simultaneously (00:20:29) I simultaneously agree with you agree (00:20:32) that it's a reason why these things (00:20:34) don't happen instantly but at the same (00:20:36) time I think the the the effect is going (00:20:39) to be very fast. So like I don't know (00:20:41) you could have these two poles right one (00:20:43) is like um you know AI is like you know (00:20:46) it's not going to make progress it's (00:20:48) slow like it's going to take you know (00:20:50) kind of forever to diffuse within the (00:20:52) economy right economic diffusion has (00:20:54) become one of these buzzwords that's (00:20:55) like a a reason why we're not going to (00:20:57) make AI progress or why AI progress (00:20:59) doesn't matter and and you know the (00:21:01) other axis is like we'll get recursive (00:21:03) self-improvement you know the whole (00:21:04) thing you know can't you just draw an (00:21:06) exponential line on the on the curve you (00:21:08) know it's it's we're going to have you (00:21:09) know Dyson spheres around the sun in (00:21:11) like you know (00:21:13) you know so many nanconds after you know (00:21:16) after after we get recursive I mean I'm (00:21:19) completely caricaturing the view here (00:21:20) but like you know there there there are (00:21:22) these two extremes but what we've seen (00:21:25) from from the beginning you know at (00:21:27) least if you look within anthropic (00:21:29) there's this bizarre 10x per year growth (00:21:33) in revenue that we've seen right so you (00:21:35) know in 2023 it was like 0 to 100 (00:21:38) million 2024 it was 100 million to a (00:21:41) billion. 2025 it was a billion to like 9 (00:21:44) or 10 billion. And then (00:21:46) >> you guys should have just bought like a (00:21:47) billion dollars with your own product so (00:21:48) you could just like have a clean 10V and (00:21:51) and the first month of this year like (00:21:53) that that exponential is you would think (00:21:55) it would slow down but it would like you (00:21:56) know we added another few billion to (00:21:59) like you know to to to we added another (00:22:01) few billion to revenue in January and (00:22:04) and so you know obviously that curve (00:22:07) can't go on forever right you know the (00:22:09) GDP is only so large I don't you know I (00:22:11) I would even guess that it bends that it (00:22:13) bends bends somewhat this here. But like (00:22:16) that is like a fast curve, right? That's (00:22:19) like a that's like a really fast curve. (00:22:21) And I would bet it stays pretty fast (00:22:23) even as the scale goes to the entire (00:22:25) economy. So like I I think we should be (00:22:28) thinking about this middle world where (00:22:30) things are like extremely fast but not (00:22:34) instant where they take time because of (00:22:36) economic diffusion because of the need (00:22:39) to close the loop because you know it's (00:22:41) like this fiddly oh man I have to do (00:22:43) change management within my enterprise (00:22:45) you know I have to like you know uh uh (00:22:48) you know I I I I like I set this up but (00:22:50) but you know I have to change the (00:22:52) security permissions on this in order to (00:22:54) make it actually work or you know, I had (00:22:56) this like old piece of software that, (00:22:58) you know, that like, you know, checks (00:23:00) the model before it's compiled and and (00:23:02) and like released and I have to rewrite (00:23:04) it. And yes, the model can do that, but (00:23:05) I have to tell the model to do that and (00:23:07) it has to it has to take time to do (00:23:09) that. and and and so I think everything (00:23:12) we've seen so far is is compatible with (00:23:15) the idea that there's one fast (00:23:18) exponential that's the the capability of (00:23:20) the model and then there's another fast (00:23:22) exponential that's downstream of that (00:23:23) which is the diffusion of the model into (00:23:25) the economy not instant (00:23:28) not slow much faster than any previous (00:23:32) technology but it has its limits and and (00:23:34) and and this is what we you know when I (00:23:37) when I look inside anthropic when I look (00:23:39) at our customers fast adoption but not (00:23:42) infinitely fast. (00:23:43) >> Um can I try a hot take on you? (00:23:45) >> Yeah. (00:23:46) >> I feel like diffusion is cope that (00:23:47) people use to say when it's like if the (00:23:50) model isn't able to do something they're (00:23:52) like oh but the diffus it's like a (00:23:53) diffusion issue. But then you should use (00:23:55) the comparison to humans. You would (00:23:57) think that the inherent advantages that (00:23:59) AIs have would make diffusion a much (00:24:01) easier problem for new AIs getting (00:24:04) onboarded than new humans getting (00:24:05) onboarded. So an AI can read your entire (00:24:07) Slack and your drive in minutes. They (00:24:09) can share all the knowledge that the (00:24:10) other copy other copies of the same (00:24:12) instance have. You don't have this (00:24:13) adverse selection problem when you're (00:24:14) hiring AI because you can just hire (00:24:15) copies of a vetted AI model. Um hiring a (00:24:18) human is like so much more hassle. And (00:24:20) people hire humans all the time, right? (00:24:22) We pay humans upwards of $50 trillion in (00:24:24) wages because they're useful. Uh even (00:24:26) though it's like in principle it would (00:24:29) be much easier to integrate AI into the (00:24:31) economy than it is to hire humans. I (00:24:32) think like the diffusion I feel like (00:24:34) doesn't really I I think diffusion is (00:24:36) very real and and and and and doesn't (00:24:40) have to you know doesn't exclusively (00:24:42) have to do with limitation limitation (00:24:44) limitations on the AI models like again (00:24:47) there are people who use diffusion to to (00:24:50) you know as kind of a buzzword to say (00:24:51) this isn't a big deal. I'm not talking (00:24:53) about that. I'm not talking about, you (00:24:55) know, AI will diffuse at the speed that (00:24:58) previous I think AI will diffuse much (00:25:00) faster than previous technologies have, (00:25:02) but but not infinitely fast. So, I'll (00:25:04) I'll just give an example of this, (00:25:05) right? Like there's like claude code. (00:25:07) Like claude code is extremely easy to (00:25:09) set up. Um, you know, if you're a (00:25:12) developer, you can kind of just start (00:25:13) using cla code. There is no reason why a (00:25:16) developer at a large enterprise should (00:25:18) not be adopting claude code as quickly (00:25:21) as you know individual developer (00:25:24) developer at a startup and we do (00:25:26) everything we can to promote it right we (00:25:28) sell uh we sell cla code to enterprises (00:25:31) and big enterprises like you know big (00:25:34) big financial companies big (00:25:35) pharmaceutical companies all of them (00:25:37) they're adopting claude code much faster (00:25:41) than enterprises typically adopt new (00:25:44) technology, right? But but again, it (00:25:47) like it it it it takes time like any (00:25:50) given feature or any given product like (00:25:52) claude code or like co-work will get (00:25:55) adopted by the you know the individual (00:25:58) developers who are on Twitter all the (00:26:00) time by the like series A startups (00:26:03) many months faster than than you know (00:26:05) than they will get adopted by like you (00:26:08) know a like large enterprise that does (00:26:10) food sales. Um there are a number of (00:26:13) factors like you have to go through (00:26:14) legal, you have to provision it for (00:26:16) everyone. It has to you know like it has (00:26:18) to pass security and compliance. The (00:26:21) leaders of the company who are further (00:26:23) away from the AI revolution you know are (00:26:26) are forwardlooking but they have to say (00:26:28) oh it makes sense for us to spend 50 (00:26:31) million. This is what this claud code (00:26:32) thing is. This is why it helps our (00:26:35) company. This is why it makes us more (00:26:36) productive. And then they have to (00:26:37) explain to the people two levels below. (00:26:39) and they have to say, "Okay, we have (00:26:41) 3,000 developers. Like, here's how we're (00:26:43) going to roll it out to our developers." (00:26:45) And we have conversations like this (00:26:47) every day. Like, you know, we are doing (00:26:49) everything we can to make Anthropics (00:26:51) revenue grow 20 or 30x a year instead of (00:26:55) 10x a year. Um, you know, and and and (00:26:57) again, you know, many enterprises are (00:26:59) just saying this is so productive like, (00:27:02) you know, we're going to take shortcuts (00:27:03) in our usual procurement process, right? (00:27:05) they're moving much faster than you know (00:27:07) when we tried to sell them just the (00:27:09) ordinary API which many of them use but (00:27:11) quad code is a more compelling product (00:27:14) um but it's not an infinitely compelling (00:27:16) product and I don't think even AGI or (00:27:18) powerful AI or country of geniuses in (00:27:20) the data center will be an infinitely (00:27:22) compelling product it will be a (00:27:24) compelling product enough maybe to get (00:27:26) three or five or 10x a year growth even (00:27:28) when you're in the hundreds of billions (00:27:30) of dollars which is extremely hard to do (00:27:31) and has never been done in history (00:27:33) before but not infinitely fast I I buy (00:27:35) that it would be a slight slowdown and (00:27:37) maybe this is not your claim but (00:27:38) sometimes people talk about this like oh (00:27:40) the capabilities are there but because (00:27:41) of diffusion um otherwise like we're (00:27:44) basically at AGI and then (00:27:46) >> I I I don't believe we're basically at (00:27:48) AGI. (00:27:48) >> I think if you had the country of (00:27:50) geniuses in a data center if your (00:27:51) company didn't (00:27:53) geniuses in a data center we would know (00:27:55) it. We would know it if you had the (00:27:57) country of geniuses in a data center (00:27:59) like everyone in this room would know (00:28:01) it. Everyone in Washington would know (00:28:03) it. like you know people in rural rural (00:28:07) parts might not know it but but but like (00:28:10) we would know it we don't have that now (00:28:12) that that's very clear as Dario was (00:28:14) ending at to get generalization you need (00:28:16) to train across a wide variety of (00:28:18) realistic tasks and environments for (00:28:20) example with a sales agent the hardest (00:28:22) part isn't teaching it to mash buttons (00:28:24) in a specific database in Salesforce (00:28:26) it's training the agent's judgment (00:28:28) across ambiguous situations how do you (00:28:30) sort through a database with thousands (00:28:31) of leads to figure out which ones are (00:28:33) How do you actually reach out? What do (00:28:35) you do when you get ghosted? When an AI (00:28:37) lab wanted to train a sales agent, (00:28:38) Labelbox brought in dozens of Fortune (00:28:41) 500 sales people to build a bunch of (00:28:43) different RL environments. They created (00:28:45) thousands of scenarios where the sales (00:28:46) agent had to engage with the potential (00:28:48) customer, which was roleplayed by a (00:28:50) second AI. Limblebox made sure that this (00:28:52) customer AI had a few different (00:28:54) personas. Because when you cold call, (00:28:55) you have no idea who's going to be on (00:28:57) the other end. You need to be able to (00:28:58) deal with a whole range of (00:29:00) possibilities. Limblebox's sales experts (00:29:02) monitored these conversations turn by (00:29:04) turn, tweaking the role playinging agent (00:29:05) to ensure it did the kinds of things an (00:29:07) actual customer would do. Label Box (00:29:09) could iterate faster than anybody else (00:29:11) in the industry. This is super important (00:29:12) because RL is an empirical science. It's (00:29:14) not a solve problem. Labelbox has a (00:29:16) bunch of tools for monitoring agent (00:29:18) performance in real time. This lets (00:29:20) their experts keep coming up with tasks (00:29:22) so that the model stays in the right (00:29:23) distribution of difficulty and gets the (00:29:25) optimal reward signal during training. (00:29:27) Label box can do this sort of thing in (00:29:29) almost every domain. They've got (00:29:30) headphone managers, radiologists, even (00:29:32) airline pilots. So, whatever you're (00:29:34) working on, Labelbox can help. Learn (00:29:37) more at labelbox.com/vorcash. (00:29:42) Coming back to concrete predictions (00:29:44) because I think because there's so many (00:29:46) different things to dis ambiguate, it (00:29:48) can be easy to talk past each other when (00:29:49) we're talking about capabilities. So, (00:29:51) for example, when I interviewed 3 years (00:29:52) ago, I asked her a prediction about what (00:29:55) should we expect 3 years from now. I (00:29:56) think you were right. So you said we (00:29:59) should expect systems which if you talk (00:30:01) to them for the course of an hour it's (00:30:03) hard to tell them apart from a generally (00:30:05) well educated human. Yes. (00:30:06) >> I think you were right about that and I (00:30:07) think spiritually I feel unsatisfied (00:30:10) because my internal expectation was was (00:30:12) that such a system could automate large (00:30:14) parts of white collar work and so it (00:30:16) might be more productive to talk about (00:30:18) the actual end capabilities. You want (00:30:20) such a system. (00:30:20) >> So so I will I will I will basically (00:30:22) tell you what what you know where where (00:30:25) I think we are. So but let me let me ask (00:30:27) it in a very specific question so that (00:30:28) we can figure out exactly what kinds of (00:30:30) capabilities we should expect soon. So (00:30:32) maybe I'll ask about it in the context (00:30:34) of a job I understand well not because (00:30:36) it's the most relevant job but um just (00:30:38) because I can evaluate the claims about (00:30:39) it. Um take video editors right I have (00:30:42) video editors and part of their invol (00:30:45) job involves learning about our (00:30:47) audience's preferences learning about my (00:30:48) preferences and tastes and the different (00:30:50) trade-offs we have and how just over the (00:30:52) course of many months building up this (00:30:53) understanding of context. And so the (00:30:56) skill and ability they have six months (00:30:58) into the job, a model that can pick up (00:30:59) that skill on the job on the fly. When (00:31:02) should we be expect such an AI system? (00:31:04) Yeah. So I guess what you're talking (00:31:05) about is like you know we've we're we're (00:31:07) doing this interview for 3 hours and (00:31:09) then like you know someone's going to (00:31:11) come in, someone's going to edit it, (00:31:12) they're going to be like oh you know you (00:31:14) know I don't know Dario like you know (00:31:16) scratched his head and you know we could (00:31:18) we could edit that out and you magnify (00:31:20) that. there was this like long there was (00:31:22) this like long discussion that like is (00:31:24) less interesting to people and then then (00:31:25) you know then there's other thing that's (00:31:27) like more interesting to people so you (00:31:28) know let's let's let's kind of make this (00:31:30) this edit so you know I think the (00:31:33) country of geniuses in a data center w (00:31:35) will be able to do that the the way it (00:31:36) will be able to do that is you know it (00:31:38) will have general control of a computer (00:31:40) screen right like you know and and and (00:31:42) you'll be able to feed this in and it'll (00:31:44) be able to also use the computer screen (00:31:45) to like go on the web look at all your (00:31:48) previous look at all your previous (00:31:49) interviews like look at what people are (00:31:51) saying on Twitter in response to your (00:31:53) interviews like talk to you ask you (00:31:55) questions talk to your staff look at the (00:31:58) history of kind of edits edits that you (00:32:00) did and from that like do the job um so (00:32:03) I think that's dependent on several (00:32:04) things one that's dependent and and and (00:32:06) and I think this is one of the things (00:32:08) that's actually blocking deployment um (00:32:10) getting to the point on computer use (00:32:12) where the models are really masters at (00:32:14) using the computer right and you know (00:32:16) we've seen this climb in in benchmarks (00:32:18) and benchmarks are always you know (00:32:20) imperfect measures but like you know OS (00:32:22) world is you know went from you know (00:32:24) like 5% a you know like uh I think when (00:32:27) we first re released you know uh uh (00:32:30) computer use like a a year and a quarter (00:32:32) ago it was like maybe 15% I don't (00:32:34) remember exactly but we've climbed from (00:32:36) that to like 65 or 70%. Um and and you (00:32:40) know there may be harder measures as (00:32:42) well but but I think computer use has to (00:32:44) pass a point of reliability. Can I just (00:32:47) ask to follow up on that before you move (00:32:48) to the next point? Um I often for years (00:32:50) I've been trying to build different (00:32:52) internal LLM tools for myself and I (00:32:54) often I have these text in text out (00:32:58) tasks which should be dead center in the (00:33:00) repertoire of these models and yet I (00:33:02) still hire humans to do them just (00:33:03) because it's if it's something like make (00:33:06) identify what the best clips would be in (00:33:07) this transcript and maybe they'll do (00:33:08) like a seven out of 10 job at them but (00:33:10) there's not this ongoing way I can (00:33:13) engage with them to help them get better (00:33:14) at the job the way I could with a human (00:33:15) employee and so that missing ability (00:33:18) even if you saw computer use would still (00:33:20) block my ability to like offload an (00:33:23) actual job to them. (00:33:24) >> Again, there's there's this gets back to (00:33:26) what to to kind to kind of what we were (00:33:28) talking about before with learning on (00:33:29) the job where it's it's very (00:33:31) interesting. You know, I think I think (00:33:32) with the coding agents like I don't (00:33:35) think people would say that learning on (00:33:36) the job is what is what is you know (00:33:38) preventing the coding agents from like (00:33:41) you know doing everything end to end (00:33:43) like they keep they keep getting better. (00:33:45) We have engineers at Enthropic who like (00:33:48) don't write any code. And when I look at (00:33:50) the productivity to your to your (00:33:52) previous question, you know, we have (00:33:53) folks who say this this GPU kernel, this (00:33:57) chip, I used to write it myself. I just (00:33:58) have Claude do it. And so there's this (00:34:00) there's this enormous improvement in (00:34:02) productivity. And I don't know like when (00:34:05) I see Claude code like familiarity with (00:34:08) the code base or like it you know or or (00:34:12) a feeling that the model hasn't worked (00:34:14) at the company for for a year that's not (00:34:16) high up on the list of complaints I see. (00:34:18) And so I think what I'm saying is we're (00:34:20) we're like we're kind of taking a (00:34:22) different path. Don't don't you think (00:34:23) with coding that's because there is an (00:34:24) external scaffold of memory which exists (00:34:26) instantiated in the codebase which I (00:34:29) don't know how many other jobs have (00:34:31) coding made fast progress precisely (00:34:33) because it has this unique um advantage (00:34:36) that other economic activity doesn't (00:34:38) >> but but when you say that what you're (00:34:40) what you're implying is that by reading (00:34:43) the code base into the context I have (00:34:45) everything that the human needed to (00:34:47) learn on the job. So that would be an (00:34:49) example of whether it's written or not, (00:34:53) whether it's available or not, a case (00:34:56) where everything you needed to know, you (00:34:58) got from the context window, right? And (00:35:00) that and that what we think of as (00:35:01) learning like, oh man, I started this (00:35:03) job, it's going to take me 6 months to (00:35:05) understand the codebase, the model just (00:35:06) did it in the context. (00:35:08) >> Yeah. I honestly don't know how to think (00:35:09) about this because there there are (00:35:11) people who qualitatively report what (00:35:13) you're saying. Um there was a meter (00:35:16) study I'm sure you saw last year where (00:35:18) they (00:35:19) >> had experienced developers try to close (00:35:23) uh pull requests in repositories that (00:35:25) they were familiar with and those (00:35:27) developers reported an uplift. They they (00:35:30) reported that they felt more productive (00:35:31) with the use of these models but in fact (00:35:32) if you look at their output and how much (00:35:34) was actually merged back in there's a (00:35:35) 20% downlift. They were less productive (00:35:37) as a result of using these models. And (00:35:38) so I'm trying to square the qualitative (00:35:40) feeling that people feel with these (00:35:42) models versus um one in a macro level (00:35:45) where are all these where is this like (00:35:46) renaissance of software and two when (00:35:48) people do these independent evaluations (00:35:50) why are we not seeing the creative (00:35:52) benefits that we would expect (00:35:53) >> within anthropic this is just really (00:35:55) unambiguous right we're under an (00:35:57) incredible amount of commercial pressure (00:36:00) and make it even hard harder for (00:36:01) ourselves because we have all this (00:36:03) safety stuff we do that I think we do (00:36:05) more than than than other companies so (00:36:07) like the the the pressure to survive (00:36:10) economically while also keeping our (00:36:13) values is is just incredible, right? (00:36:14) We're trying to keep this 10x revenue (00:36:17) curve going. There's like there is zero (00:36:20) time for [ __ ] There is zero time (00:36:22) for feeling like we're productive when (00:36:25) we're not. Like these tools make us a (00:36:28) lot more productive. Like why why do you (00:36:32) think we're concerned about competitors (00:36:33) using the tools? because we think we're (00:36:35) ahead of the competitors and like we (00:36:37) don't we don't want to accel we we we we (00:36:40) wouldn't be going through all this (00:36:42) trouble if this was secretly reducing (00:36:45) reducing our productivity like we see (00:36:48) the end productivity every few months in (00:36:50) the form of model launches like there's (00:36:53) no kidding yourself about this like the (00:36:55) models make you more productive (00:36:56) >> um one that is people feeling like (00:37:00) they're more productive is qualitatively (00:37:01) predicted by studies like this but two (00:37:03) if I Look at the end output. Obviously, (00:37:05) you guys are making fast progress. But (00:37:07) the fact, you know, the the the idea was (00:37:10) supposed to be with recursive (00:37:11) self-improvement is that you make a (00:37:13) better AI, the AI helps you build a (00:37:14) better next AI, etc., etc. And what I (00:37:16) see instead, if I look at the you open (00:37:19) AI, deep mind, is that people are just (00:37:21) shifting around the podium every few (00:37:22) months. And maybe you think that stops (00:37:24) because you you won or whatever, but um (00:37:26) but why why are we not seeing the person (00:37:28) with the best coding model have this (00:37:31) lasting advantage if in fact there are (00:37:34) these enormous productivity gains from (00:37:35) the last model? (00:37:36) >> So no, no, no. I I I mean I mean I mean (00:37:38) I think it's all like my my model of the (00:37:41) situation is there's there's an (00:37:43) advantage that's gradually growing. Like (00:37:45) I would say right now the coding models (00:37:49) give maybe I don't know a a like 15 (00:37:53) maybe 20% total factor speed up like (00:37:56) that's my view. Um uh and 6 months ago (00:37:59) it was maybe 5%. And so and so it didn't (00:38:02) matter like 5% doesn't register. It's (00:38:04) now just getting to the point where it's (00:38:06) like one of several factors that that (00:38:08) kind of matters and and that's gonna (00:38:10) that's going to keep speeding up. And so (00:38:12) I think 6 months ago like you know there (00:38:16) were several there were several (00:38:17) companies that were at roughly the same (00:38:18) point because uh you know this this (00:38:22) wasn't uh this wasn't a notable factor (00:38:23) but I think it's starting to speed up (00:38:25) more and more. I you know I I would I (00:38:27) would also say there are multiple (00:38:29) companies that you know write models (00:38:30) that are used for code and you know (00:38:32) we're not perfectly good at you know (00:38:34) preventing some of these other companies (00:38:36) from from from using from from from kind (00:38:38) of using our models internally. Um, so, (00:38:42) uh, you know, I think I think everything (00:38:44) we're kind kind of everything we're (00:38:45) seeing is consistent with this kind of, (00:38:48) um, this kind of snowball model where (00:38:50) where, you know, there's no hard. Again, (00:38:53) my my my my theme in all of this is like (00:38:57) all of this is soft takeoff, like soft (00:39:00) smooth exponentials, although the (00:39:01) exponentials are relatively steep. And (00:39:03) so and so we're seeing this snowball (00:39:05) gather momentum where it's like 10% 20% (00:39:08) 25% you know for 40% and as you go yeah (00:39:12) AMD doll's law you have to get all the (00:39:14) like things that are preventing you from (00:39:16) from closing the loop out of the way but (00:39:18) like this is one of the biggest (00:39:19) priorities within anthropic. Um (00:39:23) stepping back I think before in the (00:39:25) stack we were talking about um well when (00:39:28) do we get this on the job learning and (00:39:29) it seems like the coding the point you (00:39:31) were making the coding thing is we (00:39:33) actually don't need on the job learning (00:39:34) uh that you can have tremendous (00:39:36) productivity improvements you can have (00:39:37) potentially trillions of dollars of (00:39:38) revenue for AI companies without this (00:39:41) basic human abil maybe that's not your (00:39:42) claim you should clarify um but without (00:39:44) this basic human ability to learn on the (00:39:47) job but I just look at like in in in (00:39:50) most domain of economic activity. People (00:39:52) say, "I hired somebody. They weren't (00:39:54) that useful for the first few months and (00:39:55) then over time they built up the context (00:39:58) understanding. It's actually hard to (00:39:59) define what we're talking about here, (00:40:00) but they they got something and then now (00:40:02) now they're they're a power horse and (00:40:04) they're so valuable to us." And if AI (00:40:06) doesn't develop this ability to learn on (00:40:08) the fly, I'm not I'm a bit skeptical (00:40:10) that we're going to see huge changes to (00:40:13) the world without (00:40:14) >> So I think I think I think two things (00:40:16) here, right? There's the state of the (00:40:18) technology right now um which is again (00:40:21) we have these two stages. We have the (00:40:22) pre-training and RL stage where you (00:40:25) throw you throw a bunch of data and (00:40:27) tasks into the models and then they (00:40:29) generalize. So it's like learning but (00:40:31) it's like learning from more data and (00:40:33) and not you know not learning over kind (00:40:36) of one human or one model's lifetime. So (00:40:39) again this is situated between evolution (00:40:41) and and and human learning. But once you (00:40:43) learn all those skills, you have them. (00:40:45) And and just like with pre-training, (00:40:47) just how the models know more, you know, (00:40:50) if if I look at a pre-trained model, you (00:40:52) know, it knows more about the history of (00:40:54) samurai in Japan than I do. It knows (00:40:56) more about baseball than I do. It knows, (00:40:59) you know, it knows more about, you know, (00:41:02) lowass filters and electronics and, you (00:41:05) know, all all of these things. Its (00:41:07) knowledge is way broader than mine. So I (00:41:09) think I think even even just that um you (00:41:12) know may get us to the point where the (00:41:14) models are better at you know kind of (00:41:17) better at everything and then we also (00:41:19) have again just with scaling the kind of (00:41:21) existing setup we have the in context (00:41:23) learning which I would describe as kind (00:41:26) of like human on the job learning but (00:41:28) like a little weaker and a little short (00:41:30) term like you look at in context (00:41:32) learning the you give the model a bunch (00:41:34) of examples it does get it there's real (00:41:36) learning that happens in context and (00:41:38) like a million tokens is a lot. That's (00:41:40) that's you know that can be days of (00:41:41) human learning right you know if you (00:41:43) think about the model you know you know (00:41:46) kind of read reading reading a million (00:41:47) words you know you know it takes me how (00:41:50) long would it take me to read a million (00:41:51) I mean you know like days or weeks at (00:41:53) least um uh so you have these two things (00:41:57) and and I think these two these two (00:41:58) things within the existing paradigm may (00:42:00) just be enough to get you the country of (00:42:02) geniuses in the data center I don't know (00:42:04) for sure but I think they're going to (00:42:05) get you a large fraction of it there may (00:42:08) be gaps but I I certainly think just as (00:42:11) things are this I believe is enough to (00:42:13) generate trillions of dollars of (00:42:14) revenue. That's one that's all one. Two (00:42:18) is this idea of continual learning. This (00:42:21) idea of a single model learning on the (00:42:23) job. Um I think we're working on that (00:42:26) too and I think there's a good chance (00:42:28) that in the next year or two we also (00:42:31) make we also solve that. Um I I again I (00:42:35) I I I you know I think you get most of (00:42:37) the way there without it. I think the (00:42:40) trillions of dollars of of you know the (00:42:43) the I think the trillions of dollars a (00:42:44) year market maybe all the national (00:42:46) security implications and the safety (00:42:48) implications that I wrote about in (00:42:49) adolescence of technology can happen (00:42:51) without it. But I I I also think we and (00:42:55) I imagine others are working on it. And (00:42:58) I think there's a good chance that that (00:43:00) you know that we get there within the (00:43:02) next year or two. There are a bunch of (00:43:04) ideas. I won't go into all of them in (00:43:05) detail, but um you know, one is just (00:43:08) make the context longer. There's there's (00:43:10) nothing preventing longer context from (00:43:12) working. You just have to train at (00:43:14) longer context and then learn to to (00:43:16) serve them at inference. And both of (00:43:17) those are engineering problems that we (00:43:19) are working on and that I would assume (00:43:20) others are working on as well. Yeah. So (00:43:22) this context length increase, it seemed (00:43:24) like there was a period from 2020 to (00:43:25) 2023 where from GPD3 to GP4 Turbo, there (00:43:28) was an increase from like 2,000 context (00:43:30) lines to 128K. I feel like for the next (00:43:33) for the twoish years since then, we've (00:43:35) been in the sameish ballpark. Yeah. And (00:43:37) when model context lines get much longer (00:43:39) than that, people report qualitative (00:43:41) degradation in the ability of the model (00:43:44) to consider that full context. Um, so (00:43:47) I'm curious what you're internally (00:43:49) seeing that makes you think like, oh, 10 (00:43:50) million context, 100 million contexts to (00:43:52) get human like six months learning (00:43:53) billion. (00:43:54) >> This isn't a research problem. This is a (00:43:56) this is an engineering and inference (00:43:58) problem, right? If you want to serve (00:44:00) long context, you have to like store (00:44:02) your entire KV cache. You have to, you (00:44:04) know, um, uh, you know, it's it's it's (00:44:07) it's difficult to store all the memory (00:44:09) in the GPUs to juggle the memory around. (00:44:12) I don't even know the detail, you know, (00:44:14) at this point. this is at a level of (00:44:15) detail that that that that I'm no longer (00:44:17) able to follow although you know I I (00:44:18) knew it in the GPD3 era of like you know (00:44:21) these are the weights these are the (00:44:22) activations you have to store um uh but (00:44:25) you know you know these days the whole (00:44:26) thing has flipped because we have models (00:44:28) and and and kind of all of that but um (00:44:32) uh and and this degradation you're (00:44:33) talking about like again without getting (00:44:35) too specific like a question I would ask (00:44:37) is like there's two things there's the (00:44:39) context length you train at and there's (00:44:41) a context length that you serve at if If (00:44:44) you train at a small context length and (00:44:46) then try to serve at a long context (00:44:47) length like maybe you get these (00:44:48) degradations. (00:44:50) >> It's better than nothing. You might (00:44:51) still offer it but you get these (00:44:52) degradations and maybe it's harder to (00:44:54) train at a long context length. Yeah. (00:44:55) So, you know, there's there's a lot I I (00:44:57) I want to at the same time ask about (00:44:59) like maybe some rabbit holes of like (00:45:01) well wouldn't you expect that if you had (00:45:03) to train on longer context length that (00:45:05) would mean that um you're able to get (00:45:08) sort of like less samples in for the (00:45:09) same amount of compute. But before maybe (00:45:11) it's not worth diving deep on that. I I (00:45:13) want to get an answer to the bigger (00:45:15) picture question, which is like, okay, (00:45:17) so (00:45:18) um I don't feel a preference for a human (00:45:22) editor that's been working for me for 6 (00:45:24) months versus an AI that's been working (00:45:25) with me for 6 months. What year do you (00:45:28) predict that that will be the case? (00:45:31) >> I my I mean, you know, my guess for that (00:45:35) is, you know, there's there's a lot of (00:45:36) problems that are basically like we can (00:45:38) do this when we have the country of (00:45:39) geniuses in a data center. Um, and so, (00:45:41) you know, my my my my picture for that (00:45:44) is, you know, again, if you if you if (00:45:46) you if you know, if you made me guess, (00:45:48) it's like one to two years, maybe one to (00:45:49) three years. It's really hard to tell. I (00:45:51) have a I have a strong view 99 95% that (00:45:55) like all this will happen in 10 years. (00:45:57) Like that's I think that's just a super (00:45:59) safe bet. And then I have a hunch this (00:46:02) is more like a 50/50 thing that it's (00:46:04) going to be more like 1 to two, maybe (00:46:06) more like 1 to three. (00:46:07) >> So 1 to three years. country of genius (00:46:09) says um and then the slightly less (00:46:11) economically valuable task of editing (00:46:12) videos (00:46:14) I (00:46:14) >> it seems pretty economically valuable (00:46:16) let me tell you it's just there are a (00:46:18) lot of use cases like that right there (00:46:19) are a lot of similar ones so you're (00:46:21) predicting that within 1 to 3 years um (00:46:24) and in gerally enthropic has predicted (00:46:26) that by late 26 early 27 we will have AI (00:46:29) systems that are quote um have the (00:46:31) ability to navigate interfaces available (00:46:33) to humans doing digital work today (00:46:34) intellectual capabilities matching or (00:46:36) exceeding that of Nobel prize winners (00:46:38) and the ability to interface with the (00:46:40) physical world. And then you gave an (00:46:42) interview two months ago with Dealbook (00:46:43) where you were emphasizing your um your (00:46:47) company's more responsible comput (00:46:49) scaling as compared to your competitors. (00:46:51) And I'm trying to square these two views (00:46:52) where if you really believe that we're (00:46:54) going to have a country of geniuses, you (00:46:57) you want as big a data center as you can (00:46:59) get, there's no reason to slow down. The (00:47:01) TAM of a Nobel Prize winner that is (00:47:03) actually can do everything a Nobel Prize (00:47:05) winner can do is like trillions of (00:47:06) dollars. And so I'm trying to square (00:47:09) this conservatism (00:47:10) uh which seems rational if you have more (00:47:12) moderate timelines with your stated (00:47:15) views about AI progress. (00:47:16) >> Yeah. So so it actually all fits (00:47:18) together and and we go back to this fast (00:47:20) but not infinitely fast diffusion. So (00:47:23) like let's say that we're making (00:47:25) progress at this rate. Um you know the (00:47:27) the the technology is making progress (00:47:29) this fast. Again, I have, you know, very (00:47:32) high conviction that like it's going, (00:47:35) you know, the the, you know, we're going (00:47:37) to get there within within a few years. (00:47:39) I have a hunch that we're going to get (00:47:40) there within a year or two. So, a little (00:47:43) uncertainty on the technical side, but (00:47:45) like, you know, pretty pretty strong (00:47:47) confidence that it won't be off by much. (00:47:49) What I'm less certain about is again the (00:47:52) economic diffusion side. Like I really (00:47:55) do believe that we could have models (00:47:58) that are a country of geniuses country (00:48:00) of geniuses in a data center in one to (00:48:03) two years. One question is how many (00:48:06) years after that do the trillions in you (00:48:09) know do do the do the trillions in (00:48:10) revenue start rolling in? Um I don't (00:48:15) think it's guaranteed that it's going to (00:48:17) be immediate. Um, you know, I think it (00:48:20) could be um one year, it could be two (00:48:24) years, I could even stretch it to five (00:48:27) years, although I'm like I'm skeptical (00:48:29) of that. And so we have this uncertainty (00:48:32) which is even if the technology goes as (00:48:35) fast as I suspect that it will, we we (00:48:38) don't know exactly how fast it's going (00:48:40) to drive revenue. We we know it's (00:48:42) coming, but with the way you buy these (00:48:45) data centers, if you're off by a couple (00:48:47) years, that can be ruinous. It is just (00:48:50) like how I wrote, you know, in machines (00:48:51) of loving grace, I said, look, I think (00:48:54) we might get this powerful AI, this (00:48:55) country of genius in the data center. (00:48:57) That description you gave comes from the (00:48:58) machines of loving grace. I said, we'll (00:49:00) get that 2026, maybe 2027. Again, that (00:49:03) is that is my hunch. Wouldn't be (00:49:04) surprised if I'm off by a year or two, (00:49:06) but like that is my hunch. Let's say (00:49:09) that happens. That's the starting gun. (00:49:10) How long does it take to cure all the (00:49:12) diseases, right? That's that's one of (00:49:14) the ways that like drives a huge amount (00:49:16) of of of of economic value, right? Like (00:49:18) you cure you cure every disease. You (00:49:21) know, there's a question of how much of (00:49:22) that goes to the pharmaceutical company (00:49:23) to the AI company, but there's an (00:49:25) enormous consumer surplus because (00:49:27) everyone, you know, assuming we can get (00:49:29) access for everyone, which I care about (00:49:30) greatly. We, you know, we we cure all of (00:49:32) these diseases. How long does it take? (00:49:34) You have to do the biological discovery. (00:49:36) you have to, you know, go you have to, (00:49:38) you know, manufacture the new drug. You (00:49:41) have to, you know, go through the (00:49:42) regulatory process. I mean, we saw this (00:49:44) with like vaccines and COVID, right? (00:49:45) Like it there's just this we we got the (00:49:48) vaccine out to everyone, but it took a (00:49:50) year and a half, right? And and so my (00:49:52) question is, how long does it take to (00:49:54) get the cure for everything, which AI is (00:49:58) the genius that can in theory invent out (00:50:01) to everyone? How long from when that AI (00:50:02) first exists in the lab to when diseases (00:50:06) have actually been cured for everyone (00:50:08) right in in you know we've had a polio (00:50:11) vaccine for 50 years we're still trying (00:50:13) to eradicate it in the most remote (00:50:15) corners of Africa and you know the (00:50:17) Gateson nation is trying as hard as they (00:50:19) can others are trying as hard as they (00:50:20) can but you know that's difficult again (00:50:23) I you know I don't expect most of the (00:50:25) economic diffusion to be as difficult as (00:50:27) that right that's like the most (00:50:28) difficult case but but there's a There's (00:50:31) a real dilemma here and and where I've (00:50:33) settled on it is it will be it will be a (00:50:36) it will be faster than anything we've (00:50:39) seen in the world but it still has its (00:50:41) limits and and so then when we go to (00:50:44) buying data centers you know you again (00:50:47) again again the curve I'm looking at is (00:50:50) okay we you know we've had a 10x a year (00:50:53) increase every year so beginning of this (00:50:55) year we're looking at 10 billion in in (00:50:58) in annual in you know rate of annual ize (00:51:01) revenue at the beginning of the year. We (00:51:03) have to decide how much compute to buy. (00:51:05) Um (00:51:06) and you know it takes a year or two to (00:51:10) actually build out the data centers to (00:51:12) reserve the data centers. So basically (00:51:13) I'm saying like in uh 2027 how much (00:51:17) compute do I get? Well I could assume um (00:51:21) uh that uh the uh revenue will continue (00:51:26) growing 10x a year. So it'll be uh one (00:51:29) uh one uh 100 billion at the end of 2026 (00:51:32) and 1 trillion at the end of 2027. And (00:51:35) so I could buy a trillion dollar (00:51:38) actually would be like5 trillion dollars (00:51:40) of compute because it would be a (00:51:41) trillion dollar a year for for five (00:51:43) years, right? I could buy a trillion (00:51:45) dollars of compute that starts at the (00:51:47) end of 2027. And if my if my revenue is (00:51:50) not a trillion dollars, if it's even (00:51:53) 800 billion, there's no force on earth, (00:51:56) there's there's no hedge on earth that (00:51:59) could stop me from going bankrupt if I (00:52:01) if I buy that much compute. And and so (00:52:03) even though a part of my brain wonders (00:52:05) if it's going to keep growing 10x, I (00:52:08) can't buy a trillion dollars a year of (00:52:10) compute in in in in in in in (00:52:13) 2027. If I'm just off by a year in that (00:52:17) rate of growth, or if the the growth (00:52:18) rate is 5x a year instead of 10x a year, (00:52:21) then then you know that you go bankrupt. (00:52:24) Um and and and so you end up in a world (00:52:28) where you know you're supporting (00:52:29) hundreds of billions not trillions and (00:52:32) you accept you accept some risk that (00:52:36) there's so much demand that you can't (00:52:37) support the revenue and you accept still (00:52:40) some risk that you know you got it wrong (00:52:42) and it still slow and so when I talked (00:52:44) about behaving responsibly what I meant (00:52:47) actually was not the absolute amount (00:52:49) that that actually was not um you know I (00:52:52) think it is true we're spending somewhat (00:52:54) less than some of the other players. (00:52:56) It's actually the other things like have (00:52:58) we been thoughtful about it or are we (00:53:00) yoloing and saying, "Oh, we're going to (00:53:02) do $100 billion here, hundred billion (00:53:04) dollars there." I kind of get the (00:53:06) impression that, you know, some of the (00:53:08) other companies have not written down (00:53:10) the spreadsheet that they don't really (00:53:11) understand the risk they're taking. (00:53:13) They're just kind of doing stuff because (00:53:14) it sounds cool. Um uh and and we've (00:53:18) thought carefully about it, right? We're (00:53:19) an enterprise business. Therefore, you (00:53:22) know, we can rely more on revenue. It's (00:53:25) less fickle than consumer. We have (00:53:27) better margins, which is the buffer (00:53:28) between buying too much and buying too (00:53:30) little. And so, I think we bought an (00:53:33) amount that allows us to capture pretty (00:53:36) strong upside worlds. It won't capture (00:53:38) the full 10x a year. Um, and things (00:53:41) would have to go pretty badly for us to (00:53:43) be for us to be in financial trouble. (00:53:45) So, I think we've thought carefully and (00:53:46) we've made that balance. And and that's (00:53:48) what I mean when I say that we're being (00:53:50) responsible. Okay. So, it seems like um (00:53:53) it's possible that we're we actually (00:53:54) just have different definitions of the (00:53:55) country of a genius in a data center (00:53:57) because when I think of like actual (00:53:59) human geniuses, an actual country of (00:54:01) human geniuses in a data center, I'm (00:54:03) like (00:54:05) I would happily buy $5 trillion worth of (00:54:07) compute to run actual country of human (00:54:08) genius in a data center. So, let's say (00:54:10) JP Morgan or Madna or whatever doesn't (00:54:12) want to use them. Also, I've got a (00:54:14) country of geniuses. they they'll start (00:54:15) their own company and if like they they (00:54:17) can't start their own company and (00:54:18) they're bottlenecked by clinical trials. (00:54:19) It is worth stating with clinical trials (00:54:21) like most clinical trials fail because (00:54:22) the drug doesn't work. There's not (00:54:23) efficacy, right? (00:54:24) >> And I make exactly that point in in (00:54:27) machines of love and grace. I say the (00:54:28) clinical trials are going to go much (00:54:30) faster than we're used to, but not not (00:54:32) instant not infinitely fast. (00:54:34) >> And then suppose it takes a year to for (00:54:36) the clinical trials to work out so that (00:54:37) you're getting revenue from that and you (00:54:38) can make more drugs. Okay. Hey, well, (00:54:40) you've got a country of geniuses and (00:54:41) you're an AI lab and you have you could (00:54:44) use uh many more AI researchers. Um, you (00:54:48) also think that there's these like (00:54:49) self-reinforcing gains from, you know, (00:54:52) smart people working on AI tech. So, (00:54:54) like, okay, you can have the right you (00:54:55) can have the data center working on like (00:54:57) AI progress. (00:54:58) >> Is there more gains from buying (00:55:00) >> like substantially more gains from (00:55:03) buying a trillion dollars a year of (00:55:05) compute versus $300 billion a year of (00:55:07) compute? If your competitor is buying a (00:55:09) trillion, yes, there is. (00:55:10) >> Well, no, there's some gain, but then (00:55:12) but again, there's this chance that they (00:55:14) go bankrupt before, (00:55:16) you know, be again, if you're off by (00:55:19) only a year, you destroy yourselves. (00:55:22) That's the That's the balance. We're (00:55:24) buying a lot. We're buying a hell of a (00:55:26) lot. Like, we're not we're you know, (00:55:27) we're buying an amount that's comparable (00:55:30) to that that, you know, the the the the (00:55:32) biggest players in the game are buying. (00:55:34) Um but but if you're asking me why why (00:55:37) haven't we signed you know 10 10 (00:55:40) trillion of compute starting in starting (00:55:42) in mid 2027 first of all it can't be (00:55:44) produced there isn't that much in the (00:55:46) world um uh but but second um what if (00:55:50) the country of geniuses comes but it (00:55:52) comes in mid 2028 instead of mid2027 you (00:55:55) go bankrupt. So if your projection is 1 (00:55:58) to 3 years, it seems like you should (00:56:00) want $10 trillion of compute by um 2029 (00:56:04) 2020 maybe 2020 latest (00:56:06) >> like I mean you know you are like it (00:56:09) seems like even in your the longest (00:56:11) version of the timelines you state the (00:56:13) compute you are ramping up to build (00:56:14) doesn't seem (00:56:16) >> in accordance what what makes you think (00:56:17) that (00:56:18) >> well as you said you would want the 10 (00:56:20) trillion the human wages let's say are (00:56:22) um on the order of 50 trillion a year (00:56:24) >> if you look at so so I won't I won't (00:56:26) talk about entropic in particular but if (00:56:28) you talk about the industry like um the (00:56:32) amount of compute the industry had you (00:56:34) know the the the the amount of compute (00:56:36) the industry is building this year is (00:56:38) probably in the you know I don't know (00:56:41) very low tens of you know call it 10 15 (00:56:44) gawatts next year I you know it goes up (00:56:47) by roughly 3x a year so like next year's (00:56:50) 30 or 40 gigawatts and um 2028 might be (00:56:54) 100 202 might like three 300 gigawatts (00:56:58) and like each gigawatt costs like um (00:57:03) maybe 10 I mean I'm doing the math in my (00:57:05) head but each gigawatt costs maybe 10 (00:57:07) billion you know order 10 to 15 billion (00:57:09) a year so you know you kind of you you (00:57:12) know you put that all together and (00:57:14) you're getting about about what you (00:57:15) described you're getting multiple (00:57:16) trillions a year by 2028 or 2029 so (00:57:19) you're you're getting exactly that (00:57:20) you're getting you're getting exactly (00:57:21) what you predict (00:57:23) >> um that's for the industry (00:57:24) >> that that's for the industry That's (00:57:25) right. So suppose anthropics comput (00:57:27) keeps 3xing a year and then by like 27 (00:57:30) you have uh or 27 28 you have 10 gawatt (00:57:34) and like multiply that by as you say um (00:57:38) 10 billion. So then it's like 100 (00:57:40) billion a year but then you were saying (00:57:41) the TAM by 2028. (00:57:43) >> I I don't want to give exact numbers for (00:57:45) anthropic but but these numbers are too (00:57:46) small. These numbers are too small. (00:57:48) >> Okay. Interesting. I'm really proud that (00:57:51) the puzzles I've worked on with Jane (00:57:52) Street have resulted in them hiring a (00:57:53) bunch of people for my audience. Well, (00:57:55) they're still hiring and they just sent (00:57:57) me another puzzle. For this one, they (00:57:59) spent about 20,000 GPU hours training (00:58:01) backd doors into three different (00:58:02) language models. Each one has a hidden (00:58:04) prompt that elicits completely different (00:58:07) behavior. You just have to find the (00:58:08) trigger. This is particularly cool (00:58:10) because finding backd doorors is (00:58:11) actually an open question in Frontier AI (00:58:13) research. Enthropic actually released a (00:58:15) couple of papers about sleep agents and (00:58:17) they showed that you can build a simple (00:58:19) classifier on the residual stream to (00:58:21) detect when a back door is about to (00:58:23) fire, but they already knew what the (00:58:25) triggers were because they built them. (00:58:27) Here you don't. And it's not feasible to (00:58:29) check the activations for all possible (00:58:31) trigger phrases. Unlike the other (00:58:33) puzzles they made for this podcast, Jane (00:58:35) Street isn't even sure this one is (00:58:36) solvable, but they've set aside $50,000 (00:58:38) for the best attempts and write-ups. The (00:58:40) puzzle's live at (00:58:41) janestreet.com/themarcash (00:58:44) and they're accepting submissions until (00:58:46) April 1st. All right, back to Dario. (00:58:49) You've told investors that you plan to (00:58:51) be profitable starting 28 and this is (00:58:54) the year where we're like potentially (00:58:55) getting the country of geniuses at a (00:58:57) data center and you know this is like (00:59:00) going to now unlock all this uh progress (00:59:02) and uh medicine and uh health and etc (00:59:06) etc and new technologies. Wouldn't this (00:59:09) be a particular exactly the time where (00:59:10) you'd like want to reinvest in the (00:59:12) business and build bigger countries so (00:59:14) they can make more discoveries? (00:59:15) >> So I mean profit profitability is this (00:59:17) kind of like weird thing in this field. (00:59:19) I I like like I don't think I I don't (00:59:22) think in this field profitability is (00:59:24) actually a measure of (00:59:29) uh um you know kind of spending down (00:59:32) versus investing in the business like (00:59:34) let's let's just let's just take a model (00:59:36) of this. I actually think profitability (00:59:38) happens when you underestimated the (00:59:40) amount of demand you were going to get (00:59:42) and loss happens when you overestimated (00:59:44) the amount of demand you were going to (00:59:45) get. Um because you're buying the data (00:59:47) centers ahead of time. So think about it (00:59:49) this way. Um ideally you would like and (00:59:52) again these are stylized facts. These (00:59:54) numbers are not exact for I'm just (00:59:55) trying to make a toy model here. Let's (00:59:57) say half of your compute is for training (00:59:59) and half of your compute is for (01:00:01) inference. Um, and you know the (01:00:03) inference has some gross margin that's (01:00:05) like more than 50%. Um, and so what that (01:00:08) means is that if you were in steady (01:00:10) state, you build a data center, if you (01:00:12) knew exactly exactly exactly the demand (01:00:14) you were getting, you would um uh uh uh (01:00:18) you know you would you would you you (01:00:20) would you would get a certain amount of (01:00:21) revenue. say, I don't know, uh, uh, (01:00:23) let's say you pay $100 billion a year (01:00:25) for compute and on $50 billion a year, (01:00:28) you support $150 billion on of of of of (01:00:31) revenue and the other 50 billion the (01:00:33) other 50 billion are used for training. (01:00:35) Um, so basically, you're profitable. You (01:00:37) make you make you make $50 billion of (01:00:40) profit. Those are the economics of the (01:00:41) industry today or or sorry, not today, (01:00:44) but like that's where we're where we're (01:00:46) projecting forward in a year or two. The (01:00:48) only thing that makes that not the case (01:00:50) is if you get less demand than 50 (01:00:53) billion um then you have more than 50% (01:00:56) of your your data center for research (01:00:59) and you're not profitable. So you you (01:01:00) know you train stronger models but (01:01:02) you're like not profitable. Um if you uh (01:01:05) get more demand than you thought then (01:01:07) your research gets squeezed um but uh (01:01:10) you know you're you're you're kind of (01:01:12) able to support more inference and (01:01:14) you're more profitable. So it's maybe (01:01:16) I'm not explaining it well but but the (01:01:18) thing I'm trying to say is you decide (01:01:19) the amount of compute first and then you (01:01:23) have some target desire of of inference (01:01:25) versus versus training but that gets (01:01:28) determined by demand. It doesn't get (01:01:29) determined by (01:01:30) >> what I'm hearing is the reason you're (01:01:31) predicting profit is that you are (01:01:33) systematically underestimate uh (01:01:34) underinvesting in compute right because (01:01:36) if you actually (01:01:37) >> No I'm saying I'm saying it's hard to (01:01:39) predict. So, so these things about 2028 (01:01:42) and when it will happen, that's our (01:01:44) that's our attempt to do the best we can (01:01:45) with investors, all of this stuff is (01:01:48) really uncertain because of the cone of (01:01:49) uncertainty. Like we could be profitable (01:01:52) in 2026 if the if the revenue grows fast (01:01:55) enough and then and then um uh you know (01:01:58) if we if we overestimate or (01:02:00) underestimate the next year that could (01:02:02) swing wildly. Like I I I what I'm trying (01:02:05) to get at is you have a model in your (01:02:07) head of like the the business invests (01:02:09) invests invests invests gets scale and (01:02:12) and and and kind of then becomes (01:02:13) profitable. There's a single point at (01:02:15) which things turn around. I don't think (01:02:17) the economics of this industry work that (01:02:19) way. (01:02:19) >> I see. So if I'm understanding (01:02:22) correctly, you're saying because of the (01:02:24) discrepancy between the amount of (01:02:25) compute we should have gotten and the (01:02:26) amount of compute we got, we we were (01:02:28) like sort of forced to make profit. But (01:02:30) that that doesn't mean we're going to (01:02:31) continue making profit. But we're going (01:02:32) to like reinvest the money because well (01:02:34) now AI has made so much progress and we (01:02:36) want the bigger country of geniuses and (01:02:38) so then back into uh revenue is high but (01:02:42) losses are also high. If we if we (01:02:44) predict if every year we predict exactly (01:02:47) what the demand is going to be will be (01:02:48) profitable every year because grow (01:02:52) because spending spending 50% of your (01:02:54) compute on on 50% of your compute on (01:02:56) research roughly um plus a gross margin (01:03:00) that's higher than 50%. And and correct (01:03:02) demand prediction leads to profit. (01:03:04) That's the prof that's that's the (01:03:05) profitable business model that I think (01:03:07) is kind of like there but like obsc (01:03:10) obscured by these like building ahead (01:03:12) and prediction errors. I (01:03:14) >> I guess you're treating the 50% as a uh (01:03:17) as a sort of like you know just like a (01:03:19) given constant whereas you in fact if (01:03:21) you if AI progress is fast and you can (01:03:23) increase the progress by scaling up more (01:03:24) you just have more than 50% and not make (01:03:26) profit. (01:03:26) >> Here's what I'll say. You might want to (01:03:27) scale up it more. you might want to (01:03:29) scale it up more, but but but you know, (01:03:31) remember the log returns to scale, (01:03:33) right? If if 70% would get you a very (01:03:38) little bit of a smaller model through a (01:03:39) factor of of 1.4x, right? like that (01:03:43) extra $20 billion is is is is you know (01:03:46) that each each dollar there is worth (01:03:48) much less to you because because the log (01:03:50) linear setup and so you might find that (01:03:53) it's better to invest that that that (01:03:54) that it's better to invest that $20 (01:03:56) billion in you know in in serving (01:03:59) inference or in hiring engineers who are (01:04:01) who who are kind of better who are who (01:04:03) are kind of better who are kind of (01:04:04) better at what they're doing. So the the (01:04:06) reason I said 50% that's not that's not (01:04:08) exactly our target. It's not exactly (01:04:09) going to be 50%. It will probably vary (01:04:12) vary over time. What what I'm saying is (01:04:14) the the the the like log linear return (01:04:17) what it leads to is you spend of order (01:04:20) one fraction of the business, right? (01:04:22) Like not 5% not 95%. And then it then (01:04:27) you know then then then you get (01:04:28) diminishing returns because of the (01:04:29) because the walls that I'm like (01:04:32) convincing Dario to like believe in AI (01:04:33) progress or something but like uh okay (01:04:35) you you don't invest in research because (01:04:37) it has diminishing returns but you (01:04:38) invest in the other things you mentioned (01:04:40) >> again again we're talking about (01:04:41) diminishing returns (01:04:43) after you're spending 50 billion a year (01:04:45) right like this is a point I'm sure you (01:04:47) would make but like diminishing returns (01:04:50) on a genius is could be quite high and (01:04:53) more generally like what is profit put (01:04:55) in a market economy profit is basically (01:04:57) saying the other companies in the market (01:04:59) can like do more things with this money (01:05:01) that I (01:05:02) >> yeah put aside entropic I'm just trying (01:05:03) to like because I you know I don't want (01:05:05) to give information about entropic is (01:05:07) why I'm giving these stylized numbers (01:05:08) but like let's just derive the (01:05:10) equilibrium of the industry right I (01:05:12) think the so so so why doesn't everyone (01:05:15) spend 100% of their um uh you know 100% (01:05:21) of their compute on training and not (01:05:22) serve any customers right it's because (01:05:24) if They didn't get any revenue. They (01:05:26) couldn't raise money. They couldn't do (01:05:27) comput deals. They couldn't buy more (01:05:28) compute the next year. So, there's going (01:05:30) to be an equilibrium where every every (01:05:32) company spends less than 100% on on on (01:05:36) on on training and certainly less than (01:05:38) 100% on inference. It should be clear (01:05:39) why you don't just serve the current (01:05:41) models and and you know and and and (01:05:44) never train another model because then (01:05:46) you don't have any demand because you (01:05:47) because you'll fall behind. So, there's (01:05:49) some equilibrium. It's it's not going to (01:05:51) be 10%, it's not going to be 90%. Let's (01:05:54) just say as a stylized fact it's 50%. (01:05:56) That's what I'm getting at. And and and (01:05:58) I think we're going to be in a position (01:05:59) where that equilibrium of how much you (01:06:01) spend on training is less than the gross (01:06:04) margins that that you're that that that (01:06:07) you're able to get on compute. And so (01:06:09) the the the the underlying economics are (01:06:11) profitable. The problem is you have this (01:06:13) this hellish demand prediction problem (01:06:16) when you're when you're buying the next (01:06:17) year of compute and you might guess (01:06:19) under and be very profitable but have no (01:06:23) compute for research or you might guess (01:06:25) over and you know you're you're you're (01:06:28) um uh you you are not profitable and you (01:06:31) have all the compute compute for (01:06:33) research in the world. (01:06:35) >> Does does that make sense just as a (01:06:37) dynamic model of the industry? May maybe (01:06:39) stepping back I'm like uh I I I'm not (01:06:42) saying I I think the country of genius (01:06:43) is going to come in two years and (01:06:45) therefore you should buy this compute. (01:06:46) Um to me what you're saying the end (01:06:49) conclusion you're arriving at makes a (01:06:51) lot of sense but uh that's because like (01:06:54) oh it seems like country geniuses is (01:06:56) hard and there's a long way to go. And (01:06:58) so the stepping back the thing I'm (01:07:00) trying to get at is more like (01:07:02) it seems like your worldview is (01:07:04) compatible with somebody who says uh (01:07:05) we're like 10 years away from a world in (01:07:07) which like we're generating trillions of (01:07:09) dollars. (01:07:09) >> That's just that's just not my view. (01:07:11) That is that is not my view. Like I I so (01:07:13) so I'll like I'll like make another (01:07:15) prediction. It is hard for me to see (01:07:18) that that there won't be trillions of (01:07:20) dollars in revenue before 2030. Um like (01:07:24) uh I can I can construct a plausible (01:07:26) world. It takes maybe three years. So (01:07:29) that that you know that would be the end (01:07:30) of what I think it's plausible like in (01:07:32) 2028 we get the the real country of (01:07:35) geniuses in a data center. You know the (01:07:37) revenue's been been go you know the (01:07:39) revenue has been going into the maybe is (01:07:41) is in the low hundreds of billions by by (01:07:43) by by 2028 and and and then the country (01:07:46) of geniuses accelerates it to trillions, (01:07:49) you know, and and we're basically we're (01:07:50) basically on the slow end of diffusion. (01:07:52) It takes two years to get to the (01:07:54) trillions. that that that would that (01:07:55) that that would be the world where it (01:07:57) takes until that would be the world (01:07:58) where it takes until 2030. I I I suspect (01:08:01) even composing the technical exponential (01:08:04) and the diffusion exponential will get (01:08:06) there before 2030. So you laid out a (01:08:08) model where anthropic makes profit (01:08:11) because it seems like fundamentally (01:08:13) we're in a computed world and so it's (01:08:15) like eventually we keep growing comput. (01:08:17) Well, I think I think the way the profit (01:08:19) comes is again and and you know, let's (01:08:21) let's just abstract the whole industry (01:08:23) here. Like we have a you know, let's (01:08:24) just imagine we're we're in like an (01:08:26) economics textbook. We have a small (01:08:28) number of firms each can invest a (01:08:31) limited amount in you know or or or like (01:08:34) each can invest some fra fraction in (01:08:36) R&D. They have some marginal cost to (01:08:38) serve. the margins on that the profit (01:08:40) mar the gross profit margins on that (01:08:42) marginal cost are like very high because (01:08:45) because because because inference is (01:08:46) efficient there's some competition but (01:08:48) the models are also differentiated (01:08:50) there's some there's some um you know (01:08:53) companies will compete to push their (01:08:54) research budgets up but like because (01:08:57) there's a small number of players you (01:08:58) know we have the what is it called in (01:09:01) the corno equilibrium I think is what (01:09:02) the what the small number of firm (01:09:04) equilibrium is it the point is it it (01:09:07) doesn't equilibrate to perfect (01:09:09) competition with with with with with (01:09:10) with (01:09:12) zero margins. If there's like three (01:09:15) firms, if there's three firms in the (01:09:17) economy, all are kind of independently (01:09:19) behaving behaving rationally, it doesn't (01:09:21) equilibrate to zero. (01:09:23) >> Um, help me understand that because (01:09:25) right now we do have three leading firms (01:09:26) and they're not making profit. Um, and (01:09:29) so what what uh yeah, what what is (01:09:31) changing? Yeah. So the the again the (01:09:35) gross margins right now are very (01:09:37) positive. What's happen what what's (01:09:40) happening is a combination of two (01:09:41) things. One is we're still in the (01:09:43) exponential scale up phase of compute. (01:09:46) Um, so what basically what that means is (01:09:49) we're training like a model gets (01:09:51) trained, it costs, you know, let's say a (01:09:53) model got trained that costs uh a (01:09:55) billion last year. Um, and then uh this (01:09:59) year it produced uh $4 billion of (01:10:03) revenue and cost $1 billion to to uh to (01:10:08) to to inference from. Um so you know (01:10:10) again I'm using stylized number here but (01:10:12) you know that would be 75% you know (01:10:14) gross gross gross margins and you know (01:10:17) this this 25% tax. So that model as a (01:10:20) whole makes $2 billion. Um but at the (01:10:24) same time we're spending $10 billion to (01:10:26) train the next model because there's an (01:10:28) exponential scale up and so the company (01:10:30) loses money. Each model makes money but (01:10:32) the company loses money. The equilibrium (01:10:34) I'm talking about is an equilibrium (01:10:36) where we have the country of geniuses. (01:10:38) We have the country of geniuses in a (01:10:40) data center, but that that um model (01:10:44) training scale up has equilibrated more. (01:10:47) Maybe maybe it's still it's still going (01:10:48) up. We're still trying to predict the (01:10:50) demand, but it's more it's more um (01:10:53) leveled out. Um I'll give you a couple (01:10:55) things there. So, um let's start with (01:10:57) the current world. Um in the current (01:10:59) world, you're right that as you said (01:11:02) before, if you treat each individual (01:11:03) model as a company, it's profitable. But (01:11:06) of course, a big part of the production (01:11:07) function of being a frontier lab is (01:11:11) training the next model, right? So if (01:11:13) you didn't do that, then you'd make (01:11:15) profit for two months and then you (01:11:16) wouldn't have margins because you (01:11:17) wouldn't have the best model and then so (01:11:19) yeah, you can make profits for two (01:11:20) months on the current at some point that (01:11:21) reaches the biggest scale that it can (01:11:23) reach and then and then in equilibrium (01:11:25) we have algorithmic improvements, but (01:11:27) we're spending roughly the same amount (01:11:29) to train the next model as as as we (01:11:32) spent to train the current model. Um so (01:11:34) this equilibrium relies (01:11:36) >> I mean at some point at some at at some (01:11:38) point you run out of money in the (01:11:39) economy (01:11:40) >> uh a fixed lump of labor files the (01:11:42) economy is going to grow right that's (01:11:43) one of your predictions we're going to (01:11:44) have but this is but this is another (01:11:47) example of the theme I was talking about (01:11:49) which is that the economy will grow much (01:11:53) faster with AI than I think it ever has (01:11:55) before but it's not like right now the (01:11:58) compute is growing 3x a year I don't (01:12:00) believe the economy is going to grow (01:12:02) 300% % a year. Like I said this in (01:12:04) Machines of Loving Grace, like I think (01:12:06) we we may get 10 or 20% per year growth (01:12:09) in the economy, but we're not going to (01:12:11) get 300% growth in the economy. So I (01:12:14) think I think in the end, you know, if (01:12:16) if compute becomes the majority of what (01:12:18) the economy produces, it's it's going to (01:12:20) be capped by that. So okay, now let's (01:12:22) assume a model where compute stays (01:12:23) capped. Yeah. Um, the world where (01:12:26) Frontier Labs are making money is one (01:12:28) where they continue to make um, fast (01:12:31) progress because fundamentally your (01:12:32) margin is limited by how good the (01:12:35) alternative is. And so you are able to (01:12:37) make money because you have a frontier (01:12:38) model. Um, if you didn't have frontier (01:12:39) model, you wouldn't be making money. Um, (01:12:42) >> and and so this this model requires (01:12:45) there never to be a steady state like (01:12:46) forever and ever you keep making more (01:12:49) progress. (01:12:50) >> I don't think that's true. True. I mean (01:12:51) I I feel feel like we're we're like (01:12:53) we're talk we're we're you know I feel (01:12:54) like this is an economics uh like uh you (01:12:57) know this is like an economics class you (01:12:59) know quote we never stop talking about (01:13:01) economics. (01:13:01) >> We never we never stop talking about (01:13:03) economics. So no but but there there are (01:13:06) worlds in which um you know there so I I (01:13:10) don't think this field's going to be a I (01:13:11) don't think this field's going to be a (01:13:12) monopoly. All my lawyers never want me (01:13:14) to say the word monopoly. Um but I don't (01:13:16) think this field's going to be a (01:13:17) monopoly. But but you do get you get (01:13:19) industries in which there are small (01:13:21) number of players, not one but a small (01:13:23) number of players. And ordinarily like (01:13:26) the the way you get monopolies like (01:13:29) Facebook or or Meta, I always call them (01:13:32) Facebook but um uh uh is is these kind (01:13:35) of is these kind of these kind of (01:13:36) network effects. The way you get (01:13:38) industries in which there are small (01:13:40) number of players are very high costs of (01:13:43) entry, right? Um so you know uh cloud is (01:13:47) like this. I think cloud is a good (01:13:48) example of this. You have three maybe (01:13:51) four players within cloud. I think I (01:13:53) think that's the same for AI. Three (01:13:54) maybe four. Um uh and the reason is that (01:13:58) it's it's so expensive. It requires so (01:14:00) much expertise and so much capital to (01:14:04) like run a cloud company, right? And so (01:14:06) you have to put up all this capital. And (01:14:08) then in addition to putting up all this (01:14:10) capital, you have to get all of this (01:14:11) other stuff that like, you know, (01:14:13) requires a lot of skill to, you know, to (01:14:15) make it happen. And so it's like if you (01:14:16) go to someone and you're like, I want to (01:14:18) disrupt this industry. Here's hundred (01:14:19) billion dollars. You're like, okay, I'm (01:14:21) putting $100 billion and also betting (01:14:24) that you can do all these other things (01:14:25) that these people have been doing (01:14:26) >> decrease the profit in the industry. (01:14:27) >> Yeah. And and and then and then the (01:14:28) effect of your entering is the profit (01:14:30) margins go down. And so you know we have (01:14:32) equilibria like this all the time in the (01:14:34) economy where we have a few we have a (01:14:36) few players profits are not astronomical (01:14:39) margins are not astronomical but they're (01:14:41) they're not zero right um uh and and you (01:14:44) know I think I think that's what we see (01:14:46) on cloud cloud is very undifferentiated (01:14:48) models are more differentiated than (01:14:50) cloud right like everyone knows claude (01:14:53) is claude claude is good at different (01:14:55) things than GPT is good at is than than (01:14:57) Gemini is good at and it's not just (01:15:00) claude's good at coding GP PT is good at (01:15:02) you know math and reasoning you know um (01:15:05) uh it's more subtle than that like (01:15:07) models are good at different types of (01:15:08) coding models have different styles like (01:15:11) I think I think these things are (01:15:12) actually you know quite different from (01:15:14) each other and so I would expect more (01:15:16) differentiation than you see in in um (01:15:20) cloud now there there actually is a uh (01:15:24) counter there there there is one (01:15:25) counterargument um and that (01:15:27) counterargument is that if all of that (01:15:29) the process of producing models (01:15:31) um becomes uh if AI models can do that (01:15:34) themselves, then that could spread (01:15:36) throughout the economy. But that is not (01:15:38) an argument for commoditizing AI models (01:15:40) in general. That's kind of an argument (01:15:42) for commoditizing the whole economy at (01:15:44) once. Um I don't know what what quite (01:15:47) happens in that world where basically (01:15:49) anyone can do anything, anyone can build (01:15:51) anything, and there's like no mode (01:15:52) around anything at all. I mean, I don't (01:15:54) know, maybe we want that world like like (01:15:56) maybe that's the maybe that's the end (01:15:58) state here. like maybe maybe um you know (01:16:00) when maybe when when when kind of AI (01:16:02) models can do you know when when when (01:16:04) when AI models can do everything if (01:16:06) we've solved all the safety and security (01:16:08) problems like you know that's one of the (01:16:10) one of the one of the mechanisms for for (01:16:13) uh you know um uh uh you know just just (01:16:16) kind of the economy flattening itself (01:16:18) again but but that's kind of like post (01:16:19) like far post country of geniuses in a (01:16:22) data center. Um maybe a finer way to put (01:16:25) that uh potential point is one um it (01:16:29) seems like AI research is especially (01:16:32) loaded on raw intellectual power which (01:16:35) will be especially abundant in a world (01:16:37) with AGI and two if you just look at the (01:16:39) world today there's very few (01:16:41) technologies that seem to be diffusing (01:16:42) as fast as um as AI algorithmic progress (01:16:48) and so that does hint that this industry (01:16:50) is sort of structurally diffusive (01:16:52) So I think coding is going fast but I (01:16:54) think AI research is a supererset of (01:16:56) coding and there are aspects of it that (01:16:57) are not going fast. Um uh but I but I do (01:17:01) think again once we get coding once we (01:17:03) get AI models going fast then you know (01:17:06) AI you know that will speed up the (01:17:08) ability of AI models to kind to kind of (01:17:09) do everything else. So I think while (01:17:11) coding is going fast now I think once (01:17:14) the AI models are building the next AI (01:17:16) models and building everything else the (01:17:18) kind of whole the whole economy will (01:17:20) kind of go at the same pace. I am I am (01:17:23) worried geographically though. I'm a (01:17:25) little worried that like just proximity (01:17:28) to AI having heard about AI um uh that (01:17:32) that that may be one differentiator. And (01:17:34) so when I said the like you know 10 or (01:17:36) 20% growth rate a worry I have is that (01:17:40) the growth rate could be like 50% in (01:17:42) Silicon Valley and you know parts of the (01:17:45) world that are kind of socially (01:17:46) connected to Silicon Valley and you know (01:17:50) not that much faster than its current (01:17:51) pace elsewhere and I think that'd be a (01:17:53) pretty messed up world. So I one of the (01:17:55) things I think about a lot is how to (01:17:56) prevent that. (01:17:57) >> Yeah. Do you think that once we have uh (01:17:59) this country of geniuses in a data (01:18:00) center that robotics is sort of quickly (01:18:03) solved afterwards because it seems like (01:18:05) a big problem with robotics is that um a (01:18:08) human can learn how to teleoperate (01:18:10) current hardware but current AI models (01:18:12) can't at least not not in a way that's (01:18:14) super productive and so if we have this (01:18:16) ability to learn like a human should it (01:18:18) solve robotics immediately as well (01:18:19) >> I don't think it's dependent on learning (01:18:21) like a human it could happen in (01:18:22) different ways again we could have (01:18:24) trained the model on many different (01:18:26) video games which like robotic controls (01:18:28) or many different simulated robotics (01:18:30) environments or just you know train them (01:18:32) to control computer screens and they (01:18:34) learn to generalize. So it will happen. (01:18:38) It's not necessarily dependent on (01:18:41) humanlike learning. Humanlike learning (01:18:42) is one way it could happen. If the (01:18:44) model's like, "Oh, I pick up a robot. I (01:18:45) don't know how to use it. I learn that. (01:18:47) That could happen because we discovered (01:18:49) discovering continual learning." That (01:18:51) could also happen because we train the (01:18:53) model on a bunch of environments and (01:18:54) then it generalized. Or it could happen (01:18:56) because the model learns that in the (01:18:58) context length. It it doesn't actually (01:19:00) matter which way. If we go back to the (01:19:02) discussion we had like like an hour ago, (01:19:04) that type of thing can happen in that (01:19:06) type of thing can happen in several (01:19:08) different ways. Um uh uh but but I do (01:19:11) think when for for whatever reason the (01:19:13) models have those skills then uh (01:19:16) robotics will be revolutionized both the (01:19:18) design of robots because the models will (01:19:20) be much better than humans at that um (01:19:22) and also the the ability to kind of (01:19:25) control robots. So we'll get better at (01:19:27) the physical building the physical (01:19:29) hardware building the physical robots (01:19:30) and we'll also get better at controlling (01:19:32) it. Now you know does that mean the (01:19:34) robotics industry will also be (01:19:36) generating trillions of dollars of (01:19:38) revenue? My answer there is yes but (01:19:40) there will be the same extremely fast (01:19:42) but not infinitely fast diffusion. So (01:19:44) will robotics be be revolutionized? (01:19:47) Yeah, maybe tack on another year or two. (01:19:49) >> That's that's my that's the way I think (01:19:51) about these things. (01:19:52) Uh there's a general skepticism about (01:19:55) extremely fast progress. Like here (01:19:57) here's my view which is like it sounds (01:19:59) like you are going to solve continual (01:20:00) learning one way or another within a (01:20:02) matter of years. But just as people (01:20:04) weren't talking about continual learning (01:20:06) a couple years ago and then we realized (01:20:08) oh why aren't these models as useful as (01:20:09) they could be right now even though they (01:20:11) are clearly passing the touring test and (01:20:13) are experts in so many different (01:20:14) domains. Maybe it's this thing. And then (01:20:15) we solve this thing and we realize (01:20:16) actually there's another (01:20:18) um another thing that human intelligence (01:20:21) can do. that's a basis of human labor (01:20:23) that these models can't do. And then so (01:20:24) why not think there will be more things (01:20:26) like this? Why I think that like we're (01:20:28) we're you know we've like found the (01:20:30) pieces of human intelligence. (01:20:31) >> Well well to be clear I mean I think (01:20:32) continual learning as I've said before (01:20:34) might not be a barrier at all right like (01:20:36) like you know I think I think we maybe (01:20:38) just get there by pre-training (01:20:40) generalization and and and and and and (01:20:43) RL generalization. Like I I think there (01:20:45) just might not be um there basically (01:20:48) might not be such a thing at all. In (01:20:50) fact, I would point to the history in in (01:20:52) ML of people coming up with things that (01:20:55) are barriers that end up kind of (01:20:57) dissolving within the big blob of (01:20:58) compute, right? That, you know, people (01:21:00) talked about, you know, (01:21:03) you know, how do you have, you know, how (01:21:05) do how do your models keep track of (01:21:07) nouns and verbs and, you know, how do (01:21:09) they, you know, they can understand (01:21:11) semant syntactically, but they can't (01:21:13) understand semantically. You know, it's (01:21:15) only statistical correlations. You can (01:21:17) understand a paragraph, but you can't (01:21:19) understand a word. There's reasoning, (01:21:20) you can't do reasoning, but then (01:21:22) suddenly it turns out you can do code (01:21:24) and math very well at all. So (01:21:26) >> I I think there actually there's there's (01:21:27) actually a stronger history of some of (01:21:30) these things seeming like a big deal and (01:21:32) then and then kind of and then kind of (01:21:34) dissolving. Some of them are real. I (01:21:36) mean the need for data is real. May (01:21:38) maybe continual (01:21:40) continual learn continual learning is a (01:21:42) real thing. But again, I would ground us (01:21:44) in something like code. Like I think we (01:21:47) may get to the point in like a year or (01:21:49) two where the models can just do end to (01:21:51) end. Like that's a whole task. That's a (01:21:54) whole sphere of human activity that that (01:21:56) we're just saying models can do it now. (01:21:59) Um when you say end to end, do you mean (01:22:01) um setting technical direction, (01:22:04) understanding the context of the (01:22:05) problem, etc.? Okay. Yes. I mean all of (01:22:07) that. Interesting. I mean that that is I (01:22:11) feel like AGI complete um which maybe is (01:22:14) internally consistent but um it's not (01:22:16) like saying 90% of code or 100% of code (01:22:18) it's like no no the other parts of the (01:22:20) >> no no I gave this I gave the spectrum (01:22:23) 90% of code 100% of code 90% of N10 (01:22:26) suite 100% of N10 suite new tasks are (01:22:29) created for SWES eventually those get (01:22:31) done as well but it's a long spectrum (01:22:33) there but we're traversing the spectrum (01:22:35) very quickly (01:22:36) >> um I do think it's funny that I I've (01:22:38) seen a couple podcasts you've done where (01:22:40) um the host will be like a butcher wrote (01:22:42) the essay about the control learning (01:22:43) thing and it always makes me crack up (01:22:45) because you're like you know you've been (01:22:46) an AI researcher for like 10 years and (01:22:49) I'm sure there's like some uh feeling of (01:22:51) like okay so podcaster wrote an essay (01:22:54) like every interview I get asked about (01:22:55) it (01:22:56) >> you know the the truth of the truth of (01:22:58) the matter is that we're all trying to (01:22:59) figure this out together right there (01:23:02) there are some ways in which I'm able to (01:23:05) see things that others aren't these days (01:23:08) That probably has more to do with like I (01:23:10) can see a bunch of stuff within (01:23:11) Enthropic and have to make a bunch of (01:23:13) decisions than I have any great research (01:23:15) insight that that that others don't. (01:23:17) Right? I you know I'm running a 2500 (01:23:19) person company. Like it's it's actually (01:23:21) pretty hard for me to have have concrete (01:23:24) research insight you know much harder (01:23:26) than you know than than it would have (01:23:28) been you know 10 years ago or or you (01:23:30) know or even two or three years ago. Um, (01:23:33) as we go towards a world of a full drop (01:23:36) in remote worker replacement, does a API (01:23:40) pricing model still make the most sense? (01:23:42) And if not, what is the correct way to (01:23:43) price AGI or serve AGI? (01:23:45) >> Yeah, I mean, I think there's going to (01:23:47) be a bunch of different business models (01:23:48) here sort of all at once that are going (01:23:50) to be that are going to be experimented (01:23:53) with. Um, I I I actually do think that (01:23:56) the the API (01:23:58) um (01:24:00) model is is more durable than many (01:24:02) people think. Um, one way I think about (01:24:04) it is if the technology is kind of (01:24:08) advancing quickly, if it's advancing (01:24:09) exponentially, what that means is (01:24:11) there's there's always kind of like a (01:24:13) surface area of of kind of new use cases (01:24:16) that have been developed in in the last (01:24:18) uh in the last three months. And any (01:24:20) kind of product surface you put in place (01:24:23) is always at risk of sort of becoming (01:24:27) irrelevant, right? Any given product (01:24:29) surface probably makes sense for, you (01:24:30) know, a range of capabilities of the (01:24:32) model, right? The the chatbot is already (01:24:34) running into limitations of, you know, (01:24:38) making it smarter doesn't really help (01:24:39) the average consumer that much. But I (01:24:42) don't think that's a limitation of AI (01:24:43) models. I don't think that's evidence (01:24:45) that, you know, the models are are the (01:24:47) models are good enough and they're (01:24:48) they're, you know, them getting better (01:24:50) doesn't matter to the economy. It (01:24:52) doesn't matter to that particular (01:24:53) product. Um, and and so I think the (01:24:56) value of the API is the API always (01:24:59) offers an opportunity, you know, very (01:25:02) close to the bare metal to build on what (01:25:04) the latest thing is. Um, and so there, (01:25:06) you know, there's there's there's kind (01:25:08) of always going to be this, you know, (01:25:10) this this kind of front of new startups (01:25:13) and new ideas that weren't possible a (01:25:15) few months ago and are possible because (01:25:17) the model is advancing. And and so I I (01:25:20) actually I I I kind of actually predict (01:25:23) that we are it's going to exist (01:25:27) alongside other models, but we're always (01:25:29) going to have the API business model (01:25:31) because there's there's always going to (01:25:33) be a need for a thousand different (01:25:35) people to try experimenting with the (01:25:37) model in different way and a hundred of (01:25:39) them become startups and 10 of them (01:25:41) become big successful startups and you (01:25:42) know two or three really end up being (01:25:44) the the way that people use the model of (01:25:46) a of a given generation. So I I (01:25:49) basically think it's always going to (01:25:50) exist. At the same time, I'm sure (01:25:53) there's going to be other models as (01:25:55) well. Like not every token that's output (01:25:58) by the model is worth the same amount. (01:26:00) Think about, you know, how how what is (01:26:04) the value of the tokens that are like, (01:26:06) you know, that the model outputs when (01:26:07) someone, you know, call, you know, (01:26:10) someone, you know, calls them up and (01:26:11) says, "My Mac isn't working or (01:26:13) something." You know, the model's like, (01:26:14) "Restart it, right?" Yeah. (01:26:15) >> And like you know someone hasn't heard (01:26:17) that before but like you know the model (01:26:19) said that like 10 million times you know (01:26:23) that maybe that's worth like a dollar or (01:26:25) a few cents or something. Um whereas if (01:26:28) uh the model you know the model goes to (01:26:32) you know one of the one of the (01:26:33) pharmaceutical companies and it says oh (01:26:35) you know this molecule you're developing (01:26:37) you should take the aromatic ring from (01:26:39) that end of the molecule and put it on (01:26:41) that end of the molecule. Um and and you (01:26:43) know if you do that wonderful things (01:26:44) will happen. um uh like like those (01:26:47) tokens could be worth, you know, tens of (01:26:49) millions of dollars, right? Um uh so so (01:26:53) I think we're definitely going to see (01:26:54) business models that that recognize (01:26:56) that, you know, at some point we're (01:26:58) going to see, you know, pay for results (01:27:00) or you know, in some in some form or we (01:27:04) may see forms of compensation that are (01:27:07) like labor um uh you know, that that (01:27:10) kind of work by the hour. Um I I I you (01:27:14) know, I don't Oh, I think I think I (01:27:15) think because it's a new industry, a lot (01:27:18) of things are going to be tried and I (01:27:19) you know I don't know what will turn out (01:27:20) to be the right thing. (01:27:21) >> Um what I find uh I I take your point (01:27:24) that people will have to try things to (01:27:26) figure out what is the best way to use (01:27:27) this blob of intelligence. But what I (01:27:30) find striking is clawed code. So I don't (01:27:34) think in the history of startups there (01:27:35) has been a single application that has (01:27:37) been as hotly competed in as coding (01:27:40) agents and um and and cloud code is a (01:27:46) category leader here and that seems (01:27:48) surprising to me like it doesn't seem (01:27:50) intrinsically like enthropic had to (01:27:52) build this and I wonder if you have an (01:27:53) accounting of why it had to be enthropic (01:27:55) or why how enthropic ended up building (01:27:57) an application in addition to the model (01:27:59) underlying it. Yeah. So it actually (01:28:01) happened in a pretty simple way which is (01:28:02) we had our own um you know we had our (01:28:07) coding models which were good at coding (01:28:09) and and you know around the beginning of (01:28:11) 2025 I said I I think the time has come (01:28:13) where you can have non-trivial (01:28:15) acceleration of your own research um if (01:28:19) you're an AI company by using these (01:28:21) models and of course you know we you (01:28:23) need an interface you need a harness to (01:28:25) use them and so I encourage people (01:28:27) internally you know I didn't say this is (01:28:28) one thing that you know you have to use. (01:28:31) I I just said people should experiment (01:28:33) with this and then you know this thing I (01:28:37) think it might have been originally (01:28:38) called claude CLI and then the name (01:28:40) eventually got changed to cloud code (01:28:42) internally um was the thing that kind of (01:28:46) everyone was using and it was seeing (01:28:47) fast internal adoption and I looked at (01:28:49) it and I said probably we should launch (01:28:51) this externally right um uh you know (01:28:53) it's it's seen such fast adoption within (01:28:55) anthropic like you know like you know (01:28:58) coding is a lot of what we do and and so (01:29:00) you know we have a we have audience of (01:29:02) many many hundreds of people that's in (01:29:04) some ways at least representative of the (01:29:06) external audience. So it looks like we (01:29:08) already have product market fit. Let's (01:29:09) launch this thing. Um and and then we (01:29:11) launched it and and and I think you know (01:29:13) just just the fact that we ourselves are (01:29:16) kind of developing the model and we (01:29:19) ourselves know what we most need to use (01:29:21) the model. I think it's it's kind of (01:29:22) creating this feedback loop. (01:29:24) >> I see. in the sense that you let's say a (01:29:26) developer at Enthropic is like ah it it (01:29:29) would be better if it was better at this (01:29:30) X thing and then you bake that into the (01:29:33) next model that you build that that's (01:29:36) that's one version of it but but then (01:29:37) there's just the ordinary product (01:29:39) iteration of like you know we have a (01:29:41) bunch of we have a bunch of coders (01:29:42) within anthropic like we um you know (01:29:46) they like use cloud code every day and (01:29:48) so we get fast feedback that was more (01:29:49) important in the early days now of (01:29:51) course there are millions of people (01:29:52) using it um and so we get a bunch of (01:29:54) external feedback as well. But it's, you (01:29:56) know, it's just great to be able to get, (01:29:58) you know, kind of kind of uh um fast (01:30:01) fast internal feedback. You know, I (01:30:03) think this is the reason why we launched (01:30:04) a coding model and, you know, didn't (01:30:06) launch a pharmaceutical company, right? (01:30:08) It you know, you know, my background is (01:30:10) in in my background's in in like (01:30:12) biology, but like we don't have any of (01:30:14) the resources that are needed to launch (01:30:15) a pharmaceutical company. So, there's (01:30:18) been a ton of hype around OpenClaw, and (01:30:19) I wanted to check it out for myself. (01:30:20) I've got a day coming up this weekend (01:30:22) and I don't have anything planned yet. (01:30:24) So, I gave Open Claw a Mercury debit (01:30:26) card. I set a couple hundred limit and I (01:30:28) said, "Surprise me." Okay, so here's the (01:30:30) Mac Mini it's on. And besides having (01:30:32) access to my Mercury, it's totally (01:30:34) quarantined. They actually felt quite (01:30:36) comfortable giving it access to a debit (01:30:37) card because Mercury makes it super easy (01:30:39) to set up guardrails. I was able to (01:30:40) customize permissions, cap the spend, (01:30:42) and restrict the category of purchases. (01:30:44) I wanted to make sure the debit card (01:30:45) worked, so I asked OpenCloud to just (01:30:46) make a test transaction and decided to (01:30:48) donate a couple bucks to Wikipedia. (01:30:50) Besides that, I have no idea what's (01:30:51) going to happen. I will report back on (01:30:53) the next episode about how it goes. In (01:30:55) the meantime, if you want a personal (01:30:57) banking solution that can accommodate (01:30:58) all the different ways that people use (01:31:00) their money, even experimental ones like (01:31:01) this one, visit mercury.com/personal. (01:31:05) Mercury is a fintech company, not an (01:31:08) FDIC insured bank. Banking services (01:31:10) provided through Choice Financial Group (01:31:12) and column NA members FDIC. You know, (01:31:14) she thinks we're getting coffee and (01:31:16) walking around the neighborhood. (01:31:19) Um let me ask you about now um making AI (01:31:22) go well. Um it seems like whatever (01:31:25) vision we have about how AI goes well (01:31:28) has to be compatible with two things. (01:31:30) One is the ability to build and run AIs (01:31:34) is diffusing extremely rapidly and two (01:31:37) is that the population of AIS the amount (01:31:40) we have in their intelligence will also (01:31:42) increase very rapidly and that means (01:31:45) that lots of people will be able to (01:31:46) build huge populations of misaligned AIs (01:31:49) or uh AIs which are just like companies (01:31:52) which are trying to increase their uh (01:31:54) footprint or have weird psyches like (01:31:56) Sydney Bing but now they're superhuman. (01:31:58) What is a vision for a world in which we (01:32:01) have an equilibrium that is compatible (01:32:03) with lots of different AI some of which (01:32:04) are misaligned running around? (01:32:06) >> Yeah. Yeah. So I think you know in the (01:32:08) adolescence of technology I was kind of (01:32:10) you know skeptical of like the balance (01:32:13) of power but I I think I was (01:32:15) particularly skeptical of or the thing I (01:32:18) was specifically skeptical of is you (01:32:20) have like three or four of these (01:32:22) companies like kind of all building (01:32:24) models that are kind of derive you know (01:32:26) sort of sort of um uh uh like derived (01:32:29) from the like derived from the same (01:32:32) thing and uh you know that that these (01:32:35) would check each or or even that kind of (01:32:38) you know any number of them would would (01:32:39) would would uh would would check each (01:32:40) other like we might live in a offense (01:32:43) dominant world where you know like one (01:32:45) person or one AI model is like smart (01:32:47) enough to do something that like causes (01:32:49) damage for everything else. Um I think (01:32:52) in the I mean in the short run we have a (01:32:54) limited number of players now. So we can (01:32:56) start by within the limited number of (01:32:57) players we uh you know we kind of you (01:33:01) know we we need to put in place the you (01:33:03) know the safeguards. We need to make (01:33:04) sure everyone does the right alignment (01:33:05) work. we need to make sure everyone has (01:33:07) bio classifiers like you know those are (01:33:10) those are kind of the immediate things (01:33:11) we need to do. I agree that you know (01:33:13) that that doesn't solve the problem in (01:33:14) the long run particularly if the ability (01:33:17) of AI models to make other AI models (01:33:19) proliferates then you know the the whole (01:33:22) thing can kind of um you know can become (01:33:25) harder to solve. You know I think I (01:33:27) think in the long run we need some (01:33:29) architecture of governance right some (01:33:31) some architecture of governance that (01:33:33) preserves human freedom but but kind of (01:33:36) also allows us to like you know govern (01:33:39) the the very large number of kind of um (01:33:43) you know uh uh uh human systems AI (01:33:46) systems hybrid hybrid human human um you (01:33:50) know hybrid hybrid human AI like you (01:33:54) know companies or or like or like or (01:33:57) like economic units. So, you know, we're (01:33:59) we're going to need to think about like, (01:34:01) you know, how do we how do we protect (01:34:02) the world against, you know, (01:34:04) bioteterrorism? How do we protect the (01:34:06) world against like, you know, against (01:34:08) like against like mirror life? Like, you (01:34:10) know, pro probably we're going to need (01:34:12) to, you know, need some kind of like AI (01:34:15) monitoring system that like moni, you (01:34:17) know, kind of monitors for for all these (01:34:19) things, but then we need to build this (01:34:20) in a way that like, you know, preserves (01:34:23) civil liberties and like our (01:34:24) constitutional rights. So I think just (01:34:26) just as is as is anything else like it's (01:34:29) it's like a new security landscape with (01:34:31) a new set of you know a new set of tools (01:34:35) and a new set of vulnerabilities. And I (01:34:37) I think my worry is if we had a hundred (01:34:40) years for this to happen all very (01:34:42) slowly, we'd get used to it, you know, (01:34:44) like we've gotten used to like, you (01:34:46) know, the presence of, you know, the (01:34:48) presence of explosives in society or (01:34:50) like the, you know, the presence of (01:34:51) various um, you know, like new weapons (01:34:55) or the, you know, the pre the presence (01:34:57) of video cameras. Um, we would get used (01:34:59) to it over over over over 100 and we'd (01:35:01) develop governance mechanisms. We'd make (01:35:03) our mistakes. My my worry is just that (01:35:06) this is happening all so fast and so I (01:35:08) think maybe we need to do our thinking (01:35:10) faster about how to make these (01:35:11) governance mechanisms work. (01:35:13) >> Yeah, it seems like in a offense (01:35:15) dominant world (01:35:17) over the course of the next century. So (01:35:19) the idea is AI is making the progress (01:35:20) that would happen over the next century (01:35:21) happen in some period of 5 to 10 years. (01:35:24) But we would still need the same (01:35:26) mechanisms or balance of power would be (01:35:28) similarly intractable even if humans (01:35:30) were the only game in town. Um, and so I (01:35:34) guess we have the advice of AI. We it (01:35:38) fundamentally doesn't seem like a (01:35:39) totally different ballgame here. If (01:35:42) checks and balances were going to work, (01:35:43) they would work with humans as well. If (01:35:44) they aren't going to work, they wouldn't (01:35:45) work with AIS as well. Um, and so maybe (01:35:48) this just dooms human checks and (01:35:50) balances as well. But yeah, again, I (01:35:52) think there's some way to I think (01:35:54) there's some way to make this happen. (01:35:56) Like it, you know, it just it just, you (01:35:58) know, the governments of the world may (01:36:00) have to work together to make it happen. (01:36:01) like you know we may have to you may (01:36:04) have to talk to AIS about kind of you (01:36:07) know building societal structures in (01:36:09) such a way that like these these (01:36:10) defenses are possible. I I I don't know. (01:36:12) I mean this is so this is you know I (01:36:14) don't want to say so far ahead in time (01:36:16) but like so far ahead in technological (01:36:19) ability that may happen over a short (01:36:21) period of time that it's hard for us to (01:36:22) anticipated in advance. Um, speaking of (01:36:24) governments getting involved, on (01:36:26) December 26th, the Tennessee legislature (01:36:28) introduced a bill which uh said, quote, (01:36:31) um, it would be an offense for a person (01:36:33) to knowingly train artificial (01:36:34) intelligence to provide emotional (01:36:35) support, including through open-ended (01:36:38) conversations with a user. And of (01:36:40) course, one of the things that Claude (01:36:42) attempts to do is be uh a thoughtful um (01:36:46) thoughtful friend, thoughtful, (01:36:48) knowledgeable friend. And in general, it (01:36:50) seems like we're going to have this (01:36:51) patchwork of state laws. A lot of the (01:36:53) benefits that normal people could (01:36:54) experience as a result of AI are going (01:36:56) to be curtailed, especially when we get (01:36:57) into the kinds of things you discussed (01:36:59) in machines of love and grace, (01:37:00) biological freedom, mental health (01:37:02) improvements, etc., etc. It seems easier (01:37:04) to imagine worlds in which these get (01:37:05) whack-a-ole away by different laws. Um, (01:37:08) whereas (01:37:10) bills like this don't seem to address (01:37:12) the actual existential threats that (01:37:15) you're concerned about. So I'm curious (01:37:16) about to understand in the context of (01:37:18) things like this your anthropics (01:37:20) position against the federal moratorium (01:37:22) on state AI laws. (01:37:24) >> Yes. So I don't know there's there's (01:37:25) many different things going on at once, (01:37:27) right? I think I think that that I think (01:37:29) that particular law is is dumb. Like you (01:37:32) know I think it was it was clearly made (01:37:33) by legislators who just probably had (01:37:36) little idea what AI models could do and (01:37:38) not do. They're like AI models serving (01:37:40) as that just sounds scary. Like I don't (01:37:42) want I don't want that to happen. So, (01:37:43) you know, we're we're we're not we're (01:37:45) not in favor of that, right? But but but (01:37:48) that, you know, that that wasn't the (01:37:49) thing that was being voted on. The thing (01:37:50) that was being voted on is we're going (01:37:53) to ban all state regulation of AI for 10 (01:37:56) years with no apparent plan to to do any (01:38:00) federal regulation of AI, which would (01:38:02) take Congress to pass, which is a very (01:38:04) high bar. Um so you know the idea that (01:38:07) we'd ban states from doing anything for (01:38:08) 10 years and people said they had a plan (01:38:11) for federal government but you know (01:38:13) there was no actual there was no (01:38:14) proposal on the table. There was no (01:38:16) actual attempt. Um given the serious (01:38:19) dangers that I lay out in adolescence of (01:38:22) technology around things like the you (01:38:24) know kind of biological weapons and (01:38:26) bioteterrorism autonomy risk and the (01:38:29) timelines we've been talking about like (01:38:31) 10 years is an eternity. like that's (01:38:33) that's a that's a I I think that's a (01:38:36) crazy thing to do. So if if that's the (01:38:38) choice, if that's what you force us to (01:38:40) choose, then then we're going to we're (01:38:42) going to choose not to have that (01:38:43) moratorum. And you know, I think the the (01:38:46) the benefits of that position exceed the (01:38:48) costs. But it's it's not a perfect (01:38:50) position if that's the choice. Now, I (01:38:52) think the thing that we should do, the (01:38:54) thing that I would support is the (01:38:56) federal government should step in, not (01:38:59) saying states you can't regulate, but (01:39:01) here's what we're going to do and and (01:39:03) states you can't differ from this, (01:39:06) right? Like I think preeemption is fine (01:39:08) in the sense of saying that federal (01:39:10) government says here's our standard. (01:39:12) This applies to everyone. States can't (01:39:14) do something different. That would be (01:39:15) something I would support if it would be (01:39:17) done in the right way. What um but but (01:39:20) this idea of states you can't do (01:39:22) anything and we're not doing anything (01:39:23) either that that struck that struck us (01:39:27) as you know very much not making sense (01:39:29) and I think will not age well it's (01:39:31) already starting to not age well with (01:39:33) with all the um backlash that that (01:39:36) you've seen now in terms of in terms of (01:39:37) what we would want I mean you know the (01:39:39) things we've talked about are are (01:39:41) starting with transparency standards um (01:39:44) uh uh you know in order to monitor some (01:39:46) of these autonomy risks and bio (01:39:48) terrorism risks as the risks become more (01:39:50) serious um as we as we get more evidence (01:39:54) for them then I think we could be more (01:39:56) aggressive in some targeted ways and and (01:39:58) say hey AI bioteterrorism is really a (01:40:01) threat let's let's pass a law that kind (01:40:04) of forces people to have classifiers and (01:40:06) I could even imagine it it depends it (01:40:08) depends how serious a threat it ends up (01:40:10) being we don't know for sure and we need (01:40:12) to pursue this in an intellectually (01:40:13) honest way where we say ahead of time (01:40:15) the risk has not emerged yet but I could (01:40:17) certainly imagine with the pace that (01:40:19) things are going that you know I could (01:40:21) imagine a world where later this year we (01:40:23) say hey this this AI bioteterrorism (01:40:26) stuff is really serious we should do (01:40:27) something about it we should put it in a (01:40:29) federal we should you know put it in a (01:40:31) federal standard and if the federal (01:40:32) government won't act we should put it in (01:40:34) a state standard I could totally see (01:40:35) that I I I'm concerned about a world (01:40:38) where (01:40:40) if you just consider the the pace of (01:40:42) progress you're expecting the life cycle (01:40:44) of of legislation you the the benefits (01:40:48) are, as you say, because of diffusion (01:40:49) lag, the benefits are slow enough that I (01:40:51) really do think this patchwork of on the (01:40:54) current trajectory, this patchwork of (01:40:56) state laws would prohibit. I mean, (01:40:58) having an emotional chatbot friend is (01:41:00) something that freaks people out, then (01:41:01) just imagine the kinds of actual (01:41:03) benefits from AI we want normal people (01:41:05) to be able to experience from (01:41:06) improvements in health and health span (01:41:08) and improvements in mental health and so (01:41:10) forth. whereas at the same time uh it (01:41:13) seems like you think the dangers are (01:41:14) already on the horizon and I just don't (01:41:16) see that much um it seems like would be (01:41:20) especially injurious to the benefits of (01:41:21) AI uh as compared to the the dangers of (01:41:24) AI and so that that's maybe the where (01:41:26) the cost benefit makes less sense to me. (01:41:28) So, so, so there's a few things here, (01:41:29) right? I mean, people talk about there (01:41:31) being thousands of these state laws. (01:41:33) First of all, the vast vast majority of (01:41:35) them do not pass. Um, and you know, the (01:41:38) the the the you know, the world works a (01:41:40) certain way in theory, but like just (01:41:42) because a law has been passed doesn't (01:41:43) mean it's really enforced, right? The (01:41:45) people the people you know implementing (01:41:47) it may be like, "Oh my god, this is (01:41:49) stupid." It would mean shutting off (01:41:50) like, you know, everything that's ever (01:41:52) been built and everything that's ever (01:41:54) been built in Tennessee. So, you know, (01:41:55) very often laws are interpreted in like, (01:41:58) you know, a way that makes them that (01:42:00) that that makes them not as dangerous or (01:42:02) not as harmful. On on the same side, of (01:42:04) course, you have to worry if you're (01:42:05) passing a law to stop a bad thing, you (01:42:07) had this you had this problem as well. (01:42:09) Yeah. Um uh look my my look I mean my (01:42:12) basic view is you know if if if (01:42:16) you know we could decide you know what (01:42:17) laws were passed and how things were (01:42:19) done which you know we're only one small (01:42:21) input input into that you know I would (01:42:24) deregulate a lot of the stuff around the (01:42:27) health benefits of AI. Um I think you (01:42:29) know I I don't worry as much about the (01:42:31) like the the the kind of chatbot laws. I (01:42:34) I actually worry more about the drug (01:42:37) approval process where I think AI models (01:42:40) are going to greatly accelerate um the (01:42:44) rate at which we discover drugs and just (01:42:46) the the pipeline will get jammed up like (01:42:48) the pipeline will not be prepared to (01:42:50) like process all all of the stuff that's (01:42:52) going through it. So um you know I I I (01:42:55) think I think reform of the regulatory (01:42:58) process to buy us more towards we have a (01:43:00) lot of things coming where the safety (01:43:02) and the efficacy is actually going to be (01:43:05) really crisp and clear like I mean a (01:43:07) beautiful thing really really crisp and (01:43:09) clear and like really really effective (01:43:11) but you know and and and maybe we don't (01:43:13) need all this all this um uh uh like um (01:43:17) all this superructure around it that was (01:43:20) designed around an era of drugs that (01:43:21) barely work and often and have serious (01:43:23) side effects. Um but at the same time I (01:43:26) think we should be ramping up quite (01:43:28) significantly the um uh you know this (01:43:33) this kind of safety and security (01:43:34) legislation and you know like I've said (01:43:37) um you know starting with transparency (01:43:39) is is my view of trying not to hamper (01:43:42) the industry right trying to find the (01:43:44) right balance. I'm worried about it. (01:43:46) Some people criticize my essay for (01:43:48) saying that's too slow. The dangers of (01:43:50) AI will come too soon if we do that. (01:43:52) Well, basically I kind of think like the (01:43:55) last 6 months and maybe the next few (01:43:56) months are going to be about (01:43:58) transparency. And then if these ris if (01:44:01) these risks emerge when we're more (01:44:02) certain of them, which I think we might (01:44:03) be as soon as as later this year, then I (01:44:06) think we need to act very fast in the (01:44:08) areas that we've actually seen the risk. (01:44:10) Like I think the only way to do this is (01:44:12) to be nimble. Now the legislative (01:44:14) process is normally not nimble but we we (01:44:17) need to emphasize to everyone involved (01:44:21) the urgency of this. That's why I'm (01:44:23) sending this message of urgency, right? (01:44:24) That's why I wrote adolescence of (01:44:26) technology. I wanted policy makers to (01:44:28) read it. I wanted economists to read it. (01:44:30) I want national security professionals (01:44:32) to read it. You know, I want decision (01:44:34) makers to read it so that they have some (01:44:36) hope of acting faster than they would (01:44:38) have otherwise. Is there anything you (01:44:41) can do or advocate that would (01:44:45) make it more certain that the benefits (01:44:47) of AI are um are better instantiated (01:44:51) where I feel like you have worked with (01:44:53) legislators to be like okay we're going (01:44:54) to prevent biotterism here way we're (01:44:56) going to increase we're going to (01:44:57) increase whistleblower protection and I (01:45:00) just think by default the actual ben (01:45:01) like the things we're looking forward to (01:45:02) here it just seems very easy they seem (01:45:05) very fragile to uh different kinds of (01:45:08) moral panics or political economy (01:45:09) problems. (01:45:10) >> Yeah, I don't actually so so I don't (01:45:12) actually agree that much in the (01:45:14) developed world. I feel like, you know, (01:45:16) in the developed world like markets (01:45:18) function pretty well and when there's (01:45:21) when there's like a lot of money to be (01:45:24) made on something and it's clearly the (01:45:26) best available alternative, it's (01:45:27) actually hard for the regulatory system (01:45:28) to stop it. You know, we're we're seeing (01:45:30) that in AI itself, right? I you know, (01:45:33) like a thing I've been trying to fight (01:45:35) for is export controls on chips to (01:45:37) China, right? And like that's in the (01:45:39) national security interests of the US (01:45:42) like you know that's like square within (01:45:45) the you know the the policy beliefs of (01:45:47) you know every almost everyone in (01:45:49) Congress of both parties but and you (01:45:52) know I think the case is very clear. The (01:45:54) counterarguments against it are I'll (01:45:56) politely call them fishy. Um uh and yet (01:46:00) it doesn't happen and we sell the chips (01:46:02) because there's there's so much money (01:46:04) there's so much money riding on it. um (01:46:07) and you know the the that money wants to (01:46:09) be made and and in that case in my (01:46:11) opinion that's a bad thing. Um and but (01:46:13) but it also it also applies when when (01:46:16) it's a good thing and and so I I don't (01:46:18) think that if we're talking about drugs (01:46:22) and benefits of the technology I I I I (01:46:26) am not as worried about those benefits (01:46:28) being hampered in the developed world. I (01:46:31) am a little worried about them going too (01:46:33) slow and I as I said I do think we (01:46:36) should work to speed the approval (01:46:38) process in the FDA. I do think we should (01:46:41) fight against these chatbot bills that (01:46:42) you're describing right described (01:46:44) individually. I'm against them. I think (01:46:46) they're stupid. Um but I actually think (01:46:49) the bigger worry is a developing world (01:46:51) um where we don't have functioning (01:46:53) markets where um you know we often can't (01:46:56) build on the technology that that we've (01:46:58) had. I worry more that those folks will (01:47:00) get left behind. And I worry that even (01:47:02) if the cures are developed, you know, (01:47:04) maybe there's someone in rural (01:47:05) Mississippi who who doesn't get it as (01:47:07) well. Right? That's a that's a that's a (01:47:08) kind of smaller version of the thing the (01:47:10) concern we have in the in the developing (01:47:12) world. And so the things we've been (01:47:14) doing are you know you know we work with (01:47:17) you know we work with you know (01:47:18) philanthropists right you know we work (01:47:20) with folks um who you know who you know (01:47:24) deliver you know medicine and health (01:47:27) interventions to you know to to (01:47:29) developing world to subsaharan Africa (01:47:31) you know India Latin America you know (01:47:35) you know other other developing parts of (01:47:38) the world that's the thing I think that (01:47:39) won't happen on its own you mentioned (01:47:42) export controls Yeah. (01:47:43) >> Why can't US and China both have a (01:47:45) country of geniuses (01:47:47) >> on a data center? (01:47:47) >> Why can't you know why won't it happen (01:47:49) or why should (01:47:51) >> why shouldn't it happen? (01:47:51) >> Why shouldn't it happen? Um, you know, I (01:47:54) think I think if this does happen, um, (01:47:57) you know, then then we kind of have a (01:48:01) well, we could have a few situ if we (01:48:03) have like an offense dominant situation. (01:48:04) We could have a situation like nuclear (01:48:06) weapons, but like more dangerous, right? (01:48:08) Where it's like um, you know, kind of (01:48:09) kind of either side could could easily (01:48:12) destroy everything. Um, we could also (01:48:15) have a world where it's kind of it's (01:48:17) unstable. Like the nuclear equilibrium (01:48:19) is stable, right? Because it's, you (01:48:20) know, it's like deterrence. But let's (01:48:22) say there were uncertainty about like if (01:48:24) the two AIs fought, which AI would win. (01:48:27) Um, that could create instability, (01:48:29) right? You often have conflict when the (01:48:31) two sides have a different assessment of (01:48:33) their likelihood of winning, right? If (01:48:34) one side is like, oh yeah, there's a 90% (01:48:37) chance I'll win. And the other side's (01:48:38) like, there's a 90% chance I'll win, (01:48:40) then then a fight is much more likely. (01:48:42) Um, they can't both be right, but they (01:48:44) can both think that. But this seems like (01:48:45) a fully general argument against the (01:48:47) diffusion of AI technology which it may (01:48:50) which is that's the implication of this (01:48:52) world. (01:48:52) >> Let me let me just go on because I think (01:48:54) we will get diffusion eventually. The (01:48:56) other concern I have is that people the (01:48:58) governments will oppress their own (01:49:00) people with AI and and and so um you (01:49:03) know I'm I'm just I'm worried about some (01:49:05) world where you have a country that's (01:49:07) already you know kind of a uh you know (01:49:11) you know there's there's a government (01:49:12) that kind of kind of already um you know (01:49:15) is is kind of kind of building a you (01:49:17) know a tech high-tech authoritarian (01:49:19) state. Um and to be clear this is about (01:49:21) the government. this is not about the (01:49:22) people like people we need to find a way (01:49:24) for people everywhere to benefit. Um my (01:49:26) worry here is about governments. Um so (01:49:29) yeah my you know my my worry is the (01:49:31) world gets carved up into two pieces one (01:49:33) of those two pieces could be (01:49:35) authoritarian or totalitarian in a way (01:49:37) that's very difficult to displace. Um (01:49:39) now will will governments eventually get (01:49:42) powerful AI and and you know there's (01:49:44) risk of authoritarianism? Yes. Will (01:49:46) governments eventually get powerful AI (01:49:48) and there's risk of um uh you know of of (01:49:51) kind of bad bad bad equilibria? Yes, I (01:49:54) think both things, but the initial (01:49:56) conditions matter, right? You know, at (01:49:59) at some point we're need we're going to (01:50:00) need to set up the rules of the road. (01:50:03) I'm not saying that one country, either (01:50:05) the United States or a coalition of (01:50:07) democracies, which I think is would be a (01:50:09) better setup, although it requires more (01:50:10) international cooperation than we (01:50:12) currently seem to want to make. Um, but (01:50:14) you know, I don't I don't think a (01:50:15) coalition of democracies or or certainly (01:50:17) one country should just say these are (01:50:20) the rules of the road. There's going to (01:50:21) be some negotiation, right? The world is (01:50:23) going to have to grapple with this. And (01:50:26) what I would like is that the the the (01:50:29) democratic nations of the world, those (01:50:32) with, you know, who are clo whose (01:50:35) governments have represent closer to (01:50:37) prohuman values are are holding a (01:50:39) stronger hand then have have more (01:50:41) leverage when the rules of the road are (01:50:43) set. And and so I'm I'm very concerned (01:50:45) about that initial condition. I um I was (01:50:48) relisting to an interview from three (01:50:49) years ago and one of the ways it aged (01:50:52) poorly is that I kept asking questions (01:50:54) assuming there's going to be some key (01:50:56) fulcrum moment two to three years from (01:50:58) now when in fact being that far out it (01:51:00) just seems like progress continues AI (01:51:02) improves AI is more diffused and people (01:51:05) will use it for more things. It seems (01:51:06) like you're imagining a world in the (01:51:08) future where the countries get together (01:51:10) and here's the rules of the road and (01:51:11) here's the leverage we have, here's the (01:51:12) leverage you have when it seems like on (01:51:14) current trajectory, everybody will have (01:51:16) more AI. Um, some of that AI will be (01:51:19) used by authoritarian countries. Some of (01:51:20) that within the authoritarian countries (01:51:21) will be by private actors versus state (01:51:24) actors. It's not clear who will benefit (01:51:26) more. It's always unpredictable to tell (01:51:28) tell in advance. You know, it seems like (01:51:29) the internet privileged authoritarian (01:51:31) countries more than you would have (01:51:32) expected. Um, and maybe the AI will be (01:51:34) the opposite way around. Um so I I I (01:51:38) want to better understand what you're (01:51:39) imagining here. (01:51:40) >> Yeah. Yeah. So so just to be precise (01:51:42) about it, I think the exponential of the (01:51:45) underlying technology will continue as (01:51:46) it has before, right? The models get (01:51:48) smarter and smarter even when they get (01:51:50) to country of geniuses in a data center. (01:51:52) You know, I I think you can continue to (01:51:55) make the model smarter. There's a (01:51:56) question of like getting diminishing (01:51:59) returns on their value in the world, (01:52:02) right? How much does it matter after (01:52:04) you've already solved human biology or (01:52:06) you know you know at some point you can (01:52:08) do harder math you can do more abstuse (01:52:10) math problems but nothing after that (01:52:12) matters but putting that aside I do (01:52:14) think the the exponential will continue (01:52:17) but there will be certain distinguished (01:52:19) points on the exponential and companies (01:52:23) individuals countries will reach those (01:52:25) points at different times um and and so (01:52:28) you know there's there's you know could (01:52:30) there be some you know you know I talk (01:52:31) about is a nuclear deterrent still in (01:52:33) adolescence of technology is a nuclear (01:52:35) deterrent still stable uh in the world (01:52:37) of of of AI I don't know but that's (01:52:39) that's an example of like one thing (01:52:41) we've taken for granted that like the (01:52:43) technology could reach such a level that (01:52:44) it's no longer like you know we can no (01:52:47) longer be certain of it at least um uh (01:52:49) you know think of think of others you (01:52:51) know there there you know there there (01:52:53) are kind of points where if you if you (01:52:55) reach a certain point you maybe you have (01:52:57) offensive cyber dominance and like every (01:53:00) every computer system is transparent to (01:53:02) you after that. Um, unless the other (01:53:04) side has a has a kind of equivalent (01:53:06) defense. So, I don't know what the (01:53:09) critical moment is or if there's a (01:53:10) single critical moment. But I think (01:53:12) there will be either a critical moment, (01:53:14) a small number of critical moments or (01:53:16) some critical window where it's like AI (01:53:20) is AI confers some large advantage from (01:53:26) the perspective of national security and (01:53:29) one country or coalition has reached it (01:53:32) before others that that you know that (01:53:34) that that you know I'm not advocating (01:53:36) that they're just like okay we're in (01:53:37) charge now. That's not that's not how (01:53:40) that's not how I think about it. you (01:53:41) know that there's always the the other (01:53:43) side is catching up. There's extreme (01:53:45) actions you're not willing to take and (01:53:46) and and it's not right to take, you (01:53:48) know, to take complete um to take (01:53:51) complete control anyway. But but at at (01:53:54) the point that that happens, I think (01:53:55) people are going to understand that the (01:53:56) world has changed. And there there's (01:53:58) going to be some negotiation, implicit (01:54:02) or implicit, about what what is the what (01:54:05) is the post AI world order look like? (01:54:08) And and I think my interest is in, (01:54:12) you know, making that ne negotiation (01:54:16) be one in which, you know, classical (01:54:20) liberal democracy has, you know, has a (01:54:23) strong hand. Well, I want to understand (01:54:25) what that better means because you say (01:54:26) in the essay, quote, autocracy is simply (01:54:29) not a form of government that people can (01:54:31) accept in the post powerful AI. And that (01:54:34) sounds like you're saying the CCP as an (01:54:36) institution cannot exist after we get (01:54:39) AGI. Um, and that seems like (01:54:43) a like a very strong demand and it seems (01:54:45) to imply a world where the leading lab (01:54:48) or the leading country will be able to (01:54:51) and by that language should get to (01:54:54) determine how the world is governed or (01:54:56) what kinds of governments are allowed (01:54:59) and not allowed. (01:55:00) >> Yeah. So when I when I um I I believe (01:55:04) that paragraph was I think I said (01:55:06) something like you could take it even (01:55:08) further and say X. So I wasn't I wasn't (01:55:11) necessarily endorsing that that that I (01:55:14) wasn't necessarily endorsing that view. (01:55:15) I you know I was saying like here's if (01:55:17) first you know here here's a weaker (01:55:19) thing that I believe you know I think I (01:55:20) you know I think I said you know we have (01:55:22) to worry a lot about authoritarians and (01:55:24) you know we should try and you know kind (01:55:26) of kind of check them and limit their (01:55:27) power. Like you could take this kind of (01:55:30) further much more interventionist view (01:55:31) that says like authoritarian countries (01:55:34) with AI are these you know the the you (01:55:37) know the these kind of self-fulfilling (01:55:38) cycles that that you can't that are very (01:55:40) hard to displace and so you just need to (01:55:42) get rid of them from from the beginning (01:55:44) that that has exactly all the problems (01:55:45) you say which is you know you know if (01:55:48) you were to make a commitment to (01:55:49) overthrowing every authoritarian country (01:55:51) I mean they then they would take a bunch (01:55:53) of actions now that like you know that (01:55:55) that that could could lead to (01:55:56) instability so that that may or you know (01:55:59) that that just that just may not be (01:56:01) possible. But the point I was making (01:56:04) that I do endorse is that it is it is (01:56:07) quite possible that you know today you (01:56:10) know the view or at least my view or the (01:56:13) view in most the western world is is (01:56:15) democracy is a better form of government (01:56:17) than authoritarianism. But it's not like (01:56:19) if a country is authoritarian, we don't (01:56:22) react the way we reacted if they (01:56:24) committed a genocide or something, (01:56:25) right? And and I'm I guess what I'm (01:56:28) saying is I'm a little worried that in (01:56:30) the age of AGI, authoritarianism will (01:56:33) have a different meaning. It will be a (01:56:34) graver thing. Um and and we have to (01:56:36) decide one way or another how to h how (01:56:39) to deal with that. And the (01:56:40) interventionist view is one possible (01:56:42) view. I was exploring such views. um you (01:56:45) know (01:56:47) it may end up being the right view. It (01:56:48) it may end up being too extreme to be (01:56:50) the right view. But I do have hope and (01:56:53) one piece of hope I have is (01:56:55) there there is we have seen that as new (01:56:59) technologies are invented (01:57:03) forms of government become obsolete. I I (01:57:05) mentioned this in adolescence of (01:57:08) technology where I said you know like (01:57:10) feudalism was basically you know like a (01:57:12) form of government right and and then (01:57:14) when when we invented industrialization (01:57:18) feudalism was no longer sustainable it (01:57:20) no longer made sense why is that hope (01:57:22) couldn't that imply that democracy is no (01:57:24) longer going to be a competitive system (01:57:26) >> it it could right it could go it could (01:57:29) go either way right but but I actually (01:57:32) so I these problems with (01:57:36) authoritarianism, right? That the (01:57:38) problems with authoritarianism get (01:57:39) deeper. I just I wonder if that's an (01:57:43) indicator of other problems that (01:57:45) authoritarianism will have, right? In (01:57:48) other words, people become because (01:57:51) authoritarianism becomes worse, people (01:57:54) are more afraid of authoritarianism. (01:57:56) They work harder to stop it. It's it's (01:57:59) more of a like you have to think in (01:58:00) terms of total equilibrium, right? Um, I (01:58:03) just wonder if it will motivate new ways (01:58:07) of thinking about with the with with the (01:58:09) new technology how to preserve and (01:58:11) protect freedom (01:58:13) >> and and even more optimistically, will (01:58:15) it lead to a collective reckoning and, (01:58:18) you know, a kind of a a more emphatic (01:58:22) realization of how important some of the (01:58:25) things we take as individual rights are, (01:58:27) right? a more emphatic realization that (01:58:30) we just we really can't give these away. (01:58:32) There's there we've seen there's no (01:58:34) other way to live that actually works. (01:58:36) Um I I I I am actually I am actually (01:58:41) hopeful that I I guess one way to say it (01:58:44) it sounds too idealistic but I actually (01:58:46) believe it could be the case is is that (01:58:48) is that dictatorships become morally (01:58:50) obsolete. They become morally unworkable (01:58:52) forms of government. Um and that and (01:58:55) that and that the the the the crisis (01:58:57) that that creates is is is sufficient to (01:59:00) force us to find another way. Um I I (01:59:04) think there is genuinely a tough (01:59:05) question here which I'm not sure how you (01:59:07) resolve for and we've had to come out (01:59:09) one way or another on it through history (01:59:11) right so with China in the 70s and ' 80s (01:59:13) we decided even though it's an (01:59:15) authoritarian system we will engage with (01:59:17) it and I think in retrospect that was (01:59:19) the right call because it has stayed (01:59:20) authoritarian system but a billion plus (01:59:22) people are much wealthier and better off (01:59:24) than they would have otherwise been um (01:59:26) and it's not clear that it would have (01:59:27) stopped being an authoritarian country (01:59:28) otherwise you can just look at North (01:59:30) Korea uh as an example of that, right? (01:59:32) And I don't know if that takes that much (01:59:34) that much intelligence to remain an (01:59:37) authoritarian country that continues to (01:59:39) coales its own power. And so you can (01:59:41) just imagine a North Korea with an AI (01:59:43) that's much worse than everybody else's (01:59:45) but still enough to keep power and and (01:59:47) and and then so in general it seems like (01:59:49) should we just have this attitude of the (01:59:51) benefits of AI will in the form of all (01:59:54) these empowerments of humanity and (01:59:56) health and so forth will be big and in (01:59:58) historically we have decided it's good (02:00:00) to spread the benefits of technology (02:00:02) widely even with even to people whose (02:00:04) governments are authoritarian and I (02:00:06) think I guess it is a tough question how (02:00:07) to think about it with AI but um (02:00:09) historically we have said Yes, this (02:00:11) there this is a positive some world and (02:00:12) it's still worth diffusing technology. (02:00:14) >> Yeah. So so there are a number of (02:00:16) choices we have. I you know I think (02:00:17) framing this as a kind of government to (02:00:20) government decision and and you know in (02:00:23) in national security terms that's like (02:00:26) one lens but there are a lot of other (02:00:27) lenses like you could imagine a world (02:00:29) where you know we produce all these (02:00:30) cures to diseases and like the you know (02:00:33) the the the cures to diseases are fine (02:00:35) to sell to authoritarian countries. The (02:00:37) data centers just aren't right. the (02:00:38) chips and the data centers just aren't (02:00:40) um and that the AI industry itself. Um (02:00:43) uh you know like like another (02:00:45) possibility is and and I think folks (02:00:47) should think about this like you know (02:00:49) could there be developments we can make (02:00:52) either that naturally happen as a result (02:00:54) of AI or that we could make happen by (02:00:58) building technology on AI. Could we (02:01:00) create an equilibrium where where it (02:01:02) becomes infeasible for authoritarian (02:01:05) countries to deny their people kind of (02:01:07) private use of the benefits of the (02:01:09) technology? Um uh you know are there are (02:01:12) there are there are there equilibria (02:01:13) where we can kind of give everyone in (02:01:16) authoritarian country their own AI model (02:01:18) that kind of you you know like defends (02:01:20) themselves from surveillance and there (02:01:22) isn't a way for the authoritarian (02:01:23) country to like crack crack down on this (02:01:26) while while retaining power. I don't (02:01:28) know that that sounds to me like if that (02:01:29) went far enough it would be it would be (02:01:31) a reason why authoritarian countries (02:01:33) would disintegrate from the inside. Um (02:01:35) but but maybe there's a middle world (02:01:36) where like there there's an equilibrium (02:01:38) where if they want to hold on to power (02:01:40) the authoritarians can't deny kind of (02:01:42) individualized access access to the (02:01:45) technology. But I actually do have a (02:01:46) hope for the for the um for the for the (02:01:49) more radical version which is you know (02:01:51) is it possible that the technology might (02:01:53) inherently have properties or that by (02:01:56) building on it in certain ways we could (02:01:57) create properties um that that that that (02:02:00) have this kind of dissolving effect on (02:02:02) authoritarian structures. Now, we we (02:02:05) hoped originally, if we think back to (02:02:07) the beginning of the Obama (02:02:08) administration, we thought originally (02:02:10) that that, you know, social media and (02:02:12) and the internet would have that (02:02:14) property, and it turns out not to. But, (02:02:16) but I I don't know what what if we could (02:02:18) uh what if we could try again with with (02:02:20) the knowledge of how many things could (02:02:21) go wrong and that this is a different (02:02:23) technology. I don't know that it would (02:02:24) work, but it's worth a try. (02:02:26) >> Yeah. I think it's just it's very (02:02:28) unpredictable. Like there's first (02:02:29) principles reasons why authoritarianism. (02:02:31) >> It's all very unpredictable. I I don't (02:02:33) think I mean we got to we we just got to (02:02:35) we kind of we got to recognize the (02:02:38) problem and then we got to come up with (02:02:39) 10 things we can try and we got to try (02:02:41) those and then assess whether they're (02:02:42) working or which ones are working if any (02:02:44) and and then try new ones if the old (02:02:46) ones aren't. I guess what that nets out (02:02:47) to today is you say we will not sell (02:02:51) data centers or sorry chips and then the (02:02:54) ability to make chips to China and so in (02:02:56) some sense you are denying there would (02:02:58) be some benefits to that's right the (02:03:00) Chinese economy Chinese people etc (02:03:02) because we're doing that and then (02:03:03) there'd also be benefits to the American (02:03:04) economy because it's a positive sum (02:03:06) world we could trade they could have (02:03:07) their country data centers doing one (02:03:09) thing we could have ours doing another (02:03:10) and already we you're saying it's not (02:03:13) worth that positive sum (02:03:16) stipend to empower this country's (02:03:19) >> what what I would say is that you know (02:03:21) we are we are about to be in a world (02:03:23) where growth and economic value will (02:03:26) come very easily if right if we're able (02:03:28) to build these powerful AI models growth (02:03:30) and economic value will come very easily (02:03:32) what will not come easily is (02:03:35) distribution of benefits distribution of (02:03:37) wealth political freedom um you know (02:03:41) these are the things that are going to (02:03:42) be hard to achieve and so when I think (02:03:44) about policy (02:03:46) I think I think that the technology and (02:03:49) the market will deliver all the (02:03:50) fundamental benefits, you know, almost (02:03:52) almost faster than we can take them. Um (02:03:55) uh and and that these questions about (02:03:57) about distribution and political freedom (02:04:00) and rights are are are the ones that (02:04:02) that will actually matter and that (02:04:03) policy should focus on. Okay. So (02:04:05) speaking of distribution, as you're (02:04:06) mentioning, we have developing countries (02:04:09) and (02:04:10) >> um (02:04:11) >> in many cases catchup growth has weak (02:04:13) been weaker than we would have hoped (02:04:14) for. When catchup growth does happen, (02:04:15) it's fundamentally because they have (02:04:17) underutilized labor and we can bring the (02:04:19) capital and knowhow from developed (02:04:20) countries to these countries and then (02:04:22) they can grow quite rapidly. (02:04:23) >> Obviously in a world where labor is no (02:04:27) longer the constraining factor, this (02:04:28) mechanism no longer works. (02:04:30) >> And so is the hope basically to rely on (02:04:33) philanthropy from the people who (02:04:34) immediately get wealthy from AI or from (02:04:36) the countries that get wealthy from AI? (02:04:37) What what is the hope for? Yeah, I I (02:04:39) mean philanthropy should obviously play (02:04:41) some role as it has the you know as has (02:04:44) in the past but I think growth is always (02:04:46) growth is always better and stronger if (02:04:48) we can make it endogenous. So you know (02:04:51) what are the relevant industries in like (02:04:52) in like in like in like an AI driven (02:04:54) world. Look there's lots of stuff you (02:04:56) know like there's you know I said I said (02:04:58) we shouldn't build data centers in China (02:05:00) but there's no reason we shouldn't build (02:05:01) data centers in Africa right? Um in fact (02:05:04) I think it'd be great to build data (02:05:05) centers in Africa. um you know as not (02:05:07) long as they're not owned by China. We (02:05:08) should we should build we should build (02:05:10) data centers in Africa. I think that's a (02:05:11) that's that's I think that's a great (02:05:13) thing to do. um you know we should also (02:05:15) build you know there's no reason we (02:05:17) can't build you know a pharmaceutical (02:05:20) industry that's like AIdriven like you (02:05:22) know the the if if AI is accelerating (02:05:24) accelerating drug discovery then you (02:05:27) know there will be a bunch of biotech (02:05:28) startups like let's make sure some of (02:05:30) those happen in the developing world and (02:05:31) certainly during the transition I mean (02:05:33) we can talk about the point where humans (02:05:35) have no role but but humans will have (02:05:37) still have some role in starting up (02:05:38) these companies and supervising (02:05:40) supervising the AI models so let's make (02:05:42) sure some of those humans are humans in (02:05:43) developing world so that fast growth can (02:05:45) happen there as well. (02:05:47) >> You guys recently announced quad is (02:05:48) going to have a constitution that's (02:05:49) aligned to a set of values and not (02:05:50) necessarily just the end user and (02:05:53) there's a world you can imagine where if (02:05:55) it is aligned to the end user it (02:05:56) preserves the balance of power we have (02:05:58) in the world today because everybody (02:05:59) gets to have their own AI that's (02:06:01) advocating for them and so the ratio of (02:06:03) bad actors to good actors stays (02:06:04) constant. It seems to work out for our (02:06:06) world today. Um why is it better not to (02:06:09) do that but to have a specific set of (02:06:12) values that the AI should carry forward? (02:06:14) >> Uh yeah so I'm not sure I'd quite draw (02:06:17) the distinction in that way. Um there (02:06:19) there are maybe two relevant (02:06:21) distinctions here which are I think (02:06:23) you're talking about a mix of the two (02:06:25) like one is should we give the model a (02:06:27) set of instructions about do this and (02:06:30) versus don't do this (02:06:31) >> and the other you know versus should we (02:06:34) give the model a set of principles for (02:06:36) you know for kind of how to act. Um and (02:06:39) and and there it's it's you know it's I (02:06:43) you know it's it's just p it's kind of (02:06:45) purely a practical and empirical thing (02:06:47) that we've observed that by teaching the (02:06:50) model principles getting it to learn (02:06:52) from principles its behavior is more (02:06:54) consistent. It's easier to cover edge (02:06:56) cases and the model is more likely to do (02:06:59) what people want it to do. In other (02:07:01) words, if you, you know, if you're like, (02:07:02) you know, don't tell people how to (02:07:04) hotwire a car, don't speak in Korea, and (02:07:06) don't, you know, you know, just, you (02:07:08) know, if you give it a list of rules, it (02:07:11) doesn't really understand the rules, and (02:07:12) it's kind of hard to generalize from (02:07:14) them. Um, you know, if if it's just kind (02:07:16) of a like, you know, list of do dos and (02:07:19) don'ts. Whereas, if you give it (02:07:20) principles and then, you know, it has (02:07:22) some hard guard rails like don't make (02:07:23) biological weapons. But overall you're (02:07:25) trying to understand (02:07:27) what it should be aiming to do h how it (02:07:30) should be aiming to operate. So just (02:07:32) from a practical perspective that turns (02:07:33) out to be just a more effective way to (02:07:35) train the model. That's one piece of it. (02:07:37) So that you know it's the kind of rules (02:07:39) versus principles trade-off. Then (02:07:41) there's another thing you're talking (02:07:42) about which is kind of like the (02:07:43) cageability versus um like you know I (02:07:47) would say kind of intr intrinsic (02:07:50) motivation trade-off which is like how (02:07:52) much should the model be a kind of I (02:07:54) don't know like a a skin suit or (02:07:56) something where you know you know you (02:07:58) know you just kind of you know it just (02:08:01) kind of directly follows the (02:08:02) instructions that are given to it by (02:08:04) whoever is giving it those instructions. (02:08:06) um versus how much should the model have (02:08:08) an inherent set of values and and go off (02:08:10) and do things on its own. Um and and and (02:08:14) and and there I I would actually say (02:08:18) everything about the model is actually (02:08:19) closer to the direction of of like you (02:08:22) know it should mostly do what people (02:08:23) want. It should mostly follow the we're (02:08:25) not trying to build something that like (02:08:27) you know goes off and runs the world on (02:08:29) its own. We're actually pretty far on (02:08:31) the corable side. Now, now what we do (02:08:32) say is there are certain things that the (02:08:35) model won't do, right? That it's like (02:08:37) you know that that that I think we say (02:08:39) it in various ways in the constitution (02:08:41) that under normal circumstances if (02:08:43) someone asks the model to do a task it (02:08:45) should do that task that that should be (02:08:47) the default. Um but if you've asked it (02:08:50) to do something dangerous or if you've (02:08:52) you know if you've um asked it to um you (02:08:56) know uh uh uh to kind of harm someone (02:08:58) else um then the model is unwilling to (02:09:01) do that. So I I actually think of it as (02:09:03) like a mostly a mostly corable model (02:09:07) that has some limits but those limits (02:09:09) are based on principles. (02:09:10) >> Yeah. I mean then the fundamental (02:09:11) question is how are those principles (02:09:13) determined? And this is not a special (02:09:15) question for anthropic. would be a (02:09:16) question for any company but um (02:09:19) >> uh because you have been the ones to (02:09:21) actually write down the principles I get (02:09:24) to ask you this question normally a (02:09:26) constitution is like you write it down (02:09:28) it's set in stone and there's a process (02:09:29) of updating it and changing it and so (02:09:32) forth in this case it seems like a (02:09:34) document that people anthropic write (02:09:36) that can be changed at any time that (02:09:38) guides the behavior of systems are going (02:09:41) to be the basis of a lot of economic (02:09:43) activity what is H how do you think (02:09:46) about ho how those principles should be (02:09:49) set? (02:09:50) >> Yes. Um so I think there's there's two (02:09:52) there's maybe three (02:09:55) three kind of sizes of loop here, right? (02:09:57) Three three ways to iterate. One is you (02:09:59) can iterate. We iterate within (02:10:00) enthropic. We train the model. We're not (02:10:02) happy with it and we kind of change the (02:10:03) constitution. (02:10:04) >> And I think that's good to do. Um and (02:10:06) you know putting out publicly you know (02:10:08) making updates to the constitution every (02:10:10) once in a while saying here's a new (02:10:11) constitution. I think that's good to do (02:10:13) because people can comment on it. The (02:10:14) second level of loop is different (02:10:16) companies will have different (02:10:17) constitutions. Um and you know I think (02:10:20) it's useful for like anthropic puts out (02:10:22) a constitution and you know you the (02:10:24) Gemini model puts out a constitution and (02:10:27) you know other companies put out a (02:10:28) constitution and then then they can kind (02:10:30) of look at them compare outside (02:10:32) observers can critique and say this this (02:10:35) I like this one this thing from this (02:10:37) constitution and this thing for that (02:10:38) constitution and and then kind of that (02:10:40) that creates some kind of you know soft (02:10:43) incentive and feedback for all the (02:10:45) companies to like take the best of each (02:10:47) elements and improve Then I think (02:10:48) there's a third loop which is you know (02:10:50) society beyond the AI companies and (02:10:53) beyond just those who kind of you know (02:10:55) who who comment on the constitutions (02:10:57) without hard power and and there you (02:11:00) know we've done some experiments like (02:11:01) you know a couple years ago we did an (02:11:03) experiment with I think it was called (02:11:04) the collective intelligence project to (02:11:06) like um you know to to basically pull (02:11:09) people and ask them what should be in (02:11:11) our AI constitution. um uh and and you (02:11:14) know I think at the time we incorporated (02:11:16) some of those changes and so you could (02:11:18) imagine with the new approach we've (02:11:20) taken to the constitution doing (02:11:21) something like that it's a little harder (02:11:24) because it's like that was actually an (02:11:25) easier approach to take when the (02:11:26) constitution was like a list of dos and (02:11:28) don'ts um at the level of principles it (02:11:30) has to have a certain amount of (02:11:31) coherence um but but you could you could (02:11:34) still imagine getting views from a wide (02:11:36) variety of people and I think you could (02:11:38) also imagine and this is like a crazy (02:11:40) idea but hey you know this whole (02:11:42) interview is about crazy ideas, right? (02:11:44) So, um uh you know, you could even (02:11:46) imagine systems of of kind of (02:11:48) representative government having having (02:11:50) input, right? Like, you know, I wouldn't (02:11:52) I wouldn't do this today because the (02:11:54) legislative process is so slow. Like (02:11:56) this is exactly why I think we should be (02:11:57) careful about the legislative process (02:11:59) and AI regulation. But there's no reason (02:12:01) you couldn't in principle say like you (02:12:03) know all AI you know all AI models have (02:12:06) to have a constitution that starts with (02:12:08) like these things and then like you can (02:12:10) append you can append other things after (02:12:12) it but like there has to be this special (02:12:14) section that like takes precedence. I (02:12:16) wouldn't do that. That's too rigid. That (02:12:18) that sounds um you know that that that (02:12:21) that sounds kind of overly prescriptive (02:12:23) in a way that I think overly aggressive (02:12:25) legislation is. But like that is a thing (02:12:27) you could you know like like that is (02:12:29) that is a thing you could try to do. Is (02:12:30) is there some much less heavy-handed (02:12:33) version of that? Maybe I really like (02:12:35) control loop too. um where obviously (02:12:38) this is not how constitutions of actual (02:12:40) governments do or should work where (02:12:42) there there's not this vague sense in (02:12:44) which the Supreme Court will feel out (02:12:47) how people are feeling and what are (02:12:48) their vibes and then update the of the (02:12:49) constitution accordingly. There's with (02:12:51) actual governments there's a more (02:12:53) procedural process. Exactly. But you (02:12:56) actually have a vision (02:12:58) >> of competition between constitutions (02:13:00) which is actually very reminiscent of (02:13:02) how um (02:13:03) >> some libertarian charter cities people (02:13:05) used to talk about what an archipelago (02:13:07) of different kinds of governments could (02:13:08) look like and then there would be (02:13:10) selection among them of who could (02:13:11) operate the most effectively in which (02:13:13) place people would be the happiest. And (02:13:15) in a sense you're actually yeah there's (02:13:17) this vision. I'm I'm kind of recreating (02:13:19) that. (02:13:20) >> Yeah. Yeah. This utopia of archipelago, (02:13:22) >> you know, again, you know, I think I (02:13:24) think that vision has has, you know, (02:13:26) things to recommend it and things that (02:13:27) things things that will kind of kind of (02:13:29) go wrong with it. You know, I think I (02:13:31) think it's a I think it's an interesting (02:13:33) in some ways compelling vision, but also (02:13:34) things will go wrong with it that you (02:13:36) hadn't that you hadn't imagined. So, you (02:13:39) know, I I I like loop 2 as well, but I I (02:13:41) I feel like the whole thing has got to (02:13:43) be some some mix of loops one, two, and (02:13:46) three. And it's it's a matter of the (02:13:48) proportions, right? I I think that's got (02:13:50) to be the the answer. (02:13:52) >> Um (02:13:53) >> when somebody eventually writes the (02:13:55) equivalent of the making of the atomic (02:13:57) bomb for this era, what is the thing (02:14:00) that will be hardest to glean from the (02:14:02) historical record that they're most (02:14:03) likely to miss? I think a few things. (02:14:06) One is at every moment of this (02:14:08) exponential, the extent to which the (02:14:11) world outside it didn't understand it. (02:14:13) This is this is a bias that's often (02:14:14) present in history where anything that (02:14:16) actually happened looks inevitable in (02:14:18) retrospect and and so you know I I think (02:14:21) when people when people look back it (02:14:24) will be hard for them to put themselves (02:14:26) in the place of people who were actually (02:14:30) making a bet on this thing to happen (02:14:33) that wasn't inevitable that we had these (02:14:36) arguments like the arguments you know (02:14:38) that I make for scaling or that (02:14:39) continual learning will be solved um uh (02:14:43) uh you know that that you know some of (02:14:46) us internally in our heads put a high (02:14:48) probability on this happening but but (02:14:50) it's like there's there's a world (02:14:52) outside us that's not that's not acting (02:14:54) on it's not kind of not acting on that (02:14:56) at all. Um uh and and and I think I (02:14:59) think the the weirdness of it um I I I I (02:15:03) think unfortunately like the insolerity (02:15:04) of it like you know (02:15:07) if if we're one year or two years away (02:15:09) from it happening like the average (02:15:11) person on the street has no idea and (02:15:12) that's one of the things I'm trying to (02:15:14) change like with the memos with talking (02:15:15) to policy makers but like I don't know I (02:15:18) think I I I think that's just a that's (02:15:20) just like a crazy that's just like a (02:15:23) crazy thing. Yeah. Um, finally I would (02:15:26) say and and this probably applies to (02:15:28) almost all historical moments of crisis. (02:15:31) Um, how absolutely fast it was (02:15:34) happening, how everything was happening (02:15:35) all at once. And so decisions that you (02:15:38) might think, you know, were kind of (02:15:40) carefully calculated, well, actually you (02:15:42) have to make that decision and then you (02:15:43) have to make 30 other decisions on the (02:15:45) on the same day because it's all (02:15:47) happening so fast and and you don't even (02:15:48) know which decisions are going to turn (02:15:50) out to be consequential. So, you know, (02:15:52) one of my one of my I guess worries, (02:15:55) although it's also an insight into into, (02:15:58) you know, into kind of what's happening (02:15:59) is that, you know, some very critical (02:16:02) decision will be will be some decision (02:16:04) that, you know, someone just comes into (02:16:05) my office and is like, Dario, you have (02:16:07) two minutes like, you know, should we (02:16:09) should we do, you know, should we do (02:16:11) thing thing A or thing B on this like, (02:16:14) you know, someone gives me this random, (02:16:16) you know, half page half page memo and (02:16:18) is like, should we should we do A or B? (02:16:20) And I'm like, I don't know. I have to (02:16:21) eat lunch. Let's do B. And and you know, (02:16:23) that ends up being the most (02:16:24) consequential thing ever. (02:16:26) >> So, final question. Uh, it seems like (02:16:29) you have (02:16:31) there's not tech CEOs who are usually (02:16:33) writing 50page memos every few months, (02:16:35) and it seems like you have managed to (02:16:37) build a role for yourself and a company (02:16:40) around you which is compatible with this (02:16:43) more intellectual type role of CEO. And (02:16:47) I want to understand how you construct (02:16:50) that and how like how does that work to (02:16:52) be you just go away for a couple weeks (02:16:54) and then you tell your company this is (02:16:55) the memo like here's what we're doing. (02:16:57) It's also reported you write a bunch of (02:16:58) these internally. (02:16:59) >> Yeah. So I mean for this particular one (02:17:01) you know I wrote it over winter break. (02:17:02) Um uh so it was the time you know and I (02:17:04) was having a a hard time finding the (02:17:06) time to actually find it to actually (02:17:08) write it. But I actually think about (02:17:10) this in a broader way. Um I actually (02:17:12) think it relates to the culture of the (02:17:13) company. So I probably spend a third (02:17:15) maybe 40% of my time making sure the (02:17:18) culture of Enthropic is good. As (02:17:19) Enthropic has gotten larger, it's it's (02:17:22) gotten harder to just, you know, get (02:17:24) involved in like, you know, directly (02:17:26) involved in like the training of the (02:17:27) models, the launch of the models, the (02:17:29) building of the products. Like it's 2500 (02:17:31) people. It's like, you know, there's (02:17:32) just, you know, I have certain (02:17:34) instincts, but like there's only, you (02:17:35) know, I it's very difficult to get to (02:17:38) get to get involved in every single (02:17:39) detail. You know, I like I I try as much (02:17:42) as possible, but one thing that's very (02:17:44) leveraged is making sure Anthropic is a (02:17:47) good place to work. People like working (02:17:49) there. Everyone thinks of themselves as (02:17:51) team members. Everyone works together (02:17:53) instead of against each other. And you (02:17:55) know, we've seen as some of the other AI (02:17:57) companies have grown without naming any (02:17:59) names, you know, we're starting to see (02:18:01) decoherence and people fighting each (02:18:03) other. And you know, I would argue there (02:18:04) was even a lot of that from the (02:18:05) beginning, but but you know, that it's (02:18:06) it's gotten worse. But I I think we've (02:18:09) done an extraordinarily good job, even (02:18:11) if not perfect, of holding the company (02:18:15) together, making everyone feel the (02:18:17) mission, that we're sincere about the (02:18:19) mission, and that, you know, everyone (02:18:21) has faith that everyone else there is (02:18:23) working for the right reason, that we're (02:18:24) a team, that people aren't trying to get (02:18:26) ahead at each other's expense or (02:18:28) backstab each other, which again, I (02:18:30) think happens a lot at some of the other (02:18:31) places. Um, and and how do you make that (02:18:34) the case? I mean it's a lot of things (02:18:36) you know it's me it's it's it's Daniela (02:18:38) who you know runs the company dayto-day (02:18:40) it's the co-founders it's the other (02:18:42) people we hire it's the environment we (02:18:44) try to create but I think an important (02:18:46) thing in the culture is I some and you (02:18:50) know the the you know the other leaders (02:18:52) as well but especially me have to (02:18:54) articulate what the company is about why (02:18:57) it's doing what it's doing what its (02:18:59) strategy is what its values are what its (02:19:02) mission is and what it stands for And um (02:19:05) you know when you get to 2500 people you (02:19:08) can't do that person by person. You have (02:19:10) to write or you have to speak to the (02:19:12) whole company. This is why I get up in (02:19:14) front of the whole company every two (02:19:16) weeks and speak for an hour. It's (02:19:17) actually I mean I wouldn't say I write (02:19:20) essays internally. I do two things. One (02:19:22) I write this thing called DVQ Dario (02:19:24) Vision Quest. Um uh uh I wasn't the one (02:19:27) who named it that. That's the name it it (02:19:28) it received and it's one of these names (02:19:30) that I kind of I tried to fight it (02:19:32) because it made it sound like I was like (02:19:33) going off and smoking peyote or (02:19:34) something. Um uh but but the name just (02:19:37) stuck. Um so I get up in front of the (02:19:39) company every two weeks. I have like a (02:19:41) three or four page document and I just (02:19:44) kind of talk through like three or four (02:19:46) different topics about what's going on (02:19:47) internally the you know the the models (02:19:50) we're producing the products the outside (02:19:52) industry the world as a whole as it (02:19:54) relates to AI and geopolitically in (02:19:57) general you know just some mix of that (02:19:59) and I just go through very very honestly (02:20:01) I just go through and I just I just say (02:20:04) you know this is this is what I'm (02:20:05) thinking this is what anthropic (02:20:06) leadership is thinking and then I answer (02:20:08) questions and and that direct connection (02:20:12) I think has a lot of value that is hard (02:20:14) to achieve when you're passing things (02:20:15) down the chain you know six six levels (02:20:18) deep um uh and you know large fraction (02:20:21) of the company comes comes to attend (02:20:23) either either in person or um either in (02:20:26) person or virtually and you know it (02:20:28) really means that you can communicate a (02:20:30) lot and then the other thing I do is I (02:20:32) just you know I have a channel in Slack (02:20:33) where I just write a bunch of things and (02:20:35) comment a lot um and often that's in (02:20:38) response to you know just things I'm (02:20:40) seeing at the company or questions (02:20:42) people ask or like you know we do (02:20:45) internal surveys and there are things (02:20:47) people are concerned about and so I'll (02:20:48) write them up and I'm like I'm you know (02:20:50) I'm I'm I'm just I'm very honest about (02:20:52) these things you know I just I just say (02:20:54) them very directly and the point is to (02:20:57) get a reputation of telling the company (02:21:00) the truth about what's happening to call (02:21:02) things what they are to acknowledge (02:21:04) problems to avoid the sort of corpo (02:21:06) speak the kind of defensive (02:21:08) communication that often is necessary in (02:21:11) public because you know the world is (02:21:13) very large and full of people who are (02:21:15) you know interpreting things in bad (02:21:18) faith. Um but you know if you have a (02:21:20) company of people who you trust and we (02:21:22) try to hire people that we trust then (02:21:25) then you know you can you can you can (02:21:27) you know you can you can really just be (02:21:28) entirely unfiltered. Um and uh you know (02:21:31) I think I think that's an enormous (02:21:33) strength of the company. It makes it a (02:21:34) better place to work. It makes people (02:21:36) more you know more the sum of their (02:21:38) parts and increases likelihood that we (02:21:40) accomplish the mission because everyone (02:21:41) is on the same page about the mission (02:21:42) and everyone is debating and discussing (02:21:44) how best to accomplish the mission. (02:21:46) >> Well in lie of an external Dario vision (02:21:48) quest we have this interview. (02:21:50) >> This this interview is a little like (02:21:51) that. (02:21:53) >> Uh this is fun Dario. Thanks for doing (02:21:54) it. (02:21:54) >> Yeah thank you Dash. Hey everybody I (02:21:57) hope you enjoyed that episode. If you (02:21:59) did the most helpful thing you can do is (02:22:01) just share it with other people who you (02:22:02) think might enjoy it. It's also helpful (02:22:04) if you leave a rating or a comment on (02:22:07) whatever platform you're listening on. (02:22:09) If you're interested in sponsoring the (02:22:11) podcast, you can reach out at (02:22:13) dwarcash.com/advertise. (02:22:17) Otherwise, I'll see you on the next one.

Leave a Reply

Your email address will not be published. Required fields are marked *