Home Videos

Ilya Sutskever – We’re moving from the age of scaling to the age of research (YouTube Video Transcript)

Need transcripts for other videos? Try our YouTube Transcript Generator →
Title: Ilya Sutskever – We’re moving from the age of scaling to the age of research
Duration: 01:36:03
Total Correct Answers:
Current Caption
Correct

Learning Modes

YouTube Video Transcript Hide

Ask AI Result

The ask AI result will appear here..
(00:00:00) Your YouTube transcript will appear here (00:00:00) You know what's crazy that all of this (00:00:03) is real? (00:00:04) >> Yeah. Meaning what? (00:00:05) >> Don't you think so? (00:00:06) >> Meaning what? (00:00:07) >> Like all this AI stuff and all this (00:00:09) area. Yeah. That it's happen like (00:00:11) >> isn't it straight out of science (00:00:13) fiction? (00:00:14) >> Yeah. Another thing that's crazy is like (00:00:16) how normal the slow takeoff feels. The (00:00:19) idea that we'd be investing 1% of GDP in (00:00:23) AI, like I feel like it would have felt (00:00:25) like a bigger deal, you know, where (00:00:26) right now it just feels like (00:00:27) >> you get used to things pretty fast. (00:00:29) Turns out Yeah. But also it's kind of (00:00:31) like it's abstract like what does it (00:00:33) mean? What it means that you see it in (00:00:35) the news. (00:00:36) >> Yeah. (00:00:36) >> That such and such company announced (00:00:38) such and such dollar amount, (00:00:39) >> right? (00:00:40) >> That's that's all you see, (00:00:42) >> right? (00:00:43) >> It's not really felt in any other way so (00:00:45) far. (00:00:45) >> Yeah. Should we actually begin here? I (00:00:47) think this is an interesting discussion. (00:00:48) >> Sure. (00:00:49) >> I think your point about well from the (00:00:51) average person's point of view, nothing (00:00:54) is that different will continue being (00:00:55) true even into the singularity. (00:00:57) >> No, I don't think so. Okay. Interesting. (00:01:00) >> So the thing which I was referring to (00:01:03) not feeling different (00:01:06) is okay. So such and such company (00:01:08) announced some difficult to comprehend (00:01:11) dollar amount of investment (00:01:12) >> right (00:01:13) >> I don't think anyone knows what to do (00:01:14) with that. (00:01:15) >> Yeah. (00:01:16) >> But I think that the impact of AI is (00:01:20) going to be felt. (00:01:22) >> AI is going to be diffused through the (00:01:24) economy. There are very strong economic (00:01:26) forces for this and I think the impact (00:01:28) is going to be felt very strongly. (00:01:31) >> When do you expect that impact? I think (00:01:33) the models seem smarter than their (00:01:36) economic impact would imply. (00:01:38) >> Yeah, this is (00:01:41) one of the very confusing things about (00:01:43) the models right now. How to reconcile (00:01:47) the fact that they are doing so well on (00:01:52) evals. (00:01:52) >> Mhm. And you look at the evals and you (00:01:54) go, those are pretty hard evals, (00:01:56) >> right? (00:01:57) >> They're doing so well, (00:02:00) >> but the economic impact seems to be (00:02:03) dramatically behind. And it's almost (00:02:06) like (00:02:08) it's it's very difficult to make sense (00:02:10) of how can the model on the one hand do (00:02:12) these amazing things and then on the (00:02:14) other hand like repeat itself twice in (00:02:18) some situation in a kind of a an example (00:02:21) would be let's say you use VIP coding to (00:02:23) do something and you go to some place (00:02:25) and then you get a bug and then you tell (00:02:28) the model can you please fix the bug? (00:02:30) >> Yeah. And the model says, "Oh my god, (00:02:32) you're so right. I have a bug. Let me go (00:02:34) fix that." And it introduces a second (00:02:35) bug. (00:02:36) >> Yeah. (00:02:37) >> And then you tell it you have you have (00:02:38) this new second bug and it tells you, (00:02:40) "Oh my god, how could I've done it? (00:02:41) You're so right again." (00:02:43) >> And brings back the first bug. And you (00:02:45) can alternate between those. (00:02:46) >> Yeah. (00:02:46) >> And it's like, how is that possible? (00:02:48) >> Yeah. (00:02:49) >> It's like I'm not sure. But it does (00:02:52) suggest that the (00:02:55) something strange is going on. I have (00:02:57) two possible explanations. So here this (00:02:59) is the more kind of a whimsical (00:03:02) explanation is that maybe a real (00:03:04) training makes the models a little bit (00:03:06) too single-minded and narrowly focused a (00:03:09) little bit too (00:03:11) I don't know unaware (00:03:14) even though it also makes them aware in (00:03:15) some other ways (00:03:17) and because of this they can't do basic (00:03:20) things but there is another explanation (00:03:22) which is (00:03:24) back when people were doing pre-training (00:03:28) the question of what data to train on (00:03:31) was answered because the that answer was (00:03:34) everything. (00:03:35) >> Yeah. (00:03:36) >> When you do pre-training, you need all (00:03:39) the data. (00:03:41) So you don't have to think is it going (00:03:43) to be this data or that data. (00:03:44) >> Yeah. (00:03:45) >> But when people do RL training, they do (00:03:48) need to think. They say okay, we want to (00:03:50) have this kind of RL training for this (00:03:52) thing and that kind of RL training for (00:03:54) that thing. And from what I hear, all (00:03:57) the companies have teams that just (00:03:59) produce new RL environments and just add (00:04:01) it to the training mix. And then the (00:04:03) question is, well, what are those? There (00:04:04) are so many degrees of freedom. There is (00:04:06) such a huge variety of environments you (00:04:08) could produce. And one of the (00:04:13) one thing you could do, and I think (00:04:14) that's something that is done (00:04:16) inadvertently, (00:04:17) is that people take inspiration from the (00:04:21) evals. you say, "Hey, I would love our (00:04:24) model to do really well when we release (00:04:26) it. I want the EVOS to look great." (00:04:29) What would be RL training that could (00:04:32) help on this task, right? I think that (00:04:35) is something that happens and I think it (00:04:37) could explain a lot of what's going on. (00:04:39) If you combine this with generalization (00:04:42) of the models actually being inadequate, (00:04:45) that has the potential to explain a lot (00:04:47) of what we are seeing. this disconnect (00:04:49) between eval performance and actual real (00:04:53) real world performance which is (00:04:54) something that we don't today exactly (00:04:57) even understand what what we mean by (00:05:00) that I I like this idea that the real (00:05:03) reward hacking is a human researchers (00:05:05) who are too focused on the evals um I (00:05:09) think there's two ways to understand or (00:05:12) to try to think about what what you have (00:05:14) just pointed out one is look if it's the (00:05:18) case that simply by becoming superhuman (00:05:20) at a coding competition, a model will (00:05:23) not automatically become more tasteful (00:05:26) and exercise better judgment about how (00:05:28) to improve your codebase. Well, then you (00:05:30) should expand the suite of environments (00:05:33) such that you're not just testing it on (00:05:35) having the best performance in coding (00:05:36) competition. It should also be able to (00:05:38) make the best kind of application for X (00:05:40) thing or Y thing or Z thing. And another (00:05:43) maybe this is what you're hinting at is (00:05:45) to say why should it be the case in the (00:05:47) first place that becoming super human at (00:05:50) coding competitions doesn't make you a (00:05:52) more tasteful programmer more generally. (00:05:54) Maybe the thing to do is not to keep (00:05:57) stacking up the amount of environments (00:05:59) and the diversity of environments to (00:06:00) figure out approach with let you learn (00:06:02) from one environment and improve your (00:06:06) performance on something else. So I have (00:06:09) I have an analog a human analogy which (00:06:11) might be helpful. So even the case let's (00:06:14) take the case of competitive programming (00:06:16) since you mentioned that and suppose you (00:06:18) have two students (00:06:20) one of them work decided they want to be (00:06:23) the best competitive programmer. So they (00:06:24) will practice 10,000 hours for that (00:06:28) domain. They will solve all the (00:06:30) problems, memorize all the proof (00:06:31) techniques and be very very you know (00:06:36) be very skilled at quickly and correctly (00:06:38) implementing all the algorithms and by (00:06:40) doing by doing so they became the best (00:06:43) one of the best student number two (00:06:46) thought oh competitive programming is (00:06:47) cool maybe they practiced for 100 hours (00:06:50) >> much much less and they also did really (00:06:52) well which one do you think is going to (00:06:54) do better in their career later on (00:06:56) >> the second (00:06:57) >> right and I think that's basically (00:06:59) what's going on. The models are much (00:07:00) more like the first student but even (00:07:01) more because then we say okay so the (00:07:04) model should be good at competitive (00:07:05) programming so let's get every single (00:07:07) competitive programming problem ever and (00:07:10) then let's do some data augmentation so (00:07:12) we have even more competitive (00:07:13) programming problems (00:07:14) >> yes (00:07:15) >> and we train on that and so now you got (00:07:17) this great competitive programmer and (00:07:18) with this analogy I think it's more (00:07:20) intuitive I think it's more intuitive (00:07:22) with this analogy that yeah okay so if (00:07:25) it's so well trained okay it's like all (00:07:27) the different algorithms and all the (00:07:29) proof techniques are like right at it at (00:07:31) its fingertips (00:07:32) and it's more intuitive that with this (00:07:34) level of preparation it not would not (00:07:36) necessarily generalize to other things. (00:07:40) >> But then what is the um analogy for what (00:07:42) the second student is doing before they (00:07:45) do the 100 hours of fine-tuning. (00:07:48) >> I think it's like (00:07:51) they have it. I think it's the it (00:07:53) factor. (00:07:54) >> Yeah. (00:07:54) >> Right. And like I know like when I was (00:07:56) in undergrad, I remember there was there (00:07:58) was a student like this that studied (00:07:59) with me. So I I know it exists. (00:08:01) >> Yeah. I think it's interesting to (00:08:03) distinguish it from whatever (00:08:05) pre-training does. So one way to (00:08:07) understand what you just said about we (00:08:09) don't have to choose the data in (00:08:10) pre-training is to say actually it's not (00:08:13) dissimilar to the 10,000 hours of (00:08:15) practice. It's just that you get that (00:08:16) 10,000 hours of practice for free (00:08:19) because it's already somewhere in the (00:08:21) pre-training distribution. But it's like (00:08:23) maybe you're suggesting actually there's (00:08:25) actually not that much generalization (00:08:26) for pre-training. There's just so much (00:08:27) data in pre-training but it's like it's (00:08:29) not necessarily generalizing better than (00:08:30) RL. (00:08:31) >> Like the main the main strength of (00:08:33) pre-training is that there is a so much (00:08:35) of it. (00:08:35) >> Yeah. (00:08:36) >> And b you don't have to think hard about (00:08:40) what data to put into pre-training. (00:08:43) >> And it's a very kind of natural data and (00:08:45) it does include in it a lot of what (00:08:48) people do. (00:08:49) >> Yeah. (00:08:50) people's thoughts and a lot of the (00:08:54) features of you know it's like the whole (00:08:56) world as projected by people onto text. (00:08:59) >> Yeah. (00:09:00) >> And pre-training tries to capture that (00:09:02) using a huge amount of data. (00:09:04) It's it's very the pre-training is very (00:09:08) difficult to reason about because it's (00:09:10) so hard to understand the manner in (00:09:14) which the model relies on pre-training (00:09:16) data. And whenever the model makes a (00:09:19) mistake, could it be because something (00:09:22) by chance is not as supported by the (00:09:24) pre-training data? You know, and pre (00:09:26) support by pre-training is maybe a loose (00:09:28) term. (00:09:31) I I don't know if I can add anything (00:09:33) more useful on this, but (00:09:36) I don't think there is a human analog to (00:09:38) pre-training. Um, here's analogies that (00:09:40) people have proposed for what the human (00:09:42) analogy to pre-training is, and I'm (00:09:44) curious to get your thoughts on why (00:09:46) they're potentially wrong. One is to (00:09:49) think about the first 18 or 15 or 13 (00:09:52) years of a person's life when they (00:09:54) aren't necessarily economically (00:09:56) productive, but they are doing something (00:09:58) that is making them understand the world (00:10:02) better and so forth. And the other is to (00:10:05) think about evolution as doing some kind (00:10:08) of search for three billion years which (00:10:09) then results in a human lifetime (00:10:13) instance. And then I'm curious if you (00:10:16) think either of these are actually (00:10:16) analogous to pre-training or how how (00:10:18) would you think about at least what (00:10:19) lifetime human learning is like if not (00:10:22) pre-training. I think there are some (00:10:24) similarities between both of these two (00:10:27) pre-training and pre-training tries to (00:10:29) play the role of both of these (00:10:31) >> but I think there are some big (00:10:32) differences as well. (00:10:35) The amount of pre-training data is (00:10:38) very very staggering. (00:10:40) >> Yes. (00:10:41) >> And somehow a a human being after even (00:10:45) 15 years with a tiny fraction of that (00:10:47) pre-training data they know much less. (00:10:49) >> Yeah. But whatever they do know they (00:10:51) know much more deeply somehow and the (00:10:54) mistakes like like already at that age (00:10:57) you would not make mistakes that are (00:10:59) make. (00:10:59) >> Yeah. There is another thing you might (00:11:01) say could it be something like evolution (00:11:03) and the answer is maybe but in this case (00:11:05) I think evolution might actually have an (00:11:07) edge like there is this I remember (00:11:11) reading about this case where some you (00:11:14) know that one thing that neuroscientists (00:11:16) do or rather one way in which (00:11:19) neuroscientists can learn about the (00:11:20) brain is by studying people with brain (00:11:23) damage to different parts of the brain (00:11:26) >> and and so and some people have the most (00:11:28) strange symptoms you could imagine. It's (00:11:30) actually really really interesting. And (00:11:32) there was one case that comes to mind (00:11:34) that's relevant. (00:11:36) I read about this person who had some (00:11:39) kind of brain damage that took out I (00:11:42) think a stroke or an accident that took (00:11:45) out his emotional (00:11:48) processing. So he stopped feeling any (00:11:50) emotion (00:11:52) and as a result of that you know he (00:11:55) still remained very articulate and he (00:11:57) could solve little puzzles and on tests (00:11:59) he seemed to be just fine but he felt no (00:12:02) emotion he didn't feel sad he didn't (00:12:04) feel angry he didn't feel animated and (00:12:07) he became somehow extremely bad at (00:12:09) making any decisions at all it would (00:12:12) take him hours to decide on which socks (00:12:14) to wear and he would make very bad (00:12:16) financial decisions (00:12:19) And that's very (00:12:22) but does what what does it say about (00:12:26) the role of our built-in emotions (00:12:30) in making us like a viable agent (00:12:33) essentially (00:12:34) >> and I guess to connect to your question (00:12:35) about pre-training (00:12:37) >> it's like maybe pre- like maybe if you (00:12:40) are good enough at like getting (00:12:41) everything out of pre-training you can (00:12:43) get you could get that as well but (00:12:45) that's the kind of thing which seems is (00:12:51) well it may or may not be possible to (00:12:53) get that from pre-training. What is (00:12:58) that clearly not just directly emotion? (00:13:01) And it seems like some (00:13:04) almost value function like thing which (00:13:06) is giving telling you which decision to (00:13:08) be like what the end reward for any (00:13:11) decision should be and you think that (00:13:13) doesn't sort of implicitly come from (00:13:15) >> I think it could I'm just saying it's (00:13:17) not one it's not 100% obvious. (00:13:19) >> Yeah. But what is that like what how do (00:13:23) you think about emotions and what is the (00:13:24) ML analogy for emotions? (00:13:26) >> It should be some kind of a value (00:13:28) function thing. (00:13:29) >> Yeah. But I don't think there is a great (00:13:31) ML analogy because right now value (00:13:32) functions don't play a very prominent (00:13:34) role in uh the things people do. (00:13:36) >> It might be worth defining for the (00:13:38) audience what a value function is if if (00:13:39) you want to do that. (00:13:40) >> I mean certainly I I'll be very happy to (00:13:42) do that. Right. So (00:13:49) so when people do reinforcement learning (00:13:51) the way reinforcement learning is done (00:13:53) right now how do they do how do people (00:13:55) train those agents? So you have your (00:13:57) neural net and you give it a problem and (00:14:00) then you tell the model go solve it and (00:14:01) the model takes maybe thousands hundreds (00:14:04) of thousands of actions (00:14:06) or thoughts or something and then it (00:14:08) produces a solution. The solution is (00:14:09) created and then the score (00:14:13) is used to provide a training signal for (00:14:15) every single action (00:14:18) in your trajectory. (00:14:19) >> Mhm. So that means that if you are doing (00:14:23) something that goes for a long time, if (00:14:25) you're training a task that takes a long (00:14:28) time to solve, you will do no learning (00:14:30) at all until you solve the until you (00:14:32) came up with a proposed solution. That's (00:14:34) how reinforcement learning is done (00:14:35) naively. That's how O1 R1 ostensibly are (00:14:39) done. (00:14:41) The value function says something like (00:14:43) okay look maybe I could sometimes not (00:14:46) always could tell you if you're doing (00:14:49) well or badly. The notion of a value (00:14:52) function is more useful in some domains (00:14:53) than others. So for example when you (00:14:55) play chess (00:14:57) and you lose a piece you know I messed (00:14:59) up. You don't need to play the whole (00:15:01) game to know that what I just did was (00:15:03) bad and therefore whatever um whatever (00:15:07) preceded it was also bad. So the value (00:15:09) function lets you short circuit the (00:15:13) weight until the very end. Like let's (00:15:15) suppose that you started to pursue some (00:15:18) kind of um okay let's suppose that you (00:15:20) are doing some kind of a math thing or a (00:15:22) programming thing and you're trying to (00:15:24) explore a particular solution direction (00:15:26) and after let's say after a thousand uh (00:15:30) steps of thinking you concluded that (00:15:31) this direction is unpromising. (00:15:34) As soon as you conclude this, you could (00:15:37) already get a reward signal a thousand (00:15:40) time steps previously when you decided (00:15:42) to pursue down this path. You say, "Oh, (00:15:44) next time I shouldn't pursue this path (00:15:47) in a similar situation long before you (00:15:50) actually came up with a proposed (00:15:51) solution." M this was in the deepcar one (00:15:53) paper is that the (00:15:56) space of trajectories is so wide that (00:16:01) maybe it's hard to learn a mapping from (00:16:04) an intermediate trajectory and value and (00:16:07) also given that you know in coding for (00:16:08) example you'll have the wrong idea then (00:16:11) you'll go back then you'll change (00:16:12) something (00:16:13) >> this sounds like such lack of faith in (00:16:15) deep learning (00:16:16) >> like I mean sure it might be difficult (00:16:18) but (00:16:19) >> nothing deep learning can't Yeah. (00:16:22) >> So my expectation is that (00:16:27) like value functions should be useful (00:16:29) and (00:16:31) and I fully I fully expect that they (00:16:32) will be used in the future if not (00:16:34) already. What was I alluding to with the (00:16:36) person whose emotional center got (00:16:40) >> um damaged is more that (00:16:45) maybe what it suggests is that the value (00:16:48) function of humans is modulated by (00:16:50) emotions in some important way that's (00:16:53) hardcoded by evolution (00:16:55) and maybe that is important for people (00:16:57) to be effective in the world. (00:17:00) >> That that's the thing I was actually (00:17:02) planning on asking you. There's (00:17:03) something really interesting about (00:17:04) emotions as a value function, which is (00:17:05) that it's impressive that they have this (00:17:08) much utility while still being rather (00:17:13) um simple to understand. (00:17:16) So I have two responses. I do agree that (00:17:20) compared to (00:17:24) the kind of things that we learn and the (00:17:25) things we are talking about, the kind of (00:17:27) ads we talking about, emotions are (00:17:28) relatively simple. (00:17:31) They might even be so simple that maybe (00:17:33) you could map them out in a human (00:17:35) understandable way. I think it would be (00:17:36) cool to do. (00:17:39) In terms of utility though, I think (00:17:41) there is a thing where you know there is (00:17:44) this complexity robustness trade-off (00:17:48) where complex things can be very useful (00:17:53) but simple things are very useful in a (00:17:57) very broad range of situations. And so I (00:18:00) think what what one way to interpret (00:18:01) what we are seeing is that we've got (00:18:03) these emotions that essentially evolved (00:18:06) mostly mostly from our mammal ancestors (00:18:09) and then fine-tuned a little bit while (00:18:11) we were homminids just a bit. We do have (00:18:14) like a decent amount of social emotions (00:18:15) though which mammals may lack (00:18:19) but they're not very sophisticated (00:18:22) and because they're not sophisticated (00:18:23) they serve us so well in this very (00:18:25) different world compared to the one that (00:18:27) we've been living in. Actually they they (00:18:29) also make mistakes. For example, our (00:18:31) emotions well I don't know does hunger (00:18:33) count as an emotion (00:18:36) debate. It's debatable but I think for (00:18:37) example our intuitive feeling of hunger (00:18:42) is not succeeding in guiding us (00:18:46) correctly in this world with an (00:18:48) abundance of food. (00:18:49) >> Yeah. People have been talking about (00:18:51) scaling data, scaling parameters, (00:18:54) scaling compute. (00:18:56) Is there a more general way to think (00:18:58) about scaling? What are the other (00:18:59) scaling axes? (00:19:01) So (00:19:03) the thing so so here here is a (00:19:05) perspective here's a perspective I think (00:19:07) might be might be true. (00:19:10) So (00:19:12) the way ML used to work is that people (00:19:15) would just think of it with stuff and (00:19:17) try to (00:19:20) and try to get interesting results. (00:19:22) That's what's been going on in the past. (00:19:26) Then (00:19:28) the scaling insight arrived, right? (00:19:31) Scaling laws, GPT3. (00:19:34) And suddenly everyone realized we should (00:19:37) scale. (00:19:39) And it's just this, this is an example (00:19:42) of how language affects thought. (00:19:46) Scaling is what just one word, but it's (00:19:48) such a powerful word because it informs (00:19:50) people what to do. They say, "Okay, (00:19:52) let's let's try to scale things." And so (00:19:54) you say okay so what are we scaling and (00:19:57) pre-training was a thing to scale it was (00:19:59) a particular scaling recipe. (00:20:02) >> Yes (00:20:02) >> the big breakthrough of pre-training is (00:20:05) the realization that this recipe is (00:20:07) good. So you say hey if you mix some (00:20:11) compute with some data into a neural net (00:20:14) of a certain size you will get results (00:20:17) and you will know that it will be better (00:20:19) if you just scale the recipe up. And (00:20:22) this is also great. Companies love this (00:20:24) because it gives you a very uh lowrisk (00:20:28) way of investing. (00:20:30) >> Yeah. (00:20:31) >> Your resources. (00:20:32) >> Yeah. (00:20:32) >> Right. It's much harder to invest your (00:20:34) resources in research. Compare that. You (00:20:37) know, if you research, you need to have (00:20:38) like go forth researchers and research (00:20:40) and come up with something versus get (00:20:44) more data, get more compute. You know, (00:20:46) you'll get something from pre-training. (00:20:49) And indeed, you know, it looks like I (00:20:51) based on various um (00:20:54) um things people say on some people say (00:20:56) on Twitter, maybe it appears that Gemini (00:20:59) have found a way to get more out of (00:21:01) pre-training. At some point though, (00:21:03) pre-training will run out of data. The (00:21:04) data is very clearly finite. And so (00:21:06) then, okay, what do you do next? Either (00:21:08) you do some kind of a souped-up (00:21:10) pre-training, different recipe from the (00:21:12) one you've done before, or you're doing (00:21:14) a RL or maybe something else. But now (00:21:16) that compute is big, computer is now (00:21:19) very big. In some sense, we are back to (00:21:21) the age of research. So maybe here's (00:21:23) another way to put it. Up until 2020, (00:21:25) from 201 from 2012 to 2020, it was the (00:21:29) age of research. Now from 2020 to 2025, (00:21:33) it was the age of scaling or maybe plus (00:21:36) minus. Let's add arrow bars to those (00:21:37) years because people say this is (00:21:39) amazing. You got to scale more. Keep (00:21:40) scaling. The one word scaling. But now (00:21:44) the scale is so big. Like is is it is (00:21:46) the belief really that oh it's so big (00:21:50) but if you had 100x more everything (00:21:52) would be so different. Like it would be (00:21:54) different for sure but like is the (00:21:56) belief that if you just 100x the scale (00:22:00) everything would be transformed. (00:22:02) I don't think that's true. So it's back (00:22:04) to the age of research again just with (00:22:06) big computers. (00:22:07) >> That's a very interesting way to put it. (00:22:10) But let me ask you the question you just (00:22:12) posed then. What are we scaling and what (00:22:14) what is what would it mean to have a (00:22:16) recipe? Because I guess I'm not aware of (00:22:20) a very clean (00:22:22) relationship that almost looks like a (00:22:24) law of physics which existed in (00:22:25) pre-training that was a power law (00:22:26) between data or computer parameters and (00:22:30) loss. What is the kind of relationship (00:22:34) we should be seeking and how how should (00:22:36) we think about what this new recipe (00:22:37) might look like? So, we've we've already (00:22:42) witnessed a transition from (00:22:45) one type of scaling to a different type (00:22:47) of scaling, from pre-training to RL. (00:22:51) Now, people are scaling RL. Now, based (00:22:54) on what people say on Twitter, they (00:22:57) spend more compute on RL than on (00:22:59) pre-training at this point because RL (00:23:01) can actually consume quite a bit of (00:23:03) compute. You know, you do very very long (00:23:06) rollouts. (00:23:06) >> Yes. So it takes a lot of compute to (00:23:08) produce those rollouts and then you get (00:23:10) relatively small amount of learning per (00:23:12) roll out. So you really can spend you (00:23:14) really can spend a lot of compute and I (00:23:17) could imagine like I wouldn't at this at (00:23:20) this st it's it's more like I wouldn't (00:23:22) even call it a scale um scaling I would (00:23:25) say hey like what are you doing and is (00:23:28) the thing you are doing the the the the (00:23:30) most productive thing you could be (00:23:31) doing? Yeah. (00:23:32) >> Can you find a most more productive way (00:23:34) of using your compute? We've discussed (00:23:37) the value function business earlier and (00:23:40) maybe once people get good at value (00:23:42) functions they will be using their their (00:23:45) um resources more productively (00:23:47) and if you find a whole other way of (00:23:51) training models you could say is this (00:23:54) scaling or is it just using your (00:23:55) resources I think it becomes a little (00:23:57) bit ambiguous in a sense that when (00:23:59) people were in the age of research back (00:24:01) then it was like people say hey let's (00:24:03) try this and this and this let's try (00:24:04) that and that and that oh look something (00:24:06) interesting is happening and I think (00:24:08) there will be a return to that. (00:24:10) >> So if we're back in the era of research, (00:24:12) stepping back, what is the part of the (00:24:14) recipe that we need to think most about? (00:24:17) When you say value function, people are (00:24:19) already trying the current recipe, but (00:24:21) then having LLM as a judge and so forth, (00:24:24) you could say that's a value function, (00:24:25) but it sounds like you have something (00:24:26) much more fundamental in mind. Do we (00:24:27) need do we need to go back to (00:24:30) should we even rethink pre-training at (00:24:33) all and not just add more steps to the (00:24:35) end of that process? (00:24:36) >> Yeah. So the the the the discussion (00:24:39) about value function I think it was (00:24:41) interesting. I want to like emphasize (00:24:43) that I think the value function is (00:24:45) something like (00:24:47) it's going to make RL more efficient (00:24:50) and I think that makes a difference but (00:24:53) I think that anything you can do with a (00:24:55) value function you can do without just (00:24:57) more slowly. (00:24:58) >> Mhm. (00:25:00) >> The thing which I think is the most (00:25:02) fundamental is that these models somehow (00:25:04) just generalize dramatically worse than (00:25:06) people. (00:25:07) >> Yes. (00:25:08) >> And it's super obvious. (00:25:10) That's that seems like a very (00:25:12) fundamental thing. (00:25:13) >> Okay, so this is the crux of (00:25:15) generalization and there's two (00:25:18) sub questions. (00:25:21) There's one which is about sample (00:25:22) efficiency which is why should it take (00:25:24) so much more data for these models to (00:25:25) learn than humans. There's a second (00:25:27) about even separate from the amount of (00:25:30) data it takes, there's a question of why (00:25:32) is it so hard to teach the thing we want (00:25:34) to a model than to a human which is to (00:25:36) say for to a human that we don't (00:25:39) necessarily need a verifiable reward to (00:25:41) be able to (00:25:43) you're probably mentoring a bunch of (00:25:45) researchers right now and you're you (00:25:47) know talking with them you're showing (00:25:48) them your code and you're showing them (00:25:50) how you think and from that they're (00:25:52) picking up your way of thinking and how (00:25:53) they should do research. You don't have (00:25:55) to set like a verifiable reward for them (00:25:57) that's like okay this is the next part (00:25:58) of the curriculum and now this is the (00:25:59) next part of your curriculum and oh it (00:26:01) was this training was unstable and we (00:26:03) gota there's not this shleppy bespoke (00:26:05) process. So perhaps these two issues are (00:26:08) actually related in some way but I'd be (00:26:10) curious to (00:26:12) explore this this second thing which (00:26:14) feels more like continual learning and (00:26:15) this first thing which feels just like (00:26:17) um sample efficiency. (00:26:19) >> Yeah. So you know you could actually (00:26:21) wonder one one possible explanation for (00:26:25) the human sample efficiency that needs (00:26:28) to be considered is evolution (00:26:31) and evolution has given us a small (00:26:34) amount of the mo the most useful (00:26:36) information possible (00:26:39) and for things like vision (00:26:41) hearing and locomotion (00:26:44) I think there's a pretty strong case (00:26:46) that evolution actually has given us a (00:26:48) lot. Mhm. (00:26:50) >> So for example, human dexterity far (00:26:53) exceeds I mean robots can become (00:26:56) dexterous too if you subject them to (00:26:58) like a huge amount of training in (00:26:59) simulation. But to train a robot in the (00:27:02) real world to quickly like pick up a new (00:27:04) skill like a person does seems very out (00:27:06) of reach. And here you could say, oh (00:27:08) yeah, like locomotion all our ancestors (00:27:12) needed great locomotion squirrels like (00:27:16) so locomotion maybe like we've got like (00:27:18) some unbelievable prior. You could make (00:27:20) the same case for vision. You know, I I (00:27:22) believe Yan Lakhan made the point, oh, (00:27:24) like um children learn to drive after 16 (00:27:28) hour after like 10 hours of practice, (00:27:30) which is true, but our vision is so (00:27:33) good. At least for me, when I remember (00:27:36) myself being 5 years old, my I was I was (00:27:38) very excited about cars back then, and (00:27:41) I'm pretty sure my car recognition was (00:27:44) more than adequate for self-driving (00:27:45) already. As a 5-year-old, you don't get (00:27:48) to see that much data as a 5-year-old. (00:27:49) You spend most of your time in your (00:27:50) parents house, so you have very low data (00:27:52) diversity. But you could say maybe (00:27:54) that's evolution, too. But then language (00:27:56) and math and coding, probably not. (00:28:00) >> It still seems better than models. I (00:28:03) mean, obviously models are better than (00:28:04) the average human at language and math (00:28:06) and coding, but are they better at the (00:28:08) average human at learning? (00:28:10) >> Oh, yeah. Oh, yeah. Absolutely. What I (00:28:12) meant to say is that language math and (00:28:14) coding and especially math and coding (00:28:17) suggests that whatever it is that makes (00:28:19) people (00:28:21) good at learning (00:28:23) is probably not so much a complicated (00:28:26) prior but something more some (00:28:28) fundamental thing. (00:28:29) >> Wait, I'm not sure understood. Why (00:28:31) should that be the case? So consider a (00:28:33) skill (00:28:35) that people exhibit some kind of great (00:28:37) reliability or you know um (00:28:42) >> if the skill is one that was very useful (00:28:45) to our ancestors for many millions of (00:28:47) years, hundreds of millions of years, (00:28:49) you could say you could argue that maybe (00:28:52) humans are good at it because of (00:28:56) evolution because we have a prior (00:28:59) >> an evolutionary prior that's encoded in (00:29:02) some very nonobvious way. (00:29:04) >> Yeah. (00:29:05) >> That somehow makes us so good at it. (00:29:06) >> Yeah. (00:29:07) >> But if people exhibit great ability, (00:29:11) reliability, robustness, ability to (00:29:14) learn in a domain that really did not (00:29:17) exist until recently, (00:29:20) then this is more an indication that (00:29:23) people might have (00:29:26) just better machine learning period. (00:29:28) >> Mhm. But then how should we think about (00:29:31) what that is? Is it a matter of (00:29:34) Yeah. What is the ML analogy for what? (00:29:38) There's a couple interesting things (00:29:39) about it. It takes fewer samples. It's (00:29:41) more unsupervised. You don't have to set (00:29:43) a ver like a child learning to drive a (00:29:45) car. Children are not learning to drive (00:29:47) a car. A teenager learning how to drive (00:29:48) a car is like not exactly getting (00:29:53) some pre-built verifiable reward. (00:29:56) it comes from their interaction with the (00:29:59) machine and the with the environment. Um (00:30:02) and yet it takes much fewer samples. It (00:30:04) seems more unsupervised. It seems more (00:30:06) robust. Much more robust. The robustness (00:30:09) of people is really staggering. (00:30:12) >> Yeah. So like Okay. And do you have a (00:30:15) unified way of thinking about why are (00:30:16) all these things happening at once? What (00:30:18) is the ML analogy that would that could (00:30:22) be could realize something like this? (00:30:24) So, so, so, um, this is where, you know, (00:30:26) one of the things that you've been (00:30:27) asking about is how can, you know, the (00:30:30) teenage driver kind of self-correct and (00:30:33) learn from their experience without an (00:30:36) external teacher. (00:30:37) >> And the answer is well, they have their (00:30:40) value function, (00:30:41) >> right? They have a general sense which (00:30:44) is also by the way extremely robust in (00:30:46) people like (00:30:48) whatever it is the human value function (00:30:50) whatever the human value function is (00:30:53) with a few exceptions around addiction (00:30:56) it's actually very very robust (00:30:59) and so for something like a teenager (00:31:01) that's learning to drive they start to (00:31:03) drive and they already have a sense of (00:31:06) how they're driving immediately how (00:31:09) badly they're unconfident and then they (00:31:11) See? Okay. And they and then of course (00:31:13) the the learning speed of any teenager (00:31:15) is so fast after 10 hours you're good to (00:31:17) go. (00:31:17) >> Yeah. It seems like humans has some (00:31:19) solution, but I'm curious about like (00:31:20) well how are they doing it and like why (00:31:22) is it so hard to like how do we need to (00:31:24) reconceptualize the way we're training (00:31:25) models to make something like this (00:31:27) possible? (00:31:28) >> You know that is a great question to ask (00:31:31) and it's a question I have a lot of (00:31:35) opinions about. (00:31:37) But unfortunately (00:31:39) we live in a world where not not all (00:31:42) machine learning ideas are discussed (00:31:43) freely and this is this is one of them. (00:31:45) So there's probably a way to do it. (00:31:49) I think it can be done. The fact that (00:31:52) people are like that I think it's a (00:31:55) proof that it can be done. There may be (00:31:57) another blocker though which is there is (00:31:59) a possibility (00:32:02) that the human neurons actually do more (00:32:05) compute than we think. And if that is (00:32:08) true and if that plays an important role (00:32:11) then things might be more difficult. But (00:32:14) regardless I do think it points to the (00:32:16) existence of some (00:32:19) machine learning principle (00:32:22) that I have opinions on. But (00:32:24) unfortunately, circumstances make it (00:32:26) hard to to discuss in detail. Even (00:32:28) though (00:32:28) >> nobody nobody listens to this podcast, (00:32:30) Ilia. (00:32:32) >> Yeah. (00:32:32) >> So, I have to say that prepping for Ilia (00:32:35) was pretty tough because neither I nor (00:32:37) anybody else had any idea what he's (00:32:39) working on and what SSI is trying to do. (00:32:42) I had no basis to come up with my (00:32:44) questions and the only thing I could go (00:32:46) off honestly was trying to think from (00:32:48) first principles about what are the (00:32:50) bottlenecks to hi because clearly Ilia (00:32:52) is working on them in some way. Part of (00:32:55) this question involved thinking about RL (00:32:56) scaling because everybody's asking how (00:32:58) well RL will generalize and how we can (00:33:00) make it generalize better. As part of (00:33:02) this I was reading this paper that came (00:33:03) out recently on RL scaling and it showed (00:33:06) that actually the learning curve on RL (00:33:08) looks like a sigmoid. I found this very (00:33:10) curious. Why should it be a sigmoid (00:33:12) where it learns very little for a long (00:33:14) time and then it quickly learns a lot (00:33:16) and then it asmmptotes. This is very (00:33:18) different from the power law you see in (00:33:20) pre-training where the model learns a (00:33:22) bunch at the very beginning and then (00:33:23) less and less over time. And it actually (00:33:25) reminded me of a note that I had written (00:33:27) down after I had a conversation with a (00:33:29) researcher friend where he pointed out (00:33:31) that the number of samples that you need (00:33:33) to take in order to find a correct (00:33:35) answer scales exponentially with how (00:33:38) different your current probability (00:33:39) distribution is from the target (00:33:40) probability distribution. And I was (00:33:42) thinking about how these two ideas are (00:33:43) related. I had this vague idea that they (00:33:45) should be connected, but I really didn't (00:33:47) know how. I don't have a math (00:33:48) background, so I couldn't really (00:33:49) formalize it. But I wondered if Gemini 3 (00:33:52) could help me out here. And so I took a (00:33:54) picture of my notebook and I took the (00:33:56) paper and I put them both in the context (00:33:57) of Gemini 3 and I asked it to find the (00:34:00) connection. And it thought a bunch and (00:34:02) then it realized that the correct way to (00:34:05) model the information you gain from a (00:34:07) single yes or no outcome in RL is as the (00:34:10) entropy of a random binary variable. It (00:34:12) made a graph which showed how the bits (00:34:15) you gain for a sample in RL versus (00:34:17) supervised learning scale as a pass rate (00:34:20) increases. And as soon as I saw the (00:34:21) graph that Gemini 3 made, immediately a (00:34:24) ton of things started making sense to (00:34:25) me. Then I wanted to see if there was (00:34:27) any empirical basis to this theory. So I (00:34:29) asked Gemini to code my experiment to (00:34:32) show whether the improvement in loss (00:34:35) scales in this way with pass rate. I (00:34:37) just took the code that Gemini outputed. (00:34:39) I copy pasted it into a Google Collab (00:34:41) notebook and I was able to run this toy (00:34:43) ML experiment and visualize its results (00:34:45) without a single bug. It's interesting (00:34:47) because the results look similar but not (00:34:50) identical to what we should have (00:34:51) expected. And so I downloaded this chart (00:34:53) and I put it into Gemini and I asked it (00:34:55) what is going on here. I came up with a (00:34:56) hypothesis that I think is actually (00:34:57) correct which is that we're capping how (00:35:00) much supervised learning can improve in (00:35:02) the beginning by having a fixed learning (00:35:04) rate and in fact we should decrease the (00:35:06) learning rate over time. It actually (00:35:07) gives us an intuitive understanding for (00:35:09) why in practice we have learning rate (00:35:11) schedulers that decrease the learning (00:35:13) rate over time. I did this entire flow (00:35:15) from coming up with this vague initial (00:35:18) question to building a theoretical (00:35:20) understanding to running some toy ML (00:35:22) experiments all with Gemini 3. This (00:35:24) feels like the first model where it can (00:35:27) actually come up with new connections (00:35:28) that I wouldn't have anticipated. It's (00:35:30) actually now become the default place I (00:35:32) go to when I want to brainstorm new ways (00:35:35) to think about a problem. If you want to (00:35:36) read more about RL scaling, you can (00:35:38) check out the blog post that I wrote (00:35:39) with a little help from Gemini 3. And if (00:35:41) you want to check out Gemini 3 yourself, (00:35:43) go to gemini.google. (00:35:46) I am curious if you say we are back in (00:35:47) an era of research. (00:35:50) You were there from 2012 to 2020 (00:35:53) and do do you have Yeah. What what is (00:35:57) now the vibe going to be if we go back (00:35:59) to the era of research? For example, (00:36:02) even after Alexet, the amount of compute (00:36:05) that was used to run experiments kept (00:36:07) increasing and the size of frontier (00:36:10) systems kept increasing. (00:36:12) And do you think now that this era of (00:36:16) research will still require tremendous (00:36:17) amounts of compute? Um, do you think it (00:36:20) will require going back into the (00:36:22) archives and reading old papers? What is (00:36:26) maybe what was the vibe of like you were (00:36:28) at Google and um OpenAI and Stanford (00:36:32) these places when there was like a more (00:36:33) of a vibe of research. What what kind of (00:36:36) thing should we be expecting in the (00:36:37) community? (00:36:39) >> So one consequence of um the age of (00:36:43) scaling is that there was this (00:36:47) um scaling sucked out all the air in the (00:36:49) room. (00:36:50) >> Yeah. (00:36:51) And so (00:36:53) because scaling sucked out all the air (00:36:55) in the room, (00:36:57) everyone started to do the same thing. (00:37:00) We got to the point where (00:37:03) uh we are in a world where there are (00:37:06) more companies than ideas by quite a (00:37:08) bit. (00:37:09) >> Actually on that you know there is this (00:37:11) Silicon Valley saying that says that (00:37:14) ideas are cheap, execution is everything (00:37:18) and people say that a lot. (00:37:20) >> Yeah. And there is truth to that. But (00:37:22) then I saw I saw someone say on Twitter (00:37:25) um something like if ideas are are so (00:37:28) cheap, how come no one's having any (00:37:30) ideas? (00:37:31) >> And I think it's true too. I think like (00:37:34) if you think about um research progress (00:37:37) in terms of bottlenecks, (00:37:40) there are several bottlenecks. If you go (00:37:43) back to the if if you and um one of them (00:37:45) is ideas and one of them is your ability (00:37:47) to bring them to life. (00:37:48) >> Yeah. which might be compute but also (00:37:50) engineering. (00:37:52) So if you go back to the '9s let's say (00:37:54) you had people who had had pretty good (00:37:56) ideas and if they had much larger (00:37:58) computers maybe they could demonstrate (00:38:00) that their ideas were viable but they (00:38:02) could not. So they could only have very (00:38:04) very small demonstration that did not (00:38:06) convince anyone. (00:38:06) >> Yeah. (00:38:08) >> So the bottleneck was compute. Then in (00:38:10) the age of scaling computers increased a (00:38:13) lot and of course there is a question of (00:38:17) how much comput is needed but compute is (00:38:20) large so compute is large enough such (00:38:24) that (00:38:26) it's like not obvious that you need that (00:38:28) much more compute to prove some idea (00:38:33) like I'll give you an analogy. Alexet (00:38:36) was built on two GPUs. That was the (00:38:39) total amount of compute used for it. The (00:38:41) transformer (00:38:43) was built on 8 to 64 GPUs. No single (00:38:47) transformer paper experiment used more (00:38:49) than 64 GPUs of 2017 which would be like (00:38:52) what two GPUs of today. So the ResNet (00:38:57) right many like even even the the um you (00:39:01) could argue that the like 01 (00:39:04) reasoning was not the most comput heavy (00:39:07) thing in the world. So there definitely (00:39:10) for for research (00:39:13) you need like definitely some amount of (00:39:16) compute but it's far from obvious that (00:39:18) you need the absolutely largest amount (00:39:20) of compute ever for research. H (00:39:22) >> you might argue and I think it is true (00:39:25) that if you want to build the absolutely (00:39:27) best system, if you want to build the (00:39:30) absolutely best system, then it helps to (00:39:32) have much more compute and especially if (00:39:34) everyone is within the same paradigm, (00:39:37) then compute becomes one of the big (00:39:40) differentiators. (00:39:42) >> Yeah, I guess while it was possible to (00:39:45) develop these ideas, I'm asking you for (00:39:47) the history because you were actually (00:39:48) there. I'm not sure what actually (00:39:49) happened, but it sounds like it was (00:39:50) possible to develop these ideas using (00:39:53) minimal amounts of compute, but it (00:39:55) wasn't the transformer didn't (00:39:56) immediately become famous. It became the (00:39:59) thing everybody started doing and then (00:40:00) started experimenting on top of and (00:40:02) building on top of because it was (00:40:04) validated at higher and higher levels of (00:40:06) compute. (00:40:06) >> Correct. And if you at SSI have 50 (00:40:10) different ideas, how will you know which (00:40:12) one is the next transformer and which (00:40:15) one is you know brittle without having (00:40:20) the kinds of compute that other frontier (00:40:21) labs have. So I can I can comment on (00:40:24) that which is (00:40:26) the short comment is that you know you (00:40:29) mentioned SSI specifically for us (00:40:34) the amount of compute that SSI has for (00:40:37) research is really not that small and I (00:40:41) want to explain why like a simple math (00:40:44) can explain why the amount of compute (00:40:45) that we have is actually a lot more (00:40:47) comparable for research than one might (00:40:50) think. And I'll explain. So (00:40:56) SSI has raised $3 billion which is like (00:41:01) not small by it's like a lot by any (00:41:04) absolute sense but you could say but (00:41:05) look at the other companies raising (00:41:07) >> much more (00:41:09) but a lot of what their a lot of their (00:41:11) compute goes for inference (00:41:14) like these big numbers these big loans (00:41:16) it's earmarked for inference. That's (00:41:19) number one. Number two, you need if you (00:41:22) want to have a product on which you do (00:41:24) inference, you need to have a big staff (00:41:26) of engineers of salespeople. A lot of (00:41:29) the research needs to be dedicated for (00:41:31) producing all kinds of product related (00:41:34) features. So then when you look at (00:41:36) what's actually left for research, the (00:41:39) difference becomes a lot smaller. (00:41:42) Now the other thing is is that if you (00:41:45) are doing something different do you (00:41:47) really need the absolute maximal scale (00:41:50) to prove it? I don't think it's true at (00:41:52) all. I think that in our case we have (00:41:57) sufficient compute to prove to convince (00:42:00) ourselves and anyone else that what (00:42:01) we're doing is correct. (00:42:03) There's been public estimates that you (00:42:05) know companies like OpenAI spend on the (00:42:06) order of56 (00:42:08) billion dollars a year even just so far (00:42:10) on experiments. (00:42:12) >> This is separate from the amount of (00:42:14) money they're spending on inference and (00:42:16) so forth. So seems like they're spending (00:42:18) more a year running exper like research (00:42:20) experiments than you guys have in total (00:42:22) funding. (00:42:23) >> I think it's a question of what you do (00:42:24) with it. It's a question of what you do (00:42:26) with it. like they have a like the more (00:42:29) I think in in in their case in the case (00:42:31) of others I think there's a lot more (00:42:33) demand on the training compute there's a (00:42:36) lot more different work streams there is (00:42:38) there are different modalities there is (00:42:41) just more stuff and so it becomes (00:42:43) fragmented (00:42:44) >> how will SSI make money (00:42:46) >> you know (00:42:48) my answer to this question is something (00:42:50) like (00:42:53) we just f right now we just focus on the (00:42:55) research and then the answer to that (00:42:57) will reveal itself. I think there will (00:42:59) be lots of possible answers. (00:43:01) >> Is SSI's plan still to straightshot (00:43:03) super intelligence? (00:43:05) >> Maybe. (00:43:07) I think that there is merit to it. (00:43:09) >> I think there's a lot of merit because I (00:43:11) think that it's very nice to not be (00:43:13) affected by the day-to-day market (00:43:16) competition. (00:43:18) But I think there are two reasons that (00:43:23) may cause us to change the plan. one is (00:43:26) pragmatic if timelines turn out to be (00:43:28) long (00:43:30) which they might and second I think (00:43:32) there is a lot of value in the best and (00:43:38) most powerful AI being out there (00:43:41) impacting the world (00:43:44) >> I think this is a meaningfully valuable (00:43:46) thing (00:43:46) >> but then so why is your default plan to (00:43:48) straight shot super intelligence because (00:43:50) it sounds like you know openai anthropic (00:43:53) all these other companies their explicit (00:43:55) thinking is look we have weaker and (00:43:56) weaker intelligences that the public can (00:43:58) get used to and prepare for and why is (00:44:01) it potentially better to build a super (00:44:05) intelligence directly (00:44:06) >> so I'll make the case for and against (00:44:09) >> the case for is that you are so one of (00:44:12) the challenges (00:44:14) that people face when they're in the (00:44:16) market is that they have to participate (00:44:18) in the rat race and the rat race is (00:44:21) quite difficult in that it exposes you (00:44:23) to to to difficult trade-offs which you (00:44:26) need to make (00:44:28) and there is it is it is nice to say (00:44:31) we'll insulate ourselves from all this (00:44:33) and just focus on the research and come (00:44:35) out only when we are ready and not (00:44:37) before but the counterpoint is valid too (00:44:41) and those those are opposing forces the (00:44:44) counterpoint is hey it is useful for the (00:44:48) world to see powerful AI it is useful (00:44:52) for the world to see powerful AI because (00:44:53) that's the only way you and communicate (00:44:55) it. (00:44:55) >> Well, I guess not even just that you can (00:44:57) communicate the idea, but (00:44:58) >> communicate the AI, not the idea. (00:45:01) Communicate the AI. (00:45:03) >> What do you mean communicate the AI? (00:45:04) >> So, okay. So, let's suppose you read an (00:45:06) essay about AI (00:45:07) >> and the essay says AI is going to be (00:45:09) this and AI is going to be that and it's (00:45:11) going to be this (00:45:12) >> and you read it and you say, okay, this (00:45:14) is an interesting essay (00:45:15) >> right (00:45:16) >> now. Suppose you see an AI doing this (00:45:18) and AI doing that. (00:45:20) >> It is incomparable. Like basically I (00:45:23) think I think that there is a big (00:45:26) benefit from AI being in the public and (00:45:31) that would be a reason for us to not be (00:45:34) quite straight shot. (00:45:36) >> Yeah. Well, I guess it's not even that (00:45:38) which I but I do think that is an (00:45:39) important part of it. The other big (00:45:41) thing is I can't think of another (00:45:44) discipline in human engineering and (00:45:45) research where (00:45:47) the end artifact was made safer (00:45:52) mostly through just thinking about how (00:45:53) to make it safe as opposed to why are (00:45:56) airplane crashes per mile so much lower (00:45:58) today than they were decades ago? Why is (00:46:00) it so much harder to find a bug in Linux (00:46:03) than it would have been decades ago? And (00:46:05) I think it's mostly because these (00:46:06) systems were deployed to the world. you (00:46:09) noticed failures, those failures were (00:46:11) corrected and the systems became more (00:46:13) robust. Now, I'm not sure why AGI and (00:46:16) superhuman intelligence would be any (00:46:18) different, especially given, and I hope (00:46:20) we can talk, we're going to get to this. (00:46:23) It seems like the harms of super (00:46:25) intelligence are not just about like (00:46:27) having some malevolent uh paper clipper (00:46:30) out there, but it just like this is a (00:46:31) really powerful thing and we don't even (00:46:33) know how to conceptualize how people (00:46:34) interact with it, what people will do (00:46:35) with it and having gradual access to it (00:46:38) seems like a um better way to maybe (00:46:42) spread out the impact of it and to help (00:46:44) people prepare for it. Well, I think I (00:46:46) think on this point even in the straight (00:46:49) shot scenario, you would still do a (00:46:52) gradual release of it is how I would (00:46:55) imagine it. (00:46:57) The the gra gradualism would be an (00:47:00) inherent inherent component of any plan. (00:47:03) It's just a question of what is the (00:47:04) first thing that you get out of the (00:47:06) door. That's number one. Number two, I (00:47:08) also think you know I believe you have (00:47:11) advocated for continual learning more (00:47:13) than other people (00:47:14) >> and I actually think that this is an (00:47:16) important and correct thing and here is (00:47:20) why (00:47:22) so one of the things so I'll give you (00:47:24) another example of how thinking how (00:47:27) language affects thinking and in this (00:47:30) case it will be two words two words that (00:47:33) have shaped everyone's thinking I (00:47:36) maintain (00:47:37) F first word AGI (00:47:40) second word pre-training let me explain. (00:47:44) So the word the term AGI, (00:47:48) why does this term exist? It's a very (00:47:50) particular term. Why does it exist? (00:47:53) There's a reason. (00:47:55) The reason that the term AGI exists is (00:47:58) in my opinion not so much because it's (00:48:00) like a very important essential (00:48:03) descriptor of of of some end state of (00:48:05) intelligence but (00:48:10) because it is a reaction to a different (00:48:14) term that existed and the term is narrow (00:48:16) AI. If you go back to ancient history of (00:48:20) gameplay AI, of checkers AI, chess AI, (00:48:23) computer games AI, everyone would say, (00:48:26) look at this narrow intelligence. Sure, (00:48:28) the chess AI can beat Casper off, but it (00:48:30) can't do anything else. It is so narrow, (00:48:32) artificial narrow intelligence. So in (00:48:35) response, as a reaction to this, some (00:48:38) people said, well, this is not good. It (00:48:41) is so narrow. What we need is general (00:48:44) AI. (00:48:46) general AI, an AI that can just do all (00:48:48) the things. (00:48:51) The second and and that term just got a (00:48:55) lot of traction. (00:48:56) >> Yeah. (00:48:57) >> The second thing that got a lot of (00:48:58) traction is pre-training. (00:49:01) Specifically, the recipe of (00:49:03) pre-training. I think the current the (00:49:05) way people do RL now is maybe um un is (00:49:09) undoing the conceptual imprint of (00:49:12) pre-training. But pre-training had the (00:49:14) property. you do more pre-training and (00:49:17) the model gets better at everything more (00:49:19) or less uniformly. Yeah, (00:49:21) >> general AI pre-training gives AGI (00:49:27) but (00:49:29) the thing that happened with AGI and (00:49:32) pre-training is that in some sense they (00:49:34) overshock the target (00:49:36) because by the kind if you think about (00:49:39) the term AGI you will realize and (00:49:42) especially in the context of (00:49:43) pre-training you will realize that a (00:49:44) human being is not an AGI (00:49:48) because a human being Yes, there is (00:49:51) definitely a foundation of skills. (00:49:53) A human being, (00:49:56) a human being lacks a huge amount of (00:49:59) knowledge. Instead, we rely on continual (00:50:02) learning. We rely on continual learning. (00:50:05) And so then when you think about okay, (00:50:07) so let's suppose that we achieve success (00:50:09) and we produce a safe super some kind of (00:50:11) safe super intelligence. The question is (00:50:13) but how do you define it? Where on the (00:50:15) curve of continual learning is it going (00:50:17) to be? I produce like um a super (00:50:20) intelligent 15 year old that's very (00:50:22) eager to go and you say okay I'm going (00:50:23) to they don't know very much at all the (00:50:26) great student very eager you go and be a (00:50:29) programmer you go and be a doctor (00:50:32) go and learn so you could imagine that (00:50:34) the deployment itself will involve some (00:50:36) kind of a learning trial and error (00:50:38) period (00:50:39) >> it's a process as opposed to you drop (00:50:42) the finished thing (00:50:44) >> okay I I I I see so you're you're (00:50:46) suggesting (00:50:47) that the thing you're pointing out with (00:50:49) super intelligence (00:50:51) is not some finished (00:50:56) mind which knows how to do every single (00:50:58) job in the economy cuz the way say the (00:51:01) original I think openi charter or (00:51:03) whatever defines AGI is like it can do (00:51:05) every single job that a every single (00:51:07) thing a human can do. You're proposing (00:51:09) instead a mind which can learn to do any (00:51:13) single every single job. (00:51:14) >> Yes. (00:51:14) >> And that is super intelligence. And then (00:51:16) but once you have the learning (00:51:18) algorithm, (00:51:20) it gets deployed into the world the same (00:51:22) way a human laborer might join an (00:51:24) organization. (00:51:25) >> And it seems like one of these two (00:51:27) things might happen. Maybe neither of (00:51:29) these happens. One, this super efficient (00:51:33) learning algorithm (00:51:35) becomes superhuman becomes as good as (00:51:38) you and potentially even better at the (00:51:41) task of ML research. And as a result the (00:51:46) algorithm itself becomes more and more (00:51:47) superhuman. The other is even if that (00:51:49) doesn't happen if you have a single (00:51:52) model I mean this this is explicitly (00:51:53) your vision. If you have a single model (00:51:55) or instances of a model which are (00:51:57) deployed through the economy doing (00:51:59) different jobs learning how to do those (00:52:00) jobs continually learning on the job (00:52:04) picking up all the skills that any human (00:52:05) could pick up but actually picking them (00:52:06) all up at the same time and then (00:52:08) amalgamating the learnings. (00:52:10) you basically have a model which (00:52:12) functionally becomes super intelligent (00:52:14) even without any sort of recursive (00:52:16) self-improvement in software right (00:52:19) because you now have one model that can (00:52:20) do every single job in the economy and (00:52:22) humans can't merge our minds in the same (00:52:24) way and so do you expect some sort of (00:52:26) like intelligence explosion from broad (00:52:28) deployment (00:52:29) >> I think that it is likely that we will (00:52:33) have rapid economic growth (00:52:37) I think the broad deployment And (00:52:41) like there are two arguments you could (00:52:44) make which are conflicting. (00:52:47) One is that look if indeed you get once (00:52:50) indeed you get to a point where you have (00:52:53) an AI that can learn to do (00:52:57) things quickly (00:52:59) and you have many of them then they will (00:53:02) then there will be a strong force to (00:53:05) deploy them in the economy. Unless there (00:53:08) will be some kind of a regulation that (00:53:10) stops it, which by the way there might (00:53:12) be. But I think the idea of very rapid (00:53:17) economic growth for some time, I think (00:53:19) it's very possible from broad (00:53:21) deployment. The other question is how (00:53:23) rapid it's going to be. (00:53:25) So I think this is hard to know because (00:53:27) on the one hand you have this very (00:53:29) efficient worker. on the other hand (00:53:31) there is the world is just really big (00:53:34) and there's a lot of stuff (00:53:36) and that stuff moves at a different (00:53:38) speed but then on the other hand now the (00:53:40) AI could you know so I think very rapid (00:53:43) economic growth is possible and we will (00:53:45) see like all kinds of things like (00:53:48) different countries with different rules (00:53:49) and the ones which have the friendlier (00:53:51) rules the economic growth will be faster (00:53:53) hard to predict (00:53:54) >> some people in our audience like to read (00:53:56) the transcripts instead of listening to (00:53:58) the episode and so we put a ton of (00:54:00) effort into making the transcripts read (00:54:02) like they are standalone essays. The (00:54:04) problem is that if you just transcribe a (00:54:07) conversation verbatim using a speech to (00:54:09) text model, it'll be full of all kinds (00:54:11) of fits and starts and confusing (00:54:13) phrasing. We mentioned this problem to (00:54:15) Labelbox and they asked if they could (00:54:16) take a stab. Working with them on this (00:54:18) is probably the reason that I'm most (00:54:20) excited to recommend Labelbox to people. (00:54:22) It wasn't just, oh, hey, tell us what (00:54:24) kind of data you need and we'll go get (00:54:25) it. They walked us through the entire (00:54:27) process from helping us identify what (00:54:29) kind of data we needed in the first (00:54:30) place to assembling a team of expert (00:54:33) aligners to generate it. Even after we (00:54:35) got all the data back, Labelbox stayed (00:54:37) involved. They helped us choose the (00:54:40) right base model and set up auto QA on (00:54:42) the model's output so that we could (00:54:44) tweak and refine it. And now we have a (00:54:46) new transcriber tool that we can use for (00:54:48) all our episodes moving forward. This is (00:54:50) just one example of how Labelbox meets (00:54:53) their customers at the ideas level and (00:54:55) partners with them through their entire (00:54:56) journey. If you want to learn more or if (00:54:58) you want to try out the transcriber tool (00:55:00) yourself, go to labelbox.com/barcash. (00:55:08) It seems to me that this is a very (00:55:10) precarious situation to be in where (00:55:13) looking the limit we know that this (00:55:15) should be possible because if you have (00:55:17) something that is as good as a human at (00:55:19) learning but which can merge its brains (00:55:22) merge there are different instances in a (00:55:24) way that humans can't merge already. (00:55:26) This seems like a thing that should (00:55:28) physically be possible. Humans are (00:55:29) possible, digital computers are (00:55:30) possible. You just need both of those (00:55:32) combined to produce this thing. And it (00:55:34) also seems like this kind of thing is (00:55:36) extremely um powerful (00:55:40) and economic growth is one way to put (00:55:43) it. Um I mean Dyson spear is a lot of (00:55:45) economic growth but another way to put (00:55:46) it is just like you will have (00:55:49) potentially a very short period of time (00:55:50) because a human on the job can you know (00:55:52) you you're hiring people at SSI in six (00:55:54) months they're like net productive (00:55:55) probably right um a human like learns (00:55:57) really fast and so this thing is (00:55:58) becoming smarter and smarter very fast (00:56:01) what is how do you think about making (00:56:02) that go well and why is SSI positioned (00:56:05) to do that well or what is SSI's plan (00:56:07) there basically is what I'm trying to (00:56:08) ask (00:56:08) >> yeah (00:56:10) so one of the one of the ways in which (00:56:14) my thinking has been changing is that (00:56:19) I now place more importance on AI (00:56:25) being (00:56:26) deployed (00:56:28) incrementally and in advance. One very (00:56:32) difficult thing about AI is that we are (00:56:35) talking about systems that don't yet (00:56:39) exist (00:56:41) and it's hard to imagine them. (00:56:44) I think that one of the things that's (00:56:45) happening is that in practice it's very (00:56:50) hard to feel the AGI. (00:56:52) It's very hard to feel the AGI. (00:56:55) We can talk about it, but it's like it's (00:56:58) like talking about like the long f like (00:57:01) imagine like having a conversation about (00:57:02) like how is it like to be old when (00:57:05) you're like old and and frail and you (00:57:08) can have a conversation. You can try to (00:57:09) imagine it, but it's just hard and you (00:57:12) come back to reality. Well, that's not (00:57:14) the case. And I think that a lot of the (00:57:19) issues around (00:57:21) AGI and its future power stem from the (00:57:26) fact that it's very difficult to imagine (00:57:31) future AI is going to be diff different. (00:57:34) It's going to be powerful. Indeed, the (00:57:36) whole problem, what is the problem of AI (00:57:39) and AGI? The whole problem is the power. (00:57:43) The whole problem is the power. (00:57:47) When the power is really big, what's (00:57:49) going to happen? (00:57:51) And one of the one of the ways in which (00:57:53) I've changed my mind over the past year (00:57:55) and so that that change of mind may back (00:58:00) may I'll say I I'll I'll hedge a little (00:58:02) bit may back propagate into into the (00:58:04) plans of our of our company is that (00:58:09) so if it's hard to imagine (00:58:12) what do you do you got to be showing the (00:58:14) thing you got to be showing the thing (00:58:16) and I maintain that I think I think most (00:58:19) people who work Con AI also can't (00:58:22) imagine it because it's too different (00:58:24) from what people see on a day-to-day (00:58:26) basis. (00:58:29) I do maintain here is something which I (00:58:32) predict will happen. That's a (00:58:33) prediction. (00:58:35) I maintain (00:58:37) that as AI becomes more powerful (00:58:41) then people will change their behaviors (00:58:46) and we will see all kinds of (00:58:47) unprecedented things which are not (00:58:50) happening right now and I'll give some (00:58:52) examples. I do like I I think I think (00:58:56) for better or worse the the frontier (00:58:58) companies will play a very important (00:59:00) role in what happens as will the (00:59:02) government and the kind of things that I (00:59:04) think we'll see which you see the (00:59:07) beginnings of (00:59:09) companies that are fierce competitors (00:59:12) starting collaborate to to collaborate (00:59:14) on AI safety you may have seen open AI (00:59:17) and anthropic event doing a first small (00:59:21) step but that did not exist That's (00:59:23) actually something which I predicted in (00:59:25) one of my talks about three years ago (00:59:27) that such a thing will happen. I also (00:59:30) maintain that as AI continues to become (00:59:32) more powerful, more visibly powerful, (00:59:36) there will also be a desire from (00:59:38) governments and the public to do (00:59:40) something (00:59:42) and I think that this is a very (00:59:44) important force (00:59:46) of showing the AI. That's number one. (00:59:49) Number two, okay, so then the AI is (00:59:51) being built. what needs to what needs to (00:59:53) be done. (00:59:56) So one thing that I maintain that will (00:59:58) happen is that right now people who are (01:00:00) working on AI I maintain that the AI (01:00:03) doesn't feel powerful because of its (01:00:05) mistakes. (01:00:07) I do think that at some point the AI (01:00:08) will start to feel powerful actually and (01:00:11) I think when that happens we will see a (01:00:13) big change in the way (01:00:16) all AI companies approach safety. (01:00:20) they'll become much more paranoid. I (01:00:22) think I I say this as a predict as a as (01:00:24) a as a prediction that we will see (01:00:26) happen. We'll see if I'm right, but I (01:00:28) think this is something that will happen (01:00:30) because they will see the AI becoming (01:00:31) more powerful. Everything that's (01:00:33) happening right now, I maintain is (01:00:36) because people look at today's AI and (01:00:38) it's hard to imagine the future AI. (01:00:42) And there is a third thing which needs (01:00:44) to happen. And I think this is this this (01:00:47) and I'm talking about it in in broader (01:00:49) terms not just from the perspective of (01:00:51) SSI (01:00:53) because you ask me about our company but (01:00:55) the question is okay so then what should (01:00:56) what should the companies aspire to (01:00:58) build (01:00:58) >> what should they aspire to build and (01:01:00) there has been one big idea that (01:01:02) actually every that um everyone has been (01:01:05) locked in locked into which is the the (01:01:07) self-improving AI (01:01:10) and why why did it happen because there (01:01:12) is fewer ideas than companies (01:01:15) But I maintain that there is something (01:01:17) that's better to build and I think that (01:01:19) everyone will actually want that. It's (01:01:22) like the AI that's robustly aligned to (01:01:26) care about sentient life specifically. (01:01:30) I think in particular it will be there's (01:01:32) a case to be made that it will be easier (01:01:35) to build an AI that cares about sentient (01:01:37) life than an AI that cares about human (01:01:39) life alone because the AI itself will be (01:01:42) sentient. (01:01:44) And if you think about things like (01:01:46) mirror neurons and human empathy for (01:01:48) animals which is you know you might (01:01:50) argue it's not big enough but it exists. (01:01:53) I think it's an emerging property from (01:01:55) the fact that we model others with the (01:01:58) same circuit that we used to model (01:02:00) ourselves because that's the most (01:02:02) efficient thing to do. (01:02:04) >> So even if you got an AI to care about (01:02:06) sentient beings and it's not actually (01:02:09) clear to me that that's what you should (01:02:10) try to do if you solved alignment. It (01:02:12) would still be the case that most (01:02:14) sentient beings will be AIS. There will (01:02:17) be trillions eventually quadrillions of (01:02:19) AIs. Humans will be a very small (01:02:21) fraction of sentient beings. (01:02:23) So, it's not clear to me if the goal is (01:02:26) some kind of human control over (01:02:30) this future civilization (01:02:32) that this is the best criterion. (01:02:35) >> It's true. I I think that (01:02:39) it's possible it's not the best (01:02:40) criterion. I'll say two things. I think (01:02:44) that thing number one (01:02:48) I think that if there so (01:02:51) I think that care for sentient life I (01:02:53) think there is merit to it. I think it (01:02:55) should be considered. I think that it (01:02:57) will be helpful if there was some kind (01:02:59) of a (01:03:01) short list of ideas that then the (01:03:06) companies when they are in this (01:03:07) situation could use. That's number two. (01:03:10) Number three, I think it would be really (01:03:12) materially helpful if the power of the (01:03:16) most powerful super intelligence was (01:03:18) somehow capped (01:03:20) because it would address a lot of these (01:03:22) concerns. (01:03:24) The question of how to do it, I'm not (01:03:26) sure, but I think that would be (01:03:28) materially helpful when you're talking (01:03:30) about really really powerful systems. (01:03:32) >> Yeah. Um, before we continue the element (01:03:35) discussion, I I want to double click on (01:03:36) that. How much room is there at the top? (01:03:39) How do you think about super (01:03:40) intelligence? Do you think I mean using (01:03:43) this learning efficiency idea maybe is (01:03:45) just extremely fast at learning new (01:03:47) skills or new knowledge and does it just (01:03:49) have a bigger pool of strategies? Is (01:03:51) there a single cohesive it in the center (01:03:55) that's more powerful or bigger? And if (01:03:58) so, (01:04:00) do you do you imagine that this will be (01:04:02) sort of godlike in comparison to the (01:04:03) rest of human civilization? or does it (01:04:04) just feel like another agent or another (01:04:07) cluster of agents? (01:04:09) >> So this is an area where different (01:04:10) people have different intuitions. (01:04:12) >> I think it will be very powerful for (01:04:14) sure. I think that what I think is most (01:04:17) likely to happen (01:04:20) is that there will be multiple (01:04:22) such AIS being created roughly at the (01:04:26) same time. (01:04:28) I think that (01:04:30) if the cluster is big enough, like if (01:04:33) the cluster is literally continent (01:04:35) sized, that thing could be really (01:04:38) powerful indeed, right? If you literally (01:04:40) have a continentsized cluster, like (01:04:43) those those AIs can be very powerful. (01:04:45) And I like all I can tell you is that if (01:04:49) you're talking about extremely powerful (01:04:51) AIs, like truly dramatically powerful, (01:04:53) then yeah, it would be nice if they (01:04:55) could be restrained in some ways or if (01:05:00) there was some kind of an agreement or (01:05:02) something (01:05:04) because I think that if you are saying (01:05:05) hey like if if you really like what what (01:05:09) is the the concern of super (01:05:11) intelligence? What is one way to explain (01:05:12) the concern? If you imagine a system (01:05:15) that is sufficiently powerful, like (01:05:18) really sufficiently powerful, and you (01:05:21) could say, okay, you need to do (01:05:22) something sensible like care for (01:05:24) sentient life, let's say, in a very (01:05:26) single-minded way, we might not like the (01:05:28) results. That's really what it is. And (01:05:30) so maybe, by the way, the answer is that (01:05:32) you do not build a single you do not (01:05:34) build an RL agent in the usual sense. (01:05:37) And actually, I'll point I'll point (01:05:38) several things out. I think human beings (01:05:41) are a semi agent. You know, we pursue a (01:05:44) reward and then the emotions or whatever (01:05:47) make us tire out of the reward. We (01:05:49) pursue a different reward. (01:05:51) The market is like kind it's like a very (01:05:54) shortsighted (01:05:55) kind of agent. Evolution is the same. (01:05:58) Evolution is very intelligent in some (01:05:59) ways but very dumb in other ways. The (01:06:02) government has been designed to be a (01:06:04) never- ending fight between three parts (01:06:06) which has an effect. So I think things (01:06:09) like this (01:06:12) another thing that makes this discussion (01:06:13) difficult is that we are talking about (01:06:15) systems that don't exist that we don't (01:06:17) know how to build (01:06:19) right that's the other thing and that's (01:06:21) actually my belief I think what people (01:06:22) are doing right now will go some (01:06:24) distance and then peter out it will (01:06:27) continue to improve but it will also not (01:06:29) be it so the it we don't know how to (01:06:32) build and I think that a lot h a lot (01:06:35) hinges on (01:06:37) understanding and in reliable (01:06:39) generalization (01:06:42) and I'll say another thing which is like (01:06:45) you know one of the things that you (01:06:46) could say is that cause alignment to be (01:06:48) difficult is that human val that it's (01:06:51) it's um (01:06:53) your ability to learn human values is (01:06:55) fragile then your ability to optimize (01:06:57) them is fragile will you actually learn (01:06:59) to optimize them and then can't you say (01:07:01) are these not all instances of (01:07:03) unreliable generalization (01:07:07) why is it that human beings appear to (01:07:08) generalize so much better. What if (01:07:10) generalization was much better? What (01:07:12) would happen in this case? What would be (01:07:13) the effect? But those we can't we can't (01:07:16) like those questions are right now still (01:07:18) unanswerable. (01:07:19) >> Um how does one think about what AI (01:07:23) going well looks like because I think (01:07:25) you've scoped out how AI might evolve. (01:07:27) We'll have these sort of continual (01:07:28) learning agents. AI will be very (01:07:30) powerful. Maybe there will be many (01:07:32) different AIs. How do you think about (01:07:35) lots of continent computes size (01:07:37) intelligences going around? How (01:07:40) dangerous is that? How do we make that (01:07:43) less dangerous? And how do we do that in (01:07:46) a way that (01:07:49) protects a equilibrium where there might (01:07:52) be misaligned AIs out there and bad (01:07:54) actors out there? So, one reason why I (01:07:57) liked the AI that cares for sentient (01:08:00) life, (01:08:01) >> you know, and we can debate on whether (01:08:02) it's good or bad, but (01:08:05) if the first N of these dramatic systems (01:08:11) actually do care for, you know, (01:08:14) love humanity or something, you know, (01:08:16) care for sentient life. Obviously, this (01:08:18) also needs to be achieved. This needs to (01:08:21) be achieved. (01:08:22) So if this is achieved by the first n of (01:08:25) those systems (01:08:28) then then I can see it go well at least (01:08:31) for quite some time and then there is (01:08:33) the question of what happens in the long (01:08:35) run what happens in the long run how do (01:08:37) you achieve a long run equilibrium (01:08:40) >> and I think that there there is an (01:08:43) answer as well and I don't like this (01:08:46) answer (01:08:48) but it needs to be considered (01:08:52) In the long run, you might say, okay, so (01:08:53) if you have a world where powerful AI (01:08:56) exist. In the short term, you could say, (01:08:58) okay, you have universal high income. (01:09:01) You have universal high income and we (01:09:04) all doing well. But we know that what do (01:09:07) the Buddhists say? Change is the only (01:09:09) constant. And so things change and there (01:09:11) is some kind of government political (01:09:13) structure thing and it changes because (01:09:16) these things have a shelf life. you know (01:09:19) some new new government thing comes up (01:09:20) and it functions and then after some (01:09:22) time it stops functioning (01:09:25) that's something that you see happening (01:09:26) all the time and so I think that for the (01:09:29) long run equilibrium (01:09:32) one approach you could say okay so maybe (01:09:35) every person will have an AI that will (01:09:37) do their bidding and that's good and if (01:09:40) that could be maintained indefinitely (01:09:42) that's true but the downside with that (01:09:45) is okay so then the AI goes and like (01:09:49) earns earn earn you know earns money for (01:09:51) for the person and you know advocates (01:09:53) for their needs in like the political (01:09:55) sphere and maybe then writes a little (01:09:57) report saying okay here's what I've done (01:09:58) here's the situation and the person says (01:10:00) great keep it up but the person is no (01:10:03) longer a participant (01:10:06) and then you can say that's a precarious (01:10:07) place to be in but so I'm going to (01:10:11) preface by saying (01:10:14) I don't like this solution but it is a (01:10:17) solution (01:10:19) And the solution is if people become (01:10:21) part AI with some kind of neural link++ (01:10:24) because what will happen as a result is (01:10:26) that now the AI understands something (01:10:28) and we understand it too like (01:10:31) because now the understanding is (01:10:33) transmitted wholesale. So now if the AI (01:10:35) is in some situation now it's like you (01:10:39) are involved in the situation yourself (01:10:40) fully (01:10:42) and I think this is the answer to the (01:10:44) equilibrium. I wonder if uh the fact (01:10:47) that emotions which were (01:10:50) developed (01:10:51) millions or in many cases billions of (01:10:53) years ago in a totally different (01:10:55) environment are still guiding our (01:10:58) actions so strongly is an example of (01:11:02) alignment success to maybe spell out (01:11:04) what I mean the brain stem has these (01:11:10) I don't know if it's more accurate to (01:11:11) call it a value function or reward (01:11:12) function but the brain stem has a (01:11:15) directive where it's saying mate with (01:11:16) somebody who's more successful. The (01:11:18) cortex is the part that understands what (01:11:20) does success mean in the modern context (01:11:22) but the brain stem is able to align the (01:11:25) cortex and say however you recognize (01:11:27) success to be and I I'm not smart enough (01:11:28) to understand what that is. You're still (01:11:30) going to pursue this directive. (01:11:32) >> I think I think there is (01:11:34) so I think there's a more general point. (01:11:37) I think it's actually really mysterious (01:11:39) how the brain encodes high level (01:11:43) desires. Sorry, how evolution encodes (01:11:45) high level desires. (01:11:47) >> Like it's pretty easy to understand how (01:11:49) evolution would would endow us with the (01:11:52) desire for food that smells good cuz (01:11:54) smell is a chemical (01:11:57) and so just pursue that chemical. It's (01:11:59) very easy to imagine such a me evolution (01:12:01) doing such a thing. But evolution also (01:12:05) has has endowed us with all these social (01:12:07) desires like we we really care about (01:12:10) being seen positively by society. We (01:12:13) care about being in a good standing. We (01:12:16) like all these social intuitions that we (01:12:19) have. I feel strongly that they are (01:12:21) baked in and I don't know how evolution (01:12:25) did it because it's a high level (01:12:27) concept. It's represented in the brain. (01:12:30) like what people think like let's say (01:12:32) you are like you care about (01:12:35) some social thing. (01:12:37) It's not like a low-level signal like (01:12:40) smell. It's not something that for which (01:12:43) there's a sensor like the brain needs to (01:12:45) do a lot of processing to piece together (01:12:47) lots of bits of information to (01:12:49) understand what's going on socially and (01:12:51) somehow evolution said that's what you (01:12:53) should care about. (01:12:54) >> Yes. (01:12:55) >> How did it do it? And it did it quickly (01:12:57) too. (01:12:57) >> Yeah. because I think all these (01:12:59) sophisticated social things that um we (01:13:02) care about I think they evolved pretty (01:13:04) recently. So evolution had an easy time (01:13:06) hardcoding this high level desire and (01:13:10) >> I maintain or you know at least I'll say (01:13:12) I'm unaware of good hypothesis for how (01:13:15) it's done. I I had some ideas I was (01:13:18) kicking around but none of them none of (01:13:21) them uh are satisfying. (01:13:24) >> Yeah. And what's especially impressive (01:13:26) is if it was a desire that you learned (01:13:27) in your lifetime, it kind of makes sense (01:13:30) because your brain is intelligent. It (01:13:32) makes sense why we be able to learn (01:13:33) intelligent desires. But your point is (01:13:35) that the desire is maybe this is not (01:13:38) your point, but one way to understand it (01:13:39) is the desire is built into the genome (01:13:42) and the genome is not intelligent, (01:13:44) right? But it's able to you're somehow (01:13:46) able to describe this feature that (01:13:47) requires like it's not even clear how (01:13:49) you define that feature and you can get (01:13:51) it into the you can build it into the (01:13:53) genes. Yeah, essentially, or maybe I'll (01:13:55) put it differently. If you think about (01:13:56) the tools that are available to the (01:13:59) genome, (01:14:01) it says, okay, here's a recipe for (01:14:02) building a brain. And you could say, (01:14:04) here is a recipe for connecting the (01:14:06) dopamine neurons to like the smell (01:14:08) sensor. (01:14:08) >> Yeah. (01:14:09) >> And if the smell is a certain kind of, (01:14:11) you know, good smell, you want to eat (01:14:12) that. I could imagine the genome doing (01:14:14) that. I'm I'm claiming that it is harder (01:14:17) to imagine. It's harder to imagine the (01:14:20) genome saying you should care about some (01:14:24) complicated computation that your entire (01:14:26) brain that like a big chunk of your (01:14:28) brain does. That's all I'm claiming. I I (01:14:30) can tell you like a speculation. I was (01:14:32) wondering how it could be done. And let (01:14:34) me offer a speculation and I'll explain (01:14:36) why the speculation is probably false. (01:14:38) So the speculation is okay. So the brain (01:14:43) it's like the brain has those regions. (01:14:46) You know the brain regions. We have our (01:14:48) cortex, right? (01:14:49) >> Yeah. (01:14:50) >> It has all those brain regions and the (01:14:52) cortex is uniform. But the brain regions (01:14:55) and and and the neurons in the cortex, (01:14:57) they kind of speak to their neighbors (01:14:58) mostly. And that's explains why you get (01:15:00) brain regions because if you want to do (01:15:02) some kind of speech processing, all the (01:15:04) neurons that do speech need to talk to (01:15:06) each other and they can and because (01:15:07) neurons can only speak to their nearby (01:15:08) neighbors for the most part, it has to (01:15:10) be a region. All the regions are mostly (01:15:13) located in the same place from person to (01:15:14) person. So maybe evolution hardcoded (01:15:17) literally a location on the brain. (01:15:21) So it says, "Oh, like when when like you (01:15:24) know the GPS of the brain, GPS (01:15:26) coordinates, such and such, when that (01:15:28) fires, that's what you should care (01:15:29) about." Like maybe that's what evolution (01:15:30) did cuz that would be within the toolkit (01:15:32) of evolution. Yeah. Although there are (01:15:35) examples where for example people who (01:15:37) are born blind have that area of their (01:15:39) cortex adopted by (01:15:42) another sense and I have no idea but I'd (01:15:47) be surprised if the desires or the (01:15:51) reward functions which require visual (01:15:54) signal no longer worked. You know people (01:15:57) who have their different areas of their (01:15:58) cortex co-opted. For example, if you no (01:16:00) longer have vision, (01:16:03) can you still feel the sense that I want (01:16:05) people around me to like me and so (01:16:07) forth, which usually there's also visual (01:16:09) cues for. (01:16:10) >> So, I actually fully agree with that. I (01:16:11) I think there's an even stronger counter (01:16:13) argument to this theory, (01:16:14) >> which is like if you think about people, (01:16:17) so there are people who get half of (01:16:19) their brain removed in childhood. (01:16:23) >> Yeah. and they still have all their (01:16:25) brain regions, but they all somehow move (01:16:26) to just one hemisphere, which suggests (01:16:28) that the brain regions the the location (01:16:31) is not fixed. And so that theory is not (01:16:33) true. It would have been cool if it was (01:16:35) true, but it's not. And so I think (01:16:37) that's a mystery, but it's an (01:16:38) interesting mystery. Like the fact is (01:16:40) somehow (01:16:42) >> evolution was able to endow us to care (01:16:44) about social stuff very very reliably. (01:16:47) And even people who have like all kinds (01:16:48) of strange mental conditions and (01:16:50) deficiencies and emotional problems tend (01:16:53) to care about this. Also, AI tools like (01:16:55) defakes, voice clones, and agents have (01:16:59) dramatically increased the (01:17:00) sophistication of fraud and abuse. So, (01:17:03) it's more important than ever to (01:17:05) actually understand the identity and (01:17:07) intent of whoever or whatever is using (01:17:10) your platform. That's exactly what (01:17:12) Sardine helps you do. Sardine brings (01:17:14) together thousands of device behavior (01:17:17) and identity signals to help you assess (01:17:19) risk. Everything from how a user types (01:17:22) or moves their mouse or holds their (01:17:24) device to whether they're hiding their (01:17:26) true location behind a VPN to whether (01:17:29) they're injecting a fake camera feed (01:17:31) during KYC selfie checks. Sardine (01:17:33) combines these signals with insights (01:17:36) from their network of almost 4 billion (01:17:38) devices. things like a user's history of (01:17:40) fraud or their associations with other (01:17:42) high-risisk accounts so you can spot bad (01:17:45) actors before they do damage. This would (01:17:48) literally be impossible if you only use (01:17:50) data from your own application. Sardine (01:17:53) doesn't stop at detection. They offer a (01:17:54) suite of agents to streamline onboarding (01:17:57) checks and automate investigations. So, (01:17:59) as fraudsters use AI to scale their (01:18:01) attacks, you can use AI to scale your (01:18:04) defenses. Go to sardine.ai/warcash. (01:18:07) AI/Swarcash (01:18:08) to learn more and download their guide (01:18:10) on AI fraud detection. (01:18:13) What is SSI planning on doing (01:18:15) differently? So presumably your plan is (01:18:17) to be one of the frontier companies when (01:18:20) this time arrives (01:18:22) and then what is (01:18:25) presumably you started SSI because (01:18:26) you're like I I think I have a way of (01:18:28) approaching how to do this safely in a (01:18:30) way that the other companies don't. What (01:18:32) what is that difference? So the way I (01:18:35) would describe it as (01:18:38) there are some ideas that I think are (01:18:40) promising and I want to investigate them (01:18:42) and see if they are indeed promising or (01:18:44) not. It's really that simple. It's an (01:18:46) attempt. I think that if the ideas turn (01:18:49) out to be correct, these ideas that we (01:18:51) discussed around understanding (01:18:54) generalization, (01:18:56) >> if these ideas turn out to be correct, (01:19:01) then I think we will have something (01:19:02) worthy. Will it turn out to be correct? (01:19:05) We are doing research. We are squarely (01:19:08) age of research company. We are making (01:19:10) progress. We've actually made quite good (01:19:12) progress over the past year. But we need (01:19:13) to keep making more progress, (01:19:15) >> more research. (01:19:16) >> And that's how I see it. I see it as an (01:19:19) attempt to be (01:19:23) an attempt to be a voice and a (01:19:25) participant. (01:19:27) Um people have asked uh your co-founder (01:19:31) and previous CEO left to go to Meta (01:19:34) recently and people have asked well if (01:19:38) there was a lot of breakthroughs being (01:19:40) made that seems like a thing that should (01:19:41) have been unlikely. I wonder how you (01:19:43) respond. (01:19:43) >> Yeah. So I in for for this I will simply (01:19:47) remind a few facts that may have been (01:19:50) forgotten and I think this these facts (01:19:52) which provide the context I think they (01:19:54) explain the situation. So the context (01:19:56) was that we were fundraising at a 32 (01:20:00) billion valuation (01:20:02) and then Meta um came in and offered to (01:20:06) to acquire us and I said no (01:20:11) but my former co-founder (01:20:14) like in some sense said yes and as a (01:20:17) result he also was able to enjoy from a (01:20:20) lot of near-term liquidity and he was (01:20:23) the only person from SSI to join Meta. (01:20:25) It sounds like SSI's plan is to be a (01:20:27) company that is at the frontier when you (01:20:29) get to this (01:20:31) very important period in human history (01:20:34) where you have superhuman intelligence (01:20:35) and you have these ideas about how to (01:20:37) make superhuman intelligence go well but (01:20:40) other companies will be trying their own (01:20:41) ideas. What distinguishes SSI's approach (01:20:45) to making super intelligence go well? (01:20:48) The (01:20:48) >> the main thing that distinguishes SSI is (01:20:52) its technical approach. (01:20:55) So we have a different technical (01:20:56) approach that I think is worthy (01:20:59) and we are pursuing it. (01:21:02) I maintain that in the end there will be (01:21:04) a convergence of strategies. So I think (01:21:06) there will be a convergence of (01:21:08) strategies where (01:21:10) at some point as AI becomes more (01:21:13) powerful (01:21:15) it's going to become more or less (01:21:17) clearer to everyone what the strategy (01:21:18) should be. And it should be something (01:21:20) like, yeah, you need to find some way to (01:21:23) talk to each other. And you want your (01:21:25) first (01:21:27) actual like real super intelligent AI to (01:21:29) be aligned and somehow be, (01:21:36) you know, care for sentient life, care (01:21:38) for people, democratic, one of those, (01:21:41) some combination of thereof. And I think (01:21:45) this is the condition (01:21:47) that everyone should strive for and (01:21:50) that's what SSI is striving for and I (01:21:53) think that with time if not already all (01:21:57) the other companies will realizing that (01:21:59) they're striving towards the same thing (01:22:00) and we'll see. I think that the world (01:22:02) will truly change as AI becomes more (01:22:03) powerful. (01:22:04) >> Yeah. (01:22:04) >> And I think a lot of these forecasts (01:22:06) will like I think things will be really (01:22:09) different and people will be acting (01:22:11) really differently. What speaking of (01:22:13) forecast what are your forecasts to this (01:22:16) system you're describing which can learn (01:22:17) as well as a human and (01:22:21) subsequently as a result become (01:22:22) superhuman. (01:22:24) >> I think like uh 5 to 20 (01:22:27) >> 5 to 20 years. (01:22:28) >> Mhm. (01:22:28) >> So I just want to unroll your (01:22:32) how you might see the world coming. It's (01:22:34) like we have a couple more years where (01:22:36) these other companies are continuing the (01:22:38) current approach and it stalls out and (01:22:40) stalls out here meaning they earn no (01:22:42) more than low hundreds of billions in (01:22:44) revenue or how do you think about what (01:22:45) stalling out means? (01:22:47) >> Yeah, (01:22:49) I think the re I think it could I think (01:22:50) it could stall out and (01:22:54) I think stalling out will look like (01:22:57) it will all look very similar. (01:22:58) >> Yeah. (01:22:59) >> Among all the different companies (01:23:01) something like this. I'm not sure (01:23:02) because I think I think I think even (01:23:04) with I think even I think even with (01:23:06) stolen out I think these companies could (01:23:07) make a stupendous stupendous revenue (01:23:10) maybe not profits because they will be (01:23:12) it will be they will need to work hard (01:23:14) to differentiate each other from (01:23:16) themselves but revenue definitely (01:23:18) >> but there's something in your model (01:23:20) implies that (01:23:23) the when the correct solution does (01:23:25) emerge there will be convergence between (01:23:26) all the companies and I'm curious why (01:23:29) you think that's the case (01:23:30) >> well I was talking more about converg (01:23:31) convergence on their larger strategies. (01:23:33) >> I think eventual convergence on the (01:23:35) technical approach is probably going to (01:23:36) happen as well but I I was alluding to (01:23:39) convergence to the larger strategies. (01:23:41) What what what exactly is the thing that (01:23:42) should be done? (01:23:43) >> I I just want to better understand how (01:23:45) you see the future on rolling. So (01:23:47) currently we have these different (01:23:48) companies and you expect their approach (01:23:49) to continue generating revenue. Yes. (01:23:51) >> But not get to this humanlike learner. (01:23:53) >> Yes. (01:23:54) >> So now we have these different forks of (01:23:56) companies. We have you we have thinking (01:23:58) machines. There's a bunch of other labs. (01:24:00) >> Yes. and maybe one of them figures out (01:24:01) the correct approach (01:24:03) >> but then the release of their product (01:24:05) makes it clear to other people how to do (01:24:07) this thing. (01:24:08) >> I think it won't be clear how to do it (01:24:10) thing but it will be clear that (01:24:11) something different is possible (01:24:12) >> right (01:24:13) >> and that is information and I think (01:24:15) people will will then be trying to (01:24:18) figure out how how that's how that (01:24:20) works. I do think though that one of the (01:24:23) things that's that I think you know not (01:24:27) addressed here not discussed is that (01:24:30) with each increase in the AI's (01:24:33) capabilities I think there will be some (01:24:35) kind of changes but I don't know exactly (01:24:38) which ones in how things are being done. (01:24:41) So like I think it's going to be (01:24:44) important yet I can't spell out what (01:24:46) that is exactly. (01:24:47) >> And how how are the (01:24:50) by default you would expect the company (01:24:52) that has the model company that has that (01:24:54) model to be getting all these gains (01:24:55) because they have the model that is (01:24:57) learning how to do all has the skills (01:25:00) and knowledge that it's building up in (01:25:01) the world. What is the reason to think (01:25:03) that the benefits of that would be (01:25:04) widely distributed and not just end up (01:25:06) at whatever model company gets this (01:25:08) continuous learning loop going first? (01:25:11) >> Like I think that empirically what (01:25:13) happen so here here is what I think is (01:25:16) going to happen. Number one, I think (01:25:18) empirically when (01:25:23) let's let's look at let's look at how (01:25:25) things have gone so far with um the AIs (01:25:28) of the past. So one company produced an (01:25:30) advance and the other company scrambled (01:25:34) and produced some competi some some (01:25:37) similar things after some amount of time (01:25:40) and they started to compete in the (01:25:42) market and push their push the prices (01:25:45) down (01:25:46) >> and so I think from the market (01:25:47) perspective I think something similar (01:25:49) will happen there as well even if (01:25:51) someone okay so okay so okay so okay so (01:25:52) okay so okay so okay so okay so okay so (01:25:52) okay so okay we talking about the good (01:25:53) world by the way where (01:25:57) what's the good world. What's the good (01:25:59) world? (01:26:01) Where we have these powerful humanlike (01:26:05) learners that are also like and by the (01:26:08) way maybe there there's another thing we (01:26:10) haven't discussed on the on the the spec (01:26:13) of the super intelligent AI that I think (01:26:15) is worth considering is that you make it (01:26:18) narrow (01:26:20) can be useful and narrow at the same (01:26:21) time. So you can have lots of narrow (01:26:23) super intelligent AIs. But suppose you (01:26:25) have many of them (01:26:29) and you have some and you have some (01:26:30) company that's producing a lot of um (01:26:33) profits from it and then you have (01:26:35) another company that comes in and starts (01:26:37) to compete and the way the competition (01:26:39) is going to work is through (01:26:40) specialization. (01:26:42) I think what's going to happen is that (01:26:45) the way (01:26:48) competition like competition loves (01:26:50) specialization and you see it in the (01:26:53) market, you see it in evolution as well. (01:26:54) So you're going to have lots of (01:26:55) different niches and you're going to (01:26:56) have lots of different companies who are (01:26:58) occupying different niches in (01:27:02) in this kind of world where you might (01:27:04) say yeah like one AI company is really (01:27:06) quite a bit better at some area of (01:27:09) really complicated economic activity and (01:27:11) a different company is better at another (01:27:13) area and the third company is really (01:27:14) good at litigation and that's (01:27:16) contradicted by what humanlike learning (01:27:18) implies is that like it can learn (01:27:20) >> it can but but you have accumulated (01:27:23) learning you have a big investment. You (01:27:25) spent a lot of compute to become really (01:27:28) really really good really phenomenal at (01:27:30) this thing and someone else spent a huge (01:27:33) amount of comput and a huge amount of (01:27:34) experience to get really really good at (01:27:35) some other thing (01:27:36) >> right (01:27:36) >> you apply a lot of human learning to get (01:27:38) there but now like you you are at this (01:27:41) high point where someone else would say (01:27:43) look like I don't want to start learning (01:27:45) what you've learned to go (01:27:46) >> I guess that would require many (01:27:47) different companies to begin at the (01:27:49) human like continual learning agent at (01:27:52) the same time so that they can start (01:27:54) their different research in different (01:27:57) branches. But if one company, (01:28:01) you know, gets that agent first or gets (01:28:03) that learner first, (01:28:05) it does then seem like well, you know, (01:28:08) they could like if you just think about (01:28:10) every single job in the economy, (01:28:13) you just have uh instance learning each (01:28:16) one seems tractable for a company. (01:28:18) >> Yeah, that's that's that's a valid (01:28:20) argument. My my strong intuition is that (01:28:22) it's not how it's going to go. (01:28:24) My strong intuition is that yeah like (01:28:26) the argument says it will go this way. (01:28:28) >> Yeah. (01:28:28) >> But my strong intuition is that it will (01:28:30) not go this way that this is the you (01:28:34) know in in theory there is no difference (01:28:36) between theory and practice. In practice (01:28:37) there is and I think that's going to be (01:28:38) one of those (01:28:39) >> a lot of people's models of recursive (01:28:41) self-improvement literally explicitly (01:28:44) state we will have a million Ilias in a (01:28:47) server that are coming in with different (01:28:48) ideas and this will lead to a super (01:28:50) intelligence emerging very fast. Do you (01:28:52) have some intuition about how (01:28:53) parallelizable the thing you are doing (01:28:55) is? How how what are the gains from (01:28:59) making copies of Ilia? (01:29:01) >> I don't know. I think (01:29:05) I think there'll definitely be there'll (01:29:07) be diminishing returns because you want (01:29:09) you want people who think differently (01:29:10) rather than the same. I think that if (01:29:12) they were literal copies of me, I'm not (01:29:14) sure how much more incremental value (01:29:16) you'd get. I think that (01:29:20) but people who think differently that's (01:29:22) what you want. (01:29:23) >> Why is it that it's been if you look at (01:29:26) different models even released by (01:29:27) totally different companies trained on (01:29:30) potentially non-over overlapping data (01:29:32) sets it's actually crazy how similar (01:29:35) LLMs are to each other. (01:29:36) >> Maybe the data sets are not as non-over (01:29:37) overlapping as it seems. But there's (01:29:41) there's some sense there's like even if (01:29:43) an individual human might be less (01:29:44) productive than the future AI. Maybe (01:29:45) there's something to the fact that human (01:29:46) teams have more diversity than teams of (01:29:49) AIs might have. But how do we elicit (01:29:51) meaningful diversity among AI? So I (01:29:54) think just raising the temperature just (01:29:55) results in gibberish. I think you want (01:29:57) something more like (01:29:58) >> different scientists have different (01:30:00) different prejudices or different ideas. (01:30:01) How do you get that kind of diversity (01:30:03) among AI agents? So the reason there has (01:30:06) been no diversity I believe is because (01:30:09) of pre-training. (01:30:11) All the pre-trained models are the same (01:30:13) pretty much because the pre-train on the (01:30:16) same data. Now RL and postraining is (01:30:19) where some differentiation starts to (01:30:21) emerge because different people come up (01:30:23) with different RL training. (01:30:25) >> Yeah. And then I've heard you hint in (01:30:28) the past about selfplay as a way to (01:30:31) either get data or match agents to other (01:30:34) agents of equivalent intelligence to (01:30:36) kick off learning. How should we think (01:30:39) about why there's no public (01:30:44) um proposals of this kind of thing and (01:30:46) working with LLM? (01:30:47) >> I would say there are two things to say. (01:30:49) I would say that the reason why I (01:30:51) thought selfplayful is interesting (01:30:54) is because it offered a way to create (01:30:57) models using compute only without data, (01:31:00) right? And if you think that data is the (01:31:02) ultimate bottleneck, then using compute (01:31:04) only is very interesting. So that's what (01:31:07) makes it interesting. Now the (01:31:11) the thing is (01:31:14) that selfplay at least the way it was (01:31:17) done in the past when you have agents (01:31:19) which are somehow compete with each (01:31:20) other it's only good for developing a (01:31:23) certain set of skills it is too narrow. (01:31:26) It's only good for like negotiation (01:31:29) uh conflict (01:31:31) certain social skills (01:31:33) strategizing that kind of stuff. And so (01:31:35) if you care about those skills then (01:31:37) selfplay will be useful. Now actually I (01:31:40) think that selfplay (01:31:42) did (01:31:43) find a home but just in a different form (01:31:47) in a different form. So things like (01:31:49) debate prove a verifier. You have some (01:31:53) kind of an LLM as a judge which is also (01:31:56) incentivized to find mistakes in your (01:31:58) work. You could say this is not exactly (01:32:00) selfplay but this is you know a related (01:32:02) adversarial setup that people are doing. (01:32:04) believe (01:32:04) >> and really selfplay is an example of um (01:32:07) is a special case of more general like (01:32:10) um competition between between agents, (01:32:13) >> right? The response the natural response (01:32:14) to competition is to try to be (01:32:16) different. And so if you were to put (01:32:17) multiple agents and you tell them, you (01:32:19) know, you all need to work on some (01:32:21) problem and you're an agent and you're (01:32:24) inspecting what everyone else is (01:32:25) working, you're going to say, well, if (01:32:28) they already taken this approach, it's (01:32:30) not clear I should pursue it. they (01:32:32) should pursue something differentiated (01:32:34) and so I think that something like this (01:32:35) could also create an incentive for um a (01:32:38) diversity of approaches. (01:32:39) >> Yeah. Um final question, (01:32:43) what is research taste? You're obviously (01:32:47) the person in the world who is (01:32:50) considered to have the best taste in (01:32:54) doing research in AI. you were uh the (01:32:58) co-author on many of the biggest the (01:33:01) biggest things that have happened in the (01:33:02) history of deep learning from Alex net (01:33:03) to GBT3 to so on what is it that how do (01:33:06) you characterize how (01:33:09) you come up with these ideas (01:33:11) >> I can answer so I can comment on this (01:33:13) for myself (01:33:14) >> I think different people do it (01:33:16) differently (01:33:18) >> but one thing that um guides me (01:33:21) personally (01:33:23) is (01:33:24) an aesthetic (01:33:26) of how AI should be (01:33:29) >> by thinking about how people are but (01:33:31) thinking correctly (01:33:33) >> like it's very easy to think about how (01:33:35) people are incorrectly but what does it (01:33:37) mean to think about people correctly (01:33:39) >> so I'll give you some examples (01:33:42) the idea of the artificial neuron is (01:33:46) directly inspired by the brain and it's (01:33:48) a great idea why because you say sure (01:33:50) the brain has all these different organs (01:33:52) has the faults but the faults probably (01:33:54) don't matter M (01:33:55) >> why do we think that the neurons matter? (01:33:56) Because there's many of them. It kind of (01:33:59) feels right. So you want the neuron. (01:34:01) >> Yeah. (01:34:01) >> You want some kind of local learning (01:34:03) rule that will change the connections. (01:34:04) You want some local learning rule rule (01:34:06) that will change the connections between (01:34:07) the neurons, (01:34:10) >> right? It feels plausible that the brain (01:34:12) does it. The idea of the distributed (01:34:13) representation, (01:34:16) the idea that the brain, (01:34:18) you know, the brain responds to (01:34:19) experience or neural network should (01:34:20) learn from experience, not response. The (01:34:22) brain learns from experience. (01:34:24) the neural network of experience and you (01:34:27) kind of ask yourself is some is (01:34:29) something fundamental or not fundamental (01:34:30) how things should be (01:34:32) >> and I think that's been guiding me a (01:34:34) fair bit kind of thinking from multiple (01:34:37) angles and looking for almost beauty (01:34:40) beauty simplicity ugliness there's no (01:34:42) room for ugliness it's just beauty (01:34:44) simplicity elegance correct inspiration (01:34:47) from the brain and all of those things (01:34:49) need to be present at the same time and (01:34:51) the more they are present the more (01:34:53) confident you can be in a top- down (01:34:55) belief. And then the top down belief is (01:34:58) the thing that sustains you when the (01:35:00) experiments contradict you. Because if (01:35:02) you just trust the data all the time, (01:35:05) well, sometimes you can be doing a (01:35:06) correct thing, but there's a bug. (01:35:08) >> But you don't know that there is a bug. (01:35:09) How can you tell that there is a bug? (01:35:11) >> How do you know if you should keep (01:35:12) debugging or you conclude it's the wrong (01:35:14) direction? Well, it's the top down. (01:35:16) Well, how should you can say the things (01:35:18) have to be this way? Something like this (01:35:20) has to work. Therefore, we got to keep (01:35:22) going. That's the top down. And it's (01:35:24) based on this like multifaceted beauty (01:35:27) and inspiration by the brain. (01:35:30) >> All right, we'll leave it there. (01:35:32) >> Thank you so much. (01:35:32) >> Thank you so much. (01:35:35) >> All right. Appreciate it. (01:35:36) >> That was great. (01:35:36) >> Yeah, I enjoyed it. (01:35:38) >> Yes, me too. (01:35:39) >> Hey everybody, I hope you enjoyed that (01:35:41) episode. If you did, the most helpful (01:35:43) thing you can do is just share it with (01:35:45) other people who you think might enjoy (01:35:46) it. It's also helpful if you leave a (01:35:49) rating or a comment on whatever platform (01:35:51) you're listening on. If you're (01:35:53) interested in sponsoring the podcast, (01:35:55) you can reach out at (01:35:56) dwarcash.com/advertise. (01:36:00) Otherwise, I'll see you on the next one.

Leave a Reply

Your email address will not be published. Required fields are marked *