↔
Title: Ilya Sutskever – We’re moving from the age of scaling to the age of research
Duration: 01:36:03
Total Correct Answers:
Current Caption
Correct
Learning Modes
YouTube Video Transcript Hide
Ask AI:
Export as:
Ask AI Result
The ask AI result will appear here..
(00:00:00) Your YouTube transcript will appear here
(00:00:00)
You know what's crazy that all of this
(00:00:03)
is real?
(00:00:04)
>> Yeah. Meaning what?
(00:00:05)
>> Don't you think so?
(00:00:06)
>> Meaning what?
(00:00:07)
>> Like all this AI stuff and all this
(00:00:09)
area. Yeah. That it's happen like
(00:00:11)
>> isn't it straight out of science
(00:00:13)
fiction?
(00:00:14)
>> Yeah. Another thing that's crazy is like
(00:00:16)
how normal the slow takeoff feels. The
(00:00:19)
idea that we'd be investing 1% of GDP in
(00:00:23)
AI, like I feel like it would have felt
(00:00:25)
like a bigger deal, you know, where
(00:00:26)
right now it just feels like
(00:00:27)
>> you get used to things pretty fast.
(00:00:29)
Turns out Yeah. But also it's kind of
(00:00:31)
like it's abstract like what does it
(00:00:33)
mean? What it means that you see it in
(00:00:35)
the news.
(00:00:36)
>> Yeah.
(00:00:36)
>> That such and such company announced
(00:00:38)
such and such dollar amount,
(00:00:39)
>> right?
(00:00:40)
>> That's that's all you see,
(00:00:42)
>> right?
(00:00:43)
>> It's not really felt in any other way so
(00:00:45)
far.
(00:00:45)
>> Yeah. Should we actually begin here? I
(00:00:47)
think this is an interesting discussion.
(00:00:48)
>> Sure.
(00:00:49)
>> I think your point about well from the
(00:00:51)
average person's point of view, nothing
(00:00:54)
is that different will continue being
(00:00:55)
true even into the singularity.
(00:00:57)
>> No, I don't think so. Okay. Interesting.
(00:01:00)
>> So the thing which I was referring to
(00:01:03)
not feeling different
(00:01:06)
is okay. So such and such company
(00:01:08)
announced some difficult to comprehend
(00:01:11)
dollar amount of investment
(00:01:12)
>> right
(00:01:13)
>> I don't think anyone knows what to do
(00:01:14)
with that.
(00:01:15)
>> Yeah.
(00:01:16)
>> But I think that the impact of AI is
(00:01:20)
going to be felt.
(00:01:22)
>> AI is going to be diffused through the
(00:01:24)
economy. There are very strong economic
(00:01:26)
forces for this and I think the impact
(00:01:28)
is going to be felt very strongly.
(00:01:31)
>> When do you expect that impact? I think
(00:01:33)
the models seem smarter than their
(00:01:36)
economic impact would imply.
(00:01:38)
>> Yeah, this is
(00:01:41)
one of the very confusing things about
(00:01:43)
the models right now. How to reconcile
(00:01:47)
the fact that they are doing so well on
(00:01:52)
evals.
(00:01:52)
>> Mhm. And you look at the evals and you
(00:01:54)
go, those are pretty hard evals,
(00:01:56)
>> right?
(00:01:57)
>> They're doing so well,
(00:02:00)
>> but the economic impact seems to be
(00:02:03)
dramatically behind. And it's almost
(00:02:06)
like
(00:02:08)
it's it's very difficult to make sense
(00:02:10)
of how can the model on the one hand do
(00:02:12)
these amazing things and then on the
(00:02:14)
other hand like repeat itself twice in
(00:02:18)
some situation in a kind of a an example
(00:02:21)
would be let's say you use VIP coding to
(00:02:23)
do something and you go to some place
(00:02:25)
and then you get a bug and then you tell
(00:02:28)
the model can you please fix the bug?
(00:02:30)
>> Yeah. And the model says, "Oh my god,
(00:02:32)
you're so right. I have a bug. Let me go
(00:02:34)
fix that." And it introduces a second
(00:02:35)
bug.
(00:02:36)
>> Yeah.
(00:02:37)
>> And then you tell it you have you have
(00:02:38)
this new second bug and it tells you,
(00:02:40)
"Oh my god, how could I've done it?
(00:02:41)
You're so right again."
(00:02:43)
>> And brings back the first bug. And you
(00:02:45)
can alternate between those.
(00:02:46)
>> Yeah.
(00:02:46)
>> And it's like, how is that possible?
(00:02:48)
>> Yeah.
(00:02:49)
>> It's like I'm not sure. But it does
(00:02:52)
suggest that the
(00:02:55)
something strange is going on. I have
(00:02:57)
two possible explanations. So here this
(00:02:59)
is the more kind of a whimsical
(00:03:02)
explanation is that maybe a real
(00:03:04)
training makes the models a little bit
(00:03:06)
too single-minded and narrowly focused a
(00:03:09)
little bit too
(00:03:11)
I don't know unaware
(00:03:14)
even though it also makes them aware in
(00:03:15)
some other ways
(00:03:17)
and because of this they can't do basic
(00:03:20)
things but there is another explanation
(00:03:22)
which is
(00:03:24)
back when people were doing pre-training
(00:03:28)
the question of what data to train on
(00:03:31)
was answered because the that answer was
(00:03:34)
everything.
(00:03:35)
>> Yeah.
(00:03:36)
>> When you do pre-training, you need all
(00:03:39)
the data.
(00:03:41)
So you don't have to think is it going
(00:03:43)
to be this data or that data.
(00:03:44)
>> Yeah.
(00:03:45)
>> But when people do RL training, they do
(00:03:48)
need to think. They say okay, we want to
(00:03:50)
have this kind of RL training for this
(00:03:52)
thing and that kind of RL training for
(00:03:54)
that thing. And from what I hear, all
(00:03:57)
the companies have teams that just
(00:03:59)
produce new RL environments and just add
(00:04:01)
it to the training mix. And then the
(00:04:03)
question is, well, what are those? There
(00:04:04)
are so many degrees of freedom. There is
(00:04:06)
such a huge variety of environments you
(00:04:08)
could produce. And one of the
(00:04:13)
one thing you could do, and I think
(00:04:14)
that's something that is done
(00:04:16)
inadvertently,
(00:04:17)
is that people take inspiration from the
(00:04:21)
evals. you say, "Hey, I would love our
(00:04:24)
model to do really well when we release
(00:04:26)
it. I want the EVOS to look great."
(00:04:29)
What would be RL training that could
(00:04:32)
help on this task, right? I think that
(00:04:35)
is something that happens and I think it
(00:04:37)
could explain a lot of what's going on.
(00:04:39)
If you combine this with generalization
(00:04:42)
of the models actually being inadequate,
(00:04:45)
that has the potential to explain a lot
(00:04:47)
of what we are seeing. this disconnect
(00:04:49)
between eval performance and actual real
(00:04:53)
real world performance which is
(00:04:54)
something that we don't today exactly
(00:04:57)
even understand what what we mean by
(00:05:00)
that I I like this idea that the real
(00:05:03)
reward hacking is a human researchers
(00:05:05)
who are too focused on the evals um I
(00:05:09)
think there's two ways to understand or
(00:05:12)
to try to think about what what you have
(00:05:14)
just pointed out one is look if it's the
(00:05:18)
case that simply by becoming superhuman
(00:05:20)
at a coding competition, a model will
(00:05:23)
not automatically become more tasteful
(00:05:26)
and exercise better judgment about how
(00:05:28)
to improve your codebase. Well, then you
(00:05:30)
should expand the suite of environments
(00:05:33)
such that you're not just testing it on
(00:05:35)
having the best performance in coding
(00:05:36)
competition. It should also be able to
(00:05:38)
make the best kind of application for X
(00:05:40)
thing or Y thing or Z thing. And another
(00:05:43)
maybe this is what you're hinting at is
(00:05:45)
to say why should it be the case in the
(00:05:47)
first place that becoming super human at
(00:05:50)
coding competitions doesn't make you a
(00:05:52)
more tasteful programmer more generally.
(00:05:54)
Maybe the thing to do is not to keep
(00:05:57)
stacking up the amount of environments
(00:05:59)
and the diversity of environments to
(00:06:00)
figure out approach with let you learn
(00:06:02)
from one environment and improve your
(00:06:06)
performance on something else. So I have
(00:06:09)
I have an analog a human analogy which
(00:06:11)
might be helpful. So even the case let's
(00:06:14)
take the case of competitive programming
(00:06:16)
since you mentioned that and suppose you
(00:06:18)
have two students
(00:06:20)
one of them work decided they want to be
(00:06:23)
the best competitive programmer. So they
(00:06:24)
will practice 10,000 hours for that
(00:06:28)
domain. They will solve all the
(00:06:30)
problems, memorize all the proof
(00:06:31)
techniques and be very very you know
(00:06:36)
be very skilled at quickly and correctly
(00:06:38)
implementing all the algorithms and by
(00:06:40)
doing by doing so they became the best
(00:06:43)
one of the best student number two
(00:06:46)
thought oh competitive programming is
(00:06:47)
cool maybe they practiced for 100 hours
(00:06:50)
>> much much less and they also did really
(00:06:52)
well which one do you think is going to
(00:06:54)
do better in their career later on
(00:06:56)
>> the second
(00:06:57)
>> right and I think that's basically
(00:06:59)
what's going on. The models are much
(00:07:00)
more like the first student but even
(00:07:01)
more because then we say okay so the
(00:07:04)
model should be good at competitive
(00:07:05)
programming so let's get every single
(00:07:07)
competitive programming problem ever and
(00:07:10)
then let's do some data augmentation so
(00:07:12)
we have even more competitive
(00:07:13)
programming problems
(00:07:14)
>> yes
(00:07:15)
>> and we train on that and so now you got
(00:07:17)
this great competitive programmer and
(00:07:18)
with this analogy I think it's more
(00:07:20)
intuitive I think it's more intuitive
(00:07:22)
with this analogy that yeah okay so if
(00:07:25)
it's so well trained okay it's like all
(00:07:27)
the different algorithms and all the
(00:07:29)
proof techniques are like right at it at
(00:07:31)
its fingertips
(00:07:32)
and it's more intuitive that with this
(00:07:34)
level of preparation it not would not
(00:07:36)
necessarily generalize to other things.
(00:07:40)
>> But then what is the um analogy for what
(00:07:42)
the second student is doing before they
(00:07:45)
do the 100 hours of fine-tuning.
(00:07:48)
>> I think it's like
(00:07:51)
they have it. I think it's the it
(00:07:53)
factor.
(00:07:54)
>> Yeah.
(00:07:54)
>> Right. And like I know like when I was
(00:07:56)
in undergrad, I remember there was there
(00:07:58)
was a student like this that studied
(00:07:59)
with me. So I I know it exists.
(00:08:01)
>> Yeah. I think it's interesting to
(00:08:03)
distinguish it from whatever
(00:08:05)
pre-training does. So one way to
(00:08:07)
understand what you just said about we
(00:08:09)
don't have to choose the data in
(00:08:10)
pre-training is to say actually it's not
(00:08:13)
dissimilar to the 10,000 hours of
(00:08:15)
practice. It's just that you get that
(00:08:16)
10,000 hours of practice for free
(00:08:19)
because it's already somewhere in the
(00:08:21)
pre-training distribution. But it's like
(00:08:23)
maybe you're suggesting actually there's
(00:08:25)
actually not that much generalization
(00:08:26)
for pre-training. There's just so much
(00:08:27)
data in pre-training but it's like it's
(00:08:29)
not necessarily generalizing better than
(00:08:30)
RL.
(00:08:31)
>> Like the main the main strength of
(00:08:33)
pre-training is that there is a so much
(00:08:35)
of it.
(00:08:35)
>> Yeah.
(00:08:36)
>> And b you don't have to think hard about
(00:08:40)
what data to put into pre-training.
(00:08:43)
>> And it's a very kind of natural data and
(00:08:45)
it does include in it a lot of what
(00:08:48)
people do.
(00:08:49)
>> Yeah.
(00:08:50)
people's thoughts and a lot of the
(00:08:54)
features of you know it's like the whole
(00:08:56)
world as projected by people onto text.
(00:08:59)
>> Yeah.
(00:09:00)
>> And pre-training tries to capture that
(00:09:02)
using a huge amount of data.
(00:09:04)
It's it's very the pre-training is very
(00:09:08)
difficult to reason about because it's
(00:09:10)
so hard to understand the manner in
(00:09:14)
which the model relies on pre-training
(00:09:16)
data. And whenever the model makes a
(00:09:19)
mistake, could it be because something
(00:09:22)
by chance is not as supported by the
(00:09:24)
pre-training data? You know, and pre
(00:09:26)
support by pre-training is maybe a loose
(00:09:28)
term.
(00:09:31)
I I don't know if I can add anything
(00:09:33)
more useful on this, but
(00:09:36)
I don't think there is a human analog to
(00:09:38)
pre-training. Um, here's analogies that
(00:09:40)
people have proposed for what the human
(00:09:42)
analogy to pre-training is, and I'm
(00:09:44)
curious to get your thoughts on why
(00:09:46)
they're potentially wrong. One is to
(00:09:49)
think about the first 18 or 15 or 13
(00:09:52)
years of a person's life when they
(00:09:54)
aren't necessarily economically
(00:09:56)
productive, but they are doing something
(00:09:58)
that is making them understand the world
(00:10:02)
better and so forth. And the other is to
(00:10:05)
think about evolution as doing some kind
(00:10:08)
of search for three billion years which
(00:10:09)
then results in a human lifetime
(00:10:13)
instance. And then I'm curious if you
(00:10:16)
think either of these are actually
(00:10:16)
analogous to pre-training or how how
(00:10:18)
would you think about at least what
(00:10:19)
lifetime human learning is like if not
(00:10:22)
pre-training. I think there are some
(00:10:24)
similarities between both of these two
(00:10:27)
pre-training and pre-training tries to
(00:10:29)
play the role of both of these
(00:10:31)
>> but I think there are some big
(00:10:32)
differences as well.
(00:10:35)
The amount of pre-training data is
(00:10:38)
very very staggering.
(00:10:40)
>> Yes.
(00:10:41)
>> And somehow a a human being after even
(00:10:45)
15 years with a tiny fraction of that
(00:10:47)
pre-training data they know much less.
(00:10:49)
>> Yeah. But whatever they do know they
(00:10:51)
know much more deeply somehow and the
(00:10:54)
mistakes like like already at that age
(00:10:57)
you would not make mistakes that are
(00:10:59)
make.
(00:10:59)
>> Yeah. There is another thing you might
(00:11:01)
say could it be something like evolution
(00:11:03)
and the answer is maybe but in this case
(00:11:05)
I think evolution might actually have an
(00:11:07)
edge like there is this I remember
(00:11:11)
reading about this case where some you
(00:11:14)
know that one thing that neuroscientists
(00:11:16)
do or rather one way in which
(00:11:19)
neuroscientists can learn about the
(00:11:20)
brain is by studying people with brain
(00:11:23)
damage to different parts of the brain
(00:11:26)
>> and and so and some people have the most
(00:11:28)
strange symptoms you could imagine. It's
(00:11:30)
actually really really interesting. And
(00:11:32)
there was one case that comes to mind
(00:11:34)
that's relevant.
(00:11:36)
I read about this person who had some
(00:11:39)
kind of brain damage that took out I
(00:11:42)
think a stroke or an accident that took
(00:11:45)
out his emotional
(00:11:48)
processing. So he stopped feeling any
(00:11:50)
emotion
(00:11:52)
and as a result of that you know he
(00:11:55)
still remained very articulate and he
(00:11:57)
could solve little puzzles and on tests
(00:11:59)
he seemed to be just fine but he felt no
(00:12:02)
emotion he didn't feel sad he didn't
(00:12:04)
feel angry he didn't feel animated and
(00:12:07)
he became somehow extremely bad at
(00:12:09)
making any decisions at all it would
(00:12:12)
take him hours to decide on which socks
(00:12:14)
to wear and he would make very bad
(00:12:16)
financial decisions
(00:12:19)
And that's very
(00:12:22)
but does what what does it say about
(00:12:26)
the role of our built-in emotions
(00:12:30)
in making us like a viable agent
(00:12:33)
essentially
(00:12:34)
>> and I guess to connect to your question
(00:12:35)
about pre-training
(00:12:37)
>> it's like maybe pre- like maybe if you
(00:12:40)
are good enough at like getting
(00:12:41)
everything out of pre-training you can
(00:12:43)
get you could get that as well but
(00:12:45)
that's the kind of thing which seems is
(00:12:51)
well it may or may not be possible to
(00:12:53)
get that from pre-training. What is
(00:12:58)
that clearly not just directly emotion?
(00:13:01)
And it seems like some
(00:13:04)
almost value function like thing which
(00:13:06)
is giving telling you which decision to
(00:13:08)
be like what the end reward for any
(00:13:11)
decision should be and you think that
(00:13:13)
doesn't sort of implicitly come from
(00:13:15)
>> I think it could I'm just saying it's
(00:13:17)
not one it's not 100% obvious.
(00:13:19)
>> Yeah. But what is that like what how do
(00:13:23)
you think about emotions and what is the
(00:13:24)
ML analogy for emotions?
(00:13:26)
>> It should be some kind of a value
(00:13:28)
function thing.
(00:13:29)
>> Yeah. But I don't think there is a great
(00:13:31)
ML analogy because right now value
(00:13:32)
functions don't play a very prominent
(00:13:34)
role in uh the things people do.
(00:13:36)
>> It might be worth defining for the
(00:13:38)
audience what a value function is if if
(00:13:39)
you want to do that.
(00:13:40)
>> I mean certainly I I'll be very happy to
(00:13:42)
do that. Right. So
(00:13:49)
so when people do reinforcement learning
(00:13:51)
the way reinforcement learning is done
(00:13:53)
right now how do they do how do people
(00:13:55)
train those agents? So you have your
(00:13:57)
neural net and you give it a problem and
(00:14:00)
then you tell the model go solve it and
(00:14:01)
the model takes maybe thousands hundreds
(00:14:04)
of thousands of actions
(00:14:06)
or thoughts or something and then it
(00:14:08)
produces a solution. The solution is
(00:14:09)
created and then the score
(00:14:13)
is used to provide a training signal for
(00:14:15)
every single action
(00:14:18)
in your trajectory.
(00:14:19)
>> Mhm. So that means that if you are doing
(00:14:23)
something that goes for a long time, if
(00:14:25)
you're training a task that takes a long
(00:14:28)
time to solve, you will do no learning
(00:14:30)
at all until you solve the until you
(00:14:32)
came up with a proposed solution. That's
(00:14:34)
how reinforcement learning is done
(00:14:35)
naively. That's how O1 R1 ostensibly are
(00:14:39)
done.
(00:14:41)
The value function says something like
(00:14:43)
okay look maybe I could sometimes not
(00:14:46)
always could tell you if you're doing
(00:14:49)
well or badly. The notion of a value
(00:14:52)
function is more useful in some domains
(00:14:53)
than others. So for example when you
(00:14:55)
play chess
(00:14:57)
and you lose a piece you know I messed
(00:14:59)
up. You don't need to play the whole
(00:15:01)
game to know that what I just did was
(00:15:03)
bad and therefore whatever um whatever
(00:15:07)
preceded it was also bad. So the value
(00:15:09)
function lets you short circuit the
(00:15:13)
weight until the very end. Like let's
(00:15:15)
suppose that you started to pursue some
(00:15:18)
kind of um okay let's suppose that you
(00:15:20)
are doing some kind of a math thing or a
(00:15:22)
programming thing and you're trying to
(00:15:24)
explore a particular solution direction
(00:15:26)
and after let's say after a thousand uh
(00:15:30)
steps of thinking you concluded that
(00:15:31)
this direction is unpromising.
(00:15:34)
As soon as you conclude this, you could
(00:15:37)
already get a reward signal a thousand
(00:15:40)
time steps previously when you decided
(00:15:42)
to pursue down this path. You say, "Oh,
(00:15:44)
next time I shouldn't pursue this path
(00:15:47)
in a similar situation long before you
(00:15:50)
actually came up with a proposed
(00:15:51)
solution." M this was in the deepcar one
(00:15:53)
paper is that the
(00:15:56)
space of trajectories is so wide that
(00:16:01)
maybe it's hard to learn a mapping from
(00:16:04)
an intermediate trajectory and value and
(00:16:07)
also given that you know in coding for
(00:16:08)
example you'll have the wrong idea then
(00:16:11)
you'll go back then you'll change
(00:16:12)
something
(00:16:13)
>> this sounds like such lack of faith in
(00:16:15)
deep learning
(00:16:16)
>> like I mean sure it might be difficult
(00:16:18)
but
(00:16:19)
>> nothing deep learning can't Yeah.
(00:16:22)
>> So my expectation is that
(00:16:27)
like value functions should be useful
(00:16:29)
and
(00:16:31)
and I fully I fully expect that they
(00:16:32)
will be used in the future if not
(00:16:34)
already. What was I alluding to with the
(00:16:36)
person whose emotional center got
(00:16:40)
>> um damaged is more that
(00:16:45)
maybe what it suggests is that the value
(00:16:48)
function of humans is modulated by
(00:16:50)
emotions in some important way that's
(00:16:53)
hardcoded by evolution
(00:16:55)
and maybe that is important for people
(00:16:57)
to be effective in the world.
(00:17:00)
>> That that's the thing I was actually
(00:17:02)
planning on asking you. There's
(00:17:03)
something really interesting about
(00:17:04)
emotions as a value function, which is
(00:17:05)
that it's impressive that they have this
(00:17:08)
much utility while still being rather
(00:17:13)
um simple to understand.
(00:17:16)
So I have two responses. I do agree that
(00:17:20)
compared to
(00:17:24)
the kind of things that we learn and the
(00:17:25)
things we are talking about, the kind of
(00:17:27)
ads we talking about, emotions are
(00:17:28)
relatively simple.
(00:17:31)
They might even be so simple that maybe
(00:17:33)
you could map them out in a human
(00:17:35)
understandable way. I think it would be
(00:17:36)
cool to do.
(00:17:39)
In terms of utility though, I think
(00:17:41)
there is a thing where you know there is
(00:17:44)
this complexity robustness trade-off
(00:17:48)
where complex things can be very useful
(00:17:53)
but simple things are very useful in a
(00:17:57)
very broad range of situations. And so I
(00:18:00)
think what what one way to interpret
(00:18:01)
what we are seeing is that we've got
(00:18:03)
these emotions that essentially evolved
(00:18:06)
mostly mostly from our mammal ancestors
(00:18:09)
and then fine-tuned a little bit while
(00:18:11)
we were homminids just a bit. We do have
(00:18:14)
like a decent amount of social emotions
(00:18:15)
though which mammals may lack
(00:18:19)
but they're not very sophisticated
(00:18:22)
and because they're not sophisticated
(00:18:23)
they serve us so well in this very
(00:18:25)
different world compared to the one that
(00:18:27)
we've been living in. Actually they they
(00:18:29)
also make mistakes. For example, our
(00:18:31)
emotions well I don't know does hunger
(00:18:33)
count as an emotion
(00:18:36)
debate. It's debatable but I think for
(00:18:37)
example our intuitive feeling of hunger
(00:18:42)
is not succeeding in guiding us
(00:18:46)
correctly in this world with an
(00:18:48)
abundance of food.
(00:18:49)
>> Yeah. People have been talking about
(00:18:51)
scaling data, scaling parameters,
(00:18:54)
scaling compute.
(00:18:56)
Is there a more general way to think
(00:18:58)
about scaling? What are the other
(00:18:59)
scaling axes?
(00:19:01)
So
(00:19:03)
the thing so so here here is a
(00:19:05)
perspective here's a perspective I think
(00:19:07)
might be might be true.
(00:19:10)
So
(00:19:12)
the way ML used to work is that people
(00:19:15)
would just think of it with stuff and
(00:19:17)
try to
(00:19:20)
and try to get interesting results.
(00:19:22)
That's what's been going on in the past.
(00:19:26)
Then
(00:19:28)
the scaling insight arrived, right?
(00:19:31)
Scaling laws, GPT3.
(00:19:34)
And suddenly everyone realized we should
(00:19:37)
scale.
(00:19:39)
And it's just this, this is an example
(00:19:42)
of how language affects thought.
(00:19:46)
Scaling is what just one word, but it's
(00:19:48)
such a powerful word because it informs
(00:19:50)
people what to do. They say, "Okay,
(00:19:52)
let's let's try to scale things." And so
(00:19:54)
you say okay so what are we scaling and
(00:19:57)
pre-training was a thing to scale it was
(00:19:59)
a particular scaling recipe.
(00:20:02)
>> Yes
(00:20:02)
>> the big breakthrough of pre-training is
(00:20:05)
the realization that this recipe is
(00:20:07)
good. So you say hey if you mix some
(00:20:11)
compute with some data into a neural net
(00:20:14)
of a certain size you will get results
(00:20:17)
and you will know that it will be better
(00:20:19)
if you just scale the recipe up. And
(00:20:22)
this is also great. Companies love this
(00:20:24)
because it gives you a very uh lowrisk
(00:20:28)
way of investing.
(00:20:30)
>> Yeah.
(00:20:31)
>> Your resources.
(00:20:32)
>> Yeah.
(00:20:32)
>> Right. It's much harder to invest your
(00:20:34)
resources in research. Compare that. You
(00:20:37)
know, if you research, you need to have
(00:20:38)
like go forth researchers and research
(00:20:40)
and come up with something versus get
(00:20:44)
more data, get more compute. You know,
(00:20:46)
you'll get something from pre-training.
(00:20:49)
And indeed, you know, it looks like I
(00:20:51)
based on various um
(00:20:54)
um things people say on some people say
(00:20:56)
on Twitter, maybe it appears that Gemini
(00:20:59)
have found a way to get more out of
(00:21:01)
pre-training. At some point though,
(00:21:03)
pre-training will run out of data. The
(00:21:04)
data is very clearly finite. And so
(00:21:06)
then, okay, what do you do next? Either
(00:21:08)
you do some kind of a souped-up
(00:21:10)
pre-training, different recipe from the
(00:21:12)
one you've done before, or you're doing
(00:21:14)
a RL or maybe something else. But now
(00:21:16)
that compute is big, computer is now
(00:21:19)
very big. In some sense, we are back to
(00:21:21)
the age of research. So maybe here's
(00:21:23)
another way to put it. Up until 2020,
(00:21:25)
from 201 from 2012 to 2020, it was the
(00:21:29)
age of research. Now from 2020 to 2025,
(00:21:33)
it was the age of scaling or maybe plus
(00:21:36)
minus. Let's add arrow bars to those
(00:21:37)
years because people say this is
(00:21:39)
amazing. You got to scale more. Keep
(00:21:40)
scaling. The one word scaling. But now
(00:21:44)
the scale is so big. Like is is it is
(00:21:46)
the belief really that oh it's so big
(00:21:50)
but if you had 100x more everything
(00:21:52)
would be so different. Like it would be
(00:21:54)
different for sure but like is the
(00:21:56)
belief that if you just 100x the scale
(00:22:00)
everything would be transformed.
(00:22:02)
I don't think that's true. So it's back
(00:22:04)
to the age of research again just with
(00:22:06)
big computers.
(00:22:07)
>> That's a very interesting way to put it.
(00:22:10)
But let me ask you the question you just
(00:22:12)
posed then. What are we scaling and what
(00:22:14)
what is what would it mean to have a
(00:22:16)
recipe? Because I guess I'm not aware of
(00:22:20)
a very clean
(00:22:22)
relationship that almost looks like a
(00:22:24)
law of physics which existed in
(00:22:25)
pre-training that was a power law
(00:22:26)
between data or computer parameters and
(00:22:30)
loss. What is the kind of relationship
(00:22:34)
we should be seeking and how how should
(00:22:36)
we think about what this new recipe
(00:22:37)
might look like? So, we've we've already
(00:22:42)
witnessed a transition from
(00:22:45)
one type of scaling to a different type
(00:22:47)
of scaling, from pre-training to RL.
(00:22:51)
Now, people are scaling RL. Now, based
(00:22:54)
on what people say on Twitter, they
(00:22:57)
spend more compute on RL than on
(00:22:59)
pre-training at this point because RL
(00:23:01)
can actually consume quite a bit of
(00:23:03)
compute. You know, you do very very long
(00:23:06)
rollouts.
(00:23:06)
>> Yes. So it takes a lot of compute to
(00:23:08)
produce those rollouts and then you get
(00:23:10)
relatively small amount of learning per
(00:23:12)
roll out. So you really can spend you
(00:23:14)
really can spend a lot of compute and I
(00:23:17)
could imagine like I wouldn't at this at
(00:23:20)
this st it's it's more like I wouldn't
(00:23:22)
even call it a scale um scaling I would
(00:23:25)
say hey like what are you doing and is
(00:23:28)
the thing you are doing the the the the
(00:23:30)
most productive thing you could be
(00:23:31)
doing? Yeah.
(00:23:32)
>> Can you find a most more productive way
(00:23:34)
of using your compute? We've discussed
(00:23:37)
the value function business earlier and
(00:23:40)
maybe once people get good at value
(00:23:42)
functions they will be using their their
(00:23:45)
um resources more productively
(00:23:47)
and if you find a whole other way of
(00:23:51)
training models you could say is this
(00:23:54)
scaling or is it just using your
(00:23:55)
resources I think it becomes a little
(00:23:57)
bit ambiguous in a sense that when
(00:23:59)
people were in the age of research back
(00:24:01)
then it was like people say hey let's
(00:24:03)
try this and this and this let's try
(00:24:04)
that and that and that oh look something
(00:24:06)
interesting is happening and I think
(00:24:08)
there will be a return to that.
(00:24:10)
>> So if we're back in the era of research,
(00:24:12)
stepping back, what is the part of the
(00:24:14)
recipe that we need to think most about?
(00:24:17)
When you say value function, people are
(00:24:19)
already trying the current recipe, but
(00:24:21)
then having LLM as a judge and so forth,
(00:24:24)
you could say that's a value function,
(00:24:25)
but it sounds like you have something
(00:24:26)
much more fundamental in mind. Do we
(00:24:27)
need do we need to go back to
(00:24:30)
should we even rethink pre-training at
(00:24:33)
all and not just add more steps to the
(00:24:35)
end of that process?
(00:24:36)
>> Yeah. So the the the the discussion
(00:24:39)
about value function I think it was
(00:24:41)
interesting. I want to like emphasize
(00:24:43)
that I think the value function is
(00:24:45)
something like
(00:24:47)
it's going to make RL more efficient
(00:24:50)
and I think that makes a difference but
(00:24:53)
I think that anything you can do with a
(00:24:55)
value function you can do without just
(00:24:57)
more slowly.
(00:24:58)
>> Mhm.
(00:25:00)
>> The thing which I think is the most
(00:25:02)
fundamental is that these models somehow
(00:25:04)
just generalize dramatically worse than
(00:25:06)
people.
(00:25:07)
>> Yes.
(00:25:08)
>> And it's super obvious.
(00:25:10)
That's that seems like a very
(00:25:12)
fundamental thing.
(00:25:13)
>> Okay, so this is the crux of
(00:25:15)
generalization and there's two
(00:25:18)
sub questions.
(00:25:21)
There's one which is about sample
(00:25:22)
efficiency which is why should it take
(00:25:24)
so much more data for these models to
(00:25:25)
learn than humans. There's a second
(00:25:27)
about even separate from the amount of
(00:25:30)
data it takes, there's a question of why
(00:25:32)
is it so hard to teach the thing we want
(00:25:34)
to a model than to a human which is to
(00:25:36)
say for to a human that we don't
(00:25:39)
necessarily need a verifiable reward to
(00:25:41)
be able to
(00:25:43)
you're probably mentoring a bunch of
(00:25:45)
researchers right now and you're you
(00:25:47)
know talking with them you're showing
(00:25:48)
them your code and you're showing them
(00:25:50)
how you think and from that they're
(00:25:52)
picking up your way of thinking and how
(00:25:53)
they should do research. You don't have
(00:25:55)
to set like a verifiable reward for them
(00:25:57)
that's like okay this is the next part
(00:25:58)
of the curriculum and now this is the
(00:25:59)
next part of your curriculum and oh it
(00:26:01)
was this training was unstable and we
(00:26:03)
gota there's not this shleppy bespoke
(00:26:05)
process. So perhaps these two issues are
(00:26:08)
actually related in some way but I'd be
(00:26:10)
curious to
(00:26:12)
explore this this second thing which
(00:26:14)
feels more like continual learning and
(00:26:15)
this first thing which feels just like
(00:26:17)
um sample efficiency.
(00:26:19)
>> Yeah. So you know you could actually
(00:26:21)
wonder one one possible explanation for
(00:26:25)
the human sample efficiency that needs
(00:26:28)
to be considered is evolution
(00:26:31)
and evolution has given us a small
(00:26:34)
amount of the mo the most useful
(00:26:36)
information possible
(00:26:39)
and for things like vision
(00:26:41)
hearing and locomotion
(00:26:44)
I think there's a pretty strong case
(00:26:46)
that evolution actually has given us a
(00:26:48)
lot. Mhm.
(00:26:50)
>> So for example, human dexterity far
(00:26:53)
exceeds I mean robots can become
(00:26:56)
dexterous too if you subject them to
(00:26:58)
like a huge amount of training in
(00:26:59)
simulation. But to train a robot in the
(00:27:02)
real world to quickly like pick up a new
(00:27:04)
skill like a person does seems very out
(00:27:06)
of reach. And here you could say, oh
(00:27:08)
yeah, like locomotion all our ancestors
(00:27:12)
needed great locomotion squirrels like
(00:27:16)
so locomotion maybe like we've got like
(00:27:18)
some unbelievable prior. You could make
(00:27:20)
the same case for vision. You know, I I
(00:27:22)
believe Yan Lakhan made the point, oh,
(00:27:24)
like um children learn to drive after 16
(00:27:28)
hour after like 10 hours of practice,
(00:27:30)
which is true, but our vision is so
(00:27:33)
good. At least for me, when I remember
(00:27:36)
myself being 5 years old, my I was I was
(00:27:38)
very excited about cars back then, and
(00:27:41)
I'm pretty sure my car recognition was
(00:27:44)
more than adequate for self-driving
(00:27:45)
already. As a 5-year-old, you don't get
(00:27:48)
to see that much data as a 5-year-old.
(00:27:49)
You spend most of your time in your
(00:27:50)
parents house, so you have very low data
(00:27:52)
diversity. But you could say maybe
(00:27:54)
that's evolution, too. But then language
(00:27:56)
and math and coding, probably not.
(00:28:00)
>> It still seems better than models. I
(00:28:03)
mean, obviously models are better than
(00:28:04)
the average human at language and math
(00:28:06)
and coding, but are they better at the
(00:28:08)
average human at learning?
(00:28:10)
>> Oh, yeah. Oh, yeah. Absolutely. What I
(00:28:12)
meant to say is that language math and
(00:28:14)
coding and especially math and coding
(00:28:17)
suggests that whatever it is that makes
(00:28:19)
people
(00:28:21)
good at learning
(00:28:23)
is probably not so much a complicated
(00:28:26)
prior but something more some
(00:28:28)
fundamental thing.
(00:28:29)
>> Wait, I'm not sure understood. Why
(00:28:31)
should that be the case? So consider a
(00:28:33)
skill
(00:28:35)
that people exhibit some kind of great
(00:28:37)
reliability or you know um
(00:28:42)
>> if the skill is one that was very useful
(00:28:45)
to our ancestors for many millions of
(00:28:47)
years, hundreds of millions of years,
(00:28:49)
you could say you could argue that maybe
(00:28:52)
humans are good at it because of
(00:28:56)
evolution because we have a prior
(00:28:59)
>> an evolutionary prior that's encoded in
(00:29:02)
some very nonobvious way.
(00:29:04)
>> Yeah.
(00:29:05)
>> That somehow makes us so good at it.
(00:29:06)
>> Yeah.
(00:29:07)
>> But if people exhibit great ability,
(00:29:11)
reliability, robustness, ability to
(00:29:14)
learn in a domain that really did not
(00:29:17)
exist until recently,
(00:29:20)
then this is more an indication that
(00:29:23)
people might have
(00:29:26)
just better machine learning period.
(00:29:28)
>> Mhm. But then how should we think about
(00:29:31)
what that is? Is it a matter of
(00:29:34)
Yeah. What is the ML analogy for what?
(00:29:38)
There's a couple interesting things
(00:29:39)
about it. It takes fewer samples. It's
(00:29:41)
more unsupervised. You don't have to set
(00:29:43)
a ver like a child learning to drive a
(00:29:45)
car. Children are not learning to drive
(00:29:47)
a car. A teenager learning how to drive
(00:29:48)
a car is like not exactly getting
(00:29:53)
some pre-built verifiable reward.
(00:29:56)
it comes from their interaction with the
(00:29:59)
machine and the with the environment. Um
(00:30:02)
and yet it takes much fewer samples. It
(00:30:04)
seems more unsupervised. It seems more
(00:30:06)
robust. Much more robust. The robustness
(00:30:09)
of people is really staggering.
(00:30:12)
>> Yeah. So like Okay. And do you have a
(00:30:15)
unified way of thinking about why are
(00:30:16)
all these things happening at once? What
(00:30:18)
is the ML analogy that would that could
(00:30:22)
be could realize something like this?
(00:30:24)
So, so, so, um, this is where, you know,
(00:30:26)
one of the things that you've been
(00:30:27)
asking about is how can, you know, the
(00:30:30)
teenage driver kind of self-correct and
(00:30:33)
learn from their experience without an
(00:30:36)
external teacher.
(00:30:37)
>> And the answer is well, they have their
(00:30:40)
value function,
(00:30:41)
>> right? They have a general sense which
(00:30:44)
is also by the way extremely robust in
(00:30:46)
people like
(00:30:48)
whatever it is the human value function
(00:30:50)
whatever the human value function is
(00:30:53)
with a few exceptions around addiction
(00:30:56)
it's actually very very robust
(00:30:59)
and so for something like a teenager
(00:31:01)
that's learning to drive they start to
(00:31:03)
drive and they already have a sense of
(00:31:06)
how they're driving immediately how
(00:31:09)
badly they're unconfident and then they
(00:31:11)
See? Okay. And they and then of course
(00:31:13)
the the learning speed of any teenager
(00:31:15)
is so fast after 10 hours you're good to
(00:31:17)
go.
(00:31:17)
>> Yeah. It seems like humans has some
(00:31:19)
solution, but I'm curious about like
(00:31:20)
well how are they doing it and like why
(00:31:22)
is it so hard to like how do we need to
(00:31:24)
reconceptualize the way we're training
(00:31:25)
models to make something like this
(00:31:27)
possible?
(00:31:28)
>> You know that is a great question to ask
(00:31:31)
and it's a question I have a lot of
(00:31:35)
opinions about.
(00:31:37)
But unfortunately
(00:31:39)
we live in a world where not not all
(00:31:42)
machine learning ideas are discussed
(00:31:43)
freely and this is this is one of them.
(00:31:45)
So there's probably a way to do it.
(00:31:49)
I think it can be done. The fact that
(00:31:52)
people are like that I think it's a
(00:31:55)
proof that it can be done. There may be
(00:31:57)
another blocker though which is there is
(00:31:59)
a possibility
(00:32:02)
that the human neurons actually do more
(00:32:05)
compute than we think. And if that is
(00:32:08)
true and if that plays an important role
(00:32:11)
then things might be more difficult. But
(00:32:14)
regardless I do think it points to the
(00:32:16)
existence of some
(00:32:19)
machine learning principle
(00:32:22)
that I have opinions on. But
(00:32:24)
unfortunately, circumstances make it
(00:32:26)
hard to to discuss in detail. Even
(00:32:28)
though
(00:32:28)
>> nobody nobody listens to this podcast,
(00:32:30)
Ilia.
(00:32:32)
>> Yeah.
(00:32:32)
>> So, I have to say that prepping for Ilia
(00:32:35)
was pretty tough because neither I nor
(00:32:37)
anybody else had any idea what he's
(00:32:39)
working on and what SSI is trying to do.
(00:32:42)
I had no basis to come up with my
(00:32:44)
questions and the only thing I could go
(00:32:46)
off honestly was trying to think from
(00:32:48)
first principles about what are the
(00:32:50)
bottlenecks to hi because clearly Ilia
(00:32:52)
is working on them in some way. Part of
(00:32:55)
this question involved thinking about RL
(00:32:56)
scaling because everybody's asking how
(00:32:58)
well RL will generalize and how we can
(00:33:00)
make it generalize better. As part of
(00:33:02)
this I was reading this paper that came
(00:33:03)
out recently on RL scaling and it showed
(00:33:06)
that actually the learning curve on RL
(00:33:08)
looks like a sigmoid. I found this very
(00:33:10)
curious. Why should it be a sigmoid
(00:33:12)
where it learns very little for a long
(00:33:14)
time and then it quickly learns a lot
(00:33:16)
and then it asmmptotes. This is very
(00:33:18)
different from the power law you see in
(00:33:20)
pre-training where the model learns a
(00:33:22)
bunch at the very beginning and then
(00:33:23)
less and less over time. And it actually
(00:33:25)
reminded me of a note that I had written
(00:33:27)
down after I had a conversation with a
(00:33:29)
researcher friend where he pointed out
(00:33:31)
that the number of samples that you need
(00:33:33)
to take in order to find a correct
(00:33:35)
answer scales exponentially with how
(00:33:38)
different your current probability
(00:33:39)
distribution is from the target
(00:33:40)
probability distribution. And I was
(00:33:42)
thinking about how these two ideas are
(00:33:43)
related. I had this vague idea that they
(00:33:45)
should be connected, but I really didn't
(00:33:47)
know how. I don't have a math
(00:33:48)
background, so I couldn't really
(00:33:49)
formalize it. But I wondered if Gemini 3
(00:33:52)
could help me out here. And so I took a
(00:33:54)
picture of my notebook and I took the
(00:33:56)
paper and I put them both in the context
(00:33:57)
of Gemini 3 and I asked it to find the
(00:34:00)
connection. And it thought a bunch and
(00:34:02)
then it realized that the correct way to
(00:34:05)
model the information you gain from a
(00:34:07)
single yes or no outcome in RL is as the
(00:34:10)
entropy of a random binary variable. It
(00:34:12)
made a graph which showed how the bits
(00:34:15)
you gain for a sample in RL versus
(00:34:17)
supervised learning scale as a pass rate
(00:34:20)
increases. And as soon as I saw the
(00:34:21)
graph that Gemini 3 made, immediately a
(00:34:24)
ton of things started making sense to
(00:34:25)
me. Then I wanted to see if there was
(00:34:27)
any empirical basis to this theory. So I
(00:34:29)
asked Gemini to code my experiment to
(00:34:32)
show whether the improvement in loss
(00:34:35)
scales in this way with pass rate. I
(00:34:37)
just took the code that Gemini outputed.
(00:34:39)
I copy pasted it into a Google Collab
(00:34:41)
notebook and I was able to run this toy
(00:34:43)
ML experiment and visualize its results
(00:34:45)
without a single bug. It's interesting
(00:34:47)
because the results look similar but not
(00:34:50)
identical to what we should have
(00:34:51)
expected. And so I downloaded this chart
(00:34:53)
and I put it into Gemini and I asked it
(00:34:55)
what is going on here. I came up with a
(00:34:56)
hypothesis that I think is actually
(00:34:57)
correct which is that we're capping how
(00:35:00)
much supervised learning can improve in
(00:35:02)
the beginning by having a fixed learning
(00:35:04)
rate and in fact we should decrease the
(00:35:06)
learning rate over time. It actually
(00:35:07)
gives us an intuitive understanding for
(00:35:09)
why in practice we have learning rate
(00:35:11)
schedulers that decrease the learning
(00:35:13)
rate over time. I did this entire flow
(00:35:15)
from coming up with this vague initial
(00:35:18)
question to building a theoretical
(00:35:20)
understanding to running some toy ML
(00:35:22)
experiments all with Gemini 3. This
(00:35:24)
feels like the first model where it can
(00:35:27)
actually come up with new connections
(00:35:28)
that I wouldn't have anticipated. It's
(00:35:30)
actually now become the default place I
(00:35:32)
go to when I want to brainstorm new ways
(00:35:35)
to think about a problem. If you want to
(00:35:36)
read more about RL scaling, you can
(00:35:38)
check out the blog post that I wrote
(00:35:39)
with a little help from Gemini 3. And if
(00:35:41)
you want to check out Gemini 3 yourself,
(00:35:43)
go to gemini.google.
(00:35:46)
I am curious if you say we are back in
(00:35:47)
an era of research.
(00:35:50)
You were there from 2012 to 2020
(00:35:53)
and do do you have Yeah. What what is
(00:35:57)
now the vibe going to be if we go back
(00:35:59)
to the era of research? For example,
(00:36:02)
even after Alexet, the amount of compute
(00:36:05)
that was used to run experiments kept
(00:36:07)
increasing and the size of frontier
(00:36:10)
systems kept increasing.
(00:36:12)
And do you think now that this era of
(00:36:16)
research will still require tremendous
(00:36:17)
amounts of compute? Um, do you think it
(00:36:20)
will require going back into the
(00:36:22)
archives and reading old papers? What is
(00:36:26)
maybe what was the vibe of like you were
(00:36:28)
at Google and um OpenAI and Stanford
(00:36:32)
these places when there was like a more
(00:36:33)
of a vibe of research. What what kind of
(00:36:36)
thing should we be expecting in the
(00:36:37)
community?
(00:36:39)
>> So one consequence of um the age of
(00:36:43)
scaling is that there was this
(00:36:47)
um scaling sucked out all the air in the
(00:36:49)
room.
(00:36:50)
>> Yeah.
(00:36:51)
And so
(00:36:53)
because scaling sucked out all the air
(00:36:55)
in the room,
(00:36:57)
everyone started to do the same thing.
(00:37:00)
We got to the point where
(00:37:03)
uh we are in a world where there are
(00:37:06)
more companies than ideas by quite a
(00:37:08)
bit.
(00:37:09)
>> Actually on that you know there is this
(00:37:11)
Silicon Valley saying that says that
(00:37:14)
ideas are cheap, execution is everything
(00:37:18)
and people say that a lot.
(00:37:20)
>> Yeah. And there is truth to that. But
(00:37:22)
then I saw I saw someone say on Twitter
(00:37:25)
um something like if ideas are are so
(00:37:28)
cheap, how come no one's having any
(00:37:30)
ideas?
(00:37:31)
>> And I think it's true too. I think like
(00:37:34)
if you think about um research progress
(00:37:37)
in terms of bottlenecks,
(00:37:40)
there are several bottlenecks. If you go
(00:37:43)
back to the if if you and um one of them
(00:37:45)
is ideas and one of them is your ability
(00:37:47)
to bring them to life.
(00:37:48)
>> Yeah. which might be compute but also
(00:37:50)
engineering.
(00:37:52)
So if you go back to the '9s let's say
(00:37:54)
you had people who had had pretty good
(00:37:56)
ideas and if they had much larger
(00:37:58)
computers maybe they could demonstrate
(00:38:00)
that their ideas were viable but they
(00:38:02)
could not. So they could only have very
(00:38:04)
very small demonstration that did not
(00:38:06)
convince anyone.
(00:38:06)
>> Yeah.
(00:38:08)
>> So the bottleneck was compute. Then in
(00:38:10)
the age of scaling computers increased a
(00:38:13)
lot and of course there is a question of
(00:38:17)
how much comput is needed but compute is
(00:38:20)
large so compute is large enough such
(00:38:24)
that
(00:38:26)
it's like not obvious that you need that
(00:38:28)
much more compute to prove some idea
(00:38:33)
like I'll give you an analogy. Alexet
(00:38:36)
was built on two GPUs. That was the
(00:38:39)
total amount of compute used for it. The
(00:38:41)
transformer
(00:38:43)
was built on 8 to 64 GPUs. No single
(00:38:47)
transformer paper experiment used more
(00:38:49)
than 64 GPUs of 2017 which would be like
(00:38:52)
what two GPUs of today. So the ResNet
(00:38:57)
right many like even even the the um you
(00:39:01)
could argue that the like 01
(00:39:04)
reasoning was not the most comput heavy
(00:39:07)
thing in the world. So there definitely
(00:39:10)
for for research
(00:39:13)
you need like definitely some amount of
(00:39:16)
compute but it's far from obvious that
(00:39:18)
you need the absolutely largest amount
(00:39:20)
of compute ever for research. H
(00:39:22)
>> you might argue and I think it is true
(00:39:25)
that if you want to build the absolutely
(00:39:27)
best system, if you want to build the
(00:39:30)
absolutely best system, then it helps to
(00:39:32)
have much more compute and especially if
(00:39:34)
everyone is within the same paradigm,
(00:39:37)
then compute becomes one of the big
(00:39:40)
differentiators.
(00:39:42)
>> Yeah, I guess while it was possible to
(00:39:45)
develop these ideas, I'm asking you for
(00:39:47)
the history because you were actually
(00:39:48)
there. I'm not sure what actually
(00:39:49)
happened, but it sounds like it was
(00:39:50)
possible to develop these ideas using
(00:39:53)
minimal amounts of compute, but it
(00:39:55)
wasn't the transformer didn't
(00:39:56)
immediately become famous. It became the
(00:39:59)
thing everybody started doing and then
(00:40:00)
started experimenting on top of and
(00:40:02)
building on top of because it was
(00:40:04)
validated at higher and higher levels of
(00:40:06)
compute.
(00:40:06)
>> Correct. And if you at SSI have 50
(00:40:10)
different ideas, how will you know which
(00:40:12)
one is the next transformer and which
(00:40:15)
one is you know brittle without having
(00:40:20)
the kinds of compute that other frontier
(00:40:21)
labs have. So I can I can comment on
(00:40:24)
that which is
(00:40:26)
the short comment is that you know you
(00:40:29)
mentioned SSI specifically for us
(00:40:34)
the amount of compute that SSI has for
(00:40:37)
research is really not that small and I
(00:40:41)
want to explain why like a simple math
(00:40:44)
can explain why the amount of compute
(00:40:45)
that we have is actually a lot more
(00:40:47)
comparable for research than one might
(00:40:50)
think. And I'll explain. So
(00:40:56)
SSI has raised $3 billion which is like
(00:41:01)
not small by it's like a lot by any
(00:41:04)
absolute sense but you could say but
(00:41:05)
look at the other companies raising
(00:41:07)
>> much more
(00:41:09)
but a lot of what their a lot of their
(00:41:11)
compute goes for inference
(00:41:14)
like these big numbers these big loans
(00:41:16)
it's earmarked for inference. That's
(00:41:19)
number one. Number two, you need if you
(00:41:22)
want to have a product on which you do
(00:41:24)
inference, you need to have a big staff
(00:41:26)
of engineers of salespeople. A lot of
(00:41:29)
the research needs to be dedicated for
(00:41:31)
producing all kinds of product related
(00:41:34)
features. So then when you look at
(00:41:36)
what's actually left for research, the
(00:41:39)
difference becomes a lot smaller.
(00:41:42)
Now the other thing is is that if you
(00:41:45)
are doing something different do you
(00:41:47)
really need the absolute maximal scale
(00:41:50)
to prove it? I don't think it's true at
(00:41:52)
all. I think that in our case we have
(00:41:57)
sufficient compute to prove to convince
(00:42:00)
ourselves and anyone else that what
(00:42:01)
we're doing is correct.
(00:42:03)
There's been public estimates that you
(00:42:05)
know companies like OpenAI spend on the
(00:42:06)
order of56
(00:42:08)
billion dollars a year even just so far
(00:42:10)
on experiments.
(00:42:12)
>> This is separate from the amount of
(00:42:14)
money they're spending on inference and
(00:42:16)
so forth. So seems like they're spending
(00:42:18)
more a year running exper like research
(00:42:20)
experiments than you guys have in total
(00:42:22)
funding.
(00:42:23)
>> I think it's a question of what you do
(00:42:24)
with it. It's a question of what you do
(00:42:26)
with it. like they have a like the more
(00:42:29)
I think in in in their case in the case
(00:42:31)
of others I think there's a lot more
(00:42:33)
demand on the training compute there's a
(00:42:36)
lot more different work streams there is
(00:42:38)
there are different modalities there is
(00:42:41)
just more stuff and so it becomes
(00:42:43)
fragmented
(00:42:44)
>> how will SSI make money
(00:42:46)
>> you know
(00:42:48)
my answer to this question is something
(00:42:50)
like
(00:42:53)
we just f right now we just focus on the
(00:42:55)
research and then the answer to that
(00:42:57)
will reveal itself. I think there will
(00:42:59)
be lots of possible answers.
(00:43:01)
>> Is SSI's plan still to straightshot
(00:43:03)
super intelligence?
(00:43:05)
>> Maybe.
(00:43:07)
I think that there is merit to it.
(00:43:09)
>> I think there's a lot of merit because I
(00:43:11)
think that it's very nice to not be
(00:43:13)
affected by the day-to-day market
(00:43:16)
competition.
(00:43:18)
But I think there are two reasons that
(00:43:23)
may cause us to change the plan. one is
(00:43:26)
pragmatic if timelines turn out to be
(00:43:28)
long
(00:43:30)
which they might and second I think
(00:43:32)
there is a lot of value in the best and
(00:43:38)
most powerful AI being out there
(00:43:41)
impacting the world
(00:43:44)
>> I think this is a meaningfully valuable
(00:43:46)
thing
(00:43:46)
>> but then so why is your default plan to
(00:43:48)
straight shot super intelligence because
(00:43:50)
it sounds like you know openai anthropic
(00:43:53)
all these other companies their explicit
(00:43:55)
thinking is look we have weaker and
(00:43:56)
weaker intelligences that the public can
(00:43:58)
get used to and prepare for and why is
(00:44:01)
it potentially better to build a super
(00:44:05)
intelligence directly
(00:44:06)
>> so I'll make the case for and against
(00:44:09)
>> the case for is that you are so one of
(00:44:12)
the challenges
(00:44:14)
that people face when they're in the
(00:44:16)
market is that they have to participate
(00:44:18)
in the rat race and the rat race is
(00:44:21)
quite difficult in that it exposes you
(00:44:23)
to to to difficult trade-offs which you
(00:44:26)
need to make
(00:44:28)
and there is it is it is nice to say
(00:44:31)
we'll insulate ourselves from all this
(00:44:33)
and just focus on the research and come
(00:44:35)
out only when we are ready and not
(00:44:37)
before but the counterpoint is valid too
(00:44:41)
and those those are opposing forces the
(00:44:44)
counterpoint is hey it is useful for the
(00:44:48)
world to see powerful AI it is useful
(00:44:52)
for the world to see powerful AI because
(00:44:53)
that's the only way you and communicate
(00:44:55)
it.
(00:44:55)
>> Well, I guess not even just that you can
(00:44:57)
communicate the idea, but
(00:44:58)
>> communicate the AI, not the idea.
(00:45:01)
Communicate the AI.
(00:45:03)
>> What do you mean communicate the AI?
(00:45:04)
>> So, okay. So, let's suppose you read an
(00:45:06)
essay about AI
(00:45:07)
>> and the essay says AI is going to be
(00:45:09)
this and AI is going to be that and it's
(00:45:11)
going to be this
(00:45:12)
>> and you read it and you say, okay, this
(00:45:14)
is an interesting essay
(00:45:15)
>> right
(00:45:16)
>> now. Suppose you see an AI doing this
(00:45:18)
and AI doing that.
(00:45:20)
>> It is incomparable. Like basically I
(00:45:23)
think I think that there is a big
(00:45:26)
benefit from AI being in the public and
(00:45:31)
that would be a reason for us to not be
(00:45:34)
quite straight shot.
(00:45:36)
>> Yeah. Well, I guess it's not even that
(00:45:38)
which I but I do think that is an
(00:45:39)
important part of it. The other big
(00:45:41)
thing is I can't think of another
(00:45:44)
discipline in human engineering and
(00:45:45)
research where
(00:45:47)
the end artifact was made safer
(00:45:52)
mostly through just thinking about how
(00:45:53)
to make it safe as opposed to why are
(00:45:56)
airplane crashes per mile so much lower
(00:45:58)
today than they were decades ago? Why is
(00:46:00)
it so much harder to find a bug in Linux
(00:46:03)
than it would have been decades ago? And
(00:46:05)
I think it's mostly because these
(00:46:06)
systems were deployed to the world. you
(00:46:09)
noticed failures, those failures were
(00:46:11)
corrected and the systems became more
(00:46:13)
robust. Now, I'm not sure why AGI and
(00:46:16)
superhuman intelligence would be any
(00:46:18)
different, especially given, and I hope
(00:46:20)
we can talk, we're going to get to this.
(00:46:23)
It seems like the harms of super
(00:46:25)
intelligence are not just about like
(00:46:27)
having some malevolent uh paper clipper
(00:46:30)
out there, but it just like this is a
(00:46:31)
really powerful thing and we don't even
(00:46:33)
know how to conceptualize how people
(00:46:34)
interact with it, what people will do
(00:46:35)
with it and having gradual access to it
(00:46:38)
seems like a um better way to maybe
(00:46:42)
spread out the impact of it and to help
(00:46:44)
people prepare for it. Well, I think I
(00:46:46)
think on this point even in the straight
(00:46:49)
shot scenario, you would still do a
(00:46:52)
gradual release of it is how I would
(00:46:55)
imagine it.
(00:46:57)
The the gra gradualism would be an
(00:47:00)
inherent inherent component of any plan.
(00:47:03)
It's just a question of what is the
(00:47:04)
first thing that you get out of the
(00:47:06)
door. That's number one. Number two, I
(00:47:08)
also think you know I believe you have
(00:47:11)
advocated for continual learning more
(00:47:13)
than other people
(00:47:14)
>> and I actually think that this is an
(00:47:16)
important and correct thing and here is
(00:47:20)
why
(00:47:22)
so one of the things so I'll give you
(00:47:24)
another example of how thinking how
(00:47:27)
language affects thinking and in this
(00:47:30)
case it will be two words two words that
(00:47:33)
have shaped everyone's thinking I
(00:47:36)
maintain
(00:47:37)
F first word AGI
(00:47:40)
second word pre-training let me explain.
(00:47:44)
So the word the term AGI,
(00:47:48)
why does this term exist? It's a very
(00:47:50)
particular term. Why does it exist?
(00:47:53)
There's a reason.
(00:47:55)
The reason that the term AGI exists is
(00:47:58)
in my opinion not so much because it's
(00:48:00)
like a very important essential
(00:48:03)
descriptor of of of some end state of
(00:48:05)
intelligence but
(00:48:10)
because it is a reaction to a different
(00:48:14)
term that existed and the term is narrow
(00:48:16)
AI. If you go back to ancient history of
(00:48:20)
gameplay AI, of checkers AI, chess AI,
(00:48:23)
computer games AI, everyone would say,
(00:48:26)
look at this narrow intelligence. Sure,
(00:48:28)
the chess AI can beat Casper off, but it
(00:48:30)
can't do anything else. It is so narrow,
(00:48:32)
artificial narrow intelligence. So in
(00:48:35)
response, as a reaction to this, some
(00:48:38)
people said, well, this is not good. It
(00:48:41)
is so narrow. What we need is general
(00:48:44)
AI.
(00:48:46)
general AI, an AI that can just do all
(00:48:48)
the things.
(00:48:51)
The second and and that term just got a
(00:48:55)
lot of traction.
(00:48:56)
>> Yeah.
(00:48:57)
>> The second thing that got a lot of
(00:48:58)
traction is pre-training.
(00:49:01)
Specifically, the recipe of
(00:49:03)
pre-training. I think the current the
(00:49:05)
way people do RL now is maybe um un is
(00:49:09)
undoing the conceptual imprint of
(00:49:12)
pre-training. But pre-training had the
(00:49:14)
property. you do more pre-training and
(00:49:17)
the model gets better at everything more
(00:49:19)
or less uniformly. Yeah,
(00:49:21)
>> general AI pre-training gives AGI
(00:49:27)
but
(00:49:29)
the thing that happened with AGI and
(00:49:32)
pre-training is that in some sense they
(00:49:34)
overshock the target
(00:49:36)
because by the kind if you think about
(00:49:39)
the term AGI you will realize and
(00:49:42)
especially in the context of
(00:49:43)
pre-training you will realize that a
(00:49:44)
human being is not an AGI
(00:49:48)
because a human being Yes, there is
(00:49:51)
definitely a foundation of skills.
(00:49:53)
A human being,
(00:49:56)
a human being lacks a huge amount of
(00:49:59)
knowledge. Instead, we rely on continual
(00:50:02)
learning. We rely on continual learning.
(00:50:05)
And so then when you think about okay,
(00:50:07)
so let's suppose that we achieve success
(00:50:09)
and we produce a safe super some kind of
(00:50:11)
safe super intelligence. The question is
(00:50:13)
but how do you define it? Where on the
(00:50:15)
curve of continual learning is it going
(00:50:17)
to be? I produce like um a super
(00:50:20)
intelligent 15 year old that's very
(00:50:22)
eager to go and you say okay I'm going
(00:50:23)
to they don't know very much at all the
(00:50:26)
great student very eager you go and be a
(00:50:29)
programmer you go and be a doctor
(00:50:32)
go and learn so you could imagine that
(00:50:34)
the deployment itself will involve some
(00:50:36)
kind of a learning trial and error
(00:50:38)
period
(00:50:39)
>> it's a process as opposed to you drop
(00:50:42)
the finished thing
(00:50:44)
>> okay I I I I see so you're you're
(00:50:46)
suggesting
(00:50:47)
that the thing you're pointing out with
(00:50:49)
super intelligence
(00:50:51)
is not some finished
(00:50:56)
mind which knows how to do every single
(00:50:58)
job in the economy cuz the way say the
(00:51:01)
original I think openi charter or
(00:51:03)
whatever defines AGI is like it can do
(00:51:05)
every single job that a every single
(00:51:07)
thing a human can do. You're proposing
(00:51:09)
instead a mind which can learn to do any
(00:51:13)
single every single job.
(00:51:14)
>> Yes.
(00:51:14)
>> And that is super intelligence. And then
(00:51:16)
but once you have the learning
(00:51:18)
algorithm,
(00:51:20)
it gets deployed into the world the same
(00:51:22)
way a human laborer might join an
(00:51:24)
organization.
(00:51:25)
>> And it seems like one of these two
(00:51:27)
things might happen. Maybe neither of
(00:51:29)
these happens. One, this super efficient
(00:51:33)
learning algorithm
(00:51:35)
becomes superhuman becomes as good as
(00:51:38)
you and potentially even better at the
(00:51:41)
task of ML research. And as a result the
(00:51:46)
algorithm itself becomes more and more
(00:51:47)
superhuman. The other is even if that
(00:51:49)
doesn't happen if you have a single
(00:51:52)
model I mean this this is explicitly
(00:51:53)
your vision. If you have a single model
(00:51:55)
or instances of a model which are
(00:51:57)
deployed through the economy doing
(00:51:59)
different jobs learning how to do those
(00:52:00)
jobs continually learning on the job
(00:52:04)
picking up all the skills that any human
(00:52:05)
could pick up but actually picking them
(00:52:06)
all up at the same time and then
(00:52:08)
amalgamating the learnings.
(00:52:10)
you basically have a model which
(00:52:12)
functionally becomes super intelligent
(00:52:14)
even without any sort of recursive
(00:52:16)
self-improvement in software right
(00:52:19)
because you now have one model that can
(00:52:20)
do every single job in the economy and
(00:52:22)
humans can't merge our minds in the same
(00:52:24)
way and so do you expect some sort of
(00:52:26)
like intelligence explosion from broad
(00:52:28)
deployment
(00:52:29)
>> I think that it is likely that we will
(00:52:33)
have rapid economic growth
(00:52:37)
I think the broad deployment And
(00:52:41)
like there are two arguments you could
(00:52:44)
make which are conflicting.
(00:52:47)
One is that look if indeed you get once
(00:52:50)
indeed you get to a point where you have
(00:52:53)
an AI that can learn to do
(00:52:57)
things quickly
(00:52:59)
and you have many of them then they will
(00:53:02)
then there will be a strong force to
(00:53:05)
deploy them in the economy. Unless there
(00:53:08)
will be some kind of a regulation that
(00:53:10)
stops it, which by the way there might
(00:53:12)
be. But I think the idea of very rapid
(00:53:17)
economic growth for some time, I think
(00:53:19)
it's very possible from broad
(00:53:21)
deployment. The other question is how
(00:53:23)
rapid it's going to be.
(00:53:25)
So I think this is hard to know because
(00:53:27)
on the one hand you have this very
(00:53:29)
efficient worker. on the other hand
(00:53:31)
there is the world is just really big
(00:53:34)
and there's a lot of stuff
(00:53:36)
and that stuff moves at a different
(00:53:38)
speed but then on the other hand now the
(00:53:40)
AI could you know so I think very rapid
(00:53:43)
economic growth is possible and we will
(00:53:45)
see like all kinds of things like
(00:53:48)
different countries with different rules
(00:53:49)
and the ones which have the friendlier
(00:53:51)
rules the economic growth will be faster
(00:53:53)
hard to predict
(00:53:54)
>> some people in our audience like to read
(00:53:56)
the transcripts instead of listening to
(00:53:58)
the episode and so we put a ton of
(00:54:00)
effort into making the transcripts read
(00:54:02)
like they are standalone essays. The
(00:54:04)
problem is that if you just transcribe a
(00:54:07)
conversation verbatim using a speech to
(00:54:09)
text model, it'll be full of all kinds
(00:54:11)
of fits and starts and confusing
(00:54:13)
phrasing. We mentioned this problem to
(00:54:15)
Labelbox and they asked if they could
(00:54:16)
take a stab. Working with them on this
(00:54:18)
is probably the reason that I'm most
(00:54:20)
excited to recommend Labelbox to people.
(00:54:22)
It wasn't just, oh, hey, tell us what
(00:54:24)
kind of data you need and we'll go get
(00:54:25)
it. They walked us through the entire
(00:54:27)
process from helping us identify what
(00:54:29)
kind of data we needed in the first
(00:54:30)
place to assembling a team of expert
(00:54:33)
aligners to generate it. Even after we
(00:54:35)
got all the data back, Labelbox stayed
(00:54:37)
involved. They helped us choose the
(00:54:40)
right base model and set up auto QA on
(00:54:42)
the model's output so that we could
(00:54:44)
tweak and refine it. And now we have a
(00:54:46)
new transcriber tool that we can use for
(00:54:48)
all our episodes moving forward. This is
(00:54:50)
just one example of how Labelbox meets
(00:54:53)
their customers at the ideas level and
(00:54:55)
partners with them through their entire
(00:54:56)
journey. If you want to learn more or if
(00:54:58)
you want to try out the transcriber tool
(00:55:00)
yourself, go to labelbox.com/barcash.
(00:55:08)
It seems to me that this is a very
(00:55:10)
precarious situation to be in where
(00:55:13)
looking the limit we know that this
(00:55:15)
should be possible because if you have
(00:55:17)
something that is as good as a human at
(00:55:19)
learning but which can merge its brains
(00:55:22)
merge there are different instances in a
(00:55:24)
way that humans can't merge already.
(00:55:26)
This seems like a thing that should
(00:55:28)
physically be possible. Humans are
(00:55:29)
possible, digital computers are
(00:55:30)
possible. You just need both of those
(00:55:32)
combined to produce this thing. And it
(00:55:34)
also seems like this kind of thing is
(00:55:36)
extremely um powerful
(00:55:40)
and economic growth is one way to put
(00:55:43)
it. Um I mean Dyson spear is a lot of
(00:55:45)
economic growth but another way to put
(00:55:46)
it is just like you will have
(00:55:49)
potentially a very short period of time
(00:55:50)
because a human on the job can you know
(00:55:52)
you you're hiring people at SSI in six
(00:55:54)
months they're like net productive
(00:55:55)
probably right um a human like learns
(00:55:57)
really fast and so this thing is
(00:55:58)
becoming smarter and smarter very fast
(00:56:01)
what is how do you think about making
(00:56:02)
that go well and why is SSI positioned
(00:56:05)
to do that well or what is SSI's plan
(00:56:07)
there basically is what I'm trying to
(00:56:08)
ask
(00:56:08)
>> yeah
(00:56:10)
so one of the one of the ways in which
(00:56:14)
my thinking has been changing is that
(00:56:19)
I now place more importance on AI
(00:56:25)
being
(00:56:26)
deployed
(00:56:28)
incrementally and in advance. One very
(00:56:32)
difficult thing about AI is that we are
(00:56:35)
talking about systems that don't yet
(00:56:39)
exist
(00:56:41)
and it's hard to imagine them.
(00:56:44)
I think that one of the things that's
(00:56:45)
happening is that in practice it's very
(00:56:50)
hard to feel the AGI.
(00:56:52)
It's very hard to feel the AGI.
(00:56:55)
We can talk about it, but it's like it's
(00:56:58)
like talking about like the long f like
(00:57:01)
imagine like having a conversation about
(00:57:02)
like how is it like to be old when
(00:57:05)
you're like old and and frail and you
(00:57:08)
can have a conversation. You can try to
(00:57:09)
imagine it, but it's just hard and you
(00:57:12)
come back to reality. Well, that's not
(00:57:14)
the case. And I think that a lot of the
(00:57:19)
issues around
(00:57:21)
AGI and its future power stem from the
(00:57:26)
fact that it's very difficult to imagine
(00:57:31)
future AI is going to be diff different.
(00:57:34)
It's going to be powerful. Indeed, the
(00:57:36)
whole problem, what is the problem of AI
(00:57:39)
and AGI? The whole problem is the power.
(00:57:43)
The whole problem is the power.
(00:57:47)
When the power is really big, what's
(00:57:49)
going to happen?
(00:57:51)
And one of the one of the ways in which
(00:57:53)
I've changed my mind over the past year
(00:57:55)
and so that that change of mind may back
(00:58:00)
may I'll say I I'll I'll hedge a little
(00:58:02)
bit may back propagate into into the
(00:58:04)
plans of our of our company is that
(00:58:09)
so if it's hard to imagine
(00:58:12)
what do you do you got to be showing the
(00:58:14)
thing you got to be showing the thing
(00:58:16)
and I maintain that I think I think most
(00:58:19)
people who work Con AI also can't
(00:58:22)
imagine it because it's too different
(00:58:24)
from what people see on a day-to-day
(00:58:26)
basis.
(00:58:29)
I do maintain here is something which I
(00:58:32)
predict will happen. That's a
(00:58:33)
prediction.
(00:58:35)
I maintain
(00:58:37)
that as AI becomes more powerful
(00:58:41)
then people will change their behaviors
(00:58:46)
and we will see all kinds of
(00:58:47)
unprecedented things which are not
(00:58:50)
happening right now and I'll give some
(00:58:52)
examples. I do like I I think I think
(00:58:56)
for better or worse the the frontier
(00:58:58)
companies will play a very important
(00:59:00)
role in what happens as will the
(00:59:02)
government and the kind of things that I
(00:59:04)
think we'll see which you see the
(00:59:07)
beginnings of
(00:59:09)
companies that are fierce competitors
(00:59:12)
starting collaborate to to collaborate
(00:59:14)
on AI safety you may have seen open AI
(00:59:17)
and anthropic event doing a first small
(00:59:21)
step but that did not exist That's
(00:59:23)
actually something which I predicted in
(00:59:25)
one of my talks about three years ago
(00:59:27)
that such a thing will happen. I also
(00:59:30)
maintain that as AI continues to become
(00:59:32)
more powerful, more visibly powerful,
(00:59:36)
there will also be a desire from
(00:59:38)
governments and the public to do
(00:59:40)
something
(00:59:42)
and I think that this is a very
(00:59:44)
important force
(00:59:46)
of showing the AI. That's number one.
(00:59:49)
Number two, okay, so then the AI is
(00:59:51)
being built. what needs to what needs to
(00:59:53)
be done.
(00:59:56)
So one thing that I maintain that will
(00:59:58)
happen is that right now people who are
(01:00:00)
working on AI I maintain that the AI
(01:00:03)
doesn't feel powerful because of its
(01:00:05)
mistakes.
(01:00:07)
I do think that at some point the AI
(01:00:08)
will start to feel powerful actually and
(01:00:11)
I think when that happens we will see a
(01:00:13)
big change in the way
(01:00:16)
all AI companies approach safety.
(01:00:20)
they'll become much more paranoid. I
(01:00:22)
think I I say this as a predict as a as
(01:00:24)
a as a prediction that we will see
(01:00:26)
happen. We'll see if I'm right, but I
(01:00:28)
think this is something that will happen
(01:00:30)
because they will see the AI becoming
(01:00:31)
more powerful. Everything that's
(01:00:33)
happening right now, I maintain is
(01:00:36)
because people look at today's AI and
(01:00:38)
it's hard to imagine the future AI.
(01:00:42)
And there is a third thing which needs
(01:00:44)
to happen. And I think this is this this
(01:00:47)
and I'm talking about it in in broader
(01:00:49)
terms not just from the perspective of
(01:00:51)
SSI
(01:00:53)
because you ask me about our company but
(01:00:55)
the question is okay so then what should
(01:00:56)
what should the companies aspire to
(01:00:58)
build
(01:00:58)
>> what should they aspire to build and
(01:01:00)
there has been one big idea that
(01:01:02)
actually every that um everyone has been
(01:01:05)
locked in locked into which is the the
(01:01:07)
self-improving AI
(01:01:10)
and why why did it happen because there
(01:01:12)
is fewer ideas than companies
(01:01:15)
But I maintain that there is something
(01:01:17)
that's better to build and I think that
(01:01:19)
everyone will actually want that. It's
(01:01:22)
like the AI that's robustly aligned to
(01:01:26)
care about sentient life specifically.
(01:01:30)
I think in particular it will be there's
(01:01:32)
a case to be made that it will be easier
(01:01:35)
to build an AI that cares about sentient
(01:01:37)
life than an AI that cares about human
(01:01:39)
life alone because the AI itself will be
(01:01:42)
sentient.
(01:01:44)
And if you think about things like
(01:01:46)
mirror neurons and human empathy for
(01:01:48)
animals which is you know you might
(01:01:50)
argue it's not big enough but it exists.
(01:01:53)
I think it's an emerging property from
(01:01:55)
the fact that we model others with the
(01:01:58)
same circuit that we used to model
(01:02:00)
ourselves because that's the most
(01:02:02)
efficient thing to do.
(01:02:04)
>> So even if you got an AI to care about
(01:02:06)
sentient beings and it's not actually
(01:02:09)
clear to me that that's what you should
(01:02:10)
try to do if you solved alignment. It
(01:02:12)
would still be the case that most
(01:02:14)
sentient beings will be AIS. There will
(01:02:17)
be trillions eventually quadrillions of
(01:02:19)
AIs. Humans will be a very small
(01:02:21)
fraction of sentient beings.
(01:02:23)
So, it's not clear to me if the goal is
(01:02:26)
some kind of human control over
(01:02:30)
this future civilization
(01:02:32)
that this is the best criterion.
(01:02:35)
>> It's true. I I think that
(01:02:39)
it's possible it's not the best
(01:02:40)
criterion. I'll say two things. I think
(01:02:44)
that thing number one
(01:02:48)
I think that if there so
(01:02:51)
I think that care for sentient life I
(01:02:53)
think there is merit to it. I think it
(01:02:55)
should be considered. I think that it
(01:02:57)
will be helpful if there was some kind
(01:02:59)
of a
(01:03:01)
short list of ideas that then the
(01:03:06)
companies when they are in this
(01:03:07)
situation could use. That's number two.
(01:03:10)
Number three, I think it would be really
(01:03:12)
materially helpful if the power of the
(01:03:16)
most powerful super intelligence was
(01:03:18)
somehow capped
(01:03:20)
because it would address a lot of these
(01:03:22)
concerns.
(01:03:24)
The question of how to do it, I'm not
(01:03:26)
sure, but I think that would be
(01:03:28)
materially helpful when you're talking
(01:03:30)
about really really powerful systems.
(01:03:32)
>> Yeah. Um, before we continue the element
(01:03:35)
discussion, I I want to double click on
(01:03:36)
that. How much room is there at the top?
(01:03:39)
How do you think about super
(01:03:40)
intelligence? Do you think I mean using
(01:03:43)
this learning efficiency idea maybe is
(01:03:45)
just extremely fast at learning new
(01:03:47)
skills or new knowledge and does it just
(01:03:49)
have a bigger pool of strategies? Is
(01:03:51)
there a single cohesive it in the center
(01:03:55)
that's more powerful or bigger? And if
(01:03:58)
so,
(01:04:00)
do you do you imagine that this will be
(01:04:02)
sort of godlike in comparison to the
(01:04:03)
rest of human civilization? or does it
(01:04:04)
just feel like another agent or another
(01:04:07)
cluster of agents?
(01:04:09)
>> So this is an area where different
(01:04:10)
people have different intuitions.
(01:04:12)
>> I think it will be very powerful for
(01:04:14)
sure. I think that what I think is most
(01:04:17)
likely to happen
(01:04:20)
is that there will be multiple
(01:04:22)
such AIS being created roughly at the
(01:04:26)
same time.
(01:04:28)
I think that
(01:04:30)
if the cluster is big enough, like if
(01:04:33)
the cluster is literally continent
(01:04:35)
sized, that thing could be really
(01:04:38)
powerful indeed, right? If you literally
(01:04:40)
have a continentsized cluster, like
(01:04:43)
those those AIs can be very powerful.
(01:04:45)
And I like all I can tell you is that if
(01:04:49)
you're talking about extremely powerful
(01:04:51)
AIs, like truly dramatically powerful,
(01:04:53)
then yeah, it would be nice if they
(01:04:55)
could be restrained in some ways or if
(01:05:00)
there was some kind of an agreement or
(01:05:02)
something
(01:05:04)
because I think that if you are saying
(01:05:05)
hey like if if you really like what what
(01:05:09)
is the the concern of super
(01:05:11)
intelligence? What is one way to explain
(01:05:12)
the concern? If you imagine a system
(01:05:15)
that is sufficiently powerful, like
(01:05:18)
really sufficiently powerful, and you
(01:05:21)
could say, okay, you need to do
(01:05:22)
something sensible like care for
(01:05:24)
sentient life, let's say, in a very
(01:05:26)
single-minded way, we might not like the
(01:05:28)
results. That's really what it is. And
(01:05:30)
so maybe, by the way, the answer is that
(01:05:32)
you do not build a single you do not
(01:05:34)
build an RL agent in the usual sense.
(01:05:37)
And actually, I'll point I'll point
(01:05:38)
several things out. I think human beings
(01:05:41)
are a semi agent. You know, we pursue a
(01:05:44)
reward and then the emotions or whatever
(01:05:47)
make us tire out of the reward. We
(01:05:49)
pursue a different reward.
(01:05:51)
The market is like kind it's like a very
(01:05:54)
shortsighted
(01:05:55)
kind of agent. Evolution is the same.
(01:05:58)
Evolution is very intelligent in some
(01:05:59)
ways but very dumb in other ways. The
(01:06:02)
government has been designed to be a
(01:06:04)
never- ending fight between three parts
(01:06:06)
which has an effect. So I think things
(01:06:09)
like this
(01:06:12)
another thing that makes this discussion
(01:06:13)
difficult is that we are talking about
(01:06:15)
systems that don't exist that we don't
(01:06:17)
know how to build
(01:06:19)
right that's the other thing and that's
(01:06:21)
actually my belief I think what people
(01:06:22)
are doing right now will go some
(01:06:24)
distance and then peter out it will
(01:06:27)
continue to improve but it will also not
(01:06:29)
be it so the it we don't know how to
(01:06:32)
build and I think that a lot h a lot
(01:06:35)
hinges on
(01:06:37)
understanding and in reliable
(01:06:39)
generalization
(01:06:42)
and I'll say another thing which is like
(01:06:45)
you know one of the things that you
(01:06:46)
could say is that cause alignment to be
(01:06:48)
difficult is that human val that it's
(01:06:51)
it's um
(01:06:53)
your ability to learn human values is
(01:06:55)
fragile then your ability to optimize
(01:06:57)
them is fragile will you actually learn
(01:06:59)
to optimize them and then can't you say
(01:07:01)
are these not all instances of
(01:07:03)
unreliable generalization
(01:07:07)
why is it that human beings appear to
(01:07:08)
generalize so much better. What if
(01:07:10)
generalization was much better? What
(01:07:12)
would happen in this case? What would be
(01:07:13)
the effect? But those we can't we can't
(01:07:16)
like those questions are right now still
(01:07:18)
unanswerable.
(01:07:19)
>> Um how does one think about what AI
(01:07:23)
going well looks like because I think
(01:07:25)
you've scoped out how AI might evolve.
(01:07:27)
We'll have these sort of continual
(01:07:28)
learning agents. AI will be very
(01:07:30)
powerful. Maybe there will be many
(01:07:32)
different AIs. How do you think about
(01:07:35)
lots of continent computes size
(01:07:37)
intelligences going around? How
(01:07:40)
dangerous is that? How do we make that
(01:07:43)
less dangerous? And how do we do that in
(01:07:46)
a way that
(01:07:49)
protects a equilibrium where there might
(01:07:52)
be misaligned AIs out there and bad
(01:07:54)
actors out there? So, one reason why I
(01:07:57)
liked the AI that cares for sentient
(01:08:00)
life,
(01:08:01)
>> you know, and we can debate on whether
(01:08:02)
it's good or bad, but
(01:08:05)
if the first N of these dramatic systems
(01:08:11)
actually do care for, you know,
(01:08:14)
love humanity or something, you know,
(01:08:16)
care for sentient life. Obviously, this
(01:08:18)
also needs to be achieved. This needs to
(01:08:21)
be achieved.
(01:08:22)
So if this is achieved by the first n of
(01:08:25)
those systems
(01:08:28)
then then I can see it go well at least
(01:08:31)
for quite some time and then there is
(01:08:33)
the question of what happens in the long
(01:08:35)
run what happens in the long run how do
(01:08:37)
you achieve a long run equilibrium
(01:08:40)
>> and I think that there there is an
(01:08:43)
answer as well and I don't like this
(01:08:46)
answer
(01:08:48)
but it needs to be considered
(01:08:52)
In the long run, you might say, okay, so
(01:08:53)
if you have a world where powerful AI
(01:08:56)
exist. In the short term, you could say,
(01:08:58)
okay, you have universal high income.
(01:09:01)
You have universal high income and we
(01:09:04)
all doing well. But we know that what do
(01:09:07)
the Buddhists say? Change is the only
(01:09:09)
constant. And so things change and there
(01:09:11)
is some kind of government political
(01:09:13)
structure thing and it changes because
(01:09:16)
these things have a shelf life. you know
(01:09:19)
some new new government thing comes up
(01:09:20)
and it functions and then after some
(01:09:22)
time it stops functioning
(01:09:25)
that's something that you see happening
(01:09:26)
all the time and so I think that for the
(01:09:29)
long run equilibrium
(01:09:32)
one approach you could say okay so maybe
(01:09:35)
every person will have an AI that will
(01:09:37)
do their bidding and that's good and if
(01:09:40)
that could be maintained indefinitely
(01:09:42)
that's true but the downside with that
(01:09:45)
is okay so then the AI goes and like
(01:09:49)
earns earn earn you know earns money for
(01:09:51)
for the person and you know advocates
(01:09:53)
for their needs in like the political
(01:09:55)
sphere and maybe then writes a little
(01:09:57)
report saying okay here's what I've done
(01:09:58)
here's the situation and the person says
(01:10:00)
great keep it up but the person is no
(01:10:03)
longer a participant
(01:10:06)
and then you can say that's a precarious
(01:10:07)
place to be in but so I'm going to
(01:10:11)
preface by saying
(01:10:14)
I don't like this solution but it is a
(01:10:17)
solution
(01:10:19)
And the solution is if people become
(01:10:21)
part AI with some kind of neural link++
(01:10:24)
because what will happen as a result is
(01:10:26)
that now the AI understands something
(01:10:28)
and we understand it too like
(01:10:31)
because now the understanding is
(01:10:33)
transmitted wholesale. So now if the AI
(01:10:35)
is in some situation now it's like you
(01:10:39)
are involved in the situation yourself
(01:10:40)
fully
(01:10:42)
and I think this is the answer to the
(01:10:44)
equilibrium. I wonder if uh the fact
(01:10:47)
that emotions which were
(01:10:50)
developed
(01:10:51)
millions or in many cases billions of
(01:10:53)
years ago in a totally different
(01:10:55)
environment are still guiding our
(01:10:58)
actions so strongly is an example of
(01:11:02)
alignment success to maybe spell out
(01:11:04)
what I mean the brain stem has these
(01:11:10)
I don't know if it's more accurate to
(01:11:11)
call it a value function or reward
(01:11:12)
function but the brain stem has a
(01:11:15)
directive where it's saying mate with
(01:11:16)
somebody who's more successful. The
(01:11:18)
cortex is the part that understands what
(01:11:20)
does success mean in the modern context
(01:11:22)
but the brain stem is able to align the
(01:11:25)
cortex and say however you recognize
(01:11:27)
success to be and I I'm not smart enough
(01:11:28)
to understand what that is. You're still
(01:11:30)
going to pursue this directive.
(01:11:32)
>> I think I think there is
(01:11:34)
so I think there's a more general point.
(01:11:37)
I think it's actually really mysterious
(01:11:39)
how the brain encodes high level
(01:11:43)
desires. Sorry, how evolution encodes
(01:11:45)
high level desires.
(01:11:47)
>> Like it's pretty easy to understand how
(01:11:49)
evolution would would endow us with the
(01:11:52)
desire for food that smells good cuz
(01:11:54)
smell is a chemical
(01:11:57)
and so just pursue that chemical. It's
(01:11:59)
very easy to imagine such a me evolution
(01:12:01)
doing such a thing. But evolution also
(01:12:05)
has has endowed us with all these social
(01:12:07)
desires like we we really care about
(01:12:10)
being seen positively by society. We
(01:12:13)
care about being in a good standing. We
(01:12:16)
like all these social intuitions that we
(01:12:19)
have. I feel strongly that they are
(01:12:21)
baked in and I don't know how evolution
(01:12:25)
did it because it's a high level
(01:12:27)
concept. It's represented in the brain.
(01:12:30)
like what people think like let's say
(01:12:32)
you are like you care about
(01:12:35)
some social thing.
(01:12:37)
It's not like a low-level signal like
(01:12:40)
smell. It's not something that for which
(01:12:43)
there's a sensor like the brain needs to
(01:12:45)
do a lot of processing to piece together
(01:12:47)
lots of bits of information to
(01:12:49)
understand what's going on socially and
(01:12:51)
somehow evolution said that's what you
(01:12:53)
should care about.
(01:12:54)
>> Yes.
(01:12:55)
>> How did it do it? And it did it quickly
(01:12:57)
too.
(01:12:57)
>> Yeah. because I think all these
(01:12:59)
sophisticated social things that um we
(01:13:02)
care about I think they evolved pretty
(01:13:04)
recently. So evolution had an easy time
(01:13:06)
hardcoding this high level desire and
(01:13:10)
>> I maintain or you know at least I'll say
(01:13:12)
I'm unaware of good hypothesis for how
(01:13:15)
it's done. I I had some ideas I was
(01:13:18)
kicking around but none of them none of
(01:13:21)
them uh are satisfying.
(01:13:24)
>> Yeah. And what's especially impressive
(01:13:26)
is if it was a desire that you learned
(01:13:27)
in your lifetime, it kind of makes sense
(01:13:30)
because your brain is intelligent. It
(01:13:32)
makes sense why we be able to learn
(01:13:33)
intelligent desires. But your point is
(01:13:35)
that the desire is maybe this is not
(01:13:38)
your point, but one way to understand it
(01:13:39)
is the desire is built into the genome
(01:13:42)
and the genome is not intelligent,
(01:13:44)
right? But it's able to you're somehow
(01:13:46)
able to describe this feature that
(01:13:47)
requires like it's not even clear how
(01:13:49)
you define that feature and you can get
(01:13:51)
it into the you can build it into the
(01:13:53)
genes. Yeah, essentially, or maybe I'll
(01:13:55)
put it differently. If you think about
(01:13:56)
the tools that are available to the
(01:13:59)
genome,
(01:14:01)
it says, okay, here's a recipe for
(01:14:02)
building a brain. And you could say,
(01:14:04)
here is a recipe for connecting the
(01:14:06)
dopamine neurons to like the smell
(01:14:08)
sensor.
(01:14:08)
>> Yeah.
(01:14:09)
>> And if the smell is a certain kind of,
(01:14:11)
you know, good smell, you want to eat
(01:14:12)
that. I could imagine the genome doing
(01:14:14)
that. I'm I'm claiming that it is harder
(01:14:17)
to imagine. It's harder to imagine the
(01:14:20)
genome saying you should care about some
(01:14:24)
complicated computation that your entire
(01:14:26)
brain that like a big chunk of your
(01:14:28)
brain does. That's all I'm claiming. I I
(01:14:30)
can tell you like a speculation. I was
(01:14:32)
wondering how it could be done. And let
(01:14:34)
me offer a speculation and I'll explain
(01:14:36)
why the speculation is probably false.
(01:14:38)
So the speculation is okay. So the brain
(01:14:43)
it's like the brain has those regions.
(01:14:46)
You know the brain regions. We have our
(01:14:48)
cortex, right?
(01:14:49)
>> Yeah.
(01:14:50)
>> It has all those brain regions and the
(01:14:52)
cortex is uniform. But the brain regions
(01:14:55)
and and and the neurons in the cortex,
(01:14:57)
they kind of speak to their neighbors
(01:14:58)
mostly. And that's explains why you get
(01:15:00)
brain regions because if you want to do
(01:15:02)
some kind of speech processing, all the
(01:15:04)
neurons that do speech need to talk to
(01:15:06)
each other and they can and because
(01:15:07)
neurons can only speak to their nearby
(01:15:08)
neighbors for the most part, it has to
(01:15:10)
be a region. All the regions are mostly
(01:15:13)
located in the same place from person to
(01:15:14)
person. So maybe evolution hardcoded
(01:15:17)
literally a location on the brain.
(01:15:21)
So it says, "Oh, like when when like you
(01:15:24)
know the GPS of the brain, GPS
(01:15:26)
coordinates, such and such, when that
(01:15:28)
fires, that's what you should care
(01:15:29)
about." Like maybe that's what evolution
(01:15:30)
did cuz that would be within the toolkit
(01:15:32)
of evolution. Yeah. Although there are
(01:15:35)
examples where for example people who
(01:15:37)
are born blind have that area of their
(01:15:39)
cortex adopted by
(01:15:42)
another sense and I have no idea but I'd
(01:15:47)
be surprised if the desires or the
(01:15:51)
reward functions which require visual
(01:15:54)
signal no longer worked. You know people
(01:15:57)
who have their different areas of their
(01:15:58)
cortex co-opted. For example, if you no
(01:16:00)
longer have vision,
(01:16:03)
can you still feel the sense that I want
(01:16:05)
people around me to like me and so
(01:16:07)
forth, which usually there's also visual
(01:16:09)
cues for.
(01:16:10)
>> So, I actually fully agree with that. I
(01:16:11)
I think there's an even stronger counter
(01:16:13)
argument to this theory,
(01:16:14)
>> which is like if you think about people,
(01:16:17)
so there are people who get half of
(01:16:19)
their brain removed in childhood.
(01:16:23)
>> Yeah. and they still have all their
(01:16:25)
brain regions, but they all somehow move
(01:16:26)
to just one hemisphere, which suggests
(01:16:28)
that the brain regions the the location
(01:16:31)
is not fixed. And so that theory is not
(01:16:33)
true. It would have been cool if it was
(01:16:35)
true, but it's not. And so I think
(01:16:37)
that's a mystery, but it's an
(01:16:38)
interesting mystery. Like the fact is
(01:16:40)
somehow
(01:16:42)
>> evolution was able to endow us to care
(01:16:44)
about social stuff very very reliably.
(01:16:47)
And even people who have like all kinds
(01:16:48)
of strange mental conditions and
(01:16:50)
deficiencies and emotional problems tend
(01:16:53)
to care about this. Also, AI tools like
(01:16:55)
defakes, voice clones, and agents have
(01:16:59)
dramatically increased the
(01:17:00)
sophistication of fraud and abuse. So,
(01:17:03)
it's more important than ever to
(01:17:05)
actually understand the identity and
(01:17:07)
intent of whoever or whatever is using
(01:17:10)
your platform. That's exactly what
(01:17:12)
Sardine helps you do. Sardine brings
(01:17:14)
together thousands of device behavior
(01:17:17)
and identity signals to help you assess
(01:17:19)
risk. Everything from how a user types
(01:17:22)
or moves their mouse or holds their
(01:17:24)
device to whether they're hiding their
(01:17:26)
true location behind a VPN to whether
(01:17:29)
they're injecting a fake camera feed
(01:17:31)
during KYC selfie checks. Sardine
(01:17:33)
combines these signals with insights
(01:17:36)
from their network of almost 4 billion
(01:17:38)
devices. things like a user's history of
(01:17:40)
fraud or their associations with other
(01:17:42)
high-risisk accounts so you can spot bad
(01:17:45)
actors before they do damage. This would
(01:17:48)
literally be impossible if you only use
(01:17:50)
data from your own application. Sardine
(01:17:53)
doesn't stop at detection. They offer a
(01:17:54)
suite of agents to streamline onboarding
(01:17:57)
checks and automate investigations. So,
(01:17:59)
as fraudsters use AI to scale their
(01:18:01)
attacks, you can use AI to scale your
(01:18:04)
defenses. Go to sardine.ai/warcash.
(01:18:07)
AI/Swarcash
(01:18:08)
to learn more and download their guide
(01:18:10)
on AI fraud detection.
(01:18:13)
What is SSI planning on doing
(01:18:15)
differently? So presumably your plan is
(01:18:17)
to be one of the frontier companies when
(01:18:20)
this time arrives
(01:18:22)
and then what is
(01:18:25)
presumably you started SSI because
(01:18:26)
you're like I I think I have a way of
(01:18:28)
approaching how to do this safely in a
(01:18:30)
way that the other companies don't. What
(01:18:32)
what is that difference? So the way I
(01:18:35)
would describe it as
(01:18:38)
there are some ideas that I think are
(01:18:40)
promising and I want to investigate them
(01:18:42)
and see if they are indeed promising or
(01:18:44)
not. It's really that simple. It's an
(01:18:46)
attempt. I think that if the ideas turn
(01:18:49)
out to be correct, these ideas that we
(01:18:51)
discussed around understanding
(01:18:54)
generalization,
(01:18:56)
>> if these ideas turn out to be correct,
(01:19:01)
then I think we will have something
(01:19:02)
worthy. Will it turn out to be correct?
(01:19:05)
We are doing research. We are squarely
(01:19:08)
age of research company. We are making
(01:19:10)
progress. We've actually made quite good
(01:19:12)
progress over the past year. But we need
(01:19:13)
to keep making more progress,
(01:19:15)
>> more research.
(01:19:16)
>> And that's how I see it. I see it as an
(01:19:19)
attempt to be
(01:19:23)
an attempt to be a voice and a
(01:19:25)
participant.
(01:19:27)
Um people have asked uh your co-founder
(01:19:31)
and previous CEO left to go to Meta
(01:19:34)
recently and people have asked well if
(01:19:38)
there was a lot of breakthroughs being
(01:19:40)
made that seems like a thing that should
(01:19:41)
have been unlikely. I wonder how you
(01:19:43)
respond.
(01:19:43)
>> Yeah. So I in for for this I will simply
(01:19:47)
remind a few facts that may have been
(01:19:50)
forgotten and I think this these facts
(01:19:52)
which provide the context I think they
(01:19:54)
explain the situation. So the context
(01:19:56)
was that we were fundraising at a 32
(01:20:00)
billion valuation
(01:20:02)
and then Meta um came in and offered to
(01:20:06)
to acquire us and I said no
(01:20:11)
but my former co-founder
(01:20:14)
like in some sense said yes and as a
(01:20:17)
result he also was able to enjoy from a
(01:20:20)
lot of near-term liquidity and he was
(01:20:23)
the only person from SSI to join Meta.
(01:20:25)
It sounds like SSI's plan is to be a
(01:20:27)
company that is at the frontier when you
(01:20:29)
get to this
(01:20:31)
very important period in human history
(01:20:34)
where you have superhuman intelligence
(01:20:35)
and you have these ideas about how to
(01:20:37)
make superhuman intelligence go well but
(01:20:40)
other companies will be trying their own
(01:20:41)
ideas. What distinguishes SSI's approach
(01:20:45)
to making super intelligence go well?
(01:20:48)
The
(01:20:48)
>> the main thing that distinguishes SSI is
(01:20:52)
its technical approach.
(01:20:55)
So we have a different technical
(01:20:56)
approach that I think is worthy
(01:20:59)
and we are pursuing it.
(01:21:02)
I maintain that in the end there will be
(01:21:04)
a convergence of strategies. So I think
(01:21:06)
there will be a convergence of
(01:21:08)
strategies where
(01:21:10)
at some point as AI becomes more
(01:21:13)
powerful
(01:21:15)
it's going to become more or less
(01:21:17)
clearer to everyone what the strategy
(01:21:18)
should be. And it should be something
(01:21:20)
like, yeah, you need to find some way to
(01:21:23)
talk to each other. And you want your
(01:21:25)
first
(01:21:27)
actual like real super intelligent AI to
(01:21:29)
be aligned and somehow be,
(01:21:36)
you know, care for sentient life, care
(01:21:38)
for people, democratic, one of those,
(01:21:41)
some combination of thereof. And I think
(01:21:45)
this is the condition
(01:21:47)
that everyone should strive for and
(01:21:50)
that's what SSI is striving for and I
(01:21:53)
think that with time if not already all
(01:21:57)
the other companies will realizing that
(01:21:59)
they're striving towards the same thing
(01:22:00)
and we'll see. I think that the world
(01:22:02)
will truly change as AI becomes more
(01:22:03)
powerful.
(01:22:04)
>> Yeah.
(01:22:04)
>> And I think a lot of these forecasts
(01:22:06)
will like I think things will be really
(01:22:09)
different and people will be acting
(01:22:11)
really differently. What speaking of
(01:22:13)
forecast what are your forecasts to this
(01:22:16)
system you're describing which can learn
(01:22:17)
as well as a human and
(01:22:21)
subsequently as a result become
(01:22:22)
superhuman.
(01:22:24)
>> I think like uh 5 to 20
(01:22:27)
>> 5 to 20 years.
(01:22:28)
>> Mhm.
(01:22:28)
>> So I just want to unroll your
(01:22:32)
how you might see the world coming. It's
(01:22:34)
like we have a couple more years where
(01:22:36)
these other companies are continuing the
(01:22:38)
current approach and it stalls out and
(01:22:40)
stalls out here meaning they earn no
(01:22:42)
more than low hundreds of billions in
(01:22:44)
revenue or how do you think about what
(01:22:45)
stalling out means?
(01:22:47)
>> Yeah,
(01:22:49)
I think the re I think it could I think
(01:22:50)
it could stall out and
(01:22:54)
I think stalling out will look like
(01:22:57)
it will all look very similar.
(01:22:58)
>> Yeah.
(01:22:59)
>> Among all the different companies
(01:23:01)
something like this. I'm not sure
(01:23:02)
because I think I think I think even
(01:23:04)
with I think even I think even with
(01:23:06)
stolen out I think these companies could
(01:23:07)
make a stupendous stupendous revenue
(01:23:10)
maybe not profits because they will be
(01:23:12)
it will be they will need to work hard
(01:23:14)
to differentiate each other from
(01:23:16)
themselves but revenue definitely
(01:23:18)
>> but there's something in your model
(01:23:20)
implies that
(01:23:23)
the when the correct solution does
(01:23:25)
emerge there will be convergence between
(01:23:26)
all the companies and I'm curious why
(01:23:29)
you think that's the case
(01:23:30)
>> well I was talking more about converg
(01:23:31)
convergence on their larger strategies.
(01:23:33)
>> I think eventual convergence on the
(01:23:35)
technical approach is probably going to
(01:23:36)
happen as well but I I was alluding to
(01:23:39)
convergence to the larger strategies.
(01:23:41)
What what what exactly is the thing that
(01:23:42)
should be done?
(01:23:43)
>> I I just want to better understand how
(01:23:45)
you see the future on rolling. So
(01:23:47)
currently we have these different
(01:23:48)
companies and you expect their approach
(01:23:49)
to continue generating revenue. Yes.
(01:23:51)
>> But not get to this humanlike learner.
(01:23:53)
>> Yes.
(01:23:54)
>> So now we have these different forks of
(01:23:56)
companies. We have you we have thinking
(01:23:58)
machines. There's a bunch of other labs.
(01:24:00)
>> Yes. and maybe one of them figures out
(01:24:01)
the correct approach
(01:24:03)
>> but then the release of their product
(01:24:05)
makes it clear to other people how to do
(01:24:07)
this thing.
(01:24:08)
>> I think it won't be clear how to do it
(01:24:10)
thing but it will be clear that
(01:24:11)
something different is possible
(01:24:12)
>> right
(01:24:13)
>> and that is information and I think
(01:24:15)
people will will then be trying to
(01:24:18)
figure out how how that's how that
(01:24:20)
works. I do think though that one of the
(01:24:23)
things that's that I think you know not
(01:24:27)
addressed here not discussed is that
(01:24:30)
with each increase in the AI's
(01:24:33)
capabilities I think there will be some
(01:24:35)
kind of changes but I don't know exactly
(01:24:38)
which ones in how things are being done.
(01:24:41)
So like I think it's going to be
(01:24:44)
important yet I can't spell out what
(01:24:46)
that is exactly.
(01:24:47)
>> And how how are the
(01:24:50)
by default you would expect the company
(01:24:52)
that has the model company that has that
(01:24:54)
model to be getting all these gains
(01:24:55)
because they have the model that is
(01:24:57)
learning how to do all has the skills
(01:25:00)
and knowledge that it's building up in
(01:25:01)
the world. What is the reason to think
(01:25:03)
that the benefits of that would be
(01:25:04)
widely distributed and not just end up
(01:25:06)
at whatever model company gets this
(01:25:08)
continuous learning loop going first?
(01:25:11)
>> Like I think that empirically what
(01:25:13)
happen so here here is what I think is
(01:25:16)
going to happen. Number one, I think
(01:25:18)
empirically when
(01:25:23)
let's let's look at let's look at how
(01:25:25)
things have gone so far with um the AIs
(01:25:28)
of the past. So one company produced an
(01:25:30)
advance and the other company scrambled
(01:25:34)
and produced some competi some some
(01:25:37)
similar things after some amount of time
(01:25:40)
and they started to compete in the
(01:25:42)
market and push their push the prices
(01:25:45)
down
(01:25:46)
>> and so I think from the market
(01:25:47)
perspective I think something similar
(01:25:49)
will happen there as well even if
(01:25:51)
someone okay so okay so okay so okay so
(01:25:52)
okay so okay so okay so okay so okay so
(01:25:52)
okay so okay we talking about the good
(01:25:53)
world by the way where
(01:25:57)
what's the good world. What's the good
(01:25:59)
world?
(01:26:01)
Where we have these powerful humanlike
(01:26:05)
learners that are also like and by the
(01:26:08)
way maybe there there's another thing we
(01:26:10)
haven't discussed on the on the the spec
(01:26:13)
of the super intelligent AI that I think
(01:26:15)
is worth considering is that you make it
(01:26:18)
narrow
(01:26:20)
can be useful and narrow at the same
(01:26:21)
time. So you can have lots of narrow
(01:26:23)
super intelligent AIs. But suppose you
(01:26:25)
have many of them
(01:26:29)
and you have some and you have some
(01:26:30)
company that's producing a lot of um
(01:26:33)
profits from it and then you have
(01:26:35)
another company that comes in and starts
(01:26:37)
to compete and the way the competition
(01:26:39)
is going to work is through
(01:26:40)
specialization.
(01:26:42)
I think what's going to happen is that
(01:26:45)
the way
(01:26:48)
competition like competition loves
(01:26:50)
specialization and you see it in the
(01:26:53)
market, you see it in evolution as well.
(01:26:54)
So you're going to have lots of
(01:26:55)
different niches and you're going to
(01:26:56)
have lots of different companies who are
(01:26:58)
occupying different niches in
(01:27:02)
in this kind of world where you might
(01:27:04)
say yeah like one AI company is really
(01:27:06)
quite a bit better at some area of
(01:27:09)
really complicated economic activity and
(01:27:11)
a different company is better at another
(01:27:13)
area and the third company is really
(01:27:14)
good at litigation and that's
(01:27:16)
contradicted by what humanlike learning
(01:27:18)
implies is that like it can learn
(01:27:20)
>> it can but but you have accumulated
(01:27:23)
learning you have a big investment. You
(01:27:25)
spent a lot of compute to become really
(01:27:28)
really really good really phenomenal at
(01:27:30)
this thing and someone else spent a huge
(01:27:33)
amount of comput and a huge amount of
(01:27:34)
experience to get really really good at
(01:27:35)
some other thing
(01:27:36)
>> right
(01:27:36)
>> you apply a lot of human learning to get
(01:27:38)
there but now like you you are at this
(01:27:41)
high point where someone else would say
(01:27:43)
look like I don't want to start learning
(01:27:45)
what you've learned to go
(01:27:46)
>> I guess that would require many
(01:27:47)
different companies to begin at the
(01:27:49)
human like continual learning agent at
(01:27:52)
the same time so that they can start
(01:27:54)
their different research in different
(01:27:57)
branches. But if one company,
(01:28:01)
you know, gets that agent first or gets
(01:28:03)
that learner first,
(01:28:05)
it does then seem like well, you know,
(01:28:08)
they could like if you just think about
(01:28:10)
every single job in the economy,
(01:28:13)
you just have uh instance learning each
(01:28:16)
one seems tractable for a company.
(01:28:18)
>> Yeah, that's that's that's a valid
(01:28:20)
argument. My my strong intuition is that
(01:28:22)
it's not how it's going to go.
(01:28:24)
My strong intuition is that yeah like
(01:28:26)
the argument says it will go this way.
(01:28:28)
>> Yeah.
(01:28:28)
>> But my strong intuition is that it will
(01:28:30)
not go this way that this is the you
(01:28:34)
know in in theory there is no difference
(01:28:36)
between theory and practice. In practice
(01:28:37)
there is and I think that's going to be
(01:28:38)
one of those
(01:28:39)
>> a lot of people's models of recursive
(01:28:41)
self-improvement literally explicitly
(01:28:44)
state we will have a million Ilias in a
(01:28:47)
server that are coming in with different
(01:28:48)
ideas and this will lead to a super
(01:28:50)
intelligence emerging very fast. Do you
(01:28:52)
have some intuition about how
(01:28:53)
parallelizable the thing you are doing
(01:28:55)
is? How how what are the gains from
(01:28:59)
making copies of Ilia?
(01:29:01)
>> I don't know. I think
(01:29:05)
I think there'll definitely be there'll
(01:29:07)
be diminishing returns because you want
(01:29:09)
you want people who think differently
(01:29:10)
rather than the same. I think that if
(01:29:12)
they were literal copies of me, I'm not
(01:29:14)
sure how much more incremental value
(01:29:16)
you'd get. I think that
(01:29:20)
but people who think differently that's
(01:29:22)
what you want.
(01:29:23)
>> Why is it that it's been if you look at
(01:29:26)
different models even released by
(01:29:27)
totally different companies trained on
(01:29:30)
potentially non-over overlapping data
(01:29:32)
sets it's actually crazy how similar
(01:29:35)
LLMs are to each other.
(01:29:36)
>> Maybe the data sets are not as non-over
(01:29:37)
overlapping as it seems. But there's
(01:29:41)
there's some sense there's like even if
(01:29:43)
an individual human might be less
(01:29:44)
productive than the future AI. Maybe
(01:29:45)
there's something to the fact that human
(01:29:46)
teams have more diversity than teams of
(01:29:49)
AIs might have. But how do we elicit
(01:29:51)
meaningful diversity among AI? So I
(01:29:54)
think just raising the temperature just
(01:29:55)
results in gibberish. I think you want
(01:29:57)
something more like
(01:29:58)
>> different scientists have different
(01:30:00)
different prejudices or different ideas.
(01:30:01)
How do you get that kind of diversity
(01:30:03)
among AI agents? So the reason there has
(01:30:06)
been no diversity I believe is because
(01:30:09)
of pre-training.
(01:30:11)
All the pre-trained models are the same
(01:30:13)
pretty much because the pre-train on the
(01:30:16)
same data. Now RL and postraining is
(01:30:19)
where some differentiation starts to
(01:30:21)
emerge because different people come up
(01:30:23)
with different RL training.
(01:30:25)
>> Yeah. And then I've heard you hint in
(01:30:28)
the past about selfplay as a way to
(01:30:31)
either get data or match agents to other
(01:30:34)
agents of equivalent intelligence to
(01:30:36)
kick off learning. How should we think
(01:30:39)
about why there's no public
(01:30:44)
um proposals of this kind of thing and
(01:30:46)
working with LLM?
(01:30:47)
>> I would say there are two things to say.
(01:30:49)
I would say that the reason why I
(01:30:51)
thought selfplayful is interesting
(01:30:54)
is because it offered a way to create
(01:30:57)
models using compute only without data,
(01:31:00)
right? And if you think that data is the
(01:31:02)
ultimate bottleneck, then using compute
(01:31:04)
only is very interesting. So that's what
(01:31:07)
makes it interesting. Now the
(01:31:11)
the thing is
(01:31:14)
that selfplay at least the way it was
(01:31:17)
done in the past when you have agents
(01:31:19)
which are somehow compete with each
(01:31:20)
other it's only good for developing a
(01:31:23)
certain set of skills it is too narrow.
(01:31:26)
It's only good for like negotiation
(01:31:29)
uh conflict
(01:31:31)
certain social skills
(01:31:33)
strategizing that kind of stuff. And so
(01:31:35)
if you care about those skills then
(01:31:37)
selfplay will be useful. Now actually I
(01:31:40)
think that selfplay
(01:31:42)
did
(01:31:43)
find a home but just in a different form
(01:31:47)
in a different form. So things like
(01:31:49)
debate prove a verifier. You have some
(01:31:53)
kind of an LLM as a judge which is also
(01:31:56)
incentivized to find mistakes in your
(01:31:58)
work. You could say this is not exactly
(01:32:00)
selfplay but this is you know a related
(01:32:02)
adversarial setup that people are doing.
(01:32:04)
believe
(01:32:04)
>> and really selfplay is an example of um
(01:32:07)
is a special case of more general like
(01:32:10)
um competition between between agents,
(01:32:13)
>> right? The response the natural response
(01:32:14)
to competition is to try to be
(01:32:16)
different. And so if you were to put
(01:32:17)
multiple agents and you tell them, you
(01:32:19)
know, you all need to work on some
(01:32:21)
problem and you're an agent and you're
(01:32:24)
inspecting what everyone else is
(01:32:25)
working, you're going to say, well, if
(01:32:28)
they already taken this approach, it's
(01:32:30)
not clear I should pursue it. they
(01:32:32)
should pursue something differentiated
(01:32:34)
and so I think that something like this
(01:32:35)
could also create an incentive for um a
(01:32:38)
diversity of approaches.
(01:32:39)
>> Yeah. Um final question,
(01:32:43)
what is research taste? You're obviously
(01:32:47)
the person in the world who is
(01:32:50)
considered to have the best taste in
(01:32:54)
doing research in AI. you were uh the
(01:32:58)
co-author on many of the biggest the
(01:33:01)
biggest things that have happened in the
(01:33:02)
history of deep learning from Alex net
(01:33:03)
to GBT3 to so on what is it that how do
(01:33:06)
you characterize how
(01:33:09)
you come up with these ideas
(01:33:11)
>> I can answer so I can comment on this
(01:33:13)
for myself
(01:33:14)
>> I think different people do it
(01:33:16)
differently
(01:33:18)
>> but one thing that um guides me
(01:33:21)
personally
(01:33:23)
is
(01:33:24)
an aesthetic
(01:33:26)
of how AI should be
(01:33:29)
>> by thinking about how people are but
(01:33:31)
thinking correctly
(01:33:33)
>> like it's very easy to think about how
(01:33:35)
people are incorrectly but what does it
(01:33:37)
mean to think about people correctly
(01:33:39)
>> so I'll give you some examples
(01:33:42)
the idea of the artificial neuron is
(01:33:46)
directly inspired by the brain and it's
(01:33:48)
a great idea why because you say sure
(01:33:50)
the brain has all these different organs
(01:33:52)
has the faults but the faults probably
(01:33:54)
don't matter M
(01:33:55)
>> why do we think that the neurons matter?
(01:33:56)
Because there's many of them. It kind of
(01:33:59)
feels right. So you want the neuron.
(01:34:01)
>> Yeah.
(01:34:01)
>> You want some kind of local learning
(01:34:03)
rule that will change the connections.
(01:34:04)
You want some local learning rule rule
(01:34:06)
that will change the connections between
(01:34:07)
the neurons,
(01:34:10)
>> right? It feels plausible that the brain
(01:34:12)
does it. The idea of the distributed
(01:34:13)
representation,
(01:34:16)
the idea that the brain,
(01:34:18)
you know, the brain responds to
(01:34:19)
experience or neural network should
(01:34:20)
learn from experience, not response. The
(01:34:22)
brain learns from experience.
(01:34:24)
the neural network of experience and you
(01:34:27)
kind of ask yourself is some is
(01:34:29)
something fundamental or not fundamental
(01:34:30)
how things should be
(01:34:32)
>> and I think that's been guiding me a
(01:34:34)
fair bit kind of thinking from multiple
(01:34:37)
angles and looking for almost beauty
(01:34:40)
beauty simplicity ugliness there's no
(01:34:42)
room for ugliness it's just beauty
(01:34:44)
simplicity elegance correct inspiration
(01:34:47)
from the brain and all of those things
(01:34:49)
need to be present at the same time and
(01:34:51)
the more they are present the more
(01:34:53)
confident you can be in a top- down
(01:34:55)
belief. And then the top down belief is
(01:34:58)
the thing that sustains you when the
(01:35:00)
experiments contradict you. Because if
(01:35:02)
you just trust the data all the time,
(01:35:05)
well, sometimes you can be doing a
(01:35:06)
correct thing, but there's a bug.
(01:35:08)
>> But you don't know that there is a bug.
(01:35:09)
How can you tell that there is a bug?
(01:35:11)
>> How do you know if you should keep
(01:35:12)
debugging or you conclude it's the wrong
(01:35:14)
direction? Well, it's the top down.
(01:35:16)
Well, how should you can say the things
(01:35:18)
have to be this way? Something like this
(01:35:20)
has to work. Therefore, we got to keep
(01:35:22)
going. That's the top down. And it's
(01:35:24)
based on this like multifaceted beauty
(01:35:27)
and inspiration by the brain.
(01:35:30)
>> All right, we'll leave it there.
(01:35:32)
>> Thank you so much.
(01:35:32)
>> Thank you so much.
(01:35:35)
>> All right. Appreciate it.
(01:35:36)
>> That was great.
(01:35:36)
>> Yeah, I enjoyed it.
(01:35:38)
>> Yes, me too.
(01:35:39)
>> Hey everybody, I hope you enjoyed that
(01:35:41)
episode. If you did, the most helpful
(01:35:43)
thing you can do is just share it with
(01:35:45)
other people who you think might enjoy
(01:35:46)
it. It's also helpful if you leave a
(01:35:49)
rating or a comment on whatever platform
(01:35:51)
you're listening on. If you're
(01:35:53)
interested in sponsoring the podcast,
(01:35:55)
you can reach out at
(01:35:56)
dwarcash.com/advertise.
(01:36:00)
Otherwise, I'll see you on the next one.
