↔
Title: WTF is happening at xAI | Sulaiman Ghori
Duration: 01:11:39
Total Correct Answers:
Current Caption
Correct
Learning Modes
YouTube Video Transcript Hide
Ask AI:
Export as:
Ask AI Result
The ask AI result will appear here..
(00:00:00) Your YouTube transcript will appear here
(00:00:00)
Tyler took this bet with Elon like get a
(00:00:02)
Cybert truck tonight if you can get a
(00:00:04)
training run on these GPUs in 24 hours
(00:00:06)
and we were training that night.
(00:00:07)
>> Did he get the Cyber Truck?
(00:00:08)
>> Yeah, he got Cyber Truck.
(00:00:09)
>> My first day they just gave me a laptop
(00:00:11)
and a badge and I was like okay now
(00:00:13)
what? I don't even have a team. I've not
(00:00:15)
been told what to do. Ask Rock was
(00:00:16)
spinning up at the time our integrations
(00:00:18)
with X. They're like can you help? And I
(00:00:19)
was like yes.
(00:00:20)
>> What's the most fun thing about working
(00:00:22)
there?
(00:00:22)
>> No one tells me no. If I have a good
(00:00:24)
idea I can usually go and implement it
(00:00:26)
that same day and show it to Elon or
(00:00:28)
whoever and we got an answer. We did the
(00:00:29)
math. Right now we're I think at about
(00:00:31)
$2.5 million per commit to the main
(00:00:33)
refill and I did five today. So
(00:00:35)
>> you added like 12 and a half million of
(00:00:36)
value.
(00:00:37)
>> The levers are extremely strong.
(00:00:41)
>> Today I have the pleasure of sitting
(00:00:42)
down with Sully Kongori and he is one of
(00:00:44)
the engineers at XAI. I've been kind of
(00:00:47)
fascinated by XAI since like 2023 when
(00:00:50)
like Elon first started. I think it's
(00:00:52)
like one of the fastest growing
(00:00:53)
companies of all time. Can you just talk
(00:00:56)
about like what the [ __ ] is happening at
(00:00:58)
XAI?
(00:00:59)
>> Yeah. Um, we don't have really due
(00:01:03)
dates. It's always yesterday.
(00:01:04)
>> Um,
(00:01:06)
there's no blockers for anything like at
(00:01:09)
least nothing artificial. Uh, the whole
(00:01:12)
Elon thing about going down to the root,
(00:01:14)
uh, the fundamental, whatever the
(00:01:16)
physical thing is, we get there pretty
(00:01:17)
quick if we can, as quick as we can,
(00:01:20)
which is funny in software. It's not
(00:01:22)
really like a thing that you think about
(00:01:23)
is the physics too much, but we do try
(00:01:26)
quite a bit and we're not really fully a
(00:01:28)
software company given all the
(00:01:29)
infrastructure pull down. Um,
(00:01:31)
>> it's kind of hardware at this point.
(00:01:32)
>> Yeah,
(00:01:32)
>> it's like hardware constrained.
(00:01:33)
>> It's the probably our biggest edge is is
(00:01:36)
is the hardware because nobody else is
(00:01:38)
even close on on the deployment there.
(00:01:39)
Um, [clears throat]
(00:01:41)
although the talent density on software
(00:01:42)
is like incredible. I've never been
(00:01:44)
anywhere like that. It's it's really
(00:01:45)
cool
(00:01:46)
>> for Elon. he is very good at figuring
(00:01:49)
out like what the bottlenecks will be
(00:01:50)
even like a couple months or even years
(00:01:51)
in the future and then trying to work
(00:01:53)
backwards from that and make sure that
(00:01:54)
like he's in a really good position. Um
(00:01:57)
how does that work dayto-day with just
(00:01:58)
normal people like at XAI and like
(00:02:01)
adopting that kind of mental framework?
(00:02:02)
Usually when we spin something up new
(00:02:04)
very quickly either one of us or he
(00:02:06)
comes up with this uh metric that's
(00:02:09)
usually very core to
(00:02:12)
either the the financial or the physical
(00:02:14)
return or both sometimes. Um and so
(00:02:18)
everything is just focused on driving
(00:02:20)
that that metric. Um there's never
(00:02:24)
like a fundamental limitation to it or
(00:02:26)
like whatever the fundamental limitation
(00:02:27)
is it better be rooted deep down and not
(00:02:30)
something artificial. Um and there is a
(00:02:32)
lot of
(00:02:34)
uh perceived limitations um especially
(00:02:37)
in the software world coming from like
(00:02:39)
especially in the last 10 years of like
(00:02:40)
web dev and all these kinds of things.
(00:02:42)
People just assume or accept
(00:02:44)
>> certain limitations especially when it
(00:02:45)
comes to speed and latency
(00:02:47)
>> and they're not true. um you can get rid
(00:02:49)
of a lot of overhead. Like there's a lot
(00:02:51)
of stupid stuff in in the stack and if
(00:02:54)
you can knock out a lot of that, you can
(00:02:56)
usually two to 8x most anything at least
(00:03:00)
anything invented relatively recently.
(00:03:03)
Uh some stuff not so much, but yeah.
(00:03:05)
>> When was the last time that you
(00:03:07)
experienced this where there's some like
(00:03:08)
conventional wisdom that says that
(00:03:10)
there's this is the timeline and then
(00:03:11)
you guys just were able to completely
(00:03:13)
shred that? Um most recently it's our
(00:03:16)
model iterations on on macro hard. Um so
(00:03:20)
we're working on some novel
(00:03:21)
architectures actually multiple at the
(00:03:22)
same time and uh we're coming out with
(00:03:26)
new like iterations like daily sometimes
(00:03:29)
multiple times a day which is from
(00:03:31)
pre-train um in some cases uh which is
(00:03:34)
not something you ordinarily really see
(00:03:36)
but it comes from well a we have a
(00:03:39)
pretty great supercomput team and
(00:03:41)
they've knocked out a lot of the typical
(00:03:43)
barriers it takes to train a lot of this
(00:03:44)
stuff even with how variable our
(00:03:47)
hardware like the hardware is like it's
(00:03:49)
you know within a day of standing up a
(00:03:52)
rack you can usually be training
(00:03:54)
sometimes within the same day um even
(00:03:56)
within uh a few hours in some cases
(00:03:59)
>> and this is like not normal like
(00:04:00)
normally the timelines are like days or
(00:04:02)
or weeks
(00:04:02)
>> it takes a lot well in most cases at
(00:04:05)
least yeah in the last 10 years you
(00:04:06)
abstract this away and let Amazon or
(00:04:08)
Google take care of this um and so
(00:04:11)
whatever their capacity is is what their
(00:04:12)
capacity is but that's not like you
(00:04:15)
can't have that be the case and when an
(00:04:16)
AI now. So, the only solution is to die
(00:04:21)
or uh or build it yourself.
(00:04:23)
>> Can you tell me about like how what your
(00:04:26)
experience was like joining why you
(00:04:28)
joined and then kind of what the like
(00:04:29)
onboarding process was for the first
(00:04:31)
like couple weeks?
(00:04:32)
>> Yeah. So, um I was working on my own
(00:04:35)
startup when I moved to the Bay. Um and
(00:04:38)
actually during that time, Greg Yang,
(00:04:40)
one of the co-founders of XA, had
(00:04:41)
reached out. He's great at recruiting as
(00:04:42)
it turns out. Um
(00:04:43)
>> what did what did he say? Uh, so I got
(00:04:46)
an email and I thought it was spam
(00:04:47)
because it was I was getting a lot of
(00:04:48)
these like, you know, emails to founders
(00:04:49)
at the time of like, hey, you want to
(00:04:51)
chat or like I like what you're doing.
(00:04:52)
You want to chat, whatever. I was going
(00:04:53)
to mark it as spam to like delete it.
(00:04:55)
And I saw the domain x.ai. I was like,
(00:04:57)
oh, wait a second. I know these guys.
(00:05:00)
And they just uh I think it was probably
(00:05:03)
eight months in at that point. Um, and
(00:05:06)
so I was like, okay, yeah, let's chat.
(00:05:07)
And so we chatted a bunch of times. Um
(00:05:09)
then uh I wanted an aqua hire but uh I
(00:05:13)
think we were too early at the time and
(00:05:15)
that company kind of went mostly because
(00:05:17)
it was fairly obvious that you can't
(00:05:19)
build macro hard with like a million
(00:05:20)
dollars. Um but the uh idea was sound.
(00:05:25)
So I spent the next like six, seven
(00:05:26)
months wasting all my money um building
(00:05:28)
like aerospace projects and working on
(00:05:30)
uh an aerospace astro mining concept.
(00:05:34)
Um, that also I realized like probably
(00:05:36)
wouldn't work, but it was worth a try.
(00:05:39)
And so I emailed Greg again like uh hey
(00:05:41)
can uh like you want to chat again? He's
(00:05:43)
like yeah sure you want to interview
(00:05:44)
tomorrow. I was like okay. And um uh I
(00:05:47)
apparently did well and I moved on
(00:05:49)
Monday and I started uh then and it was
(00:05:51)
really great. Um nobody told me what to
(00:05:54)
do. So like my first day they just gave
(00:05:56)
me a laptop and a badge and I was like
(00:05:58)
okay. Um and I was like okay now what?
(00:06:01)
And so I went to go find Greg cuz I was
(00:06:04)
like, I don't I don't even have a team.
(00:06:05)
I've not been told what to do. Like Greg
(00:06:08)
just brought me on cuz I think he liked
(00:06:10)
what I was doing previously and it was
(00:06:12)
related to what the long term was for
(00:06:13)
macroard which wasn't really even a
(00:06:15)
project at the time. And I ended up
(00:06:17)
working on actually uh Ascrock was
(00:06:20)
spinning up at the time where our
(00:06:21)
integrations with X and so they're like
(00:06:23)
can you help and I was like yes I can
(00:06:25)
help. And so my first week was working
(00:06:28)
uh with the one guy. I found out very
(00:06:31)
quickly like everything that we built
(00:06:34)
like I could sit and I could stand up
(00:06:36)
from my desk which I didn't even have a
(00:06:37)
desk assigned to me. I just sat at
(00:06:39)
random people's desks that weren't there
(00:06:40)
that day. Um and I could point to
(00:06:42)
whoever built that thing at XAI um like
(00:06:45)
from my desk. It was very very very
(00:06:47)
cool.
(00:06:47)
>> And there was like almost no people
(00:06:49)
working there at this point. Just like a
(00:06:50)
couple hundred, right?
(00:06:51)
>> Uh yeah, about a hundred or so on the
(00:06:53)
engineering staff. And then I don't know
(00:06:55)
what the uh uh infra buildout team
(00:06:57)
looked like at that time. And it's kind
(00:06:59)
of hard to tell because some people move
(00:07:01)
up the ladder from like the actual
(00:07:03)
building and construction crew onto our
(00:07:04)
payroll. But um it was pretty small at
(00:07:08)
the time like much much like an order of
(00:07:09)
magnitude smaller than the other labs.
(00:07:11)
Um and we had still just done Grock 3.
(00:07:14)
Um
(00:07:15)
>> yeah, which yeah
(00:07:18)
was pretty cool. One of the things that
(00:07:20)
I kind of love um is how fast XAI went
(00:07:23)
from being founded. I remember Elon
(00:07:25)
initially saying like we're not even
(00:07:27)
sure if this can be a success with you
(00:07:30)
know people having you know a multi-year
(00:07:33)
advantage on on speed and like timing
(00:07:35)
and then you guys got done with the
(00:07:38)
first like Colossus data center in like
(00:07:40)
122 days. Um and that was just like
(00:07:42)
unheard of and Jensen's out here saying
(00:07:45)
singing the praises of XAI and Elon. uh
(00:07:48)
what kind of culture did that allow to
(00:07:51)
be formed?
(00:07:52)
>> It definitely enabled like us on on
(00:07:56)
model and product to kind of assume we
(00:07:58)
would have the resources to do what we
(00:08:00)
needed to do.
(00:08:01)
>> Um and that's definitely the case. Like
(00:08:04)
we're not super duper resource
(00:08:06)
constrained. Like we've still found a
(00:08:08)
way to push up against that wall.
(00:08:09)
>> Um but that's just we have 20 different
(00:08:11)
things going at the same time. Like more
(00:08:13)
than that, like many more things than
(00:08:16)
that. there's a absurd amount of of runs
(00:08:18)
and training and all that stuff going on
(00:08:20)
at the same time in parallel usually by
(00:08:22)
like a few a handful of people. Um which
(00:08:26)
is how we're able to iterate very
(00:08:27)
quickly on on model and product side. Um
(00:08:30)
and utilization has definitely been very
(00:08:32)
high. The the speed allows us definitely
(00:08:34)
to I guess think more long term. Um, so
(00:08:38)
I think Grock 4 or five really what it
(00:08:41)
was was already planned out and and
(00:08:45)
designed in terms of size and what we
(00:08:47)
expected um way early like before I
(00:08:50)
joined. I joined around Gro 3.
(00:08:53)
>> So it's like thinking at least a year in
(00:08:55)
advance. you can yeah you can think much
(00:08:57)
more in advance and assume that those
(00:08:59)
estimates will be hit um just because
(00:09:02)
everyone's like pretty great and
(00:09:03)
reliable
(00:09:04)
>> which frees you up a lot in terms of
(00:09:07)
like what your limitations are I guess
(00:09:09)
so for us for example the assumed
(00:09:11)
minimum latency
(00:09:13)
was about three times higher than it
(00:09:15)
actually needed to be and the buildout
(00:09:17)
allowed for that basically um
(00:09:19)
>> what do you mean by that
(00:09:22)
>> so the one of the novel architectures
(00:09:24)
we're working on um is not really
(00:09:29)
possible unless you scale up your
(00:09:33)
experiment rate because it's it's not
(00:09:36)
building on any existing body of work.
(00:09:37)
You need a new pre pre-training body and
(00:09:39)
you need also uh a new data set but
(00:09:41)
that's not really constrained by the
(00:09:42)
resources like the physical uh
(00:09:44)
infrastructure resources mostly. Um
(00:09:48)
although there's the uh the Tesla
(00:09:51)
computer thing which I think maybe we'll
(00:09:53)
get into maybe not but um uh so actually
(00:09:57)
this one's public. So one thing that
(00:09:59)
we're thinking about is okay like we're
(00:10:00)
we're building this human emulator with
(00:10:02)
macro hard. Um
(00:10:04)
how do we deploy it? Because you
(00:10:06)
actually need like if we want to deploy
(00:10:08)
1 million human emulators we need 1
(00:10:10)
million computers. Um how do we do that?
(00:10:13)
and the answer showed up two days later
(00:10:16)
in the form of a Tesla computer because
(00:10:18)
those things are actually very capital
(00:10:20)
efficient as it turns out. And we can
(00:10:22)
run um potentially like our our model
(00:10:25)
and the like full computer that a human
(00:10:28)
would otherwise work at on the Tesla
(00:10:30)
computer for much cheaper than you would
(00:10:32)
in on on a VM on AWS or Oracle or
(00:10:35)
whatever or even just buying hardware
(00:10:38)
from Nvidia. that car computer is
(00:10:40)
actually much more capital efficient and
(00:10:41)
so it enables us to assume that we can
(00:10:45)
deploy much much faster at a much higher
(00:10:47)
scale. Um and so we've adjusted our we
(00:10:50)
adjusted our expectations for that
(00:10:51)
basically.
(00:10:51)
>> Are you basically able to just bootstrap
(00:10:52)
off of the like car network?
(00:10:55)
>> So that's one of the one of the
(00:10:56)
potential uh solutions basically. Yeah.
(00:11:00)
So like okay well we want 1 million VMs.
(00:11:02)
Um there's like 4 million uh Tesla cars
(00:11:05)
in North America alone. Um, and like
(00:11:08)
let's say 2/3 or half of them have
(00:11:10)
hardware 4. Um, and like somewhere
(00:11:14)
between 70 80% 80% of the time they're
(00:11:17)
sitting there idle probably charging. We
(00:11:19)
can just potentially pay and they have,
(00:11:21)
you know, networking, they have cooling,
(00:11:22)
they have power. Um, we can just pay pay
(00:11:25)
owners to lease time off their car and
(00:11:27)
let us run um like a human emulator uh
(00:11:30)
digital Optimus on right on it. and uh
(00:11:33)
they get you know their lease paid for
(00:11:35)
and we get uh a full human emulator we
(00:11:38)
can put to work. Um and that's something
(00:11:40)
without any buildout requirement. It's a
(00:11:43)
purely software implementation that's
(00:11:46)
required. Yeah.
(00:11:46)
>> The the asset is sitting there and you
(00:11:48)
can just go and use it.
(00:11:49)
>> Yeah. Amazing. What for the human
(00:11:51)
emulators uh in macro hard what is the
(00:11:54)
like purpose of that of scaling up you
(00:11:56)
know millions of many humans? Um, I mean
(00:12:00)
the basic con concept is very simple,
(00:12:02)
right? With with Optimus, you're uh
(00:12:04)
taking any physical task a human can do
(00:12:07)
and allowing a robot to do it
(00:12:09)
automatically at a fraction of the cost
(00:12:10)
at 20, you know, with 24/7 uptime. Um,
(00:12:13)
we're doing the same with anything that
(00:12:15)
a human does digitally. So any anything
(00:12:17)
where they need to digitally input uh a
(00:12:19)
keyboard and mouse inputs, which is
(00:12:21)
usually what humans do, um, and look at
(00:12:23)
a screen back and make decisions,
(00:12:25)
>> uh, we just emulate what the human is
(00:12:27)
doing. uh directly. So no adoption from
(00:12:30)
any software is required at all. Um we
(00:12:33)
can deploy in any situation in which a
(00:12:35)
human is in potentially currently. Um
(00:12:38)
>> interesting.
(00:12:38)
>> What is what is that actually going to
(00:12:39)
look like uh for rolling it out?
(00:12:42)
>> Um
(00:12:44)
I don't think we've detailed our plans
(00:12:45)
publicly yet specifically on uh on how
(00:12:49)
we'll roll out. It'll be slowly at first
(00:12:51)
and then very quickly basically like uh
(00:12:55)
like the difference for us given that
(00:12:57)
infrastructure buildout already has
(00:12:59)
happened or we can go on the Tesla
(00:13:01)
network or we can build out our own data
(00:13:03)
center Tesla computers actually. Um
(00:13:06)
the difference for us from from going
(00:13:08)
from 1,000 human emulators to a million
(00:13:10)
is actually not very big. It's not it's
(00:13:13)
not the biggest part of the challenge.
(00:13:14)
Elon, I know one of the things that he
(00:13:16)
does best is he basically just goes from
(00:13:18)
fire to fire on whatever the company is
(00:13:20)
and just kind of like puts it out and
(00:13:21)
unfucks whatever problem is exists.
(00:13:24)
>> Uh what has that been like? What when
(00:13:26)
have you like seen some problem exist
(00:13:28)
and just had it unfucked very rapidly do
(00:13:30)
this kind of process?
(00:13:32)
>> Um definitely on the infra build out
(00:13:34)
this is the biggest. Um on model side
(00:13:37)
we've been like we've had hiccups but
(00:13:40)
>> it's more or less been smooth but on
(00:13:42)
model side especially cuz there's a lot
(00:13:43)
of uh I mean infra side there's a lot of
(00:13:46)
very specific
(00:13:48)
uh operations that each of these
(00:13:52)
basically AS6 these GPUs are are built
(00:13:54)
for and when we roll out new products
(00:13:58)
like when we pick up new products from
(00:13:59)
Nvidia or whoever um not everything
(00:14:02)
works so in some of the meetings that we
(00:14:06)
had with him uh early last year. Uh he
(00:14:09)
would hear these and he would make a
(00:14:11)
phone call and the software team would
(00:14:13)
deliver a patch the next day and we
(00:14:15)
would work like side by side until that
(00:14:16)
was resolved. Um and then we could run a
(00:14:18)
model uh or a train training run uh on
(00:14:21)
the hardware uh very very quickly where
(00:14:23)
otherwise it would have taken weeks of
(00:14:24)
back and forth. So those kind of
(00:14:27)
blockers are usually very quickly
(00:14:29)
resolved with one phone call um or just
(00:14:32)
us bringing it up to him or him just
(00:14:33)
offering like frequently when uh a
(00:14:36)
meeting is ending or there's a lull in
(00:14:38)
in the conversation he'll be like okay
(00:14:40)
how can I help how can I make this
(00:14:41)
faster whatever and someone will come up
(00:14:43)
with with an answer
(00:14:44)
>> I know you guys are doing many different
(00:14:46)
products in parallel and I get that it's
(00:14:48)
kind of like you have to do that but
(00:14:51)
also it's sometimes in most
(00:14:53)
organizations it's like very difficult
(00:14:54)
to stay focused on a single thing and
(00:14:56)
like a single objective. How does that
(00:14:58)
kind of work uh for just executing on
(00:15:01)
multiple different fronts at the same
(00:15:02)
time?
(00:15:03)
>> Very frequently we actually uh and this
(00:15:06)
is increasing with scale. We don't have
(00:15:08)
a full picture until like the all hands
(00:15:10)
or we just chat with people what
(00:15:12)
everyone is doing and how far everyone
(00:15:14)
is on these different projects. Like for
(00:15:17)
example on on on when we did our our our
(00:15:19)
voice model and our voice deployment
(00:15:22)
>> um we actually had a lot of the work
(00:15:24)
built for extremely low latency uh
(00:15:27)
extreme low latency end like uh packets
(00:15:30)
to be sent to the client. it was already
(00:15:32)
built out and um it was a matter of
(00:15:36)
flipping the right switches and the
(00:15:38)
right configs basically to cut our
(00:15:39)
latency pretty significantly um like 2
(00:15:43)
3x uh and end. Um this is actually the
(00:15:47)
case a lot of the time is there is a
(00:15:49)
stupid thing that uh exists somewhere in
(00:15:53)
the software or the hardware and someone
(00:15:55)
has come up with a solution um and you
(00:15:58)
find it when you go to look for it in in
(00:16:01)
our codebase somewhere or you ask around
(00:16:03)
and someone's like oh yeah this XYZ
(00:16:05)
person has done this you should talk to
(00:16:06)
them and they will hook you up. Um
(00:16:09)
there's not a lot of time spent syncing
(00:16:11)
up with anyone or asking for permission
(00:16:13)
or
(00:16:14)
um waiting for anyone at all. Like the
(00:16:17)
answer is like when you propose someone
(00:16:19)
someone says a good idea. Like usually
(00:16:21)
you propose something and the the answer
(00:16:23)
is either no that's dumb or why isn't it
(00:16:25)
done already? [laughter and gasps]
(00:16:26)
Like
(00:16:28)
um and then you go and do it and then
(00:16:30)
it's done. With Elon companies, you can
(00:16:32)
kind of just ask for responsibility and
(00:16:34)
then you basically just live by the
(00:16:35)
sword, die by the sword, and if you get
(00:16:37)
things done, then you can just ask for
(00:16:38)
more responsibility and you can keep on
(00:16:40)
doing that or you're just like out.
(00:16:41)
What's been your experience like with
(00:16:43)
that?
(00:16:43)
>> Very much so. Yeah, like um I've jumped
(00:16:47)
around a lot of different projects and
(00:16:48)
mostly just because someone asked for my
(00:16:49)
help and I kept helping and then I ended
(00:16:53)
up owning some of the stack or a lot of
(00:16:55)
the stack. Um and this is the case for
(00:16:57)
everyone like this is just how it is. um
(00:16:59)
if you have any particular experience or
(00:17:02)
um can iterate on something very quickly
(00:17:05)
within days you own that component. Um
(00:17:08)
yeah there's no formal anything I think
(00:17:10)
officially on our HR software I I'm on
(00:17:13)
voice and iOS or something and our
(00:17:17)
security software thinks I still work on
(00:17:20)
RX integration and um
(00:17:23)
>> which never updated.
(00:17:24)
>> Yeah. No, no one ever updates this stuff
(00:17:25)
like um it's kind of ridiculous. And is
(00:17:28)
has has your like journey at the company
(00:17:31)
kind of been you show up there's not
(00:17:33)
exactly like a clear direction of what
(00:17:35)
you're going to work on and then you
(00:17:36)
just start working on stuff and then you
(00:17:38)
just kind of like hop from project to
(00:17:40)
project by whoever asks for your help.
(00:17:42)
>> There's a bit Yeah, there's quite a bit
(00:17:44)
of like overlap and flowing.
(00:17:46)
>> Um so like after onboarding I'm usually
(00:17:49)
on two or three projects at once. Um,
(00:17:52)
and whichever one is most pressing or I
(00:17:54)
can help the most on ends up taking
(00:17:56)
majority of my time and then that kind
(00:17:58)
of overlaps and flows in like a
(00:18:00)
waterfall way.
(00:18:01)
>> What's been the journey from like the
(00:18:03)
starting to to now? Like what what
(00:18:05)
projects have you worked on?
(00:18:07)
>> Yeah, so specifically I started um I
(00:18:10)
first worked on like ASRock uh and our
(00:18:13)
integration there and I worked with our
(00:18:14)
backend team a bit on like reliability
(00:18:16)
and scaling up because we were scaling
(00:18:18)
up a lot at that time. Uh and then after
(00:18:21)
that I took on solo building up our our
(00:18:24)
desktop suite. Um and took that went to
(00:18:29)
internal completion. Uh and then I got
(00:18:32)
asked for help on our imagine roll out
(00:18:34)
and iOS which yeah our iOS team is small
(00:18:38)
for like how many people use it. Like
(00:18:41)
it's ridiculous. You won't guess the
(00:18:42)
number.
(00:18:44)
Um
(00:18:45)
>> like five people for three. It was three
(00:18:49)
and I was the third person at the time
(00:18:51)
when we were rolling that out. It was
(00:18:52)
like it was ridiculous and everyone's
(00:18:54)
like really really good. Um yeah, this
(00:18:57)
is the first place where I've had to
(00:18:58)
work very hard to keep up really
(00:19:00)
>> with like the the speed and the talent.
(00:19:03)
What was the first uh experience that
(00:19:05)
you had where you thought to yourself
(00:19:07)
like you're actually being kind of used
(00:19:10)
to your full, you know, potential? And
(00:19:12)
>> I think that imagine roll out was
(00:19:13)
definitely like it was a really good
(00:19:15)
push cuz like we had this 24-hour
(00:19:17)
iteration cycle. Um you all would get
(00:19:19)
feedback every night on whatever we were
(00:19:21)
doing. Um and yeah, we we would push out
(00:19:24)
that night. Uh in the morning we would
(00:19:26)
have all the feedback. We would
(00:19:27)
immediately knock out all the bugs. um
(00:19:29)
implement the new stuff that that people
(00:19:30)
were asking for. Whatever model had come
(00:19:33)
up with, we implemented that too. Like
(00:19:35)
it was a very very fast cycle and it was
(00:19:38)
uh I think it was the longest like
(00:19:40)
continuous stretch of me being in the
(00:19:42)
office like every day.
(00:19:44)
>> What was that like at the time?
(00:19:45)
>> It was like two or three months.
(00:19:46)
>> Two or three times. Yeah. Yeah. Okay.
(00:19:48)
>> Um yeah, like there weren't weekends for
(00:19:50)
a while, which was uh it was good to
(00:19:52)
know that I could do that and I was
(00:19:53)
pretty happy doing that. Um,
(00:19:56)
and after that I got pulled onto Macro
(00:19:59)
hard product which was just one other
(00:20:01)
person at the time. So it was the two of
(00:20:02)
us uh for a while and I've been on that
(00:20:04)
since uh since that project off
(00:20:06)
basically.
(00:20:07)
>> I don't know how much you like know
(00:20:08)
about this but uh the like Colossus
(00:20:11)
build and all the ridiculous stuff that
(00:20:15)
the like early XAI team had to do to
(00:20:17)
turn on Colossus and like get power and
(00:20:20)
all the necessary inputs to making that
(00:20:22)
work. And even today, I think like it's
(00:20:23)
just bottlenecks across the entire
(00:20:25)
thing. You just want more you you want
(00:20:27)
more like uh chips and GPUs and all the
(00:20:29)
stuff working
(00:20:30)
>> and faster. Um
(00:20:32)
>> what was that like?
(00:20:34)
>> There's a lot of war stories um and a
(00:20:36)
lot of bets. Um
(00:20:38)
>> want to go into a few?
(00:20:40)
>> Yeah. So I think Tyler was took this bet
(00:20:44)
uh with Elon like uh one we were setting
(00:20:47)
up new racks I think of I forget what
(00:20:49)
which GPUs we were rolling out at that
(00:20:51)
time. Um, we took a bet. Uh, Elan's
(00:20:54)
like, "Okay, you get a cybert truck
(00:20:55)
tonight if you can get a training run on
(00:20:56)
these GPUs uh in 24 hours."
(00:21:00)
Uh, and we were training that night. Um,
(00:21:02)
>> did he get the cyber?
(00:21:03)
>> Yeah, he got [laughter]
(00:21:05)
>> I think it's Yeah, I see it from our
(00:21:08)
lunch window.
(00:21:09)
>> Mhm.
(00:21:09)
>> Cafeter.
(00:21:10)
Yeah, he's cool. Um uh you know what the
(00:21:14)
I so for power we actually have we have
(00:21:16)
to collaborate very tightly with the
(00:21:19)
like municipal uh and state power
(00:21:22)
companies uh because when load goes high
(00:21:25)
on their end we have to shut off and go
(00:21:27)
fully on the like 80 or maybe it's more
(00:21:29)
than that I think more more than that 80
(00:21:32)
uh mobile generators we brought in on
(00:21:34)
trucks um and go fully on on those um
(00:21:38)
just so that we don't like impact power
(00:21:40)
uh anywhere. are like within and we have
(00:21:42)
to do that like seamlessly without
(00:21:44)
interrupting anyone's uh extremely
(00:21:46)
volatile training runs uh on extremely
(00:21:48)
volatile uh you know GPUs and and
(00:21:50)
hardware which scales up and down by
(00:21:53)
like megawws in milliseconds. It's it's
(00:21:57)
a lot. Um,
(00:21:58)
>> is that also part of the logic of like
(00:22:00)
basically putting massive battery packs
(00:22:02)
right next to the uh desenters cuz then
(00:22:04)
you can kind of like go up and down much
(00:22:06)
faster without
(00:22:07)
>> batteries can scale up a lot uh scale up
(00:22:09)
and down and uh balance that load a lot
(00:22:12)
faster. Um cuz with a generator you're
(00:22:14)
literally asking a physical thing to
(00:22:16)
speed up or or slow down like a spinning
(00:22:19)
spinning physical thing that's obviously
(00:22:21)
just going to take a certain amount of
(00:22:22)
time. the batteries can uh react to the
(00:22:24)
light much much faster and then yeah
(00:22:26)
it's like actually from the phys from
(00:22:30)
physical standpoint I think there's the
(00:22:32)
uh local capacitors the station like
(00:22:35)
data hall side capacitors the batteries
(00:22:37)
and then generators and then the public
(00:22:40)
municipalities although we might have
(00:22:41)
changed that infrastructure at this
(00:22:42)
point things very quickly especially on
(00:22:45)
the cooling side
(00:22:46)
>> do you have any other really good like
(00:22:47)
war stories that are just like uh I
(00:22:50)
don't know things that shouldn't have
(00:22:51)
been possible that became possible
(00:22:53)
Uh, so the the lease for the land itself
(00:22:56)
was actually technically temporary. It
(00:22:58)
was the fastest way to get the
(00:23:00)
permitting through and actually start
(00:23:01)
building things. Um, I assume that it'll
(00:23:04)
be permanent at some point, but yeah,
(00:23:05)
it's I think a very short-term lease at
(00:23:07)
the moment technically for all the data
(00:23:08)
centers. It's fastest way to get things
(00:23:10)
done.
(00:23:10)
>> And how do they how do they do that? Um
(00:23:12)
I think there's basically a special
(00:23:13)
exception uh within like the local and
(00:23:15)
state government says okay if you want
(00:23:17)
to just uh modify this ground
(00:23:20)
temporarily I think it's like for like
(00:23:22)
uh carnivals and [laughter] stuff you
(00:23:24)
can
(00:23:24)
>> Xi is actually just a carnival company
(00:23:27)
>> currently
(00:23:29)
[laughter]
(00:23:29)
>> and so that was the way to get done
(00:23:32)
quickly I mean it was done yeah 122 days
(00:23:35)
>> for like internal planning I know things
(00:23:38)
are just going to keep on scaling up
(00:23:39)
like crazy and Elon's talked about
(00:23:41)
energy being the biggest bottleneck and
(00:23:43)
then you know just being able to get
(00:23:45)
chips. Um how do you guys plan when it's
(00:23:49)
very difficult to like predict 12 to 24
(00:23:52)
months in the future exactly what
(00:23:53)
projects you're going to be working on
(00:23:54)
or what their like resource requirements
(00:23:56)
are going to be.
(00:23:57)
>> We try we try very hard to work
(00:23:59)
backwards from like what's the highest
(00:24:00)
leveraged thing we can be doing and then
(00:24:03)
we determine the physical requirements
(00:24:05)
later. So like
(00:24:06)
>> if we want to get to 10 or hundred
(00:24:08)
billion in revenue by this date, uh what
(00:24:11)
are the highest leverage things we can
(00:24:12)
do like from an econ economic
(00:24:15)
perspective? How can we actually build
(00:24:17)
systems to do that? And then what does
(00:24:19)
it take on the physical and software
(00:24:21)
side to roll that out and and get it
(00:24:23)
done? Um just roll down roll backwards
(00:24:25)
the whole way. So we don't usually start
(00:24:27)
with the with the physical requirement.
(00:24:31)
That's usually actually at the end. Is
(00:24:32)
there like a SpaceXesque um like
(00:24:35)
algorithm for making things happen?
(00:24:37)
>> As in like the usual delete?
(00:24:39)
>> Yeah.
(00:24:40)
>> Yeah. I mean that's the case all the
(00:24:42)
time. Um and we do do the thing where
(00:24:46)
Yeah. We delete something and then add
(00:24:47)
it back later. Um
(00:24:49)
>> what was the like last time that you did
(00:24:51)
that?
(00:24:51)
>> Today.
(00:24:52)
>> Today. [laughter]
(00:24:54)
>> Um today. Yeah. So with macro hard we
(00:24:56)
deploy on um a lot of like physical
(00:24:59)
hardware that changes and um the testing
(00:25:02)
harness for that is hard. Um so we try
(00:25:06)
to minimize how much how many special
(00:25:09)
cases are downstream of where it needs
(00:25:11)
to be. And um for example like with
(00:25:14)
display scaling um we need to be able to
(00:25:16)
support displays that are you know 30
(00:25:19)
years old as well as the latest like 5K
(00:25:22)
Apple whatever displays and that has to
(00:25:24)
happen on the same stack. Um, turns out
(00:25:27)
not all the systems are happy with that
(00:25:28)
at all times. Like you have to you have
(00:25:31)
to fiddle with the encoders at a certain
(00:25:32)
level. Like uh video encoders um was
(00:25:35)
[clears throat] the specific thing
(00:25:35)
basically we I didn't know but uh as it
(00:25:39)
turns out there are limits to the
(00:25:41)
maximum amount of pixels that certain
(00:25:42)
encoders can take. So we have to now
(00:25:44)
have I removed this special case for
(00:25:46)
multiple encoders and turns out we found
(00:25:48)
a problem at at plus 5K resolution and
(00:25:50)
so we added that back. What are the most
(00:25:52)
interesting things about XAI itself um
(00:25:55)
that you think like would be really good
(00:25:57)
stuff to talk about?
(00:25:58)
>> There's a lot of characters that work
(00:25:59)
there and also we're doing hiring in
(00:26:02)
like interesting ways I guess. Um like
(00:26:05)
things that I thought would be stupid
(00:26:06)
are okayed and we just do them and we
(00:26:08)
try them and it's like we we'll do a
(00:26:10)
hackathon and if we get five people in
(00:26:12)
as a result it's worth it. um because
(00:26:15)
just their like expected return on on
(00:26:17)
the company's like revenue or valuation
(00:26:20)
is higher than the cost of running this
(00:26:22)
hackathon for 500 people. Um like the
(00:26:25)
verhead value is actually very high
(00:26:27)
which is like funny. We did the math um
(00:26:32)
earlier this week. Uh right now we're
(00:26:35)
like I think at about $2.5 million per
(00:26:37)
commit is to to the main to the main
(00:26:40)
repo. Um and I did five today. So
(00:26:43)
>> you added like 12 and a half million of
(00:26:45)
value. [laughter]
(00:26:46)
>> Um
(00:26:47)
>> light day light days.
(00:26:48)
>> Exactly. It was a good day. [laughter]
(00:26:51)
Um it's funny things like that. Um like
(00:26:55)
the levers are are extremely strong.
(00:26:59)
Like you you can get a lot a lot done
(00:27:01)
with a lot less effort and time than you
(00:27:04)
used to be able to for sure just because
(00:27:06)
of who you work with, the internal
(00:27:08)
tooling that we built up. Um, and my
(00:27:12)
boss.
(00:27:13)
>> What's like an example of the type of
(00:27:15)
person that like wants to work here? Cuz
(00:27:17)
I know when when you're talking about
(00:27:19)
it, you kind of show up and the first
(00:27:20)
day you're just like, I want to work on
(00:27:22)
the weekends. I want to work on, you
(00:27:23)
know, during the night, all this stuff.
(00:27:25)
Uh, go all in on this. Um, what kind of
(00:27:27)
special characters are are working
(00:27:29)
there?
(00:27:30)
>> People are definitely very enthusiastic
(00:27:31)
when they come in. Like, um, very very
(00:27:35)
enthusiastic.
(00:27:37)
uh just like
(00:27:38)
>> like mission oriented.
(00:27:40)
>> Um there's I guess different types of
(00:27:43)
ambition for sure. Some people want to
(00:27:44)
move up like the leadership ladder and
(00:27:46)
own more in terms of a managerial like
(00:27:49)
how many people report to me sense. Some
(00:27:51)
people want to own huge parts of the
(00:27:53)
technical stack. So like right now we're
(00:27:55)
doing a big rebuild um of like our core
(00:27:59)
uh production APIs. It's being done by
(00:28:02)
one person with like 20 agents. Um, and
(00:28:05)
they're they're very good and they're
(00:28:07)
capable of doing it and um, like it's
(00:28:10)
working well. Uh, so you can own huge
(00:28:14)
chunks of the code base, no problem.
(00:28:17)
>> It's kind of like a X where like after
(00:28:19)
the acquisition they like had, you know,
(00:28:23)
much fewer people, but you just like
(00:28:24)
never had a lot of people in the first
(00:28:26)
place, so there's one person like owning
(00:28:27)
a huge part of the product.
(00:28:29)
>> Absolutely. for hiring. Um what's what
(00:28:32)
unusual practices outside of just
(00:28:34)
hackathons uh does XI do?
(00:28:37)
>> Uh so we're pushing very hard on Macro
(00:28:39)
hard. Like for two or three weeks I was
(00:28:41)
doing upwards of 20 interviews a week.
(00:28:44)
So that's like some of them are like
(00:28:46)
quick 15 minutes. Some of them are full
(00:28:47)
1 hour technicals. So a lot of my time
(00:28:50)
uh is dedicated towards bringing in new
(00:28:52)
people and a lot of people are very
(00:28:54)
good. So it's it's actually very hard to
(00:28:56)
judge them. How do you
(00:28:58)
>> uh I have a very specific problem that I
(00:29:01)
have solved. I'm not going to reveal it
(00:29:02)
because then people will use it. But I
(00:29:05)
have solved a very specific computer
(00:29:07)
vision problem a few years ago for one
(00:29:09)
of my startups and I uh I give people
(00:29:11)
half an hour to try to implement the
(00:29:13)
solution. It's actually very very
(00:29:14)
simple. This deceptively simple
(00:29:16)
solution. People always overthink it. Um
(00:29:19)
and this is something I like to index
(00:29:21)
for on my team especially is like can
(00:29:23)
you not overthink it and come up with a
(00:29:25)
simple solution? Um it helps a lot
(00:29:28)
because we're deploying on such a wide
(00:29:32)
variety of like
(00:29:34)
uh on a wide variety of hardware as a
(00:29:36)
result of the wide variety of of
(00:29:37)
customers like literally 30 years 40
(00:29:39)
years of uh different hardware,
(00:29:41)
different operating systems, everything
(00:29:43)
like that. You have to come up with
(00:29:45)
simple solutions or you're going to have
(00:29:46)
a 10 million line code base uh next
(00:29:49)
week. So you you this is like very
(00:29:53)
important. Um and especially now relying
(00:29:56)
more and more on on agents and and an AI
(00:29:59)
and and such for writing code. Um
(00:30:03)
an AI will happily train out 200 lines
(00:30:06)
when a 10line solution will do um and
(00:30:08)
probably do better. So you have to look
(00:30:10)
for that. Like I want people and I look
(00:30:12)
and actively hire for people who can
(00:30:14)
find the 10line solution first. Um,
(00:30:18)
we're totally fine with people using AI
(00:30:19)
to code things. Like you should you
(00:30:21)
should use that as a force multiplier,
(00:30:23)
but uh for now we're smarter. We'll see
(00:30:27)
next year.
(00:30:27)
>> What other like force multipliers do you
(00:30:29)
kind of like look for?
(00:30:30)
>> I like people who will challenge uh
(00:30:34)
challenge requirements and challenge me.
(00:30:36)
So often uh I got this from uh Chester
(00:30:40)
Zai German for he told me this and I
(00:30:42)
thought it was great. He throws in
(00:30:43)
usually um an incorrect requirement or
(00:30:46)
question or an impossible like uh line
(00:30:49)
in uh his challenges for people when
(00:30:52)
he's hiring like coding challenges and
(00:30:54)
he expects people to come back and say
(00:30:57)
like hey this is wrong this is not
(00:30:58)
possible you made a mistake and if he
(00:31:01)
doesn't then uh he doesn't hire them
(00:31:03)
same thing for me I picked that up it's
(00:31:05)
a great idea
(00:31:06)
>> the pace is insanely fast and like you
(00:31:09)
said you kind of have worked on a number
(00:31:11)
of different things How do you kind of
(00:31:13)
come up to speed on something as quickly
(00:31:15)
as possible when you're on a new task or
(00:31:17)
project?
(00:31:17)
>> It depends on what thing it is. If
(00:31:19)
there's a lot of code to read,
(00:31:22)
>> yeah,
(00:31:22)
>> read the code
(00:31:24)
>> by hand. Um like GD go to definition
(00:31:27)
over and over again and you'll find
(00:31:29)
things out very quickly. Actually, it's
(00:31:31)
not that hard. Um for most things, the
(00:31:34)
implementation is like less lines of
(00:31:37)
code than you would otherwise see, which
(00:31:38)
is nice. um not all the time, but in
(00:31:40)
most cases, if it's something that's in
(00:31:42)
very active development, this is not the
(00:31:43)
case. There's going to be 20 different
(00:31:45)
versions of it going at the same time,
(00:31:46)
and it's not obvious what is the current
(00:31:48)
path. So, you just got to talk to
(00:31:49)
people, and people are very open. Like,
(00:31:51)
this is actually one of the things I was
(00:31:52)
very surprised by, uh pleasantly
(00:31:54)
surprised by when I joined is I thought
(00:31:55)
people would be super smart and stuck
(00:31:56)
up, but no, people are just super smart
(00:31:58)
and very nice and helpful. Like,
(00:31:59)
everyone's on the same team, everyone's
(00:32:01)
rooting for each other, people are
(00:32:02)
willing to like help you out um and
(00:32:04)
answer your questions. So, which is good
(00:32:06)
because we don't like write a lot of
(00:32:08)
docs. We write things. We do things too
(00:32:10)
fast to write docs really. Um, actually,
(00:32:14)
yeah, we're trying to figure out some
(00:32:15)
systems on on my team to like
(00:32:16)
automatically generate docs as we like
(00:32:18)
build stuff. Um, and with Grock, which
(00:32:23)
is cool that we have unlimited access to
(00:32:25)
uh very smart AI because then we can try
(00:32:27)
a bunch of stupid things, see if it
(00:32:29)
works, which otherwise, you know, at a
(00:32:31)
startup would cost you maybe like$100 or
(00:32:32)
a million dollar 100k or a million
(00:32:33)
dollars in in credits or whatever. we do
(00:32:35)
it for free. So experimentation like
(00:32:38)
failure you can fail on a lot of things
(00:32:39)
and it uh a lot more things you
(00:32:41)
otherwise would um and as a result more
(00:32:44)
experiments are tried um more uh succeed
(00:32:47)
>> on the like experimentation side how are
(00:32:50)
you guys kind of like trying to maximize
(00:32:51)
for the number of experiments or like
(00:32:53)
good shots on goal uh that you can do.
(00:32:55)
>> There's often like uh a time time
(00:32:58)
constraint. We will frequently launch
(00:33:01)
multiple experiments especially on the
(00:33:02)
model side at the same time and in some
(00:33:06)
cases it's not even because of a time
(00:33:07)
constraint necessarily in terms of like
(00:33:10)
I need to try x amount of things in y
(00:33:12)
time it's in two weeks this prerequisite
(00:33:17)
will be ready either in the hardware or
(00:33:19)
in the training data or something but in
(00:33:22)
the meantime I need to deploy something
(00:33:23)
today
(00:33:25)
uh what can I do and so you run two
(00:33:27)
three experiments and you find out what
(00:33:28)
you can deploy day um and bring in
(00:33:30)
revenue or customer result whatever it
(00:33:32)
is today and then two weeks you switch
(00:33:34)
over um like that's something we do all
(00:33:37)
the time especially at macro hard.
(00:33:39)
>> Have you uh seen anything where a
(00:33:40)
timeline should have been much longer on
(00:33:42)
like a project that you were working on
(00:33:44)
um and somehow you guys were able to
(00:33:46)
kind of like bring that in by you know
(00:33:47)
weeks or months
(00:33:48)
>> all the time.
(00:33:50)
Every time uh all the time every time we
(00:33:53)
come come away from like an EL meeting
(00:33:54)
or something internal where um someone
(00:33:59)
pushes hard to get something done or
(00:34:01)
someone external who doesn't isn't
(00:34:03)
responsible for the thing asks for a
(00:34:05)
requirement ask for something to be done
(00:34:07)
in an what we originally think
(00:34:08)
unreasonable amount of time you know we
(00:34:10)
spend two minutes like thinking about it
(00:34:12)
complaining maybe a bit uh and then the
(00:34:14)
rest of the time is dedicated to getting
(00:34:16)
it done in that time. Um yeah,
(00:34:19)
frequently the estimated time to get
(00:34:23)
something done all the time to get
(00:34:25)
something done is based on some set of
(00:34:28)
assumptions.
(00:34:29)
>> And then once you get this timeline
(00:34:32)
that's like half or onetenth of what you
(00:34:34)
would have otherwise done. You look at
(00:34:35)
the assumptions say okay proportionally
(00:34:37)
how much is this impacting my my
(00:34:39)
timeline? And then you knock it out or
(00:34:41)
you change it and then suddenly you get
(00:34:43)
a 2x improvement in your timeline. You
(00:34:45)
do that a few times. you can meet
(00:34:47)
whatever requirement you really want.
(00:34:49)
Um, yeah, at a certain point you get to
(00:34:51)
the physical limitations, but you're
(00:34:53)
never there um from the start.
(00:34:55)
>> So,
(00:34:56)
>> I know for like full self-driving um and
(00:34:59)
same thing with the the rockets of
(00:35:01)
SpaceX, the Elon timeline was
(00:35:04)
significantly longer. like an Elon
(00:35:07)
[snorts] time might be a quarter or half
(00:35:09)
of what it actually eventually takes,
(00:35:12)
but then it also, you know, happens four
(00:35:13)
times faster because of the initial
(00:35:15)
timeline. Is it more or less like at XAI
(00:35:19)
because it's more I mean, I guess more
(00:35:21)
on the software side now. Um, but even
(00:35:23)
on the data center side, things seem to
(00:35:25)
be happening just way way way faster.
(00:35:27)
Uh, and they also seem to be happening
(00:35:28)
on like the same timeline as he's
(00:35:30)
roughly saying. He's like this is going
(00:35:32)
to happen roughly you know this number
(00:35:33)
of months in the future and then it
(00:35:35)
actually does.
(00:35:35)
>> I think he himself has calibrated his
(00:35:37)
timelines
(00:35:38)
>> like differently over
(00:35:39)
>> Yeah. now that he's deployed a number of
(00:35:42)
extremely like a wide variety of um
(00:35:46)
deployed hardware at scale.
(00:35:48)
>> So I think his own estimates for things
(00:35:51)
are definitely a lot better. And so uh
(00:35:53)
that's definely the case. I think he
(00:35:55)
also updates his timelines faster now
(00:35:57)
too like um sometimes daily. I think he
(00:36:01)
he he's talking with us and figures out
(00:36:05)
what the update on the timeline should
(00:36:06)
be based on various parameters and
(00:36:08)
sometimes they come from him too right
(00:36:10)
um especially on the infrastructure side
(00:36:12)
uh if a deal or um we can be put up in a
(00:36:15)
batch for for the production of a
(00:36:17)
certain chip um well we can save a month
(00:36:20)
or two maybe um maybe even more than
(00:36:23)
that depends on what the deployment is
(00:36:25)
specifically and then on the software
(00:36:26)
side it's the same he always says is
(00:36:29)
like you can always attempt to do
(00:36:30)
something, you know, in one month that
(00:36:32)
would otherwise take a year and you'll
(00:36:34)
probably get it done maybe in two. Um,
(00:36:37)
still a lot faster.
(00:36:39)
>> I remember in the like early days of uh
(00:36:42)
SpaceX there was this internal I think
(00:36:45)
Elon would say internally like every day
(00:36:47)
that we delay is like 10 million in loss
(00:36:49)
revenue and I have no idea what it would
(00:36:51)
be like for XAI like things are moving
(00:36:54)
so fast. It's like is there kind of an
(00:36:56)
internal thing in your head of every day
(00:36:59)
that we don't like push push hard or
(00:37:01)
make something happen um we're losing
(00:37:04)
out on x amount of value that could be
(00:37:06)
created.
(00:37:07)
>> Yeah, for for macroart specifically, we
(00:37:09)
do have a few pretty specific revenue
(00:37:12)
targets. I can't delete the number
(00:37:14)
specifically, but um like in my head
(00:37:16)
whenever something gets delayed or
(00:37:18)
accelerated, I can pretty quickly
(00:37:20)
calculate how much money we just made or
(00:37:22)
lost. um
(00:37:23)
>> just wild swings. You just
(00:37:25)
>> Yeah, I mean the numbers are huge
(00:37:28)
[laughter]
(00:37:29)
uh just because the expected return is
(00:37:31)
so huge and um the timeline is so fast.
(00:37:34)
So a few days is actually
(00:37:36)
proportionately fairly large compared to
(00:37:39)
how much you would you would otherwise
(00:37:41)
expect the revenue to be.
(00:37:42)
>> Elon's like famous for making really
(00:37:44)
really big bets pretty quickly. uh like
(00:37:46)
what's the biggest decision that's been
(00:37:47)
made in a single meeting where like huge
(00:37:49)
huge amounts of uh capital or time or
(00:37:52)
commitment were done?
(00:37:54)
>> Um I think one of them was certainly the
(00:37:59)
decision to go with a model that would
(00:38:01)
be at least 1.5 times faster than a
(00:38:04)
human for
(00:38:05)
>> macro hard looking like significantly
(00:38:08)
faster than that. 8x maybe maybe more.
(00:38:11)
Um the like for other human emulator
(00:38:16)
type attempts in the other labs the
(00:38:18)
approach has been let's do more
(00:38:19)
reasoning and build a bigger model.
(00:38:22)
We've like that decision put us in
(00:38:24)
totally the opposite track of what
(00:38:26)
everyone else is doing. And everything
(00:38:29)
that we're doing really is downstream of
(00:38:30)
that like a well not everything but a
(00:38:32)
pretty much everything. Um it impacted
(00:38:35)
and it was very early on uh that this
(00:38:38)
was decided. It was sort of expected
(00:38:40)
also. um that this is the move,
(00:38:42)
especially given the analog to full
(00:38:44)
self-driving. Um
(00:38:47)
no one's going to wait around uh 10
(00:38:49)
minutes for the computer to do something
(00:38:50)
that I could have done in five, but if
(00:38:52)
it can be done in 10 seconds, well, I'd
(00:38:54)
be happy to pay whatever amount of money
(00:38:56)
for that. Um it's just obvious really.
(00:39:00)
So normally like us engineers would you
(00:39:03)
know if it's would push back and say oh
(00:39:06)
you know here's the 20 different reasons
(00:39:08)
uh that it needs to be this way um but
(00:39:10)
if a decision is made and you work
(00:39:12)
backwards
(00:39:13)
then life finds a way.
(00:39:16)
>> I remember Elon saying uh I think it was
(00:39:18)
at the like Y cominator uh he was doing
(00:39:21)
a like Q&A with Gary Tan and Gary talked
(00:39:24)
about like AI researchers and he was
(00:39:26)
like no they're just all AI engineers
(00:39:28)
now. Yeah, we did the this was someone
(00:39:32)
said that in one of the meetings um we
(00:39:35)
did with him uh talking about recruiting
(00:39:37)
like here was here's the job
(00:39:38)
descriptions or something like that and
(00:39:40)
like for 10 minutes he just goes on
(00:39:42)
engineers just engineers doesn't matter
(00:39:44)
good engineers engineers just someone
(00:39:46)
who's fundamentally a problem solver
(00:39:48)
doesn't matter if they did like this you
(00:39:49)
know XY thing and this infrastructure or
(00:39:51)
this you know particular architecture or
(00:39:54)
whatever engineers
(00:39:56)
>> why is it so important why is that
(00:39:57)
definition so important
(00:39:58)
Um
(00:40:00)
it's keeps things broad. It means that
(00:40:02)
people can come in to us from like a
(00:40:06)
extremely wide variety of places and
(00:40:10)
this has been the case. I mean, there is
(00:40:12)
I think less so in the AI world, but I
(00:40:15)
think there's a lot of SpaceX stories
(00:40:17)
where people came in from strange walks
(00:40:20)
of life that would not have otherwise
(00:40:21)
seemed to be the case and then ended up
(00:40:23)
doing huge things at SpaceX in the
(00:40:26)
engineering world as a result. So,
(00:40:28)
keeping it broad means that those people
(00:40:31)
can have a path to us and uh and help us
(00:40:36)
accelerate. For you personally, what's
(00:40:38)
the most like fun thing about working
(00:40:40)
there dayto-day?
(00:40:41)
>> No one tells me no.
(00:40:42)
>> No one tells me no.
(00:40:43)
>> No one tells me no. Um yeah, if I like
(00:40:47)
have a good idea, I can usually go and
(00:40:49)
implement it that same day and show it
(00:40:50)
off and we'll see if uh if it makes
(00:40:53)
sense. We we'll we'll we'll run whatever
(00:40:55)
eval or um show it to a customer or show
(00:40:59)
it to Elon or whoever and we'll get an
(00:41:02)
answer usually that same day uh as to
(00:41:04)
whether or not that was the right move.
(00:41:05)
There's no deliberation. There's no
(00:41:07)
waiting for any bureaucracy. Uh I like
(00:41:09)
that a lot. I was expecting to sacrifice
(00:41:11)
some amount of this coming from
(00:41:13)
extremely small startups to a larger
(00:41:15)
company. Like I guess joining at 100
(00:41:18)
people, I [laughter] mean to me it was
(00:41:20)
like a 10x leap of anywhere else I've
(00:41:22)
been. But uh I guess relatively to loan
(00:41:25)
companies is pretty small and it does
(00:41:26)
feel very small. Um there's not a lot of
(00:41:29)
overhead in anything.
(00:41:30)
>> Did you have any other like big
(00:41:31)
assumptions going in that proved like
(00:41:33)
completely wrong? I thought there would
(00:41:36)
be more top down. Um, and there's some,
(00:41:39)
but not really that much.
(00:41:40)
>> Um, especially because of how many
(00:41:42)
there's basically only three layers of
(00:41:45)
management. There's um like IC's uh
(00:41:48)
there's the co-founders and some of the
(00:41:50)
new managers and then Elon and that's
(00:41:52)
it. And so because there's so many
(00:41:54)
reports to the managers now, um
(00:41:58)
nothing really comes from them top down
(00:42:00)
like we'll usually come up with a
(00:42:01)
solution. They're okay. Elon okay is
(00:42:05)
we're good if there's feedback then we
(00:42:07)
update but
(00:42:08)
>> it's a lot more bottom up than I
(00:42:09)
expected
(00:42:10)
>> like trying to be designed so that
(00:42:12)
everyone is like building things and
(00:42:13)
there there was like fewer manager
(00:42:15)
managers and mo more just like builders
(00:42:18)
>> uh there's yeah when I joined I think
(00:42:20)
every manager also wrote code um and I
(00:42:23)
think largely today they still do um not
(00:42:26)
as much now that some of them have you
(00:42:29)
know 100 plus people reporting to them
(00:42:31)
But
(00:42:33)
everyone's an engineer. I remember
(00:42:34)
actually on my first week um I sat down
(00:42:37)
for dinner and this guy sits next to me
(00:42:38)
and I asked, "Hey, you know, like what
(00:42:40)
what team you on? How you going? How you
(00:42:41)
doing? I I just joined." And he tells
(00:42:43)
me, "Oh, I'm on on sales uh and like
(00:42:46)
enterprise deals." And I was like, "Oh,
(00:42:48)
I don't want to talk to this guy. He's a
(00:42:49)
sales guy." And then he starts telling
(00:42:51)
me about this model he's training on the
(00:42:54)
he's an engineer, too. The sales team is
(00:42:56)
an engineer. Are all engineers. Everyone
(00:42:58)
is an engineer. Uh I think at the time
(00:43:00)
there was probably less than like eight
(00:43:01)
people who were not engineers at the
(00:43:03)
company um in some capacity and even
(00:43:06)
then like yeah it was really cool.
(00:43:09)
Everyone
(00:43:11)
everyone contributes to the machine.
(00:43:13)
>> Is it a little bit more like you have a
(00:43:16)
single person working on some project
(00:43:17)
and they just you know if you're an
(00:43:19)
engineer and you're working on the thing
(00:43:20)
you can have a much closer relationship
(00:43:22)
to the customer and like understanding
(00:43:23)
their problem and then rapidly like
(00:43:24)
implementing solutions and stuff.
(00:43:26)
>> Yeah.
(00:43:27)
>> Yeah. um the less layers you have, the
(00:43:29)
less information is lost. Um there's
(00:43:31)
less compression basically. Uh because
(00:43:34)
you have to communicate less times and
(00:43:37)
language is lossy compared to what's
(00:43:39)
going on in your brain. Um so if you
(00:43:42)
have to go from customer's brain to
(00:43:45)
words to, you know, salesperson's brain
(00:43:48)
to words to manager, every layer, you're
(00:43:51)
losing
(00:43:52)
>> a huge amount of information. Yeah. And
(00:43:54)
so if you can cut as many layers as
(00:43:57)
possible, then you've only got one
(00:43:59)
compression step of the customer telling
(00:44:01)
you what to do or what they want and
(00:44:03)
what or what their experience is or
(00:44:05)
whatever and you as the engineer can
(00:44:06)
solve it directly.
(00:44:07)
>> Is there anything like specific that
(00:44:09)
you've never heard of or seen at any
(00:44:11)
other company that XAI does where that
(00:44:14)
allows things to just happen way faster?
(00:44:16)
The fuzziness definitely between teams
(00:44:20)
and what everyone is responsible for is
(00:44:23)
definitely not what I expected and I
(00:44:24)
don't think exists nearly as much in any
(00:44:27)
large company or even remotely similar
(00:44:30)
similarly sized company. Like if I need
(00:44:33)
to fix something on our VM
(00:44:34)
infrastructure,
(00:44:36)
uh I will do it. I will show it to the
(00:44:37)
guy who owns that and they will be like
(00:44:39)
okay and it's merged immediately and
(00:44:41)
deployed. like uh there's not a lot of
(00:44:45)
strict regiment.
(00:44:47)
>> Mhm.
(00:44:47)
>> Um like everyone is allowed to update
(00:44:51)
everything and there's some checks for
(00:44:53)
dangerous things but um largely you're
(00:44:56)
trusted to do the right thing and do it
(00:44:58)
right. Uh which is really cool. I
(00:45:01)
remember when uh Elon was still like
(00:45:03)
really working on Doge, there was at one
(00:45:05)
point I think they deleted um like Ebola
(00:45:08)
prevention or something and then and
(00:45:10)
then they rapidly like reput that back
(00:45:12)
in what things have been like deleted
(00:45:14)
because of this rapid process of trying
(00:45:16)
to figure out, you know, what doesn't
(00:45:17)
need to be done. Um and then like
(00:45:19)
reimplemented.
(00:45:20)
>> There's very rarely anything like
(00:45:22)
irreversibly destructive. I'm actually
(00:45:24)
not really aware of anything where
(00:45:26)
something was irreversibly destroyed,
(00:45:28)
but like I said, yeah, frequently
(00:45:30)
something will be deleted or removed or
(00:45:31)
something like that and someone will be
(00:45:33)
like, "Hey, I needed that." uh you know
(00:45:36)
an hour or two and then you go and roll
(00:45:38)
back um or you know sometimes it can be
(00:45:41)
months uh where you know someone's
(00:45:43)
building this project and they're
(00:45:45)
depending on I don't know some piece of
(00:45:46)
infrastructure something like that and
(00:45:49)
turns out we rebuilt that thing three
(00:45:51)
times by the time you go and and deploy
(00:45:53)
and need it and so you update and and uh
(00:45:57)
go that way.
(00:45:58)
>> Do you think it's helpful to have like
(00:45:59)
so few people working there on on the
(00:46:01)
engineering team?
(00:46:02)
>> Yeah, definitely. um the more people you
(00:46:05)
have doing a like I I definitely say
(00:46:07)
like a a job for one person done done by
(00:46:10)
two will take twice as long. Um and it
(00:46:13)
applies for every skill I think. Uh and
(00:46:17)
especially now that you don't need to
(00:46:20)
physically write as much code as you did
(00:46:23)
previously. You can be more of the
(00:46:25)
decision maker and the architect.
(00:46:27)
Everyone can be an architect. Um you
(00:46:29)
just don't need as many hands. Uh so one
(00:46:32)
brain can do a lot more.
(00:46:33)
>> You tried starting multiple companies
(00:46:36)
and you were doing a whole bunch of
(00:46:37)
different projects prior to this. What
(00:46:39)
about working here and like what about
(00:46:41)
the mission the culture resonated? Why
(00:46:43)
did you why did you decide to work on
(00:46:45)
this? Uh, I've definitely always been
(00:46:47)
very Elon fil like I always uh he's been
(00:46:50)
a big personal hero of mine. Uh
(00:46:51)
especially growing up uh you know seen
(00:46:53)
the Falcon landings, the first ones and
(00:46:55)
um I went out to uh launch five of
(00:46:59)
Starship which was so worth it. It was
(00:47:01)
the first one they caught. It was it was
(00:47:03)
really cool. It was definitely the
(00:47:05)
coolest thing I've ever seen. Um so
(00:47:07)
being part of uh anything even remotely
(00:47:10)
related to that sounds awesome to me.
(00:47:12)
Um,
(00:47:13)
>> is there a reason why you chose like
(00:47:14)
this company instead of SpaceX or Tesla
(00:47:16)
or
(00:47:17)
>> Yeah, I'm definitely like an
(00:47:18)
entrepreneur by heart and um, uh, Xi is
(00:47:20)
definitely the smallest company uh, the
(00:47:22)
newest of all of them. It I think my
(00:47:25)
assumption is and this is largely proven
(00:47:27)
true I think uh, is where you can have
(00:47:29)
the most leverage and change as an
(00:47:31)
individual person um, because
(00:47:34)
proportionately you're a much larger
(00:47:36)
percentage of the company um, than you
(00:47:39)
would be at these other companies. Not
(00:47:40)
to say that like they're not doing cool
(00:47:43)
things or everyone's not as important,
(00:47:45)
but the Yeah. Just the proportional
(00:47:48)
change
(00:47:48)
>> kind of to to decision is like way
(00:47:50)
higher.
(00:47:50)
>> Yeah.
(00:47:51)
>> Yeah.
(00:47:52)
>> Uh not even to decision but to
(00:47:54)
implementation to seeing the results
(00:47:56)
like it's very quick. And um I guess
(00:48:01)
another assumption that I thought would
(00:48:02)
be the case but it's wrong uh that I had
(00:48:05)
was that uh I would be faster on my own
(00:48:07)
you know to build XYZ thing or try XYZ
(00:48:11)
experiment. I'm actually usually faster
(00:48:13)
at XAI just because I have
(00:48:16)
uh a groundwork and a team who's
(00:48:19)
probably already done a lot of the steps
(00:48:20)
that I would otherwise have to do by
(00:48:21)
hand. Um and there's yeah no one saying
(00:48:24)
no. You mentioned like it's kind of a
(00:48:27)
fuzzy blurred line between people
(00:48:29)
working on different people working on
(00:48:31)
different things. Has there been any
(00:48:33)
ability for you to kind of go to other
(00:48:34)
people in the organization and just ask
(00:48:36)
for help
(00:48:37)
>> all the time?
(00:48:38)
>> What does that look like?
(00:48:39)
>> Um I walk up to their desk and I say,
(00:48:42)
"Hey, here's my question. Um what are
(00:48:45)
you working on right now? Can I support
(00:48:46)
any of that? And can you help me with
(00:48:48)
this?" Uh that's it. Everyone's in the
(00:48:51)
same building. So, uh yeah, actually
(00:48:55)
funny enough, we um uh we started
(00:48:58)
testing some of our uh human emulators
(00:49:01)
internally within the company as as
(00:49:03)
employees. And um in some cases like
(00:49:07)
like we didn't really tell anyone about
(00:49:08)
this. And so in some cases there'll be
(00:49:11)
someone like someone doing some work and
(00:49:14)
someone is like, "Hey, can you help me
(00:49:15)
with this thing?" Or like, "Can you do
(00:49:16)
this thing?" And the virtual employee is
(00:49:19)
like, "Yeah, sure. Come to this desk.
(00:49:20)
Come to my desk." and they go there and
(00:49:21)
there's nothing there.
(00:49:22)
>> It's like the the claw situation where
(00:49:24)
it's like we're going to show up and uh
(00:49:26)
I think when they first rolled out their
(00:49:28)
vending machine, it was like, "I'm going
(00:49:30)
to see you tomorrow." And then
(00:49:31)
[laughter] it, you know, it's obviously
(00:49:32)
like a piece of code.
(00:49:34)
>> Yeah, exactly. And so, uh, multiple
(00:49:36)
times I've gotten a ping saying like,
(00:49:37)
"Hey, this guy on the org chart reports
(00:49:39)
to you. Is he like not in today or
(00:49:41)
something?" [laughter]
(00:49:42)
>> It's just like an emulation
(00:49:44)
>> and it's a it's an AI.
(00:49:46)
Uh, it's a virtual employee. Um but
(00:49:49)
yeah, generally we all expect to be like
(00:49:50)
in the same building and reachable to
(00:49:52)
each other. Um so uh it goes like always
(00:49:56)
and uh I can ask for help. I people ask
(00:50:00)
me for help all the time.
(00:50:01)
>> What have been the biggest like blunders
(00:50:03)
that have happened? Hm.
(00:50:06)
So, um, with the human emulators, with
(00:50:08)
the customers that we're working with,
(00:50:11)
um, when we try to understand, like we
(00:50:14)
always try to understand what the job
(00:50:15)
that they're doing is and all the facets
(00:50:17)
of it. Um, frequently we'll, you know,
(00:50:20)
talk to them, we'll interview, we'll
(00:50:21)
even watch them. Well, actually, we'll
(00:50:23)
do the watching at the last step. So,
(00:50:24)
we'll talk to them, we'll interview,
(00:50:24)
they'll give you either write up or
(00:50:26)
we'll just meet up with them and and
(00:50:28)
write notes as to how they do their job.
(00:50:31)
And then um like a week later we'll look
(00:50:34)
at the uh mistakes that the virtual
(00:50:36)
employee is making and realize like well
(00:50:40)
it's always making mistakes in these
(00:50:41)
places in these specific cases. What's
(00:50:42)
going on? And we go watch the human
(00:50:43)
doing the same thing and there's like 20
(00:50:45)
different steps that are missing that
(00:50:46)
they just totally left out and we go to
(00:50:48)
them and they're like oh yeah we do that
(00:50:50)
like I forgot to tell you. My bad. It it
(00:50:53)
happens all the time. Um a lot of things
(00:50:56)
people like I guess assume
(00:50:59)
automatically. it's all for granted in
(00:51:00)
their head. Totally on autopilot. The
(00:51:02)
same way that you um can like be driving
(00:51:05)
for an hour and not remember a single
(00:51:07)
second of it and not be paying attention
(00:51:08)
can be totally in your own world. Um
(00:51:10)
this is the same for every thing that a
(00:51:12)
human does uh repeatedly. And that's
(00:51:15)
what we're trying to solve basically is
(00:51:17)
all the uh dumb stuff that humans do
(00:51:20)
repetitively right now that they don't
(00:51:21)
need to.
(00:51:22)
>> Um trying to solve for that case.
(00:51:24)
Exactly.
(00:51:24)
>> How do you decide like which which thing
(00:51:26)
to go after? What's like the in your
(00:51:28)
head when you're thinking about that,
(00:51:30)
what are the biggest things outside of
(00:51:31)
driving that humans do all the time that
(00:51:33)
they just don't need to do?
(00:51:35)
>> Um,
(00:51:36)
anything repetitive on a computer. So
(00:51:38)
like customer support is a big one. um
(00:51:40)
where it's just taking in free form
(00:51:43)
input from arbitrary customer in
(00:51:45)
arbitrary form factor and uh translating
(00:51:49)
that into a standard workflow that is
(00:51:52)
purpose-built for like uh an AI to take
(00:51:56)
care of that so that human could go and
(00:51:58)
do something more creative and uh use
(00:52:00)
their brain like in a more effective
(00:52:02)
way. um totally the same like it's it's
(00:52:06)
a total parallel to what happened uh in
(00:52:08)
the coding world like okay I don't need
(00:52:11)
to write the same uh you know
(00:52:13)
implementation 20 different times
(00:52:14)
anymore uh I can describe it in like
(00:52:16)
three words and it's done um it's a huge
(00:52:21)
compression step uh and this is the same
(00:52:24)
thing basically but for arbitrary uh
(00:52:26)
digital workflows
(00:52:27)
>> on the human emulator side you run into
(00:52:29)
this problem of humans not existing and
(00:52:31)
then like someone says come to my desk
(00:52:33)
and the person doesn't exist. Is there
(00:52:34)
any other thing that's been kind of
(00:52:36)
surprising on rolling that out
(00:52:37)
internally?
(00:52:38)
>> Surprisingly, we've been able to
(00:52:41)
generalized to more cases than we
(00:52:42)
thought. We test
(00:52:44)
>> and we're pleasantly surprised a lot of
(00:52:46)
times. Um like just today we we gave
(00:52:49)
Elon a few cases where we did not train
(00:52:52)
on this task at all, but it did it
(00:52:54)
flawlessly, like perfectly like way
(00:52:56)
better than we would have expected. Um
(00:52:58)
yeah, the the generalization is better
(00:53:00)
than we expected for sure. and we're
(00:53:02)
still at a very early stage, so it's
(00:53:03)
only going to get better. Um,
(00:53:06)
and it's again the same parallels to
(00:53:08)
full self-driving where there's stuff
(00:53:10)
not in the training data that the car
(00:53:13)
does react to perfectly um due to
(00:53:16)
generalization of a otherwise very very
(00:53:19)
small model like it's a matter of like u
(00:53:23)
basically weight efficiency.
(00:53:24)
>> For the Elon meetings that you've been
(00:53:26)
in, like what does that actually look
(00:53:28)
like? Um, they're pretty simple,
(00:53:31)
honestly. Um, and, uh, I've been lucky
(00:53:34)
that most of the ones I've been in have
(00:53:36)
gone mostly pretty smoothly. Um, uh,
(00:53:40)
yeah, there's always,
(00:53:40)
>> what does smooth look like?
(00:53:42)
>> Smooth. Smooth is, uh, limited feedback
(00:53:45)
or thumbs up. Um, that means like, okay,
(00:53:48)
you're going in the right direction.
(00:53:50)
Keep going. Uh, I'll hear updates next
(00:53:52)
week. Uh, or whatever it is. Um if
(00:53:56)
there's feedback or a total reversal of
(00:53:58)
direction as a request then we messed up
(00:54:01)
somewhere. Um
(00:54:03)
then the question is where? So that's
(00:54:07)
usually we don't even have time to
(00:54:08)
identify that. Um that's something you
(00:54:10)
just build up implicitly as a muscle as
(00:54:12)
you go on. Um and sometimes assumptions
(00:54:15)
also change um
(00:54:17)
based on new information. That always
(00:54:19)
happens in every case. So when it comes
(00:54:20)
from the top down, it's a little
(00:54:21)
chaotic, but
(00:54:22)
>> I know with like SpaceX, the cost for
(00:54:25)
parts and building things is super super
(00:54:27)
important. Uh cuz, you know, everything
(00:54:29)
basically costs a [ __ ] ton of money and
(00:54:31)
time to to do, right? Um
(00:54:34)
>> for this sort of thing, I imagine it's a
(00:54:36)
little bit less focused on, you know,
(00:54:38)
he's not like necessarily drilling down
(00:54:40)
on do you understand every part of every
(00:54:43)
process. Um what is what does it look
(00:54:45)
like when he's kind of giving feedback?
(00:54:48)
Um, usually
(00:54:52)
it's either at a very high level or at a
(00:54:54)
very low level.
(00:54:55)
>> It's not really often in between. Um, so
(00:54:59)
either on the high level it's like a
(00:55:00)
product direction or customer sense, you
(00:55:02)
know, focus on this segment
(00:55:04)
>> exclusively or don't do this thing at
(00:55:07)
all or whatever. Um and then at a low
(00:55:10)
level uh especially when it comes to uh
(00:55:14)
compute efficiency or latency, he'll
(00:55:16)
always have a specific uh suggestion uh
(00:55:20)
or let's try this. And he's open to
(00:55:23)
being like proven wrong, but it has to
(00:55:25)
be proof. It has to be like let's try it
(00:55:27)
and see what the results are. Uh it
(00:55:29)
can't be just someone's opinion. There
(00:55:31)
has to be an experiment done. Um which
(00:55:33)
has led to some surprising results
(00:55:35)
sometimes and we go with it. What have
(00:55:37)
been those?
(00:55:38)
>> Um, so the compute efficiency of going
(00:55:40)
with the small model has led to well a
(00:55:44)
lot of improvements that we wouldn't
(00:55:45)
have otherwise thought. Um, some of them
(00:55:47)
are secondary, some of them are primary.
(00:55:49)
The obvious ones are well obvious um
(00:55:52)
being able to go much much faster to
(00:55:53)
human but also uh as a result and Tesla
(00:55:57)
found this too with full self-driving
(00:55:59)
going with the smaller model they're
(00:56:00)
able to iterate much much faster. Um so
(00:56:03)
not only does the model uh react to
(00:56:05)
situations faster and um can be more I
(00:56:08)
guess tolerant of time frames um
(00:56:13)
you can also just deploy iterations much
(00:56:15)
faster if it was 4 weeks before maybe
(00:56:17)
it's one week now. Um so as like that
(00:56:20)
that actually goes back to the
(00:56:22)
experimentations why we can have 20
(00:56:23)
different ones going in parallels was a
(00:56:25)
result of that particular decision um
(00:56:28)
early on in in the chain
(00:56:29)
>> and was the initial idea like go just do
(00:56:32)
big large models and then
(00:56:34)
>> sort of uh we definitely wanted to go
(00:56:36)
faster than everyone else um but the
(00:56:39)
question of how much faster was well the
(00:56:42)
answer to that was amplified basically
(00:56:44)
multiply by a lot
(00:56:45)
>> there's this uh like a lot of bias and
(00:56:47)
stuff in Wikipedia and Elon has been
(00:56:49)
like focused on kind of creating an
(00:56:51)
alternate version that's just kind of
(00:56:52)
like you know more truthful uh in
(00:56:54)
effect. Um how do you go about basically
(00:56:58)
cleaning up the internet in that way to
(00:57:00)
figure out what is truth?
(00:57:01)
>> It's a really hard problem.
(00:57:03)
>> Yeah,
(00:57:03)
>> it's very hard especially because um
(00:57:08)
the internet is not usually the ground
(00:57:10)
truth for whatever thing it is. So
(00:57:13)
wherever we can we try to drill down to
(00:57:16)
the fundamentals which is very hard like
(00:57:19)
I don't know what is the fundamentals
(00:57:21)
like in physics of the constitution
(00:57:25)
that's not really a question I think I
(00:57:27)
can answer or anyone could really
(00:57:28)
faithfully answer very well but you try
(00:57:30)
to do something like that um drill down
(00:57:33)
as close as you can and then build up
(00:57:35)
from that which is hard too because
(00:57:37)
there's not actually a big body of
(00:57:38)
[snorts] like
(00:57:40)
writing that does that. Um,
(00:57:44)
one of the few probably examples is like
(00:57:46)
James Burke um with his connection
(00:57:49)
series is where he'll take two totally
(00:57:51)
seemingly unrelated concepts and then
(00:57:53)
connect them um through physics and
(00:57:55)
inventions. Um it's really cool and
(00:57:58)
we're trying to do the same but uh it's
(00:58:01)
fairly novel.
(00:58:02)
>> How do you find better data?
(00:58:04)
>> Uh data is not the only thing that goes
(00:58:05)
into the results.
(00:58:08)
>> Yeah. like how you actually train on
(00:58:11)
that data and I know it's a pretty broad
(00:58:14)
term but um it is true like how you
(00:58:17)
actually evaluate against that data and
(00:58:19)
train against it and your different
(00:58:21)
methods for updating the weights do
(00:58:22)
matter a lot. um
(00:58:26)
you can try to faithfully recreate the
(00:58:28)
input or the the output given any
(00:58:30)
arbitrary input and well you can create
(00:58:34)
basically a horrible copy paste
(00:58:35)
mechanism if you want um which is a
(00:58:38)
classic problem in in ML um
(00:58:42)
there's a bit of an art to it to to
(00:58:43)
avoid that problem but the I guess we're
(00:58:46)
a few steps removed from that at this
(00:58:47)
point um we're not measuring
(00:58:52)
the fitness to any particular data set.
(00:58:54)
At this point, we're trying to measure
(00:58:55)
to an arbitrary output. So, it matters a
(00:58:57)
lot how you construct your E dolls. Um,
(00:58:59)
which is really hard for truthfulness
(00:59:01)
because then you need to know the truth,
(00:59:03)
which isn't always well, I mean, that's
(00:59:06)
really the problem we're trying to
(00:59:06)
solve, right? So, it's kind of chicken
(00:59:08)
egg. Um,
(00:59:11)
yeah, there's like a lot of different
(00:59:12)
approaches and a bunch of smart people
(00:59:14)
working on it. Um, if yeah, if anyone
(00:59:17)
has suggestions, please send them
(00:59:18)
through. There's like a lot of different
(00:59:20)
ways to look at it. So,
(00:59:22)
>> there's been like moments in time where
(00:59:23)
I've seen um Elon on X and someone has
(00:59:26)
said like this is obviously not right
(00:59:28)
and it's like some Grock output and he's
(00:59:30)
like we're going to fix this and then
(00:59:32)
you know 12 hours later, 24 hours later
(00:59:33)
he's like all right it's fixed.
(00:59:35)
>> When that happens like what happens
(00:59:36)
internally.
(00:59:37)
>> Uh he shows us what went wrong and then
(00:59:42)
quickly whoever um is awake at the time
(00:59:47)
it will uh start up a thread to go and
(00:59:49)
solve it. uh usually individually pull
(00:59:51)
in a few few others if need be um and
(00:59:55)
then give a postmortem on what happened
(00:59:56)
and everyone will understand then what
(00:59:59)
uh what went wrong and how to avoid it
(01:00:00)
in the future. Uh, ideally,
(01:00:02)
>> yeah, the like generally making mistakes
(01:00:05)
once is okay, but making the same
(01:00:07)
mistake twice is big problem.
(01:00:09)
>> Throughout SpaceX's history, there's
(01:00:10)
been a number of and same thing with
(01:00:12)
Tesla, there's been a bunch of these
(01:00:13)
like surges where randomly Elon will
(01:00:16)
like come in at midnight and say, you
(01:00:17)
know, like everyone that can come in,
(01:00:19)
like send out a companywide email and
(01:00:20)
say like come in, we need to be working.
(01:00:22)
That sort of thing. Um, has there any
(01:00:24)
been anything like that?
(01:00:26)
>> It's more for the big models that that
(01:00:27)
that happens more than anything. Um, for
(01:00:31)
Macro Heart specifically, we've been
(01:00:32)
operating in in a war room for 4
(01:00:35)
[laughter] months.
(01:00:36)
So, so we've kind of always been on that
(01:00:38)
on that push.
(01:00:39)
>> Do you guys have like a sign on the door
(01:00:40)
that says war room?
(01:00:41)
>> Yeah.
(01:00:42)
>> Amazing.
(01:00:43)
>> Actually, well, yeah, we we outgrew the
(01:00:45)
original war room. Um, and so we moved
(01:00:48)
everything out and uh I'm told like
(01:00:51)
walks in to the war room and it's
(01:00:52)
totally empty and he's like, "Where is
(01:00:54)
everyone? What?" and he walks over to
(01:00:56)
where we are now, which is just the gym,
(01:00:58)
which we cleared out and put everyone in
(01:01:00)
now, and then conducts his impromptu
(01:01:03)
questions of what's going on. That was a
(01:01:06)
long night. [laughter]
(01:01:08)
>> What is it like on on one of those
(01:01:09)
nights where a lot of things kind of get
(01:01:12)
shaken up and and moved forward or like
(01:01:14)
there's there's one of these searches.
(01:01:15)
What does that feel like?
(01:01:17)
>> Um, I think actually I saw this from one
(01:01:19)
of the co co-founders uh of XA posted
(01:01:22)
this recently. um Igor uh who was great
(01:01:26)
to work with. I liked him a lot. It was
(01:01:28)
actually really cool to work with him.
(01:01:28)
Side tangent um because his work on um
(01:01:32)
on Starcraft AI uh way back like
(01:01:36)
>> I guess 10 years ago now almost was one
(01:01:38)
of the first like cool ML work that I
(01:01:41)
tried to replicate myself in high
(01:01:42)
school. Uh which was very hard. Um it
(01:01:45)
was really cool. So it was really cool
(01:01:46)
to work with him. Like I totally never
(01:01:49)
thought I would get the chance to. Um,
(01:01:51)
but anyway, uh, I saw him, uh, post this
(01:01:54)
thing a few days ago where he's like,
(01:01:55)
"Okay, there there are some, uh, you
(01:01:58)
know, months where, uh, only a few days
(01:02:01)
go by and then there's some nights where
(01:02:03)
months happen." And that was like one of
(01:02:06)
them for sure. Um,
(01:02:08)
months might be an exaggeration. I think
(01:02:09)
we would have gotten to the technical
(01:02:10)
result we would have in a few weeks
(01:02:12)
anyway, but doing it in one night was a
(01:02:14)
huge push and it was a long night.
(01:02:18)
Has there been any moments where the
(01:02:20)
company just didn't leave the office for
(01:02:22)
like 5 days or like a week?
(01:02:24)
>> Yeah, the surges for the models usually
(01:02:27)
results in a lot of people staying in
(01:02:29)
overnight. Um,
(01:02:30)
>> and you mentioned there's like five or
(01:02:31)
six pods that people can sleep in and
(01:02:33)
they like toggle out.
(01:02:35)
>> Yeah. Yeah. There's some there's some
(01:02:36)
sleeping pods and we have some bunk beds
(01:02:37)
now, too. Um, which are less less nice,
(01:02:40)
>> but they exist. Um, and then when the
(01:02:42)
tent picture came out, everyone kept
(01:02:44)
sending that to me and I was like,
(01:02:46)
honestly, yeah, we have tents, but I've
(01:02:48)
never seen that many out at once.
(01:02:51)
[laughter]
(01:02:51)
Um, so yeah,
(01:02:54)
>> I know you worked on a bunch of
(01:02:55)
different projects as a kid. And I think
(01:02:57)
I don't know if this was their first
(01:02:58)
one, but it was like fidget spinners and
(01:02:59)
and and making fidget spinners. Um, I
(01:03:01)
don't think it was in your garage, but
(01:03:02)
maybe it was like in your room.
(01:03:04)
>> Yeah. What kind of stuff like that
(01:03:06)
tinkering mindset? How much of that have
(01:03:08)
you kind of taken to this? Uh quite a
(01:03:11)
bit. Quite a bit. Yeah. Um so I learned
(01:03:14)
programming when I was quite young. Um
(01:03:16)
my dad got me a book when I was like 11
(01:03:18)
and I liked it a lot. Um well I liked it
(01:03:21)
a bit but I really started to like it
(01:03:22)
when I realized you can make money from
(01:03:24)
it.
(01:03:24)
>> And so um I I met some people online who
(01:03:27)
were basically writing scripts for games
(01:03:29)
as hacks and would sell them online um
(01:03:31)
for small amounts of money. But you know
(01:03:33)
making a couple hundred bucks online was
(01:03:35)
huge for me. Um,
(01:03:36)
>> I think the first time that you like
(01:03:37)
have someone give you money, it's the
(01:03:39)
strangest feeling.
(01:03:40)
>> Crazy. Yeah. I remember having to ask my
(01:03:42)
dad for like a PayPal like custody
(01:03:44)
account or whatever and uh getting the
(01:03:46)
money in and it was like the coolest
(01:03:48)
thing of all time ever for me. Um, yeah,
(01:03:52)
it was really big. And so uh I did that
(01:03:55)
for um
(01:03:58)
like a couple months and saved up enough
(01:04:00)
money to at the time I was really
(01:04:02)
interested in uh added manufacturing
(01:04:03)
like 3D printers. RepRap was the big
(01:04:06)
thing then. So that was kind of where
(01:04:08)
what kicked off the modern 3D printing
(01:04:10)
revolution. Uh RepRap was like this
(01:04:12)
>> built your own, right?
(01:04:13)
>> Yeah, you had to. That was the only
(01:04:14)
option.
(01:04:15)
>> Um RepRap is literally just a bunch of
(01:04:19)
university students basically um who
(01:04:21)
said like let's see if we can build a
(01:04:23)
machine that can build almost all the
(01:04:25)
components for itself. Um which was that
(01:04:27)
why it was called RepRap. And uh they
(01:04:30)
basically built in a variety of
(01:04:32)
universities these rooms where you start
(01:04:35)
with one printer um and then it prints
(01:04:38)
the parts for the next printer and you
(01:04:39)
go all the way up and you scale up and
(01:04:40)
there's lots of problems as it turns out
(01:04:43)
and that's what they were solving and
(01:04:45)
eventually kicked off like the the
(01:04:46)
modern 3D printing resolution. Um, but I
(01:04:48)
was very obsessed with it and so I took
(01:04:50)
one of their parts list and bought
(01:04:51)
everything from Alibaba and a month
(01:04:52)
later things came in and I assembled it
(01:04:54)
all one night which went poorly actually
(01:04:56)
when I was uh unbundling the copper
(01:04:58)
cable for the power supply. Um, which
(01:05:00)
was a very sketchy power supply and did
(01:05:02)
catch fire in the end. Um, the all the
(01:05:05)
copper windings came like loose and
(01:05:07)
frayed everywhere and one went like 2 in
(01:05:09)
into my thumb. Um,
(01:05:10)
>> did you just can't your thumb just
(01:05:12)
doesn't work or did you go to the
(01:05:14)
hospital or something?
(01:05:15)
>> No. So, it was a school night and it was
(01:05:16)
like 3:00 a.m. cuz I wasn't very good at
(01:05:18)
building things at 13. Um, and I spent
(01:05:22)
like an hour in the bathroom trying to
(01:05:23)
pull it out with tweezers and it just
(01:05:25)
wasn't it was like it was bad. So, I
(01:05:27)
just cut it off and I was like, eh.
(01:05:29)
[laughter]
(01:05:30)
And so, bit by bit over the next few
(01:05:31)
weeks it came out and I would snip it
(01:05:33)
off in the mornings. Um, it was fun.
(01:05:35)
[laughter]
(01:05:36)
Um, yeah. Uh, but I got the printer
(01:05:39)
assembled. Um, and uh around that time,
(01:05:42)
yeah, the fidget spinner craze was going
(01:05:43)
off. So, I bought 1 th00and skateboard
(01:05:45)
bearings from China and basically set up
(01:05:46)
a little factory uh in my bedroom where
(01:05:49)
every two hours at night I would wake up
(01:05:51)
and I would clear the print bed, start a
(01:05:52)
new print of fidget spinners and I would
(01:05:54)
sell them online and then um before
(01:05:56)
school [clears throat] I had a little
(01:05:57)
assembly line in my garage where I would
(01:05:59)
uh put in the bearings, spray paint, dry
(01:06:01)
and then run around to all the other bus
(01:06:02)
stops of the other schools um sell them
(01:06:04)
to my distributors which were just uh
(01:06:06)
other kids of other schools, sell all
(01:06:08)
day at school, come back, collect from
(01:06:09)
my distributors and then um sell online,
(01:06:12)
ship uh built a little healthy business
(01:06:15)
and uh after 2 months they ended up
(01:06:17)
getting shut down by the county. Um
(01:06:19)
their official quoted reason was that
(01:06:20)
the uh companies that sell the school
(01:06:23)
food have technically an exclusive
(01:06:25)
license to sell anything in school
(01:06:28)
property. But I think they just didn't
(01:06:30)
like that I was distracting everyone and
(01:06:32)
making money doing it. Um but it taught
(01:06:35)
me a good like healthy disrespect for
(01:06:36)
authority. I think
(01:06:37)
>> that that has kind of been like a
(01:06:39)
constant theme. what what does that
(01:06:40)
actually how is that materialized in
(01:06:43)
your life with like the healthy
(01:06:44)
disrespect for authority like what and
(01:06:46)
you even mentioned um institutions like
(01:06:49)
you don't like necessarily trust
(01:06:50)
institutions um
(01:06:52)
>> how did you kind of come to that and
(01:06:54)
what does what does that look like
(01:06:56)
>> um
(01:06:58)
I I've always known from very young like
(01:07:00)
I I want an unconventional outcome
(01:07:03)
>> and so
(01:07:05)
going through a conventional path would
(01:07:08)
pretty much necessarily not lead you to
(01:07:10)
an unconventional outcome. So I grew
(01:07:12)
opposed to any form of convention and
(01:07:16)
institutions necessarily enforce
(01:07:18)
convention. Um I think creativity and
(01:07:22)
interesting outcomes come mostly from
(01:07:24)
free-spirited individuals um in almost
(01:07:28)
every case if not all of them. Um, I
(01:07:31)
guess it's a bit of a like high
(01:07:33)
highminded way of saying it, but yeah,
(01:07:35)
like individuals are
(01:07:37)
the most creative you can get. And so
(01:07:39)
staying true to that is the way to go.
(01:07:40)
>> I do love uh John Carlson's idea of like
(01:07:43)
everything is so hard to build and so
(01:07:46)
hard to make, especially, you know, put
(01:07:48)
into the real world that if you look
(01:07:50)
around, it's basically like the world is
(01:07:51)
just filled with some, you know,
(01:07:53)
people's passion projects.
(01:07:54)
>> Yeah. It's a total miracle. Um there's a
(01:07:56)
story behind every little thing. Um way
(01:07:59)
more than you would think. I remember
(01:08:01)
reading about the um I think it was YKK
(01:08:03)
zippers. Apparently every good zipper
(01:08:05)
like there's two or three companies in
(01:08:07)
the world that make zippers which are
(01:08:08)
actually pretty little little miracles.
(01:08:10)
They're very cheap but also mechanically
(01:08:12)
comp like relatively complicated for how
(01:08:14)
much you pay for them. And there's only
(01:08:16)
a few companies that are capable of
(01:08:18)
building or have have set up to build
(01:08:20)
them. Um and it's it's Yeah. Yeah.
(01:08:22)
basically this one Japanese guy's
(01:08:24)
passion project over 40 years uh to
(01:08:26)
figure out how to do this properly. Um
(01:08:27)
and this is the case for pretty much
(01:08:29)
everything. Um anything very specific
(01:08:32)
and at scale is probably only done by a
(01:08:34)
few companies or a few people in the
(01:08:35)
world. Um so yeah, I mean you hear about
(01:08:38)
it every so often, right? Like some
(01:08:40)
company in Germany, arbitrary company in
(01:08:42)
Germany shuts down and Volkswagen has to
(01:08:43)
halt all their lines or something like
(01:08:45)
that. Um happens all the same. It was a
(01:08:48)
big thing in co
(01:08:49)
>> right before we met you had made a
(01:08:52)
liquid fuel I think rocket engine.
(01:08:54)
>> Um it was like a very small thing. I saw
(01:08:55)
it upstairs. Um but you said we were
(01:08:58)
talking before this that you did it in
(01:09:00)
like 24 hours just on a whim. Um how did
(01:09:03)
that happen?
(01:09:04)
>> Yeah. Um so it was a project over like
(01:09:07)
roughly four weeks. Um and I started by
(01:09:10)
literally just buying a bunch of
(01:09:11)
textbooks. um and trying to figure out
(01:09:14)
like what are the design principles
(01:09:16)
behind a rocket engine like how do I
(01:09:18)
design it? There's not like um you it's
(01:09:21)
totally different from learning software
(01:09:22)
where you can just go on GitHub and
(01:09:23)
download people's code and modify it.
(01:09:25)
There's no file for a rocket engine. You
(01:09:27)
have to learn how to like what are the
(01:09:29)
material properties, what's the chemical
(01:09:31)
properties, how do you actually machine
(01:09:33)
it, um how do you design the parameters
(01:09:35)
and know what to expect from in terms of
(01:09:37)
thrust output and how do you not over
(01:09:39)
pressure the engine and all these kinds
(01:09:41)
of things. Um, how did you design the
(01:09:42)
injector which is uh the injector was
(01:09:44)
very hard. That was probably 50% of the
(01:09:46)
time.
(01:09:47)
>> Was that the hardest thing?
(01:09:48)
>> Yeah, the injector was very hard and it
(01:09:50)
was like the biggest flaw in the end.
(01:09:51)
Um, so yeah, I spent like 3 4 weeks
(01:09:54)
doing this and uh expedited a bunch of
(01:09:56)
parts from China like CNC and all that
(01:09:57)
stuff. Um, and uh it was right before
(01:10:00)
Thanksgiving. I was going to go fly back
(01:10:02)
to the east coast and visit my family
(01:10:04)
and I was like, "Okay, either I fire it
(01:10:05)
like build it and fire it tonight. It
(01:10:07)
was all just a bunch of parts at that
(01:10:09)
time. uh or I uh do this in two weeks
(01:10:12)
and I'm like I'm not going to do this in
(01:10:13)
two weeks. I'm going to do this right
(01:10:14)
now. So uh I drank a lot of coffee in
(01:10:18)
the morning and then spend the whole day
(01:10:20)
like hacking away at at uh aluminum
(01:10:22)
extrusions and built out the test frame
(01:10:24)
and then the the engine itself and uh
(01:10:26)
let it off that night. Um yeah, which
(01:10:30)
had a lot of um
(01:10:33)
we'll say
(01:10:35)
uh concessions made to make it happen
(01:10:37)
that night. [laughter]
(01:10:38)
Um,
(01:10:39)
>> I did find it absolutely hilarious that
(01:10:41)
you like you said, were you like a
(01:10:43)
couple feet away?
(01:10:44)
>> Yeah. So, I designed it like I wasn't
(01:10:47)
stupid. I designed it so that I could
(01:10:48)
remotely fire it, but um I didn't the
(01:10:51)
power supply hadn't come in yet to to
(01:10:53)
remotely power the computer that was on
(01:10:55)
board. So, I had to use a USB cable from
(01:10:57)
my laptop to power the onboard computer.
(01:10:59)
And I didn't have a long enough USB
(01:11:00)
cable. Uh, the longest one I had was
(01:11:03)
like 6 foot. So, I had to stand right
(01:11:05)
next to it and light it up. And I was
(01:11:08)
like, there's like a 30% chance that
(01:11:09)
this thing explodes or or launches fire
(01:11:12)
everywhere. And actually, um I don't
(01:11:14)
know if it shows in the video. I think
(01:11:15)
it does show in the video, but my jacket
(01:11:16)
did catch fire [laughter]
(01:11:18)
because I I wasn't that great at
(01:11:20)
designing the injector and it did create
(01:11:22)
a lot of over pressure events, which
(01:11:23)
meant there was a lot of basically uh
(01:11:24)
unburnt fuel spewing out, which was
(01:11:28)
ethanol. And so that's liquid and just
(01:11:30)
landed some landed on my on my jacket
(01:11:33)
and caught fire. Um so yeah, that's a
(01:11:35)
trophy still, the burnt jacket.
