↔
Title: Dylan Patel: NVIDIA’s New Moat & Why China is “Semiconductor Pilled”
Duration: 01:16:51
Total Correct Answers:
Current Caption
Correct
Learning Modes
YouTube Video Transcript Hide
Ask AI:
Export as:
Ask AI Result
The ask AI result will appear here..
(00:00:00) Your YouTube transcript will appear here
(00:00:00)
This is the biggest change in human
(00:00:02)
history maybe ever. What's about to
(00:00:04)
happen with AI? This is the biggest
(00:00:06)
revolution bigger than industrial
(00:00:07)
revolution. Jensen is very paranoid
(00:00:09)
about losing. If he just kept making his
(00:00:10)
mainline chip, people crush him on cost
(00:00:12)
and performance. Acquiring Grock is how
(00:00:14)
you get those resources to make more
(00:00:15)
solutions for different parts of the
(00:00:17)
market to stay king. At the end of the
(00:00:18)
day, this is an economic war. If the US
(00:00:20)
and the West win in AI, China will not
(00:00:24)
rise to be the global hedgeimony. But
(00:00:25)
without AI, China definitely will rise.
(00:00:28)
They're just going to outrun America.
(00:00:29)
Hi, I'm Matt Turk. Welcome back to the
(00:00:31)
Matt podcast. Today I'm joined by the
(00:00:33)
one person Wall Street and Silicon
(00:00:35)
Valley turn to when they need to cut
(00:00:37)
through the hardware hype, Dylan Patel
(00:00:39)
of Semi analysis. We dove into many of
(00:00:41)
the most important topics [music] of
(00:00:43)
today. Nvidia's massive move to acquire
(00:00:45)
Grock, the truth about the capex bubble,
(00:00:47)
whether the US power grid can actually
(00:00:49)
handle the AI boom, and the geopolitical
(00:00:52)
chess match [music] playing out between
(00:00:53)
the US and China. But I have to warn
(00:00:55)
you, this conversation went off the
(00:00:57)
rails in the best possible way. And we
(00:00:59)
ended up going into all sorts of fun
(00:01:00)
tangents like the strange phenomenon of
(00:01:03)
Chinese romance dramas set inside
(00:01:05)
semiconductor factories and what's
(00:01:07)
really like when three AI famous
(00:01:08)
roommates live together in SF. Please
(00:01:10)
enjoy this fantastic conversation with
(00:01:12)
Dylan.
(00:01:15)
>> Hey Dylan, welcome.
(00:01:16)
>> Hello. How are you?
(00:01:17)
>> I'm great. I'd love to start with Grock
(00:01:19)
and Nvidia since it's still fresh. So,
(00:01:22)
not so long ago, Nvidia was saying that
(00:01:24)
uh one GPU could do it all, and now
(00:01:26)
they're doing this acquisition
(00:01:28)
non-exclusive deal with Grock. What does
(00:01:31)
that mean from your perspective?
(00:01:33)
>> It's very clear. We're not sure where AI
(00:01:36)
models are headed in terms of, you know,
(00:01:38)
over the next few years, what happens to
(00:01:40)
the architecture, but you know, the
(00:01:42)
thing that I think everyone is sort of
(00:01:43)
like agreed on is models are pretty auto
(00:01:45)
reggressive, right? Next token
(00:01:46)
generation is like the thing but beyond
(00:01:48)
that right attention mechanisms changed
(00:01:49)
the how how it works everything changes
(00:01:51)
right could could change and so what's
(00:01:52)
interesting is the reason Nvidia one is
(00:01:55)
because they just took like the widest
(00:01:56)
surface area bet and then people kept
(00:01:57)
developing models on that and that kind
(00:02:00)
of shape worked but now the workload is
(00:02:02)
so large that there is room for
(00:02:04)
specialization that will give you 10x
(00:02:06)
increases in certain domains right in a
(00:02:08)
general purpose workload grock doesn't
(00:02:10)
work right you know it can't train it
(00:02:12)
can't you know it can't inference really
(00:02:13)
really large models um cost efficiently,
(00:02:16)
right? You can't serve many many many
(00:02:17)
users, but what it can do is it can go
(00:02:19)
bl screamingly fast, right? Same with
(00:02:21)
the cerebrous open AI deal, but that's
(00:02:23)
like one workload, right? Uh very decode
(00:02:26)
focused, right? Gener doing auto
(00:02:27)
reggressive tokens in a in a single
(00:02:29)
stream super fast. Another direction AI
(00:02:31)
models could head, right? We don't know
(00:02:33)
are models going to think in one token
(00:02:35)
stream or is it actually they're
(00:02:38)
constantly context switching, right? and
(00:02:40)
they're going from they have this
(00:02:41)
humongous humongous context and they're
(00:02:44)
generating in multiple parallel streams
(00:02:47)
right and so Google and openi have both
(00:02:49)
released mechanisms of this with their
(00:02:51)
pro models where the model actually
(00:02:53)
doesn't just have one single chain of
(00:02:55)
thought for reasoning it has multiple
(00:02:57)
right and then I don't exactly like you
(00:02:59)
know and and and how they choose which
(00:03:00)
one and what the final answer to you
(00:03:02)
delivers is is an area of research um
(00:03:05)
but there there is room for that kind of
(00:03:07)
chip right something that works on very
(00:03:08)
parallel a lot lot of streams of chain
(00:03:11)
of thought and maybe the latency
(00:03:12)
requirements are not as crazy, right?
(00:03:13)
Maybe you don't want to go blindingly
(00:03:15)
fast, right? Maybe you're okay with it
(00:03:17)
being, you know, because I can spin up
(00:03:19)
100 parallel, you know, streams of
(00:03:21)
thought or agents or whatever you want
(00:03:22)
to call them. Maybe I I care a lot about
(00:03:24)
cost there. And because it's 100 in
(00:03:26)
parallel instead of one going super
(00:03:27)
super fast, it's not as deep, right? The
(00:03:29)
tree search or the depth of the
(00:03:31)
inference is not as deep, but it is much
(00:03:32)
wider. You know, there's other parts of
(00:03:34)
inference. Hey, process do creating the
(00:03:35)
KV cache. So, Nvidia has a chip for
(00:03:37)
that, right? That's the CPX. So they
(00:03:39)
they've made the CPX, they bought Grock
(00:03:41)
for decode, and then they still have
(00:03:42)
their general purpose GPU. So they've
(00:03:43)
they're kind of trying to cover their
(00:03:45)
bases because unlike the first wave of
(00:03:47)
AI chip companies where they sort of
(00:03:49)
just made chips and then tried to figure
(00:03:50)
out where it would work, right? They had
(00:03:51)
a thesis, Grock and Cerebrus, both as
(00:03:53)
well as Samanova, right, which was put a
(00:03:55)
lot of memory on the chip and not
(00:03:57)
necessarily in the case of Cerebrus and
(00:03:59)
Grock, no memory off chip. And in the
(00:04:01)
case of Samanova, less memory offchip or
(00:04:03)
slower memory offchip with higher
(00:04:04)
capacity. You know, they they sort of
(00:04:06)
all made similar bets in that direction.
(00:04:08)
And it didn't work for a while until it
(00:04:11)
kind of did, right? Um because there's a
(00:04:13)
workload that now necessitates it.
(00:04:14)
Nvidia recognizes they're they're the
(00:04:16)
leader. They're at the tent pole. Hey,
(00:04:18)
in one respect they can just run faster
(00:04:21)
than everyone, but it's kind of hard to
(00:04:23)
be 2x better than Google or or OpenAI or
(00:04:27)
whoever else's internal chip, right? To
(00:04:29)
justify their, you know, 75% plus
(00:04:31)
margins, right? And then they have to be
(00:04:33)
2x to 4x better to justify 4x better to
(00:04:36)
justify their margins because that's
(00:04:37)
what they're charging above COGS. You
(00:04:38)
know, the question is what what
(00:04:40)
architecture will deliver that? Well,
(00:04:42)
yes, keep the programmability of their
(00:04:43)
GPUs is great for training and for a lot
(00:04:47)
of workloads, but you know, guess what?
(00:04:48)
I think I think a lot of people will
(00:04:50)
just be downloading an open source
(00:04:52)
model, downloading an inference
(00:04:53)
framework and pressing go, right? A
(00:04:55)
little bit more complicated than that,
(00:04:56)
but that's that's going to be the
(00:04:58)
consumption method for a lot of
(00:04:59)
enterprises, a lot of uh startups, a lot
(00:05:02)
of tech companies is they're just going
(00:05:03)
to do that or they're going to rent the
(00:05:05)
G GPUs or or rent the chips and then
(00:05:07)
download an open source framework and
(00:05:08)
model and go, right? And Nvidia
(00:05:10)
recognizes this and hey, there is room
(00:05:11)
for products that aren't general
(00:05:13)
purpose, right? The general purpose GPU
(00:05:15)
will still probably be the main line for
(00:05:17)
training and for a lot of inference and
(00:05:19)
for costefficient inference, but maybe
(00:05:21)
blindingly fast or workloads that have a
(00:05:23)
ton of prefill, i.e. creating the the KV
(00:05:25)
cache. Maybe that those workloads could
(00:05:28)
be different chips, right? And the CPX
(00:05:29)
chip they announced, right? They say
(00:05:30)
it's for the context processing,
(00:05:32)
creating the KV cache. It's also really
(00:05:33)
useful for video models because video
(00:05:35)
models don't care about memory bandwidth
(00:05:36)
and so you know why pay for the
(00:05:37)
expensive memory that the general
(00:05:39)
purpose chip has or why do what Grock is
(00:05:40)
doing which is tying hundreds or
(00:05:42)
thousands of chips together and not
(00:05:44)
having memory but keeping the entire
(00:05:46)
model on chip. The trade-off for that of
(00:05:48)
course is you need thousands of chips
(00:05:49)
and you have less compute per chip and
(00:05:51)
so like Nvidia's trying to capture the
(00:05:53)
whole surface area because again you
(00:05:54)
don't know where models are headed and
(00:05:56)
it's hard to say where the research is
(00:05:57)
headed.
(00:05:57)
>> And do you think it's a good thing for
(00:05:58)
the market? Yet another one of those
(00:06:00)
deals that's structured as a as a
(00:06:02)
license but really an acquisition.
(00:06:04)
>> I certainly think it's not good from an
(00:06:08)
anti-competitive sense, right? I don't
(00:06:10)
think people should just be able to buy
(00:06:12)
companies without like any antitrust
(00:06:15)
like process at all. Now, in the case of
(00:06:17)
like a large company buying a startup,
(00:06:19)
I'm completely fine with it. The flip
(00:06:20)
side is like, hey, we know the deal is
(00:06:22)
happening, right? Uh this happened for a
(00:06:24)
company I was an adviser for Nvidia
(00:06:26)
acquired in fabrica just maybe a few
(00:06:27)
months before they did Grock and similar
(00:06:30)
style of deal right if someone wanted to
(00:06:32)
strike it down that's the biggest limbo
(00:06:34)
right we've seen this happen in venture
(00:06:35)
and you probably know more stories of
(00:06:37)
this but like a company trying to get
(00:06:38)
acquired they get stuck in limbo for
(00:06:40)
like a year
(00:06:41)
>> and then it falls apart
(00:06:43)
>> stories
(00:06:43)
>> yeah it falls apart the deal did because
(00:06:46)
some regulatory BS and now the company
(00:06:49)
was and the founders were focused on
(00:06:50)
getting the deal done instead of like
(00:06:52)
making the product better for a year and
(00:06:53)
now they're like behind or you know they
(00:06:54)
they they weren't focused on growth as
(00:06:56)
much right you know you only have so
(00:06:57)
much time as a founder so in that sense
(00:06:59)
I like the license deals right
(00:07:00)
>> so now is uh Nvidia also dominating the
(00:07:04)
the inference market is there any world
(00:07:06)
where Nvidia is no longer the king or
(00:07:08)
they seem to be getting stronger
(00:07:10)
>> I think the thing about Nvidia is they
(00:07:13)
take the Andy Grove mentality like more
(00:07:16)
serious than anyone else right like okay
(00:07:18)
fine Google like implemented OKRs
(00:07:20)
because Intel did it but that's like you
(00:07:21)
know management stuff, right? Only the
(00:07:23)
paranoid survive, right? This is like
(00:07:25)
core to the Bay Area, um core to Nvidia.
(00:07:28)
Um Jensen is very paranoid about losing,
(00:07:31)
right? These specializations, if he just
(00:07:32)
kept making his mainline chip, would
(00:07:34)
mean people could, you know, point point
(00:07:36)
solutions for specific parts of the
(00:07:38)
market would crush him on cost and
(00:07:39)
performance. Then he can't justify his
(00:07:40)
margin. That's a threat to Nvidia's
(00:07:42)
business model as a whole, especially if
(00:07:44)
the best model only changes every 3
(00:07:46)
months or the model you want to roll
(00:07:47)
out. Okay, well then you're going to
(00:07:48)
have three months to figure out how to
(00:07:49)
make a model work on one chip
(00:07:51)
architecture for that point solution and
(00:07:53)
you know it's fine. Software software
(00:07:54)
advantage of Nvidia is not that
(00:07:55)
important. Then Jensen's super paranoid
(00:07:57)
about losing and frankly it's really
(00:07:59)
hard to hire enough talented chip
(00:08:01)
people. When you look across the market,
(00:08:03)
there is only a few companies who have
(00:08:05)
successfully created a chip architecture
(00:08:08)
software to run the models accurately,
(00:08:11)
run the run the models accurately,
(00:08:12)
right? Like cuz you can look at random
(00:08:14)
APIs of say an Alibaba Quen model and
(00:08:17)
different people are doing all sorts of
(00:08:18)
tricks like quantizing it, but also many
(00:08:20)
other tricks which then end up like
(00:08:22)
making the model quality lower. You
(00:08:23)
know, building a rack scale solution,
(00:08:25)
networking thousands of chips together
(00:08:26)
and then deploying an API and Grock did
(00:08:28)
the whole thing with frankly not that
(00:08:30)
many people. So now it's like okay well
(00:08:32)
I'm Nvidia I want to make four different
(00:08:33)
chip architectures and actually four
(00:08:35)
different point solutions maybe the
(00:08:36)
general purpose and then one here one
(00:08:37)
here one here and in addition my general
(00:08:40)
purpose thing is actually not just like
(00:08:41)
a GPU chip it's like GPU chips CPU chips
(00:08:44)
networking chips NV switch nicks like
(00:08:47)
you know there's many many chips and
(00:08:48)
each of those chips has many chiplets
(00:08:50)
you don't have enough engineering
(00:08:51)
resources right and so like acquiring
(00:08:53)
gro is like how you get those resources
(00:08:54)
to make more solutions for different
(00:08:56)
parts of the market as far as like are
(00:08:58)
they threatened like I think I think
(00:08:59)
like obviously There's some cool
(00:09:01)
startups out there, right, that are
(00:09:02)
raising a lot, right, currently or have
(00:09:05)
raised such as Etched, Maddx, uh,
(00:09:07)
positron, these new age of AI companies.
(00:09:09)
There's also the prior age of like
(00:09:10)
Cerebrris is is out there still, right?
(00:09:13)
You know, Tenstor, etc. And there's so
(00:09:14)
there's a lot of AI chip companies on
(00:09:15)
the startup side, but then there's also,
(00:09:16)
you know, Google's TPU, AMD GPUs, uh,
(00:09:20)
Amazon Tranium, uh, who are all really
(00:09:22)
credible competitors. And then, you
(00:09:24)
know, Meta's MTIA is somewhat credible.
(00:09:26)
and then you know Microsoft Somaya is
(00:09:28)
not credible but like you know maybe it
(00:09:30)
will be one day right so you sort of
(00:09:31)
have like a lot of competition they've
(00:09:33)
got to hold the gates back and so I
(00:09:34)
think
(00:09:35)
>> is there a risk to them being I mean
(00:09:37)
like there's there's risk from all of
(00:09:38)
those companies that I mentioned and and
(00:09:40)
you know effectively California/ Seattle
(00:09:43)
right only two two places there's
(00:09:45)
there's also chips from other parts of
(00:09:46)
the world right obviously China has a
(00:09:48)
number of different AI chip companies
(00:09:49)
that are doing cool things anyone would
(00:09:51)
have told you Grock was you know their
(00:09:53)
business revenue their revenue was not
(00:09:55)
like stellar right in fact they missed
(00:09:57)
revenue last year significantly and yet
(00:09:58)
they got bought right because the value
(00:10:00)
of the IP was there and the value of the
(00:10:01)
team anyone else would have been like
(00:10:02)
well why the heck would I buy this right
(00:10:04)
uh makes no sense there's definitely a
(00:10:06)
credible threat
(00:10:06)
>> yeah and do you think uh CUDA is going
(00:10:09)
to remain that mode I guess a
(00:10:11)
combination of CUDA and whatever came
(00:10:13)
out of the Melanox acquisition like do
(00:10:15)
do those persist as long-lasting
(00:10:17)
advantages
(00:10:18)
>> I think they do I think networking is
(00:10:20)
super important I think uh the CUDA
(00:10:22)
software mode is very important but it's
(00:10:24)
also like changing rapidly right It's an
(00:10:26)
incredible amount of the software that
(00:10:28)
Nvidia GPUs run on is not from Nvidia.
(00:10:30)
It's it's the developer ecosystem that's
(00:10:32)
open sourcing it. When you look at, for
(00:10:34)
example, VLM and SGLANG, right? These
(00:10:37)
support AMD GPUs almost as first class
(00:10:39)
citizens now. And VLM is getting
(00:10:42)
significant support for TPUs for tranium
(00:10:46)
and there will be other chips coming out
(00:10:48)
from startups that also support VLMs
(00:10:50)
SGLang. Now like how difficult is it?
(00:10:52)
You know the the the reason why CUDA is
(00:10:54)
so important is like okay I can do
(00:10:55)
whatever I need to do right programming
(00:10:57)
a GPU.
(00:10:58)
>> I think most AI chips will not be
(00:11:01)
consumed by people programming anything
(00:11:03)
for it.
(00:11:04)
>> They will download an open source
(00:11:06)
inference engine. and they will download
(00:11:07)
an open source model and then they will
(00:11:09)
put it on the and it's really simple to
(00:11:11)
download VLM and like make it work like
(00:11:13)
it's not that hard to set up uh you know
(00:11:15)
a server and Nvidia's putting out a lot
(00:11:16)
of open source software like Triton
(00:11:18)
inference server and and uh Dynamo and
(00:11:21)
all these things to to make it easy
(00:11:22)
because that is the consumption model
(00:11:24)
ultimately for the majority of AI right
(00:11:27)
is and it might be like oh it's my own
(00:11:29)
inference engine but most servers will
(00:11:31)
not run code besides the inference
(00:11:33)
engine and the model it's like not like
(00:11:35)
people are actually like researchers are
(00:11:37)
like writing code for GPUs to see ideas
(00:11:39)
if they'll work and train models and all
(00:11:40)
these things or just mess around with
(00:11:42)
them to figure out you know infra
(00:11:44)
performance or whatever it is but most
(00:11:45)
of it won't be there and so CUDA as a
(00:11:47)
mode CUDA language is like you know like
(00:11:49)
it's like fine right like you know no
(00:11:50)
one actually writes CUDA right like most
(00:11:52)
people write PyTorch and then like torch
(00:11:54)
compile and then they just run it on the
(00:11:55)
GPU they don't write CUDA but a lot of
(00:11:57)
this CUDA mode is like how does PyTorch
(00:11:59)
translate into high performance GPUs and
(00:12:01)
that surface area from when people were
(00:12:04)
just writing like hardcore when people
(00:12:05)
are hardcore writing CUDA kernels to
(00:12:07)
like hey they're writing PyTorch and
(00:12:09)
then it's compiling down to GPUs versus
(00:12:11)
oh I'm just downloading VLM is it is a
(00:12:14)
it is a curve of like not a ton of
(00:12:15)
people that can do CUDA kernels a whole
(00:12:17)
lot more people can do PyTorch right
(00:12:19)
random you know PhDs and random people
(00:12:21)
it's very simple right a crapload of
(00:12:23)
people can do VLM download it run it on
(00:12:26)
a server well if it now supports other
(00:12:28)
chips what is the CUDA mode's recognized
(00:12:30)
this and they've been building software
(00:12:32)
that is not necessarily the CUDA remote
(00:12:34)
and I I can give some examples All
(00:12:35)
right. So the name of the game is fast
(00:12:37)
tokens and lowest cost tokens, right?
(00:12:40)
And lowest cost tokens happens by your
(00:12:42)
chip being fast. But there's also
(00:12:43)
tricks, right? One example, right? Like
(00:12:45)
I mentioned with, you know, the CPX
(00:12:46)
versus Grock, right? Is processing your
(00:12:48)
prefill contacts, right? Super cheap
(00:12:51)
CPX, right? If I'm if I'm care a lot
(00:12:53)
about speed, then Grock. These are
(00:12:54)
optimizations on the hardware side.
(00:12:56)
There's optimizations on the software
(00:12:57)
side as well, right? And so one example
(00:12:59)
is when I'm doing for example if I look
(00:13:02)
at a cloud code or a cursor type
(00:13:05)
application right the workload is like
(00:13:08)
it takes your repo takes the relevant
(00:13:10)
parts of your repo puts it in the
(00:13:12)
context of the LLM it prompts it
(00:13:14)
generates right and if it's an agent
(00:13:15)
mode it it it circulates the context a
(00:13:18)
couple times it'll collapse put things
(00:13:19)
off to the side access different
(00:13:20)
contexts but what's you know especially
(00:13:22)
when you think about an agent for
(00:13:23)
software and you can see this in codeex
(00:13:25)
you know Codex Codex actually not as
(00:13:27)
good as cloud code, but it can do work
(00:13:29)
on time horizons of like 9 10 hours. Um,
(00:13:32)
and do like a big refactor better than
(00:13:34)
cloud code can, even though most of the
(00:13:35)
times cloud code is better. And and
(00:13:37)
what's interesting about Codeex does is
(00:13:39)
it'll like take your repo, it'll
(00:13:41)
identify parts if you're asking it to
(00:13:42)
refactor it, identify parts, write
(00:13:44)
stuff, you know, make like these notes
(00:13:46)
for itself everywhere, collapse the
(00:13:47)
context, switch from this part of the
(00:13:49)
repo to that part of the repo to this
(00:13:50)
part of the repo. But when you think
(00:13:51)
about it, it's like, oh, if this thing
(00:13:52)
is just generating tokens all the time,
(00:13:54)
plus it's switching what my context is
(00:13:57)
constantly, that's really expensive,
(00:14:00)
right? If you think about like what's
(00:14:01)
the cost of inference, um, I want to say
(00:14:04)
it's like it's it's $10 per million
(00:14:06)
tokens of output and or and $3 for
(00:14:10)
decode or 10 for decode and three for
(00:14:12)
prefill. Uh, and so if you think about,
(00:14:14)
oh, it just worked for nine hours on one
(00:14:16)
task, one refactor, huge value. But if
(00:14:18)
it changed context a ton of times and
(00:14:20)
your context is like 30k usually or 50k
(00:14:23)
or you know heading to hundreds of
(00:14:24)
thousands you know how long your how big
(00:14:26)
your repository is and how much context
(00:14:28)
switch now you're spending all this
(00:14:29)
money on on prefill right not the decode
(00:14:31)
tokens but actually why am I like
(00:14:33)
regenerating the KV cache I can actually
(00:14:36)
just like store the KV cache elsewhere
(00:14:38)
and then when I need it again I can pull
(00:14:39)
it and and plop it into CPU memory or
(00:14:41)
into GPU memory. And so Nvidia's got
(00:14:43)
this like KV cache manager and they've
(00:14:45)
been working really hard on like making
(00:14:47)
it so they can interface SSDs and stick
(00:14:50)
the KV cache on there and pull it out
(00:14:51)
whenever they want. So for this kind of
(00:14:53)
workload and then if you do this and you
(00:14:55)
look at like coding as an application
(00:14:57)
and you like look at these coding
(00:14:58)
companies and how much they're paying
(00:14:59)
for prefill versus decode actually
(00:15:01)
majority of their cost is pre-fill
(00:15:02)
tokens not decode tokens because their
(00:15:04)
context is just so large and it's
(00:15:06)
switching all the time even in agent
(00:15:08)
modes. You know, if you can now not have
(00:15:10)
to do the pre-fill, your costs go down
(00:15:11)
dramatically. But that's a very
(00:15:13)
complicated thing to do from a software
(00:15:15)
perspective. You know, companies like
(00:15:16)
Enthropic, Google, OpenAI have already
(00:15:18)
done it. But what about the wide world,
(00:15:20)
right? And so Nvidia is trying to make
(00:15:21)
the open source software for this. And
(00:15:23)
that's like CUDA mode, but it's like
(00:15:24)
actually no, none of this is CUDA,
(00:15:26)
right? like it's like memory management
(00:15:27)
and like you know storage management and
(00:15:30)
when do you call what and how do you
(00:15:31)
transfer it and how do you like spread
(00:15:32)
the KV cache across a bunch of different
(00:15:34)
storage nodes and what happens when you
(00:15:36)
read it and the network congestion just
(00:15:37)
like all these things yeah it's like
(00:15:39)
Nvidia's wheelhouse but it's not CUDA
(00:15:41)
and I think like the easy way to say it
(00:15:42)
is it is the CUDA mode right and so
(00:15:44)
things like this KV cache manager and
(00:15:46)
many other things they're trying to do
(00:15:48)
to reduce the cost of inference like is
(00:15:50)
how they build the new CUDA mode because
(00:15:52)
again today it's it's you know it is
(00:15:55)
quite I mean AMD is like not fully there
(00:15:57)
yet and TPU is being added right now and
(00:15:59)
tranium is being added soon as well to
(00:16:01)
VLM but all of them will have a very
(00:16:03)
good UX for download model run model on
(00:16:06)
VLM by the middle of the year I think
(00:16:09)
right certainly AMD is already there by
(00:16:11)
the end of this quarter we have
(00:16:12)
something that like tests this right
(00:16:13)
it's called inferencemaxa it's open
(00:16:15)
source all the code is and the results
(00:16:16)
are uh but we run across I think $60
(00:16:19)
million of GPUs which are donated to us
(00:16:21)
by companies like Nvidia AMD openai
(00:16:24)
Microsoft Amazon on Crusoe, Core Weave,
(00:16:27)
Together AI, uh all these companies are
(00:16:30)
sponsoring GPUs for us to run this.
(00:16:31)
We're running VLM and SDG Lang every
(00:16:33)
night on, you know, nine different kinds
(00:16:35)
of GPUs on a variety of different models
(00:16:37)
and different work uh context lens and
(00:16:39)
all these things, right? To see the
(00:16:40)
performance and you can see the
(00:16:41)
performance moving every day or pretty
(00:16:42)
often because the software changes all
(00:16:44)
the time. And so like the fact that this
(00:16:46)
exists is the cuda boat, right? It's not
(00:16:48)
that like AMD you can do this on their
(00:16:50)
chips, Nvidia can do this on their
(00:16:51)
chips. It's oh when the new model comes
(00:16:52)
out, how fast does it get to peak
(00:16:54)
performance? because you know it's it's
(00:16:55)
a moving target or hey can I implement
(00:16:57)
this KV cache management thing how hard
(00:16:59)
is it how many engineers do I need oh
(00:17:01)
just one great like or 10 great if I
(00:17:03)
need a hundred people to develop it like
(00:17:05)
Google and you know so on and so forth
(00:17:06)
did then that's much harder
(00:17:08)
>> do you think AMD can uh catch up
(00:17:09)
>> I think AMD will be caught up at times
(00:17:12)
and very behind at other times like
(00:17:14)
currently they're super far behind right
(00:17:16)
because Blackwell is just way better
(00:17:17)
than MI355 um and then you know Rubin
(00:17:20)
comes out and they'll be way way behind
(00:17:21)
but then AMD's new chip comes out and
(00:17:22)
AMD will be caught up or evenlight ly
(00:17:24)
ahead on a hardware perspective.
(00:17:26)
Software's behind, right? And you have
(00:17:27)
this like leaprogging and and AMD is a
(00:17:29)
very credible second competitor. I don't
(00:17:31)
think they'll go beyond like I think
(00:17:33)
they'll stay in single digits market
(00:17:34)
share. Single digit percentage market
(00:17:36)
share.
(00:17:37)
>> Single digit percentage market share is
(00:17:38)
>> still [laughter] pretty good.
(00:17:40)
>> Yeah. I mean, Nvidia's revenue this year
(00:17:41)
is going to be like
(00:17:43)
>> it's a lot.
(00:17:44)
>> The three gajillion
(00:17:45)
dollars.
(00:17:46)
>> I think it's actually four gajillion.
(00:17:48)
[laughter]
(00:17:49)
[gasps]
(00:17:49)
>> What about all the startups? You
(00:17:51)
mentioned a few. So there's a cerebrus
(00:17:54)
on the one end of the spectrum and then
(00:17:56)
newer ones edged and and others if if
(00:17:59)
AMD has a you know uphill battle in
(00:18:02)
front of them like do you think those
(00:18:03)
guys can take significant market share?
(00:18:06)
you sort of the whole specialization
(00:18:07)
game, right? You you have to specialize
(00:18:09)
because you're never going to beat
(00:18:10)
Nvidia at their own game, right? They're
(00:18:12)
going to have the supply chain unlock.
(00:18:14)
They're going to get to the newest
(00:18:15)
memory technology or process technology
(00:18:17)
or whatever packaging technology,
(00:18:18)
whatever it is, sooner than you and
(00:18:20)
they're just going to crush you, right?
(00:18:21)
If you play their game, you have to AMD
(00:18:23)
is trying to play Nvidia's game, but AMD
(00:18:26)
is like extremely good at engineering
(00:18:28)
silicon, right? Everyone else has to has
(00:18:31)
to has to try something weird or
(00:18:33)
different, right? And so when you look
(00:18:34)
at Etched or Maddx or Posatron or
(00:18:37)
Cerebrris or Tenstor, you go to look at
(00:18:39)
all these companies, right? There are
(00:18:41)
unique things about what they're doing
(00:18:44)
and it's not clear if AI models will
(00:18:47)
still be within that realm when that
(00:18:49)
comes out, right? Uh does oh now people
(00:18:52)
use like engrams and other sparse
(00:18:54)
attention techniques. Is that like is
(00:18:56)
does that change like some of the
(00:18:58)
specializations people are doing or hey
(00:19:00)
people are now doing like you know
(00:19:02)
models are now sparse instead of being
(00:19:04)
dense models does that change things
(00:19:06)
there's so many optimizations and
(00:19:08)
changes on the model side and you can't
(00:19:10)
predict what's going to happen with the
(00:19:12)
ML research easily at least you can't
(00:19:14)
the thing you're optimizing for today
(00:19:16)
has to be a vision of where AI will be
(00:19:18)
in 2 years and Nvidia's fully accepted
(00:19:20)
they don't know where that's going to be
(00:19:21)
that's why they have a portfolio of
(00:19:24)
chips now not just one GPU line, right?
(00:19:26)
It's not just Hopper, Blackwell, Reuben.
(00:19:29)
Now, it's going to be, you know, it's
(00:19:30)
not Ampure, Hopper, you know, you know,
(00:19:31)
it's not that line. It's like there's a
(00:19:33)
variety of chips to serve the different
(00:19:34)
markets um and different possible
(00:19:36)
scenarios. They think each of them has
(00:19:38)
this vision today, but oh, it might turn
(00:19:39)
out the general purpose one sucks and
(00:19:41)
and actually AI models have developed in
(00:19:42)
a way where CPX or Grock style chips are
(00:19:45)
the best, right? Well, okay, now we have
(00:19:46)
a solution for that market. And so, I
(00:19:48)
think that's the challenge with the
(00:19:49)
startups. With that said, I think
(00:19:51)
they're all taking very interesting
(00:19:52)
bets. I think it's I think it's much
(00:19:54)
more exciting than the first wave of AI
(00:19:57)
hardware uh bets graph course rebringing
(00:20:05)
the memory on the chip they sort of just
(00:20:07)
made a bet and they optimized for a
(00:20:08)
certain kind of model all similar kinds
(00:20:10)
of model and it didn't end up working
(00:20:11)
out for a long time right they had to
(00:20:13)
pivot and they had to work on a lot of
(00:20:14)
things and it took a long time I think
(00:20:16)
these companies have like a really clear
(00:20:18)
vision of what they think models will
(00:20:20)
look like right like Etch does Maddx
(00:20:22)
does, Posatron does, and that's what's
(00:20:24)
really cool about it between the three
(00:20:25)
of them, uh, these new age. So, I mean,
(00:20:27)
I'm I'm excited for them. I'm very very
(00:20:29)
skeptical. I don't know what uh what a
(00:20:31)
venture capitalist views as likely
(00:20:33)
chances of succeeding, but I think all
(00:20:35)
of them are less than 1%. Right?
(00:20:38)
>> But, you know, that's that's that's a
(00:20:40)
>> but the world where they win is a
(00:20:42)
multi-silicon kind of world where any
(00:20:44)
given customer uses a range of different
(00:20:48)
GPUs. It could it could or it could be
(00:20:50)
any given customer has like one workload
(00:20:52)
they care a lot about. Anthropic clearly
(00:20:55)
does not give a crap about videogen
(00:20:56)
image gen right they just don't care. Um
(00:20:59)
on the flip side, company like
(00:21:00)
midjourney cares a lot about image and
(00:21:02)
videogen, right? Image and videogen is
(00:21:05)
very very like like I mentioned like
(00:21:07)
it's a very like it's not very memory
(00:21:08)
bandwidth heavy. It loves loves loves
(00:21:11)
compute, right? Whereas inference of
(00:21:13)
large language models in the style of
(00:21:15)
like you know this these you know say
(00:21:17)
for example coding agents cares a lot
(00:21:19)
about decoding for long streams of time.
(00:21:21)
Um and that's very memory bandwidth
(00:21:23)
heavy right? And so there's like that's
(00:21:24)
like a simple example, but there's a lot
(00:21:26)
more nuance there in terms of like even
(00:21:28)
like the size of like the matrix
(00:21:30)
multiply, you know, the tensor cores
(00:21:31)
that you you know the systolic arrays
(00:21:32)
that you use or the ratios of networking
(00:21:34)
and memory memory and like what's that
(00:21:36)
memory hierarchy look like and you know
(00:21:37)
what are you doing for different kinds
(00:21:38)
of attention and like oh like all these
(00:21:40)
sorts of things like there's a lot of
(00:21:42)
specialization here and so some people
(00:21:44)
are betting big on on different types of
(00:21:45)
specialization and I I think like you
(00:21:47)
could clearly see a world where
(00:21:49)
companies do care about different stuff
(00:21:51)
right like like if for example a chip
(00:21:53)
optimized for video and image generation
(00:21:56)
existed today and it was better than
(00:21:58)
Nvidia or Nvidia made it. I think
(00:22:00)
Midjourney would absolutely only use
(00:22:02)
that for inference. I think for training
(00:22:03)
they'd still use the general purpose
(00:22:04)
thing and as would like Meta and Google
(00:22:06)
would like they should do that, right?
(00:22:08)
And hey, Meta actually has two lines of
(00:22:10)
AI chips there. MTIA there's a line
(00:22:13)
that's focused on recommendation systems
(00:22:15)
and then there's a line that's focused
(00:22:16)
on Gen AI. The GI one is a new line, but
(00:22:19)
that recommendation systems ch line is
(00:22:20)
still continuing, right? It's not sexy.
(00:22:22)
No one cares because there's no and bite
(00:22:24)
dance also has a recommendation system
(00:22:26)
line of chips and it's not really
(00:22:27)
focused on Jedi which is fine because
(00:22:30)
you know this is a $200 billion business
(00:22:31)
or something which is just deciding what
(00:22:33)
ad to serve me right and what order to
(00:22:35)
put my friends stories and you know
(00:22:36)
things like this so so I think like it's
(00:22:38)
perfectly fine for there to be
(00:22:39)
specialized AI chips given the target
(00:22:41)
market is big enough and you have to
(00:22:42)
have vision to know what that target
(00:22:44)
market is unless you're hyperscaler then
(00:22:46)
you can like just like you can just use
(00:22:47)
general purpose until you've like it's
(00:22:49)
clearly there and then you can make your
(00:22:50)
asich right
(00:22:51)
>> fascinating turning to the geopolitical
(00:22:53)
aspect of of uh all of this which is
(00:22:56)
always fun. Huawei and Nvidia in China
(00:23:00)
last year that was like 10 or 12% of
(00:23:02)
their overall revenue and this year they
(00:23:04)
they were saying that their market share
(00:23:06)
but has basically dropped to not very
(00:23:08)
much. Is that Huawei chips? Is that
(00:23:10)
restrictions? Is that tariffs? Uh what's
(00:23:12)
happening?
(00:23:12)
>> It's a variety of things actually in in
(00:23:14)
some in some quarters last year. uh it
(00:23:16)
was even north of 20 I think but I don't
(00:23:18)
remember exactly but anyways you know if
(00:23:20)
you look at 2022 China was almost the
(00:23:23)
size of the US in terms of buying server
(00:23:24)
hardware right almost not quite but
(00:23:26)
getting there um and it looked like they
(00:23:28)
were going to be the same size as
(00:23:30)
America in like a year or two after that
(00:23:31)
right and if you look at like global
(00:23:32)
data center capacity global cloud
(00:23:35)
capacity etc etc etc it's American
(00:23:37)
companies and Chinese companies right
(00:23:38)
that dominate the world American
(00:23:39)
companies obviously doing a lot better
(00:23:40)
here but both of those dominate the
(00:23:42)
world and if you look at like every
(00:23:44)
industry right you know it's It's it's
(00:23:46)
very clear that like China wants to
(00:23:48)
insource stuff, right? So in 2015, they
(00:23:50)
made these 5-year plans for two 2020 and
(00:23:52)
2020 uh five where they set the
(00:23:55)
percentage of semiconductors they wanted
(00:23:57)
uh domestically produced and they've
(00:23:59)
missed the goal both times which is
(00:24:01)
fine, right? They set really aggressive
(00:24:02)
goals and even you know shoot for the uh
(00:24:04)
moon even if you miss you hit the stars,
(00:24:06)
right? And that's sort of what's
(00:24:07)
happened, right? Like look, China is not
(00:24:09)
caught up on, you know, leading edge
(00:24:11)
semiconductors, but microcontrollers
(00:24:13)
from China are almost as good as the
(00:24:15)
microcontrollers are as good and cheaper
(00:24:17)
than the ones from Texas Instruments or
(00:24:19)
ST Micro or, you know, etc., right? Or
(00:24:21)
like this power random power chip is
(00:24:22)
better than or the same as the one from
(00:24:24)
like another company, right? And so
(00:24:25)
they've really built up a semiconductor
(00:24:27)
industry and started insourcing a lot
(00:24:28)
more. I don't see why China wouldn't be
(00:24:30)
buying you know 30 40% of the world's AI
(00:24:33)
chips and the US like 50 60% and then
(00:24:35)
the rest of the world like you know and
(00:24:37)
when I say US I mean US origin companies
(00:24:39)
that seems like a more natural state for
(00:24:41)
the world but there are restrictions and
(00:24:43)
and hey this is the biggest change in
(00:24:46)
human history maybe ever knowledge work
(00:24:48)
and you know everything that's going to
(00:24:50)
happen there and and then eventually
(00:24:51)
like robotics and all these things like
(00:24:53)
you know obviously there's there's a lot
(00:24:54)
of geopolitical stuff and so there are
(00:24:55)
restrictions Nvidia's been handcapped
(00:24:58)
hand handicapped from selling their best
(00:24:59)
chips to China. And so that's obviously
(00:25:01)
impacted the sales a lot because like
(00:25:03)
why would you do that? And so when you
(00:25:05)
look at who rents the most GPUs in the
(00:25:07)
world, it's three companies, right? So
(00:25:09)
one of them is obviously OpenAI. Second
(00:25:10)
one, actually they were bigger than
(00:25:12)
OpenAI. They are bigger than OpenAI
(00:25:13)
today or no, they were bigger than
(00:25:15)
OpenAI than OpenI. Eclipsed them
(00:25:16)
recently is Bite Dance. Bite Dance runs
(00:25:18)
rents tons of chips from Oracle and
(00:25:21)
Google and and you know many other cloud
(00:25:23)
companies because they couldn't get the
(00:25:25)
chips they need in in China. They're
(00:25:27)
mostly just serving Tik Tok, right?
(00:25:29)
Okay. Well, they they're not allowed to
(00:25:31)
buy them and that sucks, but you know,
(00:25:32)
they're they're allowed to rent them.
(00:25:33)
And so, okay, if I'm not allowed to get
(00:25:34)
the best ones, I'm going to rent
(00:25:35)
externally. And if Bite Dance is the
(00:25:37)
second biggest renter of GPUs in the
(00:25:38)
world, that's substituting demand that
(00:25:39)
would have been built in China in many
(00:25:41)
cases. It's instead being built in
(00:25:42)
Malaysia. And Oracle has over a gigawatt
(00:25:44)
of capacity in Malaysia that Bite Dance
(00:25:46)
is going to take, right? So, things like
(00:25:48)
this are, you know, you know, hundreds
(00:25:50)
of thousands, if not millions of chips,
(00:25:51)
tens of billions of dollars of cap
(00:25:53)
capacity that would go to China, but
(00:25:54)
it's not. that it's going to Malaysia
(00:25:55)
instead as an example. Another sort of
(00:25:57)
point around this is China's like you
(00:25:59)
know they've had these 5-year plans. So
(00:26:01)
and and you know the way these
(00:26:02)
initiatives work from China is there is
(00:26:04)
like some top down ordering but then
(00:26:06)
they just kind of whip the whole like
(00:26:07)
everyone just kind of gets into it and
(00:26:08)
it's really cool like I don't think it's
(00:26:10)
as top down as many people think. Like I
(00:26:12)
think the entire country is like
(00:26:13)
semiconductor pill right there are
(00:26:16)
dramas where people fall in love in the
(00:26:18)
fab or dramas where people fall in love
(00:26:22)
and they're photovoltaic like solar cell
(00:26:24)
researchers and engineers and it's like
(00:26:26)
it's like this is just the backdrop and
(00:26:28)
it's like actually this is it's like
(00:26:29)
super cool for your like significant
(00:26:32)
other to be that semiconductor engineer
(00:26:34)
or to be that photovoltaic you know uh
(00:26:37)
solar panel researcher
(00:26:38)
>> as opposed to an influencer
(00:26:40)
>> as opposed to an influencer. Right. Like
(00:26:41)
I'm sorry. Love Island is I I I watched
(00:26:44)
like for 10 minutes cuz I was forced to.
(00:26:45)
I was like this is freaking terrible.
(00:26:47)
[laughter] Um but you know like um
(00:26:50)
>> we are so cooked.
(00:26:51)
>> No, you know [laughter] seriously we're
(00:26:52)
cooked. We're cooked. And I think I
(00:26:53)
think like when you think about like
(00:26:55)
this happens it's like it's diffused
(00:26:56)
into drama even people like like there's
(00:26:59)
multiple dramas like taking place about
(00:27:01)
semiconductor industry and and they're
(00:27:03)
like romance comedy like the entire
(00:27:06)
spectrum, right? Drama like it's like
(00:27:08)
it's like what the heck is going on?
(00:27:09)
Anyways, you have all these provinces,
(00:27:11)
you have all these local cities studying
(00:27:14)
out ordinances and giving out subsidies
(00:27:17)
and all sorts of stuff, right? It's
(00:27:19)
truly like crazy. Like there's some
(00:27:21)
national level stuff like, "Oh, no taxes
(00:27:23)
on uh this. Oh, we're going to ban a few
(00:27:25)
things." But as far as I understand, the
(00:27:27)
national government has not banned
(00:27:29)
Nvidia's H20 or H200. But the local ones
(00:27:33)
have, right? A lot of local ones have
(00:27:35)
said, "No, you know, you must use China
(00:27:37)
manufactured chips." And it's like, who
(00:27:38)
told you that, you know, you're here to
(00:27:40)
uphold this? It's like, does it matter,
(00:27:41)
right? I mean, like, it's it's it's cool
(00:27:43)
because then you have this like survival
(00:27:44)
of the fittest, all these all these
(00:27:46)
provinces and cities are trying to
(00:27:48)
attract different companies with
(00:27:49)
different types of subsidies and grants
(00:27:52)
and industrial parks and like all these
(00:27:54)
different things
(00:27:55)
>> and then like the ones who succeed
(00:27:57)
actually develop an industry and they
(00:27:58)
take over.
(00:27:59)
>> This how one thinks of of China, right?
(00:28:01)
It almost sounds like more like the US
(00:28:02)
or like with the federal government and
(00:28:04)
states where the provinces have
(00:28:05)
authority over their purchasing. It's
(00:28:07)
It's actually like uh great. There's
(00:28:09)
this one um Tik Tok or not Tik Tok, Tik
(00:28:12)
Tok and Instagram like uh person and
(00:28:14)
they're like they they like sing it.
(00:28:15)
They're like if you want to if you want
(00:28:16)
to buy things in China, make sure you go
(00:28:18)
to the right place. And then they just
(00:28:19)
say the most random [ __ ] and name the
(00:28:21)
city. And then you look into it and
(00:28:22)
you're like wow this city has the entire
(00:28:24)
supply chain for this. Um and it's like
(00:28:26)
lampshades and then it names the city.
(00:28:27)
It's like what the [ __ ] There's a city
(00:28:29)
that specializes in lampshades. Like
(00:28:31)
it's like and it's like microphone arms
(00:28:33)
like microphones. It's like it's like
(00:28:34)
literally there's a city in China that
(00:28:36)
specializes in
(00:28:36)
>> guitars as well, right? This one one
(00:28:38)
city that became the guitar capital of
(00:28:39)
the world.
(00:28:39)
>> It's literally everything.
(00:28:41)
>> Literally everything. There's a city and
(00:28:43)
it's not like hey specifically for uh
(00:28:46)
camera arms for example, there's ball
(00:28:48)
bearings in this and the ball bearings
(00:28:49)
are like there's ball bearings. There's
(00:28:50)
multiple manufacturers of ball bearings
(00:28:52)
for camera arms
(00:28:53)
>> and then like most of the camera arms in
(00:28:55)
the world come from that one city. It's
(00:28:56)
like what the hell is going on? Um and
(00:28:58)
and so like the semiconductor industry I
(00:29:00)
think people don't realize is absurdly
(00:29:01)
specialized. I'm not answering your
(00:29:03)
question. I'm just going a little bit of
(00:29:04)
a rant because I think people don't
(00:29:05)
understand China semiconductors. It's
(00:29:07)
really sick or semiconductors in
(00:29:08)
general. But like you know like in Japan
(00:29:11)
they like focus on a few different types
(00:29:13)
of chemicals and they're the best at it
(00:29:15)
and it's like almost a cultural thing,
(00:29:16)
right? Japanese people were so precise
(00:29:18)
like with sushi and like it's all about
(00:29:20)
the trade and the craft and like you
(00:29:21)
know the French food in Japan is better
(00:29:23)
than the French food in France because
(00:29:24)
the f the Japanese chefs went there and
(00:29:26)
then come back and they perfected it in
(00:29:27)
Japan and like cuz they're so precise
(00:29:29)
and and there's so many different like
(00:29:30)
things that like Japan is so good at
(00:29:32)
because they're so precise and like
(00:29:34)
dedicated to the craft and it comes out
(00:29:35)
of like I don't know like samurai
(00:29:37)
culture or something I don't know right
(00:29:38)
like I don't exactly know how that
(00:29:40)
culture came up and so when you look at
(00:29:41)
like and it's like across the world
(00:29:43)
there's different places where things
(00:29:44)
like this happen right like Oh, like the
(00:29:46)
Netherlands makes EUV tools. Cool. I
(00:29:49)
guess so. And you look across the
(00:29:50)
semiconductor industry. There's a famous
(00:29:51)
economic essay called I pencil or
(00:29:54)
something like that. Or talking about
(00:29:55)
how the pencil like a simple pencil
(00:29:57)
comes from like oh the rubber comes from
(00:29:59)
like Indonesia for the eraser and the
(00:30:02)
graphite comes from this mine here and
(00:30:03)
and the wood comes from these aspen
(00:30:05)
trees in Canada and like you actually
(00:30:06)
can't make a pencil without aggregating
(00:30:08)
this entire supply chain. semiconductor
(00:30:10)
industry is like way crazier because
(00:30:11)
like I would say there's like 15 or 20
(00:30:13)
countries that could shut down the
(00:30:14)
entire semiconductor industry, right?
(00:30:16)
Even like Austria could, right? And and
(00:30:17)
it's like what? And it's like well yeah,
(00:30:18)
there's two different companies there
(00:30:19)
who have like 90% share in like some
(00:30:22)
random niche stuff.
(00:30:23)
>> And it's like okay, cool. I guess
(00:30:25)
Austria can and oh yeah, those two
(00:30:27)
companies only like have less than a
(00:30:28)
billion of revenue, but they just happen
(00:30:30)
to have lynchpin critical things. And
(00:30:32)
there's lynch pin critical things
(00:30:33)
everywhere because the process is so
(00:30:34)
complicated. And so China's been trying
(00:30:36)
to replicate this. Um,
(00:30:37)
>> is there one thing they're missing that
(00:30:38)
they don't have yet?
(00:30:39)
>> I think there's a lot of things. I think
(00:30:41)
if you were to close your eyes and say
(00:30:42)
or if you were to cut off every country
(00:30:43)
and say there's no more globalism, China
(00:30:45)
has the most vertical stack in
(00:30:47)
semiconductors today and they're the
(00:30:48)
best at semiconductors in the world
(00:30:50)
because their fabs could still run
(00:30:51)
somewhat on a lot of things because they
(00:30:53)
have built some of these chemical supply
(00:30:54)
chains, right? Like TSMC for certain
(00:30:57)
kinds of chemicals 100% share from
(00:30:58)
Japan, right? Or Intel same thing,
(00:31:00)
right? or you know for certain kinds of
(00:31:02)
tools 100% share from Netherlands or
(00:31:03)
100% share from you know this American
(00:31:05)
company or that you know Austrian
(00:31:06)
company or this or that right like
(00:31:07)
there's just all these like you know
(00:31:09)
this Swiss company like there's just all
(00:31:10)
these different places have 100% share
(00:31:12)
it might be one company might be three
(00:31:14)
companies but geographically or in the
(00:31:16)
same area and China's built that up
(00:31:18)
right because they've created this made
(00:31:19)
in China initiatives which just plowed
(00:31:22)
money into it and they've got this
(00:31:23)
culture of like the diffused like you
(00:31:25)
know these provinces like yeah I just
(00:31:27)
decided I'm going to [ __ ] focus on or
(00:31:29)
might not even be might not even be the
(00:31:30)
Right. It may be the like, you know,
(00:31:32)
someone brought it there and decided and
(00:31:34)
then people were like, "Oh, wow. You're
(00:31:35)
doing that?" Me, too. Like, I'm a Patel
(00:31:37)
and I grew up in a motel and guess what?
(00:31:38)
We like almost all the Patels I know
(00:31:40)
grew up in a motel and it's because some
(00:31:43)
random Patel immigrated to America and
(00:31:46)
like worked at a hotel motel and then
(00:31:48)
bought a motel and then like it just
(00:31:49)
started happening, right? Like you sort
(00:31:50)
of like these things are serendipitous
(00:31:52)
of sorts and like I don't know like and
(00:31:53)
it's like I I view it as the same kind
(00:31:55)
of specialization, right? Chinese cities
(00:31:57)
are like starting to do the these
(00:31:58)
things. China's missing a lot of things,
(00:32:00)
right? I would say like if you say minus
(00:32:01)
10 years tech, China's complete and no
(00:32:04)
one else is complete, right? Taiwan is
(00:32:06)
not complete. Their the fabs would shut
(00:32:07)
down without foreign supply, you know,
(00:32:09)
and you go down or you go across the
(00:32:10)
stack. Uh but if you go to 10ear tech,
(00:32:12)
maybe maybe more like 20-year tech, you
(00:32:14)
could get a fully vertical supply chain
(00:32:15)
in China, which I do not think any
(00:32:17)
country could do. Like America could not
(00:32:19)
build a fully vertical fab without stuff
(00:32:20)
from elsewhere, even if it's 20-y old
(00:32:22)
tech.
(00:32:23)
>> Um probably not even 40-y old tech. And
(00:32:25)
so, so that's interesting. But then when
(00:32:27)
the flip side is like well like you kind
(00:32:28)
of do need specialization. That's how
(00:32:31)
that chemical gets the purest best, you
(00:32:33)
know, most engineered, you know, or that
(00:32:35)
that slurry of chemicals or that, you
(00:32:38)
know, that gas or like that tool because
(00:32:40)
every smart person or a lot of them in
(00:32:43)
that country grew up around that culture
(00:32:45)
and like every the supply chain is there
(00:32:47)
and like everyone kind of knows and like
(00:32:48)
it's like a a driveaway and like sort of
(00:32:51)
like this is what makes supply chains
(00:32:53)
work is that there is this
(00:32:54)
specialization and the best of the best
(00:32:56)
only comes when you have that hyper
(00:32:57)
specialization. So, China doesn't have
(00:32:58)
lithography. their lithography is like
(00:33:00)
10 years behind and I think it'll be 5
(00:33:02)
years behind in a couple years, right?
(00:33:03)
They're catching up fast. I don't think
(00:33:05)
they'll be as good as ASML for a long
(00:33:08)
time. You know, maybe I don't know,
(00:33:10)
maybe they will be, you know, China. You
(00:33:11)
shouldn't ever underestimate China, but
(00:33:13)
like and Chinese engineers or, you know,
(00:33:14)
but like for a while, right? Or like,
(00:33:17)
you know, I don't think they'll be able
(00:33:18)
to make leading edge chemicals like many
(00:33:20)
chi uh Japanese companies or many
(00:33:21)
American companies and their tools and
(00:33:23)
like you just go across the supply
(00:33:24)
chain. They're not hey forefront on
(00:33:28)
really anything in the manufacturing
(00:33:30)
supply chain on the design supply chain.
(00:33:32)
There's some things that they're
(00:33:34)
starting to be similar par but like
(00:33:36)
cheaper or like a year or two behind but
(00:33:39)
cheaper and that's like fine for a lot
(00:33:40)
of stuff. An example of that is Huawei,
(00:33:42)
right? Huawei in mobile phones was on
(00:33:45)
par with Apple like entirely. Yeah. And
(00:33:47)
they had become Apple TSMC's biggest
(00:33:49)
customer and they were designing the
(00:33:50)
best thing and they are number one in
(00:33:52)
telecom and their tech is just literally
(00:33:53)
better. And so when you think what
(00:33:56)
happens, you know, is is is China
(00:33:59)
missing anything? It's like they're they
(00:34:01)
don't they don't they don't have the
(00:34:02)
best of much, you know, today in the AI
(00:34:05)
supply chain. they have a complete
(00:34:06)
package and a couple years behind and
(00:34:08)
they'll figure out how to make it
(00:34:09)
cheaper slash do more slashcatch up and
(00:34:11)
and create a robust industry. But
(00:34:14)
there's a reason like I don't think that
(00:34:16)
like Jensen is scared of AMD really.
(00:34:18)
He's paranoid. I mentioned he's
(00:34:19)
paranoid. I'm sure he's a little bit
(00:34:20)
scared of them, right? Like I think some
(00:34:22)
of the things that they've done are
(00:34:23)
reactions and competitive dynamics with
(00:34:25)
AMD or Google's TPUs or whatever. Right.
(00:34:27)
There was a Core Weave deal today and I
(00:34:29)
think that's directly the result of what
(00:34:30)
Google's been doing.
(00:34:31)
>> Yeah. the two billion pipe that Nvidia
(00:34:34)
announced into
(00:34:35)
>> Nvidia invested two billion in core but
(00:34:37)
what's more important is that that's
(00:34:38)
like sort of just like the sticker
(00:34:39)
what's really relevant is Nvidia is
(00:34:41)
going to work with core reef to uh
(00:34:43)
acquire um and and back stop and all
(00:34:46)
these things the the land the power the
(00:34:49)
energy the transmission that help build
(00:34:51)
the data center all this capital side
(00:34:53)
stuff that because Nvidia has so much
(00:34:54)
money they can backs stop corore weave
(00:34:56)
doing it because corore reweave then can
(00:34:58)
be the one who generates demand anyways
(00:35:00)
there's like because Google was doing
(00:35:01)
And they did that with like a couple
(00:35:02)
companies such as Fluid Stack and
(00:35:04)
Terowolf and Cipher. These are some
(00:35:06)
public deals that have been announced.
(00:35:07)
And so Google is doing that with TPUs
(00:35:08)
and Nvidia reacted, right? Um, and so in
(00:35:11)
the same way, I think Nvidia's reacted
(00:35:12)
to AMD. And in the same way, I think the
(00:35:15)
thing is Nvidia is like deathly
(00:35:16)
terrified of Huawei
(00:35:18)
>> because Huawei has caught up to Apple
(00:35:20)
and actually surpassed them as TSMC's
(00:35:22)
biggest customer before they got banned,
(00:35:24)
right? They did just crush Nokia, Sony,
(00:35:26)
Sony, Ericson, etc., right? Like the
(00:35:28)
entire telecom supply chain. they just
(00:35:30)
like completely destroyed them. And
(00:35:32)
there's so many other areas like they
(00:35:33)
straight up made a folding phone, right?
(00:35:35)
You know, I have a Samsung folding
(00:35:37)
phone. They have a folding phone that's
(00:35:38)
better than Samsung's folding phone.
(00:35:40)
>> And it's like, bro, what? Like, you
(00:35:42)
know, you know, Huawei's really really
(00:35:44)
cracked. And so, of course, they're
(00:35:46)
terrified of uh and and Huawei is the
(00:35:48)
most vertical company in the world. No
(00:35:49)
company is more verticalized than
(00:35:51)
Huawei, which then leads to huge
(00:35:53)
innovations. It's something that we
(00:35:55)
don't fully appreciate in the US, but
(00:35:56)
like when you travel in Europe, you see
(00:35:57)
everybody who's like honors honor phones
(00:36:00)
and it's like the the footprint of
(00:36:01)
Huawei is huge in in phones in a way
(00:36:03)
that people
(00:36:05)
>> not just phones um you know security
(00:36:07)
cameras actually they think they have
(00:36:08)
like you know
(00:36:09)
>> a lot of training on the [laughter]
(00:36:12)
that a captive group of testers.
(00:36:15)
>> Exactly. Exactly. Um I think I think
(00:36:17)
Huawei is terrifying right and and so
(00:36:19)
like yes their chips are not as good
(00:36:20)
today
(00:36:20)
>> and is that is is that already
(00:36:22)
happening? I mean obviously the US and
(00:36:24)
China are the two biggest markets but
(00:36:25)
like for other markets I don't know UAE,
(00:36:27)
Middle East, Europe are Nvidia and
(00:36:30)
Huawei already uh headto-head in
(00:36:32)
>> well they shipped a little bit but like
(00:36:35)
mostly just like sticker capacity like
(00:36:37)
there's nothing like no no like I would
(00:36:38)
say like a little bit as in like a few
(00:36:39)
servers not like a billion dollars worth
(00:36:41)
of stuff right the thing is China's
(00:36:44)
supply chain has to ramp up right um
(00:36:46)
China China's express goal is to have
(00:36:48)
all inter internalized but then like a
(00:36:50)
company like Alibaba's like I I don't
(00:36:52)
want to use Huawei, right? Like I want
(00:36:54)
to make I want to use Nvidia and just
(00:36:56)
make the best freaking models, right?
(00:36:57)
Because that's my business. My business
(00:36:58)
is not, you know, using a Huawei thing,
(00:37:00)
but it's like, okay, it's being pushed
(00:37:01)
upon me. There's other companies too,
(00:37:02)
like Cameron Con and so on and so forth.
(00:37:04)
And so the sort of like supply chain,
(00:37:06)
you know, companies in China don't want
(00:37:07)
to use they're kind of encouraged
(00:37:09)
obviously and pushed, you know, you must
(00:37:11)
some local provincial government be
(00:37:13)
like, well, you're doing this much
(00:37:14)
business here. You got to do this,
(00:37:15)
right? Like there's all sorts of like
(00:37:16)
crazy stuff that, you know, pushing of
(00:37:18)
of companies to use Huawei. Um the
(00:37:20)
challenge is probably can't manufacture
(00:37:22)
enough, right? We've like done a lot of
(00:37:24)
work on this. Um and we've just put it
(00:37:26)
for free, you know, instead of like to
(00:37:28)
our customers because it's like
(00:37:29)
something that's like national security,
(00:37:30)
which is how was Huawei actually
(00:37:32)
building chips? Well, actually they were
(00:37:34)
uh using shell companies to get chips
(00:37:36)
from TSMC and using different methods of
(00:37:38)
like sneaking HBM, which is memory, from
(00:37:41)
you know, Korea through Taiwan to China,
(00:37:43)
right? Like all sorts of crazy stuff
(00:37:44)
we've reported on and and people it's
(00:37:46)
like a whack-a-ole, right? they shut it
(00:37:48)
down or like tools that get shipped to
(00:37:50)
China and they shouldn't be for you know
(00:37:51)
making leading edge chips but they
(00:37:53)
actually are um and all these sorts of
(00:37:55)
things are happening because they can't
(00:37:56)
make everything and if they want to make
(00:37:58)
the leading edge stuff they do need to
(00:37:59)
rely on the foreign supply chain quite a
(00:38:00)
bit in terms of the upstream supply
(00:38:02)
chain right uh memory logic chips uh
(00:38:05)
tools for fabs chemicals for fabs etc
(00:38:07)
Huawei cannot satisfy the market um
(00:38:09)
because there's not enough advanced
(00:38:11)
leading edge capacity in memory logic
(00:38:14)
you know and all all these other things
(00:38:16)
uh domestically in and they're trying to
(00:38:18)
build it as fast as they can, but that
(00:38:19)
means there's just not enough to satisfy
(00:38:20)
the market. So, Nvidia has a market. I
(00:38:22)
think they'll figure out how to sell
(00:38:23)
chips to China. And Jensen's in China, I
(00:38:25)
think, like right now or was yesterday.
(00:38:26)
And so, like he's clearly like wheeling
(00:38:28)
and dealing to try and get his chips
(00:38:30)
into China because, you know, I think
(00:38:32)
Nvidia's argument is if we sell them
(00:38:34)
chips, then they won't, you know, there
(00:38:35)
won't be enough of as much of a domestic
(00:38:37)
market. The feedback loop for software
(00:38:38)
and everything else won't be there. That
(00:38:40)
will sort of like really challenge it,
(00:38:42)
right? Like most of the open source
(00:38:44)
software for AI has a lot of Chinese
(00:38:47)
contributors, right? VLM and PyTorch, SG
(00:38:51)
Lang and like all of these other like
(00:38:54)
libraries and things that are just like
(00:38:55)
you know and and and it goes to
(00:38:56)
low-level software especially right like
(00:38:58)
a lot of the best open source stuff is
(00:39:00)
actually just from like a Chinese
(00:39:01)
company who decided to open source it
(00:39:02)
and same with models right and so like
(00:39:04)
it's like okay well if they can't use
(00:39:05)
Nvidia chips anymore then this open
(00:39:07)
source stuff won't be designed for
(00:39:08)
Nvidia chips it'll be designed for
(00:39:09)
Huawei chips and now does that like
(00:39:10)
weaken the CUDA mode and now like not
(00:39:12)
only is China domestic now they have
(00:39:14)
like a feedback loop internally and then
(00:39:16)
they can externalize across the rest of
(00:39:17)
the world right so This is the like
(00:39:18)
argument Nvidia makes. I'm not sure if I
(00:39:21)
am like I'm like you know I think I
(00:39:23)
think my AI timelines are so fast. I'm
(00:39:26)
not that fast like not in terms of like
(00:39:27)
AGI but like hey AI is hundred billion
(00:39:29)
dollars of revenue uh across the
(00:39:32)
industry. I think the industry could hit
(00:39:34)
100 billion ARR by the end of this year
(00:39:36)
like 4550 for open AI like 3540 for
(00:39:40)
anthropic and then you know vertex deep
(00:39:42)
minds uh models at Google Gemini right
(00:39:45)
um and then vertex API for anthropic
(00:39:47)
models and uh bedrock APIs and Azure
(00:39:51)
foundry APIs like I think hundred
(00:39:53)
billion dollars like end of this year
(00:39:54)
>> that's a lot and then what's the
(00:39:56)
economic value of that hundred billion
(00:39:57)
dollars now how much of that is in China
(00:39:59)
right like China's number is probably
(00:40:00)
10x lower right? Because they just
(00:40:04)
haven't been able to pervasively push
(00:40:05)
AI, right? Chat GPT has a billion users
(00:40:08)
roughly and you know, then you add on
(00:40:10)
Gemini and Meta claims they have 500
(00:40:12)
million users. I don't know. I think
(00:40:13)
people just accidentally click like
(00:40:14)
generative sticker or something. Um,
(00:40:16)
[laughter] but like anyways, like
(00:40:18)
there's like there's like a lot of usage
(00:40:19)
of AI in the west already and it's going
(00:40:22)
to climb. It's going to keep climbing
(00:40:23)
and like you kind of have to get used to
(00:40:24)
it and so like the question is like do
(00:40:26)
you you know what's what's the economic
(00:40:28)
benefit to the world, right? And at the
(00:40:30)
end of the day, this is an economic war,
(00:40:31)
right? If the US and the West win in AI
(00:40:35)
and control, you know, more powerful AI
(00:40:37)
systems that have this feedback loop
(00:40:39)
that improved economic growth and
(00:40:41)
weapons systems and whatever else,
(00:40:42)
right? Engineering of grids and cyber
(00:40:45)
attacks and all these sorts of things.
(00:40:46)
They have this like advantage over
(00:40:48)
China, then China will not rise to be
(00:40:50)
the global hedgeimony. But without AI,
(00:40:52)
China definitely will rise to be the
(00:40:54)
global hedgeimony. They're just going to
(00:40:56)
outrun America. And so the question is
(00:40:57)
like you know that's I think like the
(00:40:58)
other view right and how fast are super
(00:41:00)
powerful AI systems versus you know
(00:41:02)
China building a domestic ecosystem for
(00:41:05)
chips and models and everything that is
(00:41:06)
a few years behind like what's what's
(00:41:09)
actually the value right like that's
(00:41:10)
sort of like
(00:41:12)
>> around restrictions and regulations
(00:41:13)
>> where where do the uh US onshoring
(00:41:16)
efforts fall in that category what do
(00:41:18)
you make of them from the chipsack to
(00:41:20)
like all the thing that is being built
(00:41:22)
everything looks like it's massively
(00:41:23)
delayed by the way which perhaps is not
(00:41:25)
surprising
(00:41:26)
>> I think TSM MC's manufacturing wafers
(00:41:27)
and they're like building real wafers
(00:41:29)
and there's real fabs and like you know
(00:41:31)
there's some other fabs that have been
(00:41:32)
announced and like they're doing well
(00:41:33)
and there's like a bunch of like
(00:41:34)
different kinds of plants like a Korean
(00:41:35)
company making a random gas plant in
(00:41:37)
Texas for you know their chips right
(00:41:39)
like uh for chips and all these like
(00:41:40)
sort of things are happening. Um I think
(00:41:42)
the chips act did really well with its
(00:41:44)
$50 billion. It's just I don't think
(00:41:46)
people understand the scale of the
(00:41:48)
semiconductor industry. is the most
(00:41:50)
complicated supply chain in the world,
(00:41:52)
right? It's much bigger than, you know,
(00:41:55)
say manufacturing airplanes. It's much
(00:41:57)
bigger than like, you know, really
(00:41:58)
anything else, right? If you look at the
(00:42:00)
top 10 companies like of the world, I
(00:42:02)
think eight of them designed
(00:42:03)
semiconductors, right? Now, obviously
(00:42:05)
like Google designed semiconductors, but
(00:42:06)
it's like, oh wait, no, but their cost
(00:42:08)
of search would be like 10x higher if
(00:42:10)
they didn't have TPUs and TPUs were
(00:42:11)
super optimized for search, right? Or
(00:42:13)
like, you know, you you you go down the
(00:42:15)
list, right? Like Meta serves
(00:42:16)
recommendation systems with their chips,
(00:42:18)
right? Like you go down the list, it's
(00:42:19)
everyone is making their own chips.
(00:42:21)
Apple devices would be materially worse
(00:42:23)
if they didn't have their own chips,
(00:42:24)
right? Um and you just go down the list,
(00:42:26)
it's like it's the most complicated
(00:42:28)
supply chain and they they're spending
(00:42:29)
something on the order of like $150
(00:42:32)
billion roughly in subsidies a year to
(00:42:34)
the chip industry.
(00:42:36)
>> We are doing 50 over like a decade.
(00:42:38)
>> Yeah,
(00:42:39)
>> there's a difference in scale here,
(00:42:40)
right? The collective total amount of
(00:42:42)
like capex that has been spent in Taiwan
(00:42:45)
is like 500 billion plus, right? across
(00:42:48)
the industry, across all the companies
(00:42:49)
that are making semiconductors in Taiwan
(00:42:51)
and Taiwan doesn't have a domestic
(00:42:52)
industry. How is $50 billion of
(00:42:54)
subsidies going to change America's
(00:42:56)
needle? Right? It does move it a little
(00:42:58)
bit, right? I I want to be clear like
(00:43:00)
the chips act is awesome. I don't
(00:43:02)
understand why like EVs or like solar
(00:43:05)
was given this massive massive like
(00:43:08)
trillion dollar package. Semiconductors
(00:43:10)
were only given 50. Like semiconductors
(00:43:12)
need a lot bigger package to actually
(00:43:13)
incentivize onoring. I think what's
(00:43:15)
happened so far has proven that it's
(00:43:17)
working well. TSMC is literally making
(00:43:19)
chips for Nvidia and Apple and AMD and
(00:43:22)
others in Arizona today, right? And I
(00:43:24)
think that's really great.
(00:43:26)
>> Is is your sense that the broad American
(00:43:28)
government is just uh aware of of all of
(00:43:31)
this that it's uh I wouldn't say only
(00:43:33)
passed because the automotive like
(00:43:35)
prices went up because car manufacturers
(00:43:37)
are like the worst because they do just
(00:43:39)
in time inventory, right? Or not worse,
(00:43:40)
but like this is just like a thing,
(00:43:41)
right? Just in time inventory systems.
(00:43:44)
COVID happens, sales plummet fabs that
(00:43:46)
were making, you know, random power IC's
(00:43:48)
or random microcontrollers for engines
(00:43:49)
got repurposed to the boom from COVID,
(00:43:52)
which is which was data centers and PCs
(00:43:54)
and smartphones. So, that stuff was
(00:43:55)
booming. And then when people were like,
(00:43:56)
"Oh, wait. Actually, like, you know, I
(00:43:58)
have some money. I stayed at home. I
(00:43:59)
didn't go out. I didn't drink. I have a
(00:44:00)
lot of I have some cash, right? Let me
(00:44:02)
buy a car." They went out and bought
(00:44:03)
cars and cars started skyrocketing in
(00:44:04)
prices. Oh, let's restart and let's
(00:44:06)
let's Oh, yeah. Can I can you sell me
(00:44:07)
that microcontroller for the engine
(00:44:08)
again? It's like, "No, I I'm making a
(00:44:11)
slightly different microcontroller that
(00:44:12)
works for, you know, uh, let's say a
(00:44:15)
keyboard or a mouse, right, or
(00:44:17)
whatever." And it's like, and and and
(00:44:18)
they actually didn't just leave me
(00:44:21)
flatfooted and they were like a partner
(00:44:22)
through co, right? You know, versus you
(00:44:24)
just left me. Screw you Ford or whoever,
(00:44:27)
Toyota, um, or automotive OEM, you know,
(00:44:29)
you up that supply chain. And so, Chips
(00:44:31)
Act did not get passed, only got passed
(00:44:33)
because that happened. And people are
(00:44:35)
like, "Oh my god, the semiconductors are
(00:44:36)
why cars can't be made." If that didn't
(00:44:38)
happen, we wouldn't even have the chips
(00:44:40)
act. It's like it's like silly. So like
(00:44:41)
I don't know like I think you know
(00:44:43)
whereas like and and even though that's
(00:44:44)
what was pitched to all the senators
(00:44:46)
like I know people who were running
(00:44:47)
around Capitol Hill just pushing that
(00:44:49)
narrative and story and that's why it
(00:44:50)
finally got passed in reality it was all
(00:44:52)
for advanced leading edge chips, right?
(00:44:54)
Nothing that goes in a car, right? And
(00:44:56)
so it's like this like funny thing. So,
(00:44:57)
in other words, do you think my words my
(00:44:59)
words not yours, but is it is it
(00:45:01)
hopeless that the US is going to I'm
(00:45:03)
very optimistic. Okay. I mean, do you
(00:45:04)
think there's a world where the US just
(00:45:06)
decides to invest in semiconductor at
(00:45:09)
the scale that
(00:45:10)
>> you know, I thought we just needed a
(00:45:11)
bigger chips act, but
(00:45:14)
>> look, Trump's kind of gotten TSMC to
(00:45:16)
promise to invest a fuckload more
(00:45:18)
[laughter] and they're moving on it,
(00:45:20)
right? They're like actually like just
(00:45:22)
building it. It's like, I'm going to
(00:45:23)
tariff the [ __ ] out of you unless you
(00:45:24)
build a fab. But it's like we'll build a
(00:45:26)
fab [laughter] and they're building it
(00:45:28)
right now. The timelines for fabs just
(00:45:30)
takes forever cuz again it's the most
(00:45:31)
complicated thing in the world. The
(00:45:33)
cleanest space in the place in the world
(00:45:34)
is not like a hospital or a biotech lab
(00:45:36)
or whatever. It's a semiconductor fab.
(00:45:37)
And the most expensive tools in the
(00:45:38)
world are not you know any of these
(00:45:40)
medical tools or whatever. It's it's
(00:45:41)
semiconductor tools or it's not a
(00:45:43)
rocket. It's a semiconductor tool,
(00:45:44)
right? Like everything you know I
(00:45:46)
describe it as um I remember when I was
(00:45:47)
a kid I was like I want to be a rocket
(00:45:49)
scientist and then I was like oh I want
(00:45:50)
to be a surgeon. And I'm like wait chips
(00:45:52)
are like rocket surgery but even cooler
(00:45:54)
right? Like I think anyways like sort of
(00:45:56)
like there there are fabs being built in
(00:45:58)
America.
(00:45:59)
>> They won't take America to
(00:46:00)
self-sufficiency. I don't think that's a
(00:46:01)
relevant. I don't think that's a goal
(00:46:03)
relevant like that's relevant, right?
(00:46:04)
Like globalism is generally just good.
(00:46:07)
Hot take [laughter]
(00:46:10)
like in terms of economics.
(00:46:11)
>> We'll turn this into a short a YouTube
(00:46:12)
short.
(00:46:13)
>> Globalism.
(00:46:13)
>> Globalism is good. [laughter]
(00:46:14)
>> Dude, you're gonna get me like
(00:46:15)
cancelled.
(00:46:18)
>> [gasps]
(00:46:19)
>> I tweeted about ice and it was a
(00:46:20)
complete joke, but so many people got
(00:46:22)
mad at me because I can't be, you know,
(00:46:23)
I'm too I'm too much of a joker. You
(00:46:25)
know, these are serious things.
(00:46:26)
>> Yeah. Yeah. No, I know the I know the
(00:46:28)
feeling. Yes. [laughter]
(00:46:29)
>> Anyways, um I think I think you know I
(00:46:33)
think we are building fabs and I think
(00:46:34)
it's like going to move and now even
(00:46:36)
Elon's talking about building fabs now
(00:46:37)
because he sees the shortages in the
(00:46:38)
world, right? Uh there's a lot of
(00:46:40)
semiconductor related shortages for
(00:46:41)
building out AI and and so I don't think
(00:46:44)
it's hopeless. I think I'm like very
(00:46:45)
optimistic that we're going to do more
(00:46:46)
and more and more. And maybe this
(00:46:48)
administration threatens tariffs and
(00:46:49)
they get the deals and the next
(00:46:50)
administration comes back with the
(00:46:51)
carrot. If it is the Democrats, whatever
(00:46:53)
happens, I don't know. Like I was at a
(00:46:55)
comedy club on Sunday night and like
(00:46:57)
he's like, "Oh, I'm I use Chad GPT." And
(00:46:59)
then like there were a couple people who
(00:47:00)
booed and he's like, "Yeah, I'm one of
(00:47:01)
those guys. I know." And like it's like,
(00:47:03)
"Wow, people hate AI."
(00:47:05)
>> And that has has not even started,
(00:47:06)
right? Like the actual impact of AI
(00:47:08)
>> or like New Jersey power prices are up,
(00:47:10)
right? Uh is it because of a data
(00:47:12)
center? New Jersey, the governor's
(00:47:14)
election like I think literally fl like
(00:47:16)
there's like an election that changed
(00:47:17)
recently in New Jersey because power
(00:47:19)
prices were up and people blamed a
(00:47:22)
Microsoft Nebius data center in New
(00:47:24)
Jersey for that reason. But in reality
(00:47:27)
that data center has nothing to do with
(00:47:29)
power prices going up. It's super storm
(00:47:31)
standy like five years ago knocking or
(00:47:34)
whatever how many years ago knocking
(00:47:35)
down the state's electrical
(00:47:36)
infrastructure and then the then
(00:47:37)
improving all these improvements and
(00:47:39)
then those improvements have to be paid
(00:47:41)
by someone and it turns out the consumer
(00:47:42)
has to pay for them with higher power
(00:47:44)
prices, right? And so like you know like
(00:47:45)
there's like there's a lot like going on
(00:47:47)
in that regard, right? Um that kind of
(00:47:50)
is uh
(00:47:51)
>> sad. Um, and and people hate AI and
(00:47:54)
they're blaming AI on it and artists
(00:47:55)
hate AI and like you know you see all
(00:47:57)
this deep fake stuff and like I think I
(00:47:59)
think it'll be the hottest button issue
(00:48:00)
especially as like we're really getting
(00:48:02)
into like I think last year Google spent
(00:48:04)
$3 billion on Whimo and we're waiting
(00:48:06)
for their guide for this year $3 billion
(00:48:08)
on Whimo taxis but their t their Whimos
(00:48:11)
went from like 300k to like 100k or 90k
(00:48:14)
the new Whimo car and they're going to
(00:48:16)
spend more than three because they've
(00:48:17)
just launched in like four cities now
(00:48:19)
right or five cities and and they're
(00:48:20)
testing it a lot and the same a robo
(00:48:22)
taxi like people are going to hate AI
(00:48:23)
for that reason people are going to hate
(00:48:24)
AI because the slop on the internet
(00:48:26)
people are going to hate AI because you
(00:48:27)
know the perceived job replacement
(00:48:29)
people are going to hate AI for all
(00:48:30)
these reasons and so yeah it's going to
(00:48:32)
be a hot button political issue don't
(00:48:33)
you think
(00:48:33)
>> yeah talking about that so um capex is
(00:48:37)
there a capex bubble are we u investing
(00:48:41)
too much or actually are we investing
(00:48:44)
not enough given what you were saying
(00:48:46)
earlier about uh the the the rate of
(00:48:49)
revenue increase and and therefore
(00:48:50)
implied demand
(00:48:51)
that you expect for this year.
(00:48:53)
>> I'm obviously a maxi. I think we're
(00:48:55)
going to need a lot of infra and I think
(00:48:57)
I'm literally paid to like analyze the
(00:48:59)
supply chain and do consulting. Like
(00:49:01)
that's what my company does. So like
(00:49:03)
obviously I'm very [laughter] biased.
(00:49:05)
I think I think we're pretty good at
(00:49:07)
calling when when things go down though,
(00:49:08)
right? Before like a part of the supply
(00:49:10)
chain reb. Anyways, you know, again,
(00:49:12)
going back to the economics of it, it's
(00:49:13)
north of hundred billion dollars of
(00:49:14)
revenue exiting this year for AI from a
(00:49:16)
base of, you know, sub1 billion gen AI
(00:49:20)
from a base because ads and stuff is
(00:49:22)
like already a multiundred billion
(00:49:23)
dollar AI industry, right? You know, go
(00:49:25)
back to 2023 it was like less than a
(00:49:26)
billion, right? And 2024, I don't know
(00:49:28)
exactly what number, maybe let's call it
(00:49:31)
10 and 25 was maybe like 30 40. It'll be
(00:49:34)
north of 100 easily. If you're talking
(00:49:36)
about hundred billion of revenue, let's
(00:49:39)
say at a 50% gross margin. So that's $50
(00:49:42)
billion of gross profit um and $50
(00:49:45)
billion of COGS. That $50 billion of
(00:49:46)
COGS needs to run on infra, which cost
(00:49:49)
roughly if a five, if you're talking
(00:49:51)
about fiveyear depreciation, call it
(00:49:53)
$250 billion, right, of infra
(00:49:55)
>> for hundred billion of revenue.
(00:49:57)
>> Mhm.
(00:49:58)
>> Okay. What is what is the actual spend
(00:49:59)
on AI infra this year? It's going to be
(00:50:00)
like it's I mean it depends on what
(00:50:02)
layer. If you're talking about energy,
(00:50:04)
those are longer lived assets and all
(00:50:05)
these other things, right? Um data
(00:50:06)
centers are longer lived assets. The
(00:50:08)
chips are not as much. People are
(00:50:09)
putting capex down. Um and the
(00:50:11)
hyperscalers capex is going to be like
(00:50:13)
$500 billion this year or something like
(00:50:14)
this. And then besides them, there's
(00:50:16)
also a lot more hyp uh capex elsewhere.
(00:50:19)
Um and so, you know, is it a bubble? I
(00:50:21)
mean theoretically like you know it's
(00:50:23)
twice as much as it should be but it's
(00:50:24)
also like well no there's an R&D
(00:50:26)
component to this and the excess spent
(00:50:30)
that wasn't revenue generating last year
(00:50:31)
is what led to models being so good this
(00:50:34)
year um and led to like everyone who can
(00:50:37)
using cloud code and like that changing
(00:50:39)
their life. This is like it's not a
(00:50:41)
bubble, right? I don't think it's a
(00:50:42)
bubble yet. Um I think if AI model
(00:50:44)
progress stops and that's the main
(00:50:45)
thing, right? The moment model progress
(00:50:47)
stops all the spending is for not. But
(00:50:49)
so far we've had consistent improvement.
(00:50:52)
As you put in more compute, you get more
(00:50:54)
performance and better models.
(00:50:56)
>> Yeah. Model performance being the
(00:50:57)
lagging indicator of hardware progress
(00:51:00)
or data center.
(00:51:01)
>> Yeah. of of capex, right? Yeah.
(00:51:03)
>> Ultimately, the capex that Microsoft
(00:51:05)
spent in 2024 for OpenAI is what results
(00:51:08)
in in 2025 for OpenAI Cory or whoever is
(00:51:10)
what results in their models being so
(00:51:12)
good this year. Same with Enthropic and
(00:51:14)
Amazon Google and their models now being
(00:51:16)
so good now is is that capex and
(00:51:18)
actually they still haven't paid for
(00:51:19)
those chips yet because those chips are
(00:51:21)
still have a useful life for another few
(00:51:22)
years right I think model progress is
(00:51:24)
very clear um the moment that stops
(00:51:27)
happening right if we hit a wall there's
(00:51:29)
no new research directions um then then
(00:51:32)
it's cooked yeah right
(00:51:34)
>> and that assumes that better model leads
(00:51:36)
to more demand which is a reasonable
(00:51:39)
assumption
(00:51:40)
>> yeah for sure
(00:51:41)
>> but um yeah I mean there scale the
(00:51:43)
adoption curve regardless of how good
(00:51:45)
the model is uh in the enterprise
(00:51:47)
>> like 2% of GitHub commits today are
(00:51:48)
cloud code
(00:51:50)
>> as in committed by cloud code you can
(00:51:51)
disable that where it's not
(00:51:52)
automatically committed but 2% of GitHub
(00:51:54)
commits today are cloud code $2 trillion
(00:51:56)
of software wages paid in the world
(00:51:58)
>> if it was 2% then you like you're like
(00:52:01)
wait a second
(00:52:02)
>> this is this is an insane amount um AI
(00:52:05)
is under earning the value that it's
(00:52:07)
producing in the world right by a
(00:52:09)
significant margin already today
(00:52:11)
>> Bor's journey from Cloud code who had
(00:52:13)
who we had on the pod was saying that
(00:52:16)
what he's written all of Claude what is
(00:52:19)
it called co-work like the new product
(00:52:20)
entirely with cloud code yet so we're
(00:52:23)
very much in that world. Yes.
(00:52:24)
>> Yeah. My uh one of my roommates I was
(00:52:26)
asking him because he's like always been
(00:52:28)
a really low-level good programmer and
(00:52:30)
he started you know I was like he's like
(00:52:32)
he had this um holiday obsession right I
(00:52:36)
mean he was using cloud code for work
(00:52:37)
already right like whatever. Um but he
(00:52:39)
had this holiday obsession. We got into
(00:52:41)
playing Age of Empires 2. Myself, you
(00:52:44)
know, my roommate, a handful of people
(00:52:45)
from like Open Eye, GDM, Anthropic. We
(00:52:48)
just would do land parties of AoE 2 over
(00:52:51)
the holidays a bit. Not not like
(00:52:52)
Christmas, but like a little bit before,
(00:52:53)
a little bit after, you know, cuz most
(00:52:54)
of us went home for Christmas. Um, but
(00:52:56)
like we'd do these lands. My roommate
(00:52:58)
got so obsessed with like the game that
(00:53:01)
during Christmas week, cuz he didn't go
(00:53:03)
home, he just stayed in San Francisco.
(00:53:05)
Um, he just worked on an RTS game and he
(00:53:07)
built an entire RTS game. And I think I
(00:53:10)
kid you not, I think he he used like
(00:53:12)
$10,000 of Claude in one week and built
(00:53:15)
an entire RTS from scratch uh about a
(00:53:18)
like but instead of like being a
(00:53:19)
standard RTS where it's like oh Age of
(00:53:21)
Empires for advance through ages or
(00:53:22)
Starcraft, it is it is an RTS where it's
(00:53:24)
China versus the US and you're in the AI
(00:53:26)
race and you go from the start of the
(00:53:28)
information age all the way through to
(00:53:31)
you know AGI and like robots and
(00:53:33)
humanoids and and and like all like
(00:53:35)
space fairing civil like it's crazy. He
(00:53:37)
built it in a week
(00:53:39)
>> and he didn't type a single line of
(00:53:40)
code, right? He can only dictate it to
(00:53:42)
the model. And he told me, yeah, like we
(00:53:44)
have an indicator internally at
(00:53:45)
Enthropic where you see how many people
(00:53:47)
actually write code now. There's only a
(00:53:49)
few hold outs left.
(00:53:50)
>> But I guess the question to the bubble
(00:53:52)
is is really a question of uh timing as
(00:53:54)
well, right? Uh it's it's whether the
(00:53:57)
build which is supply side and the
(00:54:00)
demand side are going to land sort of at
(00:54:03)
the same time. Is that is that fair?
(00:54:04)
>> Yeah. But also the economics of like say
(00:54:06)
you you spend let's say you spend you
(00:54:08)
build a gigawatt you put down roughly
(00:54:10)
$50 billion across you know the data
(00:54:13)
center the chips the networking blah
(00:54:14)
blah blah blah blah right let's say it
(00:54:16)
has a 5year useful life so it's $10
(00:54:18)
billion a year is it a bubble if the
(00:54:20)
first year you have you didn't make any
(00:54:22)
money it's zero the second year it's
(00:54:23)
zero and then third fourth fifth year
(00:54:25)
you're at 50% gross margins and so you
(00:54:27)
make 20 2020 now you've made $60 billion
(00:54:30)
off of this $50 billion investment it's
(00:54:33)
not the best return on invested capital,
(00:54:34)
but it did pay for itself.
(00:54:36)
>> Yeah.
(00:54:36)
>> Um, is that is that a bubble? Well,
(00:54:38)
that's what's happening today is that
(00:54:39)
people are spending all this infra money
(00:54:41)
on infra and there's no return for a lot
(00:54:42)
of it, right? A lot of it is just doing
(00:54:44)
research and like trying to get adoption
(00:54:45)
and is free users and like what does
(00:54:47)
that mean?
(00:54:48)
>> Yeah.
(00:54:48)
>> Um,
(00:54:49)
>> depends a bit on
(00:54:50)
>> the timing. That's the timing though.
(00:54:51)
Yeah. But oh, that $50 billion capex was
(00:54:53)
spent in year one.
(00:54:54)
>> What about energy? In the in the data
(00:54:56)
center world, you had this fun post
(00:54:58)
about the gas replacement for for
(00:55:00)
energy. So, is uh is AI basically uh uh
(00:55:04)
destroying the grid?
(00:55:05)
>> It would if the utilities were willing
(00:55:08)
to let it, but I think the utilities are
(00:55:10)
so slow and dumb that they don't want
(00:55:12)
to. Not destroy, but like expanding the
(00:55:14)
grid. Yeah.
(00:55:15)
>> Um I think the US could have a way
(00:55:16)
better grid, but we just don't want to.
(00:55:18)
Like, no one's made the effort or
(00:55:20)
initiative. You know, there's not enough
(00:55:21)
power. America's not built power for 50
(00:55:23)
years really, right? It's like converted
(00:55:24)
from coal to gas and like things like
(00:55:26)
this but like really just have not built
(00:55:28)
wholesale new power on a large scale and
(00:55:31)
there have been a lot of times where the
(00:55:32)
industry blew up right independent power
(00:55:34)
producers IPs have blown up multiple
(00:55:36)
times in the 2010s when uh Korean and
(00:55:39)
Japanese investors like flooded the
(00:55:40)
market with because they saw such a good
(00:55:42)
return there or before in the early
(00:55:45)
2000s power was growing a little bit for
(00:55:47)
a little bit and so people overbuilt on
(00:55:48)
power so power industry has been burned
(00:55:50)
a couple times but no one really built
(00:55:51)
power and then you've got data centers
(00:55:52)
now all of a sudden coming online and
(00:55:54)
going from 2% to 10% of the US grid in
(00:55:56)
just a handful of years. And so you've
(00:55:58)
got this humongous humongous change in
(00:55:59)
the industry. We don't have the labor,
(00:56:01)
right? I think ultimately that's the
(00:56:02)
biggest problem is the equipment and the
(00:56:03)
labor and equipment is basically you
(00:56:05)
know again labor and time takes time to
(00:56:07)
build a factory so you can build the
(00:56:09)
things. I think the equipment side of
(00:56:10)
things will be solved like more
(00:56:12)
reasonably. And one one example was like
(00:56:13)
gas, right? People initially thought,
(00:56:15)
oh, you can only use like the two
(00:56:17)
vendors, right? Uh Seammens or G Vernova
(00:56:20)
for gas turbines, but they have the they
(00:56:21)
have the best ones, the most efficient
(00:56:22)
ones. It's like, okay, well, like, okay,
(00:56:24)
also Mitsubishi exists and they're
(00:56:26)
ramping up production fast. Oh, Ducson
(00:56:28)
and Korea exist and they're ramping up
(00:56:29)
production fast. Oh, actually, I can
(00:56:31)
just take Cumins engines, right? Like,
(00:56:33)
you know, if you've ever like ridden a
(00:56:34)
pickup truck or like you know, like
(00:56:35)
diesel trucks, like everyone loves
(00:56:36)
Cumins, right? You know, you see the Ram
(00:56:38)
on the street and has the Cumins like
(00:56:39)
badge. It's like it's like a that's like
(00:56:40)
an aura symbol for a certain kind of
(00:56:42)
redneck from South Georgia, which I have
(00:56:44)
a little bit of. Anyways, I I don't have
(00:56:45)
a I don't have a truck. [laughter]
(00:56:47)
I have though. Um but anyways, like the
(00:56:49)
there's like all these engines like
(00:56:51)
people are figuring out how to make the
(00:56:52)
equipment. You know, solar sucks. It's
(00:56:53)
too intermittent. Wind sucks. It's too
(00:56:55)
intermittent. Nuclear sucks. It takes
(00:56:56)
forever to build. Coal sucks. It's way
(00:56:58)
too dirty. How do you make power for
(00:57:00)
data centers besides gas? And like,
(00:57:02)
okay, the grid's not willing to put the
(00:57:04)
gas on your site, right? That's what
(00:57:05)
Elon did. Now everyone's doing it,
(00:57:06)
right?
(00:57:07)
>> This other cool post just uh last week
(00:57:09)
or two weeks ago that was about water
(00:57:11)
consumption. Uh did you want to talk to
(00:57:13)
that?
(00:57:14)
>> Yeah. Yeah. So there's this annoying
(00:57:15)
thing where everyone's like, "Oh, AI is
(00:57:17)
using all the water. Oh wow, AI and data
(00:57:21)
centers are going to like use up all the
(00:57:23)
water and now we don't have any water."
(00:57:24)
And it's like that's so silly. Uh water
(00:57:26)
is a distribution problem, not a like we
(00:57:29)
don't have enough problem, right? Like
(00:57:30)
you look at California. So California
(00:57:31)
has shitloads of water. But people
(00:57:33)
decide to make oat milk which consumes
(00:57:36)
like 1,000x the water of like anything
(00:57:40)
else like regular milk even and and cows
(00:57:43)
obviously eat a you know consume a lot
(00:57:44)
of water. Um but anyways like you know
(00:57:46)
data centers consume very little water
(00:57:48)
actually right. So the US grid will get
(00:57:50)
to like 10% of power by like 28 27 is
(00:57:52)
data centers. For water consumption it's
(00:57:54)
not even going to crack 1%.
(00:57:56)
>> Yeah.
(00:57:56)
>> By the end of the decade.
(00:57:57)
>> And what was the metric? Um and so so
(00:57:59)
the the comparison we made is because
(00:58:01)
like you know it was a bit of a [ __ ]
(00:58:02)
post but it was like serious research.
(00:58:04)
Yeah. Basically like we were doing
(00:58:05)
serious research because we keep getting
(00:58:07)
this like question and debunking it and
(00:58:09)
we would do it seriously but then I was
(00:58:11)
like no no no this is like too like
(00:58:12)
complicated like let's make it very
(00:58:14)
simple. So I was like, "Guys, why don't
(00:58:15)
we just compare it to like hamburgers,
(00:58:17)
right? Cuz cuz you know, I've heard that
(00:58:19)
argument from some like vegetarian
(00:58:20)
people before or some Hindus or like I'm
(00:58:23)
Hindu myself, although you know, and I I
(00:58:25)
I do eat beef sometimes, but you know,
(00:58:28)
like I I'm Hindu, but like you know, so
(00:58:30)
so we made this comparison to
(00:58:31)
hamburgers, right? Hamburgers require a
(00:58:34)
shitload of water cuz cows, you know,
(00:58:36)
when to for them they require a ton of
(00:58:39)
water and when a cow's taking a lot of
(00:58:40)
water, it's not the cow itself, it's all
(00:58:41)
the feed you're feeding them, right?
(00:58:42)
Because no one grass feeds their cows,
(00:58:45)
you know, and just lets the rain take
(00:58:46)
care of the grass. They like either rain
(00:58:48)
the the grass or most likely they do
(00:58:51)
mass industrial farming of corn,
(00:58:53)
soybean, alalfa, etc., which uses
(00:58:56)
shitloads of water, right? Like, you
(00:58:59)
know, or like almond milk like uses tons
(00:59:01)
and tons of water. Like produce is like
(00:59:03)
the main user of water. I think the uh
(00:59:05)
metric was the entirety of Elon Musk's
(00:59:09)
Colossus data center, right? Uses as
(00:59:11)
much water as two and a half in-n-outs.
(00:59:14)
Um because that's, you know, you do the
(00:59:16)
calculation on how many how many b
(00:59:17)
what's the average revenue per in-n-out
(00:59:19)
and how many hamburgers does that
(00:59:20)
translate to, right? If everyone's
(00:59:21)
ordering like a combo, right? Okay,
(00:59:23)
let's ignore the drink, let's ignore the
(00:59:25)
fries, let's just talk about the
(00:59:26)
hamburger, let's ignore the bread, which
(00:59:28)
does use have grain, let's just do the
(00:59:30)
meat
(00:59:31)
>> and the cheese. And all of a sudden all
(00:59:33)
this water is there's so much water,
(00:59:36)
right? Like a single query like all of
(00:59:39)
your AI usage from chat GBT of the
(00:59:41)
average user is like a hamburger, right?
(00:59:43)
Like it's like okay, this is nothing,
(00:59:46)
right? You know, because these things
(00:59:47)
are the data centers actually are like
(00:59:49)
they're mostly closed loops and like
(00:59:50)
sure they evaporate some water for like
(00:59:52)
cooling reasons, but like by doing
(00:59:54)
evaporative cooling, they're using less
(00:59:55)
power, right? And that's actually better
(00:59:56)
for the environment than uh than not
(00:59:59)
using evaporative cool. There's all all
(01:00:00)
these reasons why this myth or hoax of
(01:00:03)
AI of AI using all the water is just
(01:00:05)
nonsense, right? Like Meta's data center
(01:00:07)
in Louisiana is getting protested
(01:00:08)
because the water it's it's going to be
(01:00:10)
the largest data center in the world.
(01:00:11)
It's going to be like four or five
(01:00:12)
gigawatts at least announced so far.
(01:00:13)
We're tracking some other ones that are
(01:00:15)
that may be as big or bigger. Uh but
(01:00:17)
Meta is getting protested because the
(01:00:20)
local population around that area is
(01:00:21)
like, "Oh, the water's dirty. It's
(01:00:22)
because of this meta data center." And
(01:00:24)
like there's these trucks on these big
(01:00:26)
trucks on these back roads that used to
(01:00:27)
be empty completely. They're just like
(01:00:29)
mad and annoyed about that, right? But
(01:00:30)
at the end of the day, what actually
(01:00:32)
made the water dirty is that that's an
(01:00:34)
area where you go fracking. Like
(01:00:37)
>> fracking is absurdly worse and almost
(01:00:39)
all of that gas is being shipped to an
(01:00:41)
LG terminal and being shipped to Asia.
(01:00:44)
Like you know, you know, like Japan or
(01:00:46)
Taiwan or China or Korea and some Europe
(01:00:48)
as well, right? Like like actually all
(01:00:50)
of this water is dirty because of
(01:00:52)
regulation fracking. Like I support
(01:00:54)
fracking by the way, but you know that's
(01:00:55)
that's an insane take too maybe. Um but
(01:00:58)
like water usage is is is like not a
(01:01:00)
relevant argument.
(01:01:02)
>> Are you bullish on the sort of energy uh
(01:01:05)
companies I'm thinking constellation for
(01:01:08)
nuclear or Vistra I guess is an
(01:01:11)
independent power producer.
(01:01:14)
>> I think IPS will do well. I think IPS
(01:01:17)
can secure contracts at premiums to what
(01:01:21)
they've previously been able to for new
(01:01:22)
power plants that are either uh
(01:01:24)
dedicated or grid connected but come
(01:01:26)
with a pairing of a grid load right for
(01:01:29)
example utilities won't let you just do
(01:01:30)
data centers now but if you come with a
(01:01:32)
a pair right you're like hey I'm going
(01:01:33)
to build this massive data center but
(01:01:34)
we're also going to have this massive uh
(01:01:36)
power generating asset right say you
(01:01:38)
know whatever it is right some IP
(01:01:40)
they're going to partner with and
(01:01:41)
they'll build the load and the uh
(01:01:43)
consumption even if it's connected
(01:01:44)
through the grid for better stability
(01:01:46)
and more reliability. Um or it's not
(01:01:48)
it's behind the meter i.e. not connected
(01:01:50)
to the grid at all. Um like some part
(01:01:52)
some data centers like partially like
(01:01:54)
Colossus from Elon uh the original one
(01:01:56)
or part of Abene's Texas OpenAI right
(01:01:59)
like Cruso there's a lot of room for
(01:02:01)
power producers to get outsized returns.
(01:02:04)
I'm not necessarily bullish nuclear. Um
(01:02:06)
existing nuclear fine yeah it'll it'll
(01:02:08)
it can find a higher buyer higher priced
(01:02:10)
buyer but majority of it will be gas but
(01:02:13)
like you can do like renewables backed
(01:02:14)
by gas and then just turn off the gas
(01:02:16)
and like it's cost more but whatever
(01:02:17)
right or you can do wind backed by gas
(01:02:19)
>> and why not nuclear
(01:02:20)
>> takes too long
(01:02:21)
>> takes too long
(01:02:21)
>> no one can build nuclear fast
(01:02:24)
>> even China takes like 5 years to build
(01:02:25)
nuclear right like it's it's complicated
(01:02:28)
it's unsafe right you know I love
(01:02:31)
nuclear I wish it would work it's just
(01:02:32)
not relevant in the time scale that like
(01:02:34)
AI's power is going crazy. Um, but yeah,
(01:02:37)
there's a lot of interesting stuff like
(01:02:39)
have clients would like had a client buy
(01:02:41)
a coal plant and we were advising them
(01:02:43)
on the transaction based on they just
(01:02:44)
like showed up and they're like, "Yeah,
(01:02:45)
we want to buy we want to buy power
(01:02:46)
assets. We believe in this power story."
(01:02:48)
It's like, "Okay, great." So, yeah. So,
(01:02:49)
here's all of the like power plants that
(01:02:51)
we know of like you can get some of it
(01:02:52)
from EIA blah blah blah. um which are
(01:02:54)
these like and then we like worked
(01:02:56)
through the economics and we looked at
(01:02:57)
the new data centers being built in the
(01:02:58)
region and all this and then they
(01:02:59)
decided to buy a coal plant and they
(01:03:01)
restarted it and they're like making
(01:03:02)
tons of money now because now someone a
(01:03:05)
certain hyperscaler wants to buy the
(01:03:06)
entire pipeline of power and put a load
(01:03:08)
load near it right instead of just being
(01:03:10)
a grid connected asset. So it's like a
(01:03:12)
super awesome investment. So like you
(01:03:13)
know power is power is going to do
(01:03:15)
great.
(01:03:15)
>> Yeah. I was going to talk about peace
(01:03:17)
dividends of the whole AI boom. Uh
(01:03:19)
generally yes right like hyperscalers
(01:03:21)
are paying for uh transmission grid
(01:03:23)
upgrades which people will benefit from
(01:03:26)
right or like you know investors are
(01:03:27)
obviously going to benefit people who
(01:03:29)
work in the industry electricians wages
(01:03:31)
are skyrocketing you know etc right like
(01:03:33)
plumbers wages are skyrocketing so
(01:03:35)
there's like a lot of trades that are
(01:03:36)
doing really well too I think that's
(01:03:38)
definitely also um part of it yeah
(01:03:40)
>> I wanted to come back quickly to uh that
(01:03:44)
um Nvidian core wave deal that you
(01:03:45)
mentioned as we sort of close the
(01:03:47)
discussion on uh on capex and a and a
(01:03:50)
bubble. It seems like there is circular
(01:03:52)
deals but also a lot of debt kind of
(01:03:54)
like flushing around. So I don't know
(01:03:56)
the specifics of of that deal but like I
(01:03:58)
did hear variations of this where
(01:04:01)
effectively you have a large player
(01:04:02)
guaranteeing the debt being the last
(01:04:04)
recourse uh for a lot of infrastructure
(01:04:08)
build is sort of uh this plus the whole
(01:04:11)
like oracle commitment.
(01:04:14)
there there is a fragility into this
(01:04:16)
whole thing that can be a little
(01:04:18)
unnerving. What do you make of it?
(01:04:20)
>> I think it's like completely fine and I
(01:04:22)
think like people are like freaking out
(01:04:23)
and making narratives where there really
(01:04:25)
is shouldn't be one. It's like well okay
(01:04:27)
Google doesn't have enough data center
(01:04:28)
capacity. They need people to build data
(01:04:29)
centers, but no one can build a data
(01:04:32)
center because they don't have the
(01:04:33)
capital. Like don't have, you know, many
(01:04:34)
cases capital is not the, you know, they
(01:04:36)
don't have capital, right? Or like no
(01:04:38)
one will give them a loan because they
(01:04:39)
don't trust some random [ __ ] company.
(01:04:40)
And it's like, but then Google's like,
(01:04:41)
well, no, we've due diligence to them.
(01:04:42)
We think they can build it here. We'll
(01:04:44)
like even guarantee we'll buy the thing
(01:04:46)
or start using it once they build it.
(01:04:47)
You know, just having a customer alone
(01:04:50)
spoken for it was enough, right? Um, in
(01:04:52)
the case of Cororewave, they were
(01:04:53)
actually able to no backs stop, right?
(01:04:54)
Right? They were able to just say, "Hey,
(01:04:55)
hey look, here's our Microsoft contract
(01:04:57)
for this many GPUs. I want to put in
(01:04:59)
that data center, that data center, that
(01:05:00)
data center. Here's the contract for
(01:05:01)
renting those GPUs. I want to hire these
(01:05:02)
people. I want to do this." No one will
(01:05:04)
like they don't have any money, but then
(01:05:05)
they were able to like have it work out
(01:05:06)
because they were able to get people to
(01:05:07)
lend to them. I think like Cororeweave
(01:05:09)
did that and there was no circular
(01:05:10)
financing. But that was when there was
(01:05:11)
like the scale of investment was like
(01:05:13)
single digit billions or less than a
(01:05:14)
billion. Right? Now the scale of
(01:05:16)
investment is hundreds of billions.
(01:05:18)
>> Yeah.
(01:05:18)
>> Um and so the question is like, oh well,
(01:05:20)
if I want data center capacity, how do I
(01:05:22)
how do I get data center capacity? I
(01:05:23)
just go to everyone who's going to build
(01:05:25)
it looks smart is smart enough to do it
(01:05:27)
but can't afford to do it and tell them
(01:05:29)
I'll I'll take it and in fact I won't
(01:05:30)
just take it. I'll go to your debtor and
(01:05:32)
be like, I'll guarantee you. Yeah.
(01:05:33)
>> Because, you know, obviously you're a
(01:05:34)
new company. I've vetted you, but the
(01:05:36)
debtor hasn't. And so, you know, like,
(01:05:38)
you know, you know, they don't want me
(01:05:39)
to just be able to walk away because
(01:05:41)
like in the Microsoft Corwave deals,
(01:05:42)
Microsoft could have walked away if
(01:05:43)
Corwe [ __ ] it up,
(01:05:45)
>> right?
(01:05:45)
>> Yeah.
(01:05:46)
>> There's no I mean, yeah, there's there's
(01:05:47)
always like uh sort of like cancellation
(01:05:48)
or whatever possibilities. And so, this
(01:05:50)
is just a further form of guarantee um
(01:05:52)
as far as on like a lot of these back
(01:05:54)
stops as far as on like Oracle getting
(01:05:56)
the money and then OpenAI getting money
(01:05:57)
and Nvidia, you know, paying and it's a
(01:05:59)
whole circular. It's kind of nonsense
(01:06:01)
because it's like Nvidia's getting
(01:06:03)
equity in OpenAI. They're basically
(01:06:04)
saying, "Hey, every gigawatt you buy,
(01:06:06)
we'll also buy some equity."
(01:06:07)
>> Yeah.
(01:06:08)
>> Right. Okay. Well, cool. Now, Nvidia
(01:06:09)
owns an asset which they think is
(01:06:11)
valuable. OpenAI, right? Open AAI is
(01:06:13)
turning around and is like trying to
(01:06:14)
rent those uh use the equity they buy.
(01:06:17)
What do they what was their use of
(01:06:18)
equity? People's cash pay isn't that
(01:06:20)
great, right? It's mostly just 99 plus%
(01:06:23)
of their spend at the company is
(01:06:24)
probably just compute.
(01:06:25)
>> Yeah.
(01:06:26)
>> Uh so so sort of like it's like, okay,
(01:06:27)
well then I I raise this money. I'm
(01:06:29)
going to do the the whole thing I
(01:06:30)
explained earlier, right? Year one and
(01:06:31)
two I lose money. Year three, four,
(01:06:32)
five, I hope to make money on it, right?
(01:06:34)
Um, and open has been doing that, right?
(01:06:35)
So, I'm going to Okay, I'm going to go
(01:06:36)
out there. I've raised $50 billion. I've
(01:06:39)
raised $10 billion. I'm going to raise
(01:06:40)
it. I'm going to rent a cluster for five
(01:06:43)
years for $65 billion. And I've rented
(01:06:48)
that contract and now I only have enough
(01:06:50)
to pay for the first year to be clear.
(01:06:51)
But I think, you know, you trust me,
(01:06:52)
Oracle, you think I'm going to grow and
(01:06:54)
you think I'll be able to pay for it.
(01:06:55)
Oracle's like, "Yeah, or if you're not,
(01:06:56)
I think I'll be able to sell it to
(01:06:57)
someone else." So like, okay, cool. I'm
(01:06:59)
going to spend $50 billion this year.
(01:07:00)
>> Yep.
(01:07:01)
>> To build that data center. And and and
(01:07:03)
this these this is like for a gigawatt.
(01:07:05)
Um and so is it like circular that
(01:07:07)
OpenAI is every amount of GPUs they
(01:07:08)
consume and gives an investment that
(01:07:11)
investment is turned around to pay for
(01:07:12)
the first year of the rent to the
(01:07:13)
cluster. Um or second year then first
(01:07:15)
two years go, you know, it's sort of
(01:07:17)
like it's fine.
(01:07:18)
>> Yeah. Yeah.
(01:07:18)
>> Like it's like it's like it is a little
(01:07:21)
bit funky, but like I don't think it's a
(01:07:23)
big deal.
(01:07:23)
>> Yeah. Love it. Contrary intake. Maybe
(01:07:25)
let's finish with the models and the
(01:07:28)
software side of things. We talked
(01:07:30)
extensively about hardware and supply
(01:07:31)
chain and all the things. I get a sense
(01:07:33)
that you are super super bullish on uh
(01:07:35)
what's happening next in in AI. Your
(01:07:38)
roommate Schulto I assume was the
(01:07:40)
roommate that you were talking about
(01:07:41)
earlier on this pod effectively making
(01:07:43)
the point that we're just starting to
(01:07:45)
scratch the surface and there was so
(01:07:47)
much low hanging fruit around you know
(01:07:49)
RL and all the things you were in
(01:07:51)
Silicon Valley circles. Is that is that
(01:07:53)
your sense as well and what are you
(01:07:55)
tracking on the model side?
(01:07:57)
>> One thing is like you know simple stuff
(01:07:58)
like uh GitHub commits other things are
(01:08:01)
like what's the amount of usage how much
(01:08:03)
are people using like all these sorts of
(01:08:05)
things. I think there's so many
(01:08:06)
different alternative data sources for
(01:08:08)
tracking AI model progress area
(01:08:10)
tokconomics uh token economics
(01:08:12)
tokconomics and so that's like an entire
(01:08:13)
practice for us.
(01:08:14)
>> Are you rebranding the term from crypto?
(01:08:17)
>> I yeah I don't believe in crypto people
(01:08:19)
like I've always hated them. [laughter]
(01:08:21)
Um,
(01:08:21)
>> so now you're taking the term.
(01:08:22)
>> Yeah. Yeah. And Jensen's used it now. So
(01:08:24)
I've like I've convinced him to use the
(01:08:26)
word. He's used it as sovereigns and so
(01:08:27)
I think I think we've won.
(01:08:28)
>> That's awesome. Congratulations.
(01:08:30)
>> I've said it to him. We've written it in
(01:08:31)
articles. It's an entire practice of
(01:08:33)
consulting that I just I started in like
(01:08:34)
23 2023 uh was token economics and we've
(01:08:38)
been trying to build out these like you
(01:08:39)
know but basically I think the main
(01:08:40)
things are like people who don't code
(01:08:41)
can use cloud code now right? I think
(01:08:43)
people don't understand that like even
(01:08:44)
if you don't code you've never had any
(01:08:46)
training in software development, you've
(01:08:47)
never take had a job as a software
(01:08:49)
developer you can code. Let's take an an
(01:08:51)
example of what one of the one of the
(01:08:52)
analysts at my company did right comes
(01:08:54)
from a engineering background but on
(01:08:56)
like semiconductor systems right uh like
(01:08:59)
worked on mechanical systems worked on
(01:09:02)
these sorts of things and they coded
(01:09:03)
this thing which was they wanted to do
(01:09:05)
an analysis of area of clean rooms right
(01:09:08)
clean rooms are the building that you
(01:09:10)
the fab has all the tools in the most
(01:09:11)
complicated kind of building in the
(01:09:12)
world has every all sorts of chemical
(01:09:14)
systems and all this area of that a
(01:09:16)
company who builds systems builds these
(01:09:19)
systems
(01:09:20)
and revenue of that company, right? And
(01:09:22)
so it was like, okay, uh we have this
(01:09:24)
fab data set. Pointed it at it was like,
(01:09:26)
hey, here's this fab data set. What's
(01:09:27)
the square footage of all of them? And
(01:09:29)
we have this like thing that we built
(01:09:30)
which uh just pulls with cloud code
(01:09:32)
separately which for data centers and
(01:09:34)
and and fabs and everything else just
(01:09:36)
calculates the area of something from a
(01:09:37)
from a satellite image, right? Very
(01:09:39)
simple. So we have the square footage of
(01:09:40)
all these things. Points at that. Here's
(01:09:42)
the company name. Okay, go find the
(01:09:43)
filings. So it dig dug through all these
(01:09:45)
filings. It it pulled the data, right?
(01:09:47)
Okay, great. now told it to um compare
(01:09:50)
these two. Make a chart. Great. Oh,
(01:09:52)
wait. There's this like weird
(01:09:52)
inflection. Oh, that's because they
(01:09:53)
bought a company five years ago. Can you
(01:09:55)
do a proform of this analysis without
(01:09:57)
those financials of that of that company
(01:09:58)
they acquired? Okay, great. And then
(01:10:00)
like we were able to like like figure
(01:10:02)
out an investment case for our clients
(01:10:05)
as well as like you know some other
(01:10:07)
interesting details from someone who's
(01:10:08)
never really coded just using clawed
(01:10:11)
code and it like doing this all and this
(01:10:13)
is like not even their and it wrote the
(01:10:14)
note and they just like they didn't even
(01:10:16)
like work on this full-time for like 3
(01:10:18)
hours right they just told the model and
(01:10:20)
would go work on other things and told
(01:10:21)
the model and worked on other things
(01:10:22)
they just did this people don't
(01:10:24)
understand that like the skill sets that
(01:10:25)
like I think like if you go talk to an
(01:10:27)
analyst right a very junior analyst at
(01:10:29)
any right? Whether it's venture or
(01:10:31)
especially growth venture or public
(01:10:33)
markets or private equity, their their
(01:10:35)
job is like finding data, cleaning it,
(01:10:37)
making charts. It's like this is cloud
(01:10:38)
code now. You don't need junior
(01:10:40)
analysts. Just like a lot of companies
(01:10:42)
have stopped hiring L4 engineers because
(01:10:43)
it's useless. Why would I hire an L4
(01:10:46)
engineer? I just tell Claude to do it.
(01:10:49)
You you sort of like have this has
(01:10:50)
happened and this is a really big like
(01:10:53)
shift I guess like is that like
(01:10:55)
low-level knowledge work just doesn't
(01:10:57)
matter, right? Why would I why would I
(01:10:59)
use Excel when I can just tell Claude to
(01:11:01)
manipulate CSVs? Why would I use Word
(01:11:03)
when Claude will just generate the
(01:11:05)
markdown and I can copy and paste the
(01:11:06)
markdown directly into our WordPress and
(01:11:08)
then you know and that WordPress is
(01:11:09)
fully formatted now and it's like oh my
(01:11:10)
god like what's the point of Word,
(01:11:12)
right? Um and what's the point of doing
(01:11:13)
all sorts of stuff? I think when we look
(01:11:15)
at model progress that's just for Opus
(01:11:17)
4.5. Open's new model I think will be
(01:11:19)
better than Opus 4.5 and it's coming
(01:11:20)
like somewhat soon in Marchish um time
(01:11:23)
frame. I maybe February, Marchish, but
(01:11:25)
yeah. Um because OpenAI has a better RL
(01:11:27)
stack than Enthropic today. It's just
(01:11:29)
their pre-trained models suck compared
(01:11:31)
to Enthropic's pre-training, right? And
(01:11:33)
so like if they catch up a lot on
(01:11:34)
pre-training and keep their better RL
(01:11:36)
stack, they would actually have a model
(01:11:38)
that's much better, right? Flip side,
(01:11:39)
Google has a better pre-trained model
(01:11:41)
than Anthropic or OpenAI, but their RL
(01:11:43)
stack sucks. So if they catch up on RL,
(01:11:45)
like these models are going to get
(01:11:47)
ridiculously and then Anthropic is
(01:11:48)
obviously advancing as well, right? And
(01:11:49)
so and then and then you look across the
(01:11:51)
ecosystem, everyone's advancing really
(01:11:53)
fast progress. These moments are
(01:11:55)
happening, right? You know, chat GPT was
(01:11:57)
a moment. Gibbly was a moment. Those
(01:11:58)
were more consumer. Those were less like
(01:11:59)
I mean there chat GBT is everyone using
(01:12:01)
it for work too. But like I think cloud
(01:12:02)
code is like a new moment right 4.5 on
(01:12:05)
cloud code is a new moment where the way
(01:12:07)
you work has forever changed. And so now
(01:12:09)
we're trying to force everyone in my
(01:12:10)
company. There's 54 people here. I think
(01:12:12)
like half of them have coded. The other
(01:12:14)
half we're trying to force them to use
(01:12:15)
like cloud code. And it could be like oh
(01:12:17)
well actually you come from a consult a
(01:12:19)
semiconductor consulting background. Oh,
(01:12:20)
you come from like a semiconductor like
(01:12:22)
engineering of like package. Oh, you
(01:12:24)
worked in a fab, right? Like these kind
(01:12:25)
of people, they're using cloud code now,
(01:12:27)
right? And and their productivity is
(01:12:29)
being boosted.
(01:12:30)
>> And it's like,
(01:12:31)
>> you know, workspace, cloud workspace is
(01:12:33)
new. It sucks compared to cloud code,
(01:12:34)
but it'll get there, right? He he he
(01:12:36)
said he coded it entirely in cloud code.
(01:12:37)
You know that, right? Or that was on
(01:12:38)
your pod, right? Yeah. Yeah. So, like um
(01:12:41)
you I've heard that and I think maybe
(01:12:42)
that might have been from your pod uh
(01:12:44)
original uh disclosure.
(01:12:46)
>> My pod was before that, but yes. Oh,
(01:12:48)
okay. Okay. It was
(01:12:49)
>> I had as the guy on my pod subsequently
(01:12:50)
said that.
(01:12:51)
>> Okay. I think it's like a brand new age
(01:12:53)
and and like there's so much low hanging
(01:12:54)
fruit as Shto said on the episode when
(01:12:56)
he was here. There's so much low hanging
(01:12:57)
fruit. Yeah. I mean for for the models
(01:12:59)
progressing and I think model progress
(01:13:00)
will translate to revenue. Adoption is
(01:13:03)
difficult but like actually the UX of
(01:13:05)
cloud code sucks but like give it 6
(01:13:07)
months the models will be good enough
(01:13:08)
that the UX can be like talking to it.
(01:13:10)
Yep.
(01:13:10)
>> And you don't even have to have like you
(01:13:12)
know CLI integration, right? It's
(01:13:14)
something even easier. or like cloud for
(01:13:16)
XL was released recently and it's like
(01:13:18)
not bad you know building models and
(01:13:19)
like all these sorts of things are just
(01:13:20)
going to be like tell someone right like
(01:13:22)
why tell a junior analyst right when you
(01:13:24)
can just do it yourself I think it's a
(01:13:25)
whole new world and it's a $2 trillion
(01:13:27)
of software work but also of wages but
(01:13:29)
it's also we have more north of 2% 2% is
(01:13:32)
claw and then you know there's codeex
(01:13:33)
and cursor and all these other guys so
(01:13:35)
probably like 5% of code committed today
(01:13:36)
is AI generated if not higher marked as
(01:13:39)
AI generated what's going to happen when
(01:13:41)
normal workers who do spreadsheets and
(01:13:44)
office processing start automating their
(01:13:45)
workflows. I think it's a whole new
(01:13:47)
world.
(01:13:47)
>> And speaking of Schultoe, we both agreed
(01:13:49)
that he was a a perfect specimen.
(01:13:52)
>> Dude, [laughter] I' I've been I'm
(01:13:54)
straight, but I've been accused of being
(01:13:56)
uh homosexual, which is perfectly fine
(01:13:58)
for for how much I like praise this man
(01:14:01)
because like, think about it, right?
(01:14:02)
He's like 6'4. He's like really
(01:14:05)
good-looking. He's like Australian
(01:14:07)
accent. Sounds amazing. Like you've
(01:14:09)
heard his I I have like a annoying voice
(01:14:11)
probably. His voice sounds amazing. He's
(01:14:13)
absurdly good at coding. He was an
(01:14:15)
Olympian level fencer. Like like he
(01:14:18)
picks up any sport, he's really good at
(01:14:20)
it, right? Because he's athletic. It's
(01:14:21)
like, "Holy crap, you're a specimen."
(01:14:23)
>> Yeah. Yeah.
(01:14:24)
>> This clip and sent him [laughter] for
(01:14:26)
sure.
(01:14:28)
>> Yeah. It must be uh you know I guess uh
(01:14:31)
may maybe some people don't follow the
(01:14:33)
playbyplay on on Twitter and like don't
(01:14:36)
haven't haven't heard of like the fact
(01:14:37)
that all of you guys are roommates or
(01:14:39)
you roommate with Scholto and then with
(01:14:41)
Dwarish and Darkish is like the
(01:14:43)
podcasters podcaster. So it must be
(01:14:46)
absolutely
(01:14:47)
>> What's a podcasters podcaster mean?
(01:14:49)
>> Uh the podcaster that other podcasters
(01:14:52)
uh aspire to to to become or learn from.
(01:14:56)
>> Yeah. Yeah. his his when he's preparing,
(01:14:58)
you know, it's like he's he's he's he's
(01:15:00)
so locked in and he prepares so hard for
(01:15:02)
interviews. It's great.
(01:15:03)
>> No, he's he's just uh incredible.
(01:15:05)
>> And then and then he might only say like
(01:15:07)
a hundred words on the episode,
(01:15:09)
>> but he's prepared so hard and then like
(01:15:10)
I think people just realized, oh wow,
(01:15:12)
he's not just like, you know, it's like,
(01:15:13)
oh, he just has good guests. No, no, no.
(01:15:15)
Like he's preparing really hard, but you
(01:15:16)
can't tell if you're not like realizing
(01:15:18)
that. And then once he started writing
(01:15:20)
more and he started writing more, people
(01:15:21)
like, oh wow, he's actually really
(01:15:22)
really smart. It's like, yeah, cuz he's
(01:15:24)
studying like crazy. Like it's like,
(01:15:26)
"Oh, I'm interviewing an AI researcher
(01:15:27)
who worked on this. I'm gonna try and
(01:15:28)
train a freaking model." Yeah.
(01:15:29)
>> Right. It's like that's the level of
(01:15:31)
like commitment he goes to when he
(01:15:32)
records this stuff.
(01:15:33)
>> What do you guys talk about when you
(01:15:34)
bump into each other? Is that is that AI
(01:15:36)
non-stop or you talk about everything
(01:15:37)
but AI
(01:15:38)
>> with Shoto? It's like the Age of Empires
(01:15:40)
game, you know, because we we got super
(01:15:41)
into it for a bit. We talked only about
(01:15:43)
that in his RTS that he made. Uh with
(01:15:45)
with with Dwarash, it's I mean, it's all
(01:15:47)
sorts. It's like normal roommate stuff.
(01:15:49)
It's like, [laughter] "How's your dating
(01:15:50)
life?" "Oh, okay. You went on a date. It
(01:15:52)
wasn't well. It didn't go well." "Okay,
(01:15:53)
well, okay." Yeah. you know, like, oh,
(01:15:55)
you know, like that's me. That's me. You
(01:15:56)
know, my days don't go [laughter] well.
(01:15:58)
No, I'm just kidding. Um, or like it's
(01:16:00)
like, oh, you want to like have dinner?
(01:16:02)
We can invite a few friends. Like, yeah,
(01:16:03)
great. Or like, you know, it's like all
(01:16:04)
sorts of like normal stuff, too. Um, al
(01:16:06)
obviously we also do talk about a lot
(01:16:08)
about tech, right? Like we are like this
(01:16:10)
is our lives. Um, and tech is the most
(01:16:12)
fun thing.
(01:16:13)
>> Awesome. Well, great. Great San
(01:16:14)
Francisco lore. Uh, Dylan, thank you so
(01:16:17)
much. Uh, that was absolutely fabulous.
(01:16:18)
Really enjoyed it. Learned a lot. So,
(01:16:20)
really appreciate uh your coming on the
(01:16:22)
pub.
(01:16:22)
>> Thank you so much.
(01:16:25)
Hi, it's Matt Turk again. Thanks for
(01:16:26)
listening to this episode of the Mad
(01:16:28)
Podcast. If you enjoyed it, we'd be very
(01:16:30)
grateful if you would consider
(01:16:31)
subscribing if you haven't already or
(01:16:33)
leaving a positive review or comment on
(01:16:35)
whichever platform you're watching this
(01:16:37)
or listening to this episode from. This
(01:16:39)
really helps us build a podcast and get
(01:16:41)
great guests. Thanks, and see you at the
(01:16:44)
next episode.
