Dylan Patel: NVIDIA's New Moat & Why China is "Semiconductor Pilled” (YouTube Video Transcript)

↔

Title: Dylan Patel: NVIDIA’s New Moat & Why China is “Semiconductor Pilled”

Duration: 01:16:51

Total Correct Answers:

Dictation Mode:

Current Caption

Correct

Show All Captions

Learning Modes

Dictation

YouTube Video Transcript Hide

Ask AI Result

The ask AI result will appear here..

Show timestamps

Display as lines

(00:00:00) Your YouTube transcript will appear here (00:00:00) This is the biggest change in human (00:00:02) history maybe ever. What's about to (00:00:04) happen with AI? This is the biggest (00:00:06) revolution bigger than industrial (00:00:07) revolution. Jensen is very paranoid (00:00:09) about losing. If he just kept making his (00:00:10) mainline chip, people crush him on cost (00:00:12) and performance. Acquiring Grock is how (00:00:14) you get those resources to make more (00:00:15) solutions for different parts of the (00:00:17) market to stay king. At the end of the (00:00:18) day, this is an economic war. If the US (00:00:20) and the West win in AI, China will not (00:00:24) rise to be the global hedgeimony. But (00:00:25) without AI, China definitely will rise. (00:00:28) They're just going to outrun America. (00:00:29) Hi, I'm Matt Turk. Welcome back to the (00:00:31) Matt podcast. Today I'm joined by the (00:00:33) one person Wall Street and Silicon (00:00:35) Valley turn to when they need to cut (00:00:37) through the hardware hype, Dylan Patel (00:00:39) of Semi analysis. We dove into many of (00:00:41) the most important topics [music] of (00:00:43) today. Nvidia's massive move to acquire (00:00:45) Grock, the truth about the capex bubble, (00:00:47) whether the US power grid can actually (00:00:49) handle the AI boom, and the geopolitical (00:00:52) chess match [music] playing out between (00:00:53) the US and China. But I have to warn (00:00:55) you, this conversation went off the (00:00:57) rails in the best possible way. And we (00:00:59) ended up going into all sorts of fun (00:01:00) tangents like the strange phenomenon of (00:01:03) Chinese romance dramas set inside (00:01:05) semiconductor factories and what's (00:01:07) really like when three AI famous (00:01:08) roommates live together in SF. Please (00:01:10) enjoy this fantastic conversation with (00:01:12) Dylan. (00:01:15) >> Hey Dylan, welcome. (00:01:16) >> Hello. How are you? (00:01:17) >> I'm great. I'd love to start with Grock (00:01:19) and Nvidia since it's still fresh. So, (00:01:22) not so long ago, Nvidia was saying that (00:01:24) uh one GPU could do it all, and now (00:01:26) they're doing this acquisition (00:01:28) non-exclusive deal with Grock. What does (00:01:31) that mean from your perspective? (00:01:33) >> It's very clear. We're not sure where AI (00:01:36) models are headed in terms of, you know, (00:01:38) over the next few years, what happens to (00:01:40) the architecture, but you know, the (00:01:42) thing that I think everyone is sort of (00:01:43) like agreed on is models are pretty auto (00:01:45) reggressive, right? Next token (00:01:46) generation is like the thing but beyond (00:01:48) that right attention mechanisms changed (00:01:49) the how how it works everything changes (00:01:51) right could could change and so what's (00:01:52) interesting is the reason Nvidia one is (00:01:55) because they just took like the widest (00:01:56) surface area bet and then people kept (00:01:57) developing models on that and that kind (00:02:00) of shape worked but now the workload is (00:02:02) so large that there is room for (00:02:04) specialization that will give you 10x (00:02:06) increases in certain domains right in a (00:02:08) general purpose workload grock doesn't (00:02:10) work right you know it can't train it (00:02:12) can't you know it can't inference really (00:02:13) really large models um cost efficiently, (00:02:16) right? You can't serve many many many (00:02:17) users, but what it can do is it can go (00:02:19) bl screamingly fast, right? Same with (00:02:21) the cerebrous open AI deal, but that's (00:02:23) like one workload, right? Uh very decode (00:02:26) focused, right? Gener doing auto (00:02:27) reggressive tokens in a in a single (00:02:29) stream super fast. Another direction AI (00:02:31) models could head, right? We don't know (00:02:33) are models going to think in one token (00:02:35) stream or is it actually they're (00:02:38) constantly context switching, right? and (00:02:40) they're going from they have this (00:02:41) humongous humongous context and they're (00:02:44) generating in multiple parallel streams (00:02:47) right and so Google and openi have both (00:02:49) released mechanisms of this with their (00:02:51) pro models where the model actually (00:02:53) doesn't just have one single chain of (00:02:55) thought for reasoning it has multiple (00:02:57) right and then I don't exactly like you (00:02:59) know and and and how they choose which (00:03:00) one and what the final answer to you (00:03:02) delivers is is an area of research um (00:03:05) but there there is room for that kind of (00:03:07) chip right something that works on very (00:03:08) parallel a lot lot of streams of chain (00:03:11) of thought and maybe the latency (00:03:12) requirements are not as crazy, right? (00:03:13) Maybe you don't want to go blindingly (00:03:15) fast, right? Maybe you're okay with it (00:03:17) being, you know, because I can spin up (00:03:19) 100 parallel, you know, streams of (00:03:21) thought or agents or whatever you want (00:03:22) to call them. Maybe I I care a lot about (00:03:24) cost there. And because it's 100 in (00:03:26) parallel instead of one going super (00:03:27) super fast, it's not as deep, right? The (00:03:29) tree search or the depth of the (00:03:31) inference is not as deep, but it is much (00:03:32) wider. You know, there's other parts of (00:03:34) inference. Hey, process do creating the (00:03:35) KV cache. So, Nvidia has a chip for (00:03:37) that, right? That's the CPX. So they (00:03:39) they've made the CPX, they bought Grock (00:03:41) for decode, and then they still have (00:03:42) their general purpose GPU. So they've (00:03:43) they're kind of trying to cover their (00:03:45) bases because unlike the first wave of (00:03:47) AI chip companies where they sort of (00:03:49) just made chips and then tried to figure (00:03:50) out where it would work, right? They had (00:03:51) a thesis, Grock and Cerebrus, both as (00:03:53) well as Samanova, right, which was put a (00:03:55) lot of memory on the chip and not (00:03:57) necessarily in the case of Cerebrus and (00:03:59) Grock, no memory off chip. And in the (00:04:01) case of Samanova, less memory offchip or (00:04:03) slower memory offchip with higher (00:04:04) capacity. You know, they they sort of (00:04:06) all made similar bets in that direction. (00:04:08) And it didn't work for a while until it (00:04:11) kind of did, right? Um because there's a (00:04:13) workload that now necessitates it. (00:04:14) Nvidia recognizes they're they're the (00:04:16) leader. They're at the tent pole. Hey, (00:04:18) in one respect they can just run faster (00:04:21) than everyone, but it's kind of hard to (00:04:23) be 2x better than Google or or OpenAI or (00:04:27) whoever else's internal chip, right? To (00:04:29) justify their, you know, 75% plus (00:04:31) margins, right? And then they have to be (00:04:33) 2x to 4x better to justify 4x better to (00:04:36) justify their margins because that's (00:04:37) what they're charging above COGS. You (00:04:38) know, the question is what what (00:04:40) architecture will deliver that? Well, (00:04:42) yes, keep the programmability of their (00:04:43) GPUs is great for training and for a lot (00:04:47) of workloads, but you know, guess what? (00:04:48) I think I think a lot of people will (00:04:50) just be downloading an open source (00:04:52) model, downloading an inference (00:04:53) framework and pressing go, right? A (00:04:55) little bit more complicated than that, (00:04:56) but that's that's going to be the (00:04:58) consumption method for a lot of (00:04:59) enterprises, a lot of uh startups, a lot (00:05:02) of tech companies is they're just going (00:05:03) to do that or they're going to rent the (00:05:05) G GPUs or or rent the chips and then (00:05:07) download an open source framework and (00:05:08) model and go, right? And Nvidia (00:05:10) recognizes this and hey, there is room (00:05:11) for products that aren't general (00:05:13) purpose, right? The general purpose GPU (00:05:15) will still probably be the main line for (00:05:17) training and for a lot of inference and (00:05:19) for costefficient inference, but maybe (00:05:21) blindingly fast or workloads that have a (00:05:23) ton of prefill, i.e. creating the the KV (00:05:25) cache. Maybe that those workloads could (00:05:28) be different chips, right? And the CPX (00:05:29) chip they announced, right? They say (00:05:30) it's for the context processing, (00:05:32) creating the KV cache. It's also really (00:05:33) useful for video models because video (00:05:35) models don't care about memory bandwidth (00:05:36) and so you know why pay for the (00:05:37) expensive memory that the general (00:05:39) purpose chip has or why do what Grock is (00:05:40) doing which is tying hundreds or (00:05:42) thousands of chips together and not (00:05:44) having memory but keeping the entire (00:05:46) model on chip. The trade-off for that of (00:05:48) course is you need thousands of chips (00:05:49) and you have less compute per chip and (00:05:51) so like Nvidia's trying to capture the (00:05:53) whole surface area because again you (00:05:54) don't know where models are headed and (00:05:56) it's hard to say where the research is (00:05:57) headed. (00:05:57) >> And do you think it's a good thing for (00:05:58) the market? Yet another one of those (00:06:00) deals that's structured as a as a (00:06:02) license but really an acquisition. (00:06:04) >> I certainly think it's not good from an (00:06:08) anti-competitive sense, right? I don't (00:06:10) think people should just be able to buy (00:06:12) companies without like any antitrust (00:06:15) like process at all. Now, in the case of (00:06:17) like a large company buying a startup, (00:06:19) I'm completely fine with it. The flip (00:06:20) side is like, hey, we know the deal is (00:06:22) happening, right? Uh this happened for a (00:06:24) company I was an adviser for Nvidia (00:06:26) acquired in fabrica just maybe a few (00:06:27) months before they did Grock and similar (00:06:30) style of deal right if someone wanted to (00:06:32) strike it down that's the biggest limbo (00:06:34) right we've seen this happen in venture (00:06:35) and you probably know more stories of (00:06:37) this but like a company trying to get (00:06:38) acquired they get stuck in limbo for (00:06:40) like a year (00:06:41) >> and then it falls apart (00:06:43) >> stories (00:06:43) >> yeah it falls apart the deal did because (00:06:46) some regulatory BS and now the company (00:06:49) was and the founders were focused on (00:06:50) getting the deal done instead of like (00:06:52) making the product better for a year and (00:06:53) now they're like behind or you know they (00:06:54) they they weren't focused on growth as (00:06:56) much right you know you only have so (00:06:57) much time as a founder so in that sense (00:06:59) I like the license deals right (00:07:00) >> so now is uh Nvidia also dominating the (00:07:04) the inference market is there any world (00:07:06) where Nvidia is no longer the king or (00:07:08) they seem to be getting stronger (00:07:10) >> I think the thing about Nvidia is they (00:07:13) take the Andy Grove mentality like more (00:07:16) serious than anyone else right like okay (00:07:18) fine Google like implemented OKRs (00:07:20) because Intel did it but that's like you (00:07:21) know management stuff, right? Only the (00:07:23) paranoid survive, right? This is like (00:07:25) core to the Bay Area, um core to Nvidia. (00:07:28) Um Jensen is very paranoid about losing, (00:07:31) right? These specializations, if he just (00:07:32) kept making his mainline chip, would (00:07:34) mean people could, you know, point point (00:07:36) solutions for specific parts of the (00:07:38) market would crush him on cost and (00:07:39) performance. Then he can't justify his (00:07:40) margin. That's a threat to Nvidia's (00:07:42) business model as a whole, especially if (00:07:44) the best model only changes every 3 (00:07:46) months or the model you want to roll (00:07:47) out. Okay, well then you're going to (00:07:48) have three months to figure out how to (00:07:49) make a model work on one chip (00:07:51) architecture for that point solution and (00:07:53) you know it's fine. Software software (00:07:54) advantage of Nvidia is not that (00:07:55) important. Then Jensen's super paranoid (00:07:57) about losing and frankly it's really (00:07:59) hard to hire enough talented chip (00:08:01) people. When you look across the market, (00:08:03) there is only a few companies who have (00:08:05) successfully created a chip architecture (00:08:08) software to run the models accurately, (00:08:11) run the run the models accurately, (00:08:12) right? Like cuz you can look at random (00:08:14) APIs of say an Alibaba Quen model and (00:08:17) different people are doing all sorts of (00:08:18) tricks like quantizing it, but also many (00:08:20) other tricks which then end up like (00:08:22) making the model quality lower. You (00:08:23) know, building a rack scale solution, (00:08:25) networking thousands of chips together (00:08:26) and then deploying an API and Grock did (00:08:28) the whole thing with frankly not that (00:08:30) many people. So now it's like okay well (00:08:32) I'm Nvidia I want to make four different (00:08:33) chip architectures and actually four (00:08:35) different point solutions maybe the (00:08:36) general purpose and then one here one (00:08:37) here one here and in addition my general (00:08:40) purpose thing is actually not just like (00:08:41) a GPU chip it's like GPU chips CPU chips (00:08:44) networking chips NV switch nicks like (00:08:47) you know there's many many chips and (00:08:48) each of those chips has many chiplets (00:08:50) you don't have enough engineering (00:08:51) resources right and so like acquiring (00:08:53) gro is like how you get those resources (00:08:54) to make more solutions for different (00:08:56) parts of the market as far as like are (00:08:58) they threatened like I think I think (00:08:59) like obviously There's some cool (00:09:01) startups out there, right, that are (00:09:02) raising a lot, right, currently or have (00:09:05) raised such as Etched, Maddx, uh, (00:09:07) positron, these new age of AI companies. (00:09:09) There's also the prior age of like (00:09:10) Cerebrris is is out there still, right? (00:09:13) You know, Tenstor, etc. And there's so (00:09:14) there's a lot of AI chip companies on (00:09:15) the startup side, but then there's also, (00:09:16) you know, Google's TPU, AMD GPUs, uh, (00:09:20) Amazon Tranium, uh, who are all really (00:09:22) credible competitors. And then, you (00:09:24) know, Meta's MTIA is somewhat credible. (00:09:26) and then you know Microsoft Somaya is (00:09:28) not credible but like you know maybe it (00:09:30) will be one day right so you sort of (00:09:31) have like a lot of competition they've (00:09:33) got to hold the gates back and so I (00:09:34) think (00:09:35) >> is there a risk to them being I mean (00:09:37) like there's there's risk from all of (00:09:38) those companies that I mentioned and and (00:09:40) you know effectively California/ Seattle (00:09:43) right only two two places there's (00:09:45) there's also chips from other parts of (00:09:46) the world right obviously China has a (00:09:48) number of different AI chip companies (00:09:49) that are doing cool things anyone would (00:09:51) have told you Grock was you know their (00:09:53) business revenue their revenue was not (00:09:55) like stellar right in fact they missed (00:09:57) revenue last year significantly and yet (00:09:58) they got bought right because the value (00:10:00) of the IP was there and the value of the (00:10:01) team anyone else would have been like (00:10:02) well why the heck would I buy this right (00:10:04) uh makes no sense there's definitely a (00:10:06) credible threat (00:10:06) >> yeah and do you think uh CUDA is going (00:10:09) to remain that mode I guess a (00:10:11) combination of CUDA and whatever came (00:10:13) out of the Melanox acquisition like do (00:10:15) do those persist as long-lasting (00:10:17) advantages (00:10:18) >> I think they do I think networking is (00:10:20) super important I think uh the CUDA (00:10:22) software mode is very important but it's (00:10:24) also like changing rapidly right It's an (00:10:26) incredible amount of the software that (00:10:28) Nvidia GPUs run on is not from Nvidia. (00:10:30) It's it's the developer ecosystem that's (00:10:32) open sourcing it. When you look at, for (00:10:34) example, VLM and SGLANG, right? These (00:10:37) support AMD GPUs almost as first class (00:10:39) citizens now. And VLM is getting (00:10:42) significant support for TPUs for tranium (00:10:46) and there will be other chips coming out (00:10:48) from startups that also support VLMs (00:10:50) SGLang. Now like how difficult is it? (00:10:52) You know the the the reason why CUDA is (00:10:54) so important is like okay I can do (00:10:55) whatever I need to do right programming (00:10:57) a GPU. (00:10:58) >> I think most AI chips will not be (00:11:01) consumed by people programming anything (00:11:03) for it. (00:11:04) >> They will download an open source (00:11:06) inference engine. and they will download (00:11:07) an open source model and then they will (00:11:09) put it on the and it's really simple to (00:11:11) download VLM and like make it work like (00:11:13) it's not that hard to set up uh you know (00:11:15) a server and Nvidia's putting out a lot (00:11:16) of open source software like Triton (00:11:18) inference server and and uh Dynamo and (00:11:21) all these things to to make it easy (00:11:22) because that is the consumption model (00:11:24) ultimately for the majority of AI right (00:11:27) is and it might be like oh it's my own (00:11:29) inference engine but most servers will (00:11:31) not run code besides the inference (00:11:33) engine and the model it's like not like (00:11:35) people are actually like researchers are (00:11:37) like writing code for GPUs to see ideas (00:11:39) if they'll work and train models and all (00:11:40) these things or just mess around with (00:11:42) them to figure out you know infra (00:11:44) performance or whatever it is but most (00:11:45) of it won't be there and so CUDA as a (00:11:47) mode CUDA language is like you know like (00:11:49) it's like fine right like you know no (00:11:50) one actually writes CUDA right like most (00:11:52) people write PyTorch and then like torch (00:11:54) compile and then they just run it on the (00:11:55) GPU they don't write CUDA but a lot of (00:11:57) this CUDA mode is like how does PyTorch (00:11:59) translate into high performance GPUs and (00:12:01) that surface area from when people were (00:12:04) just writing like hardcore when people (00:12:05) are hardcore writing CUDA kernels to (00:12:07) like hey they're writing PyTorch and (00:12:09) then it's compiling down to GPUs versus (00:12:11) oh I'm just downloading VLM is it is a (00:12:14) it is a curve of like not a ton of (00:12:15) people that can do CUDA kernels a whole (00:12:17) lot more people can do PyTorch right (00:12:19) random you know PhDs and random people (00:12:21) it's very simple right a crapload of (00:12:23) people can do VLM download it run it on (00:12:26) a server well if it now supports other (00:12:28) chips what is the CUDA mode's recognized (00:12:30) this and they've been building software (00:12:32) that is not necessarily the CUDA remote (00:12:34) and I I can give some examples All (00:12:35) right. So the name of the game is fast (00:12:37) tokens and lowest cost tokens, right? (00:12:40) And lowest cost tokens happens by your (00:12:42) chip being fast. But there's also (00:12:43) tricks, right? One example, right? Like (00:12:45) I mentioned with, you know, the CPX (00:12:46) versus Grock, right? Is processing your (00:12:48) prefill contacts, right? Super cheap (00:12:51) CPX, right? If I'm if I'm care a lot (00:12:53) about speed, then Grock. These are (00:12:54) optimizations on the hardware side. (00:12:56) There's optimizations on the software (00:12:57) side as well, right? And so one example (00:12:59) is when I'm doing for example if I look (00:13:02) at a cloud code or a cursor type (00:13:05) application right the workload is like (00:13:08) it takes your repo takes the relevant (00:13:10) parts of your repo puts it in the (00:13:12) context of the LLM it prompts it (00:13:14) generates right and if it's an agent (00:13:15) mode it it it circulates the context a (00:13:18) couple times it'll collapse put things (00:13:19) off to the side access different (00:13:20) contexts but what's you know especially (00:13:22) when you think about an agent for (00:13:23) software and you can see this in codeex (00:13:25) you know Codex Codex actually not as (00:13:27) good as cloud code, but it can do work (00:13:29) on time horizons of like 9 10 hours. Um, (00:13:32) and do like a big refactor better than (00:13:34) cloud code can, even though most of the (00:13:35) times cloud code is better. And and (00:13:37) what's interesting about Codeex does is (00:13:39) it'll like take your repo, it'll (00:13:41) identify parts if you're asking it to (00:13:42) refactor it, identify parts, write (00:13:44) stuff, you know, make like these notes (00:13:46) for itself everywhere, collapse the (00:13:47) context, switch from this part of the (00:13:49) repo to that part of the repo to this (00:13:50) part of the repo. But when you think (00:13:51) about it, it's like, oh, if this thing (00:13:52) is just generating tokens all the time, (00:13:54) plus it's switching what my context is (00:13:57) constantly, that's really expensive, (00:14:00) right? If you think about like what's (00:14:01) the cost of inference, um, I want to say (00:14:04) it's like it's it's $10 per million (00:14:06) tokens of output and or and $3 for (00:14:10) decode or 10 for decode and three for (00:14:12) prefill. Uh, and so if you think about, (00:14:14) oh, it just worked for nine hours on one (00:14:16) task, one refactor, huge value. But if (00:14:18) it changed context a ton of times and (00:14:20) your context is like 30k usually or 50k (00:14:23) or you know heading to hundreds of (00:14:24) thousands you know how long your how big (00:14:26) your repository is and how much context (00:14:28) switch now you're spending all this (00:14:29) money on on prefill right not the decode (00:14:31) tokens but actually why am I like (00:14:33) regenerating the KV cache I can actually (00:14:36) just like store the KV cache elsewhere (00:14:38) and then when I need it again I can pull (00:14:39) it and and plop it into CPU memory or (00:14:41) into GPU memory. And so Nvidia's got (00:14:43) this like KV cache manager and they've (00:14:45) been working really hard on like making (00:14:47) it so they can interface SSDs and stick (00:14:50) the KV cache on there and pull it out (00:14:51) whenever they want. So for this kind of (00:14:53) workload and then if you do this and you (00:14:55) look at like coding as an application (00:14:57) and you like look at these coding (00:14:58) companies and how much they're paying (00:14:59) for prefill versus decode actually (00:15:01) majority of their cost is pre-fill (00:15:02) tokens not decode tokens because their (00:15:04) context is just so large and it's (00:15:06) switching all the time even in agent (00:15:08) modes. You know, if you can now not have (00:15:10) to do the pre-fill, your costs go down (00:15:11) dramatically. But that's a very (00:15:13) complicated thing to do from a software (00:15:15) perspective. You know, companies like (00:15:16) Enthropic, Google, OpenAI have already (00:15:18) done it. But what about the wide world, (00:15:20) right? And so Nvidia is trying to make (00:15:21) the open source software for this. And (00:15:23) that's like CUDA mode, but it's like (00:15:24) actually no, none of this is CUDA, (00:15:26) right? like it's like memory management (00:15:27) and like you know storage management and (00:15:30) when do you call what and how do you (00:15:31) transfer it and how do you like spread (00:15:32) the KV cache across a bunch of different (00:15:34) storage nodes and what happens when you (00:15:36) read it and the network congestion just (00:15:37) like all these things yeah it's like (00:15:39) Nvidia's wheelhouse but it's not CUDA (00:15:41) and I think like the easy way to say it (00:15:42) is it is the CUDA mode right and so (00:15:44) things like this KV cache manager and (00:15:46) many other things they're trying to do (00:15:48) to reduce the cost of inference like is (00:15:50) how they build the new CUDA mode because (00:15:52) again today it's it's you know it is (00:15:55) quite I mean AMD is like not fully there (00:15:57) yet and TPU is being added right now and (00:15:59) tranium is being added soon as well to (00:16:01) VLM but all of them will have a very (00:16:03) good UX for download model run model on (00:16:06) VLM by the middle of the year I think (00:16:09) right certainly AMD is already there by (00:16:11) the end of this quarter we have (00:16:12) something that like tests this right (00:16:13) it's called inferencemaxa it's open (00:16:15) source all the code is and the results (00:16:16) are uh but we run across I think $60 (00:16:19) million of GPUs which are donated to us (00:16:21) by companies like Nvidia AMD openai (00:16:24) Microsoft Amazon on Crusoe, Core Weave, (00:16:27) Together AI, uh all these companies are (00:16:30) sponsoring GPUs for us to run this. (00:16:31) We're running VLM and SDG Lang every (00:16:33) night on, you know, nine different kinds (00:16:35) of GPUs on a variety of different models (00:16:37) and different work uh context lens and (00:16:39) all these things, right? To see the (00:16:40) performance and you can see the (00:16:41) performance moving every day or pretty (00:16:42) often because the software changes all (00:16:44) the time. And so like the fact that this (00:16:46) exists is the cuda boat, right? It's not (00:16:48) that like AMD you can do this on their (00:16:50) chips, Nvidia can do this on their (00:16:51) chips. It's oh when the new model comes (00:16:52) out, how fast does it get to peak (00:16:54) performance? because you know it's it's (00:16:55) a moving target or hey can I implement (00:16:57) this KV cache management thing how hard (00:16:59) is it how many engineers do I need oh (00:17:01) just one great like or 10 great if I (00:17:03) need a hundred people to develop it like (00:17:05) Google and you know so on and so forth (00:17:06) did then that's much harder (00:17:08) >> do you think AMD can uh catch up (00:17:09) >> I think AMD will be caught up at times (00:17:12) and very behind at other times like (00:17:14) currently they're super far behind right (00:17:16) because Blackwell is just way better (00:17:17) than MI355 um and then you know Rubin (00:17:20) comes out and they'll be way way behind (00:17:21) but then AMD's new chip comes out and (00:17:22) AMD will be caught up or evenlight ly (00:17:24) ahead on a hardware perspective. (00:17:26) Software's behind, right? And you have (00:17:27) this like leaprogging and and AMD is a (00:17:29) very credible second competitor. I don't (00:17:31) think they'll go beyond like I think (00:17:33) they'll stay in single digits market (00:17:34) share. Single digit percentage market (00:17:36) share. (00:17:37) >> Single digit percentage market share is (00:17:38) >> still [laughter] pretty good. (00:17:40) >> Yeah. I mean, Nvidia's revenue this year (00:17:41) is going to be like (00:17:43) >> it's a lot. (00:17:44) >> The three gajillion (00:17:45) dollars. (00:17:46) >> I think it's actually four gajillion. (00:17:48) [laughter] (00:17:49) [gasps] (00:17:49) >> What about all the startups? You (00:17:51) mentioned a few. So there's a cerebrus (00:17:54) on the one end of the spectrum and then (00:17:56) newer ones edged and and others if if (00:17:59) AMD has a you know uphill battle in (00:18:02) front of them like do you think those (00:18:03) guys can take significant market share? (00:18:06) you sort of the whole specialization (00:18:07) game, right? You you have to specialize (00:18:09) because you're never going to beat (00:18:10) Nvidia at their own game, right? They're (00:18:12) going to have the supply chain unlock. (00:18:14) They're going to get to the newest (00:18:15) memory technology or process technology (00:18:17) or whatever packaging technology, (00:18:18) whatever it is, sooner than you and (00:18:20) they're just going to crush you, right? (00:18:21) If you play their game, you have to AMD (00:18:23) is trying to play Nvidia's game, but AMD (00:18:26) is like extremely good at engineering (00:18:28) silicon, right? Everyone else has to has (00:18:31) to has to try something weird or (00:18:33) different, right? And so when you look (00:18:34) at Etched or Maddx or Posatron or (00:18:37) Cerebrris or Tenstor, you go to look at (00:18:39) all these companies, right? There are (00:18:41) unique things about what they're doing (00:18:44) and it's not clear if AI models will (00:18:47) still be within that realm when that (00:18:49) comes out, right? Uh does oh now people (00:18:52) use like engrams and other sparse (00:18:54) attention techniques. Is that like is (00:18:56) does that change like some of the (00:18:58) specializations people are doing or hey (00:19:00) people are now doing like you know (00:19:02) models are now sparse instead of being (00:19:04) dense models does that change things (00:19:06) there's so many optimizations and (00:19:08) changes on the model side and you can't (00:19:10) predict what's going to happen with the (00:19:12) ML research easily at least you can't (00:19:14) the thing you're optimizing for today (00:19:16) has to be a vision of where AI will be (00:19:18) in 2 years and Nvidia's fully accepted (00:19:20) they don't know where that's going to be (00:19:21) that's why they have a portfolio of (00:19:24) chips now not just one GPU line, right? (00:19:26) It's not just Hopper, Blackwell, Reuben. (00:19:29) Now, it's going to be, you know, it's (00:19:30) not Ampure, Hopper, you know, you know, (00:19:31) it's not that line. It's like there's a (00:19:33) variety of chips to serve the different (00:19:34) markets um and different possible (00:19:36) scenarios. They think each of them has (00:19:38) this vision today, but oh, it might turn (00:19:39) out the general purpose one sucks and (00:19:41) and actually AI models have developed in (00:19:42) a way where CPX or Grock style chips are (00:19:45) the best, right? Well, okay, now we have (00:19:46) a solution for that market. And so, I (00:19:48) think that's the challenge with the (00:19:49) startups. With that said, I think (00:19:51) they're all taking very interesting (00:19:52) bets. I think it's I think it's much (00:19:54) more exciting than the first wave of AI (00:19:57) hardware uh bets graph course rebringing (00:20:05) the memory on the chip they sort of just (00:20:07) made a bet and they optimized for a (00:20:08) certain kind of model all similar kinds (00:20:10) of model and it didn't end up working (00:20:11) out for a long time right they had to (00:20:13) pivot and they had to work on a lot of (00:20:14) things and it took a long time I think (00:20:16) these companies have like a really clear (00:20:18) vision of what they think models will (00:20:20) look like right like Etch does Maddx (00:20:22) does, Posatron does, and that's what's (00:20:24) really cool about it between the three (00:20:25) of them, uh, these new age. So, I mean, (00:20:27) I'm I'm excited for them. I'm very very (00:20:29) skeptical. I don't know what uh what a (00:20:31) venture capitalist views as likely (00:20:33) chances of succeeding, but I think all (00:20:35) of them are less than 1%. Right? (00:20:38) >> But, you know, that's that's that's a (00:20:40) >> but the world where they win is a (00:20:42) multi-silicon kind of world where any (00:20:44) given customer uses a range of different (00:20:48) GPUs. It could it could or it could be (00:20:50) any given customer has like one workload (00:20:52) they care a lot about. Anthropic clearly (00:20:55) does not give a crap about videogen (00:20:56) image gen right they just don't care. Um (00:20:59) on the flip side, company like (00:21:00) midjourney cares a lot about image and (00:21:02) videogen, right? Image and videogen is (00:21:05) very very like like I mentioned like (00:21:07) it's a very like it's not very memory (00:21:08) bandwidth heavy. It loves loves loves (00:21:11) compute, right? Whereas inference of (00:21:13) large language models in the style of (00:21:15) like you know this these you know say (00:21:17) for example coding agents cares a lot (00:21:19) about decoding for long streams of time. (00:21:21) Um and that's very memory bandwidth (00:21:23) heavy right? And so there's like that's (00:21:24) like a simple example, but there's a lot (00:21:26) more nuance there in terms of like even (00:21:28) like the size of like the matrix (00:21:30) multiply, you know, the tensor cores (00:21:31) that you you know the systolic arrays (00:21:32) that you use or the ratios of networking (00:21:34) and memory memory and like what's that (00:21:36) memory hierarchy look like and you know (00:21:37) what are you doing for different kinds (00:21:38) of attention and like oh like all these (00:21:40) sorts of things like there's a lot of (00:21:42) specialization here and so some people (00:21:44) are betting big on on different types of (00:21:45) specialization and I I think like you (00:21:47) could clearly see a world where (00:21:49) companies do care about different stuff (00:21:51) right like like if for example a chip (00:21:53) optimized for video and image generation (00:21:56) existed today and it was better than (00:21:58) Nvidia or Nvidia made it. I think (00:22:00) Midjourney would absolutely only use (00:22:02) that for inference. I think for training (00:22:03) they'd still use the general purpose (00:22:04) thing and as would like Meta and Google (00:22:06) would like they should do that, right? (00:22:08) And hey, Meta actually has two lines of (00:22:10) AI chips there. MTIA there's a line (00:22:13) that's focused on recommendation systems (00:22:15) and then there's a line that's focused (00:22:16) on Gen AI. The GI one is a new line, but (00:22:19) that recommendation systems ch line is (00:22:20) still continuing, right? It's not sexy. (00:22:22) No one cares because there's no and bite (00:22:24) dance also has a recommendation system (00:22:26) line of chips and it's not really (00:22:27) focused on Jedi which is fine because (00:22:30) you know this is a $200 billion business (00:22:31) or something which is just deciding what (00:22:33) ad to serve me right and what order to (00:22:35) put my friends stories and you know (00:22:36) things like this so so I think like it's (00:22:38) perfectly fine for there to be (00:22:39) specialized AI chips given the target (00:22:41) market is big enough and you have to (00:22:42) have vision to know what that target (00:22:44) market is unless you're hyperscaler then (00:22:46) you can like just like you can just use (00:22:47) general purpose until you've like it's (00:22:49) clearly there and then you can make your (00:22:50) asich right (00:22:51) >> fascinating turning to the geopolitical (00:22:53) aspect of of uh all of this which is (00:22:56) always fun. Huawei and Nvidia in China (00:23:00) last year that was like 10 or 12% of (00:23:02) their overall revenue and this year they (00:23:04) they were saying that their market share (00:23:06) but has basically dropped to not very (00:23:08) much. Is that Huawei chips? Is that (00:23:10) restrictions? Is that tariffs? Uh what's (00:23:12) happening? (00:23:12) >> It's a variety of things actually in in (00:23:14) some in some quarters last year. uh it (00:23:16) was even north of 20 I think but I don't (00:23:18) remember exactly but anyways you know if (00:23:20) you look at 2022 China was almost the (00:23:23) size of the US in terms of buying server (00:23:24) hardware right almost not quite but (00:23:26) getting there um and it looked like they (00:23:28) were going to be the same size as (00:23:30) America in like a year or two after that (00:23:31) right and if you look at like global (00:23:32) data center capacity global cloud (00:23:35) capacity etc etc etc it's American (00:23:37) companies and Chinese companies right (00:23:38) that dominate the world American (00:23:39) companies obviously doing a lot better (00:23:40) here but both of those dominate the (00:23:42) world and if you look at like every (00:23:44) industry right you know it's It's it's (00:23:46) very clear that like China wants to (00:23:48) insource stuff, right? So in 2015, they (00:23:50) made these 5-year plans for two 2020 and (00:23:52) 2020 uh five where they set the (00:23:55) percentage of semiconductors they wanted (00:23:57) uh domestically produced and they've (00:23:59) missed the goal both times which is (00:24:01) fine, right? They set really aggressive (00:24:02) goals and even you know shoot for the uh (00:24:04) moon even if you miss you hit the stars, (00:24:06) right? And that's sort of what's (00:24:07) happened, right? Like look, China is not (00:24:09) caught up on, you know, leading edge (00:24:11) semiconductors, but microcontrollers (00:24:13) from China are almost as good as the (00:24:15) microcontrollers are as good and cheaper (00:24:17) than the ones from Texas Instruments or (00:24:19) ST Micro or, you know, etc., right? Or (00:24:21) like this power random power chip is (00:24:22) better than or the same as the one from (00:24:24) like another company, right? And so (00:24:25) they've really built up a semiconductor (00:24:27) industry and started insourcing a lot (00:24:28) more. I don't see why China wouldn't be (00:24:30) buying you know 30 40% of the world's AI (00:24:33) chips and the US like 50 60% and then (00:24:35) the rest of the world like you know and (00:24:37) when I say US I mean US origin companies (00:24:39) that seems like a more natural state for (00:24:41) the world but there are restrictions and (00:24:43) and hey this is the biggest change in (00:24:46) human history maybe ever knowledge work (00:24:48) and you know everything that's going to (00:24:50) happen there and and then eventually (00:24:51) like robotics and all these things like (00:24:53) you know obviously there's there's a lot (00:24:54) of geopolitical stuff and so there are (00:24:55) restrictions Nvidia's been handcapped (00:24:58) hand handicapped from selling their best (00:24:59) chips to China. And so that's obviously (00:25:01) impacted the sales a lot because like (00:25:03) why would you do that? And so when you (00:25:05) look at who rents the most GPUs in the (00:25:07) world, it's three companies, right? So (00:25:09) one of them is obviously OpenAI. Second (00:25:10) one, actually they were bigger than (00:25:12) OpenAI. They are bigger than OpenAI (00:25:13) today or no, they were bigger than (00:25:15) OpenAI than OpenI. Eclipsed them (00:25:16) recently is Bite Dance. Bite Dance runs (00:25:18) rents tons of chips from Oracle and (00:25:21) Google and and you know many other cloud (00:25:23) companies because they couldn't get the (00:25:25) chips they need in in China. They're (00:25:27) mostly just serving Tik Tok, right? (00:25:29) Okay. Well, they they're not allowed to (00:25:31) buy them and that sucks, but you know, (00:25:32) they're they're allowed to rent them. (00:25:33) And so, okay, if I'm not allowed to get (00:25:34) the best ones, I'm going to rent (00:25:35) externally. And if Bite Dance is the (00:25:37) second biggest renter of GPUs in the (00:25:38) world, that's substituting demand that (00:25:39) would have been built in China in many (00:25:41) cases. It's instead being built in (00:25:42) Malaysia. And Oracle has over a gigawatt (00:25:44) of capacity in Malaysia that Bite Dance (00:25:46) is going to take, right? So, things like (00:25:48) this are, you know, you know, hundreds (00:25:50) of thousands, if not millions of chips, (00:25:51) tens of billions of dollars of cap (00:25:53) capacity that would go to China, but (00:25:54) it's not. that it's going to Malaysia (00:25:55) instead as an example. Another sort of (00:25:57) point around this is China's like you (00:25:59) know they've had these 5-year plans. So (00:26:01) and and you know the way these (00:26:02) initiatives work from China is there is (00:26:04) like some top down ordering but then (00:26:06) they just kind of whip the whole like (00:26:07) everyone just kind of gets into it and (00:26:08) it's really cool like I don't think it's (00:26:10) as top down as many people think. Like I (00:26:12) think the entire country is like (00:26:13) semiconductor pill right there are (00:26:16) dramas where people fall in love in the (00:26:18) fab or dramas where people fall in love (00:26:22) and they're photovoltaic like solar cell (00:26:24) researchers and engineers and it's like (00:26:26) it's like this is just the backdrop and (00:26:28) it's like actually this is it's like (00:26:29) super cool for your like significant (00:26:32) other to be that semiconductor engineer (00:26:34) or to be that photovoltaic you know uh (00:26:37) solar panel researcher (00:26:38) >> as opposed to an influencer (00:26:40) >> as opposed to an influencer. Right. Like (00:26:41) I'm sorry. Love Island is I I I watched (00:26:44) like for 10 minutes cuz I was forced to. (00:26:45) I was like this is freaking terrible. (00:26:47) [laughter] Um but you know like um (00:26:50) >> we are so cooked. (00:26:51) >> No, you know [laughter] seriously we're (00:26:52) cooked. We're cooked. And I think I (00:26:53) think like when you think about like (00:26:55) this happens it's like it's diffused (00:26:56) into drama even people like like there's (00:26:59) multiple dramas like taking place about (00:27:01) semiconductor industry and and they're (00:27:03) like romance comedy like the entire (00:27:06) spectrum, right? Drama like it's like (00:27:08) it's like what the heck is going on? (00:27:09) Anyways, you have all these provinces, (00:27:11) you have all these local cities studying (00:27:14) out ordinances and giving out subsidies (00:27:17) and all sorts of stuff, right? It's (00:27:19) truly like crazy. Like there's some (00:27:21) national level stuff like, "Oh, no taxes (00:27:23) on uh this. Oh, we're going to ban a few (00:27:25) things." But as far as I understand, the (00:27:27) national government has not banned (00:27:29) Nvidia's H20 or H200. But the local ones (00:27:33) have, right? A lot of local ones have (00:27:35) said, "No, you know, you must use China (00:27:37) manufactured chips." And it's like, who (00:27:38) told you that, you know, you're here to (00:27:40) uphold this? It's like, does it matter, (00:27:41) right? I mean, like, it's it's it's cool (00:27:43) because then you have this like survival (00:27:44) of the fittest, all these all these (00:27:46) provinces and cities are trying to (00:27:48) attract different companies with (00:27:49) different types of subsidies and grants (00:27:52) and industrial parks and like all these (00:27:54) different things (00:27:55) >> and then like the ones who succeed (00:27:57) actually develop an industry and they (00:27:58) take over. (00:27:59) >> This how one thinks of of China, right? (00:28:01) It almost sounds like more like the US (00:28:02) or like with the federal government and (00:28:04) states where the provinces have (00:28:05) authority over their purchasing. It's (00:28:07) It's actually like uh great. There's (00:28:09) this one um Tik Tok or not Tik Tok, Tik (00:28:12) Tok and Instagram like uh person and (00:28:14) they're like they they like sing it. (00:28:15) They're like if you want to if you want (00:28:16) to buy things in China, make sure you go (00:28:18) to the right place. And then they just (00:28:19) say the most random [ __ ] and name the (00:28:21) city. And then you look into it and (00:28:22) you're like wow this city has the entire (00:28:24) supply chain for this. Um and it's like (00:28:26) lampshades and then it names the city. (00:28:27) It's like what the [ __ ] There's a city (00:28:29) that specializes in lampshades. Like (00:28:31) it's like and it's like microphone arms (00:28:33) like microphones. It's like it's like (00:28:34) literally there's a city in China that (00:28:36) specializes in (00:28:36) >> guitars as well, right? This one one (00:28:38) city that became the guitar capital of (00:28:39) the world. (00:28:39) >> It's literally everything. (00:28:41) >> Literally everything. There's a city and (00:28:43) it's not like hey specifically for uh (00:28:46) camera arms for example, there's ball (00:28:48) bearings in this and the ball bearings (00:28:49) are like there's ball bearings. There's (00:28:50) multiple manufacturers of ball bearings (00:28:52) for camera arms (00:28:53) >> and then like most of the camera arms in (00:28:55) the world come from that one city. It's (00:28:56) like what the hell is going on? Um and (00:28:58) and so like the semiconductor industry I (00:29:00) think people don't realize is absurdly (00:29:01) specialized. I'm not answering your (00:29:03) question. I'm just going a little bit of (00:29:04) a rant because I think people don't (00:29:05) understand China semiconductors. It's (00:29:07) really sick or semiconductors in (00:29:08) general. But like you know like in Japan (00:29:11) they like focus on a few different types (00:29:13) of chemicals and they're the best at it (00:29:15) and it's like almost a cultural thing, (00:29:16) right? Japanese people were so precise (00:29:18) like with sushi and like it's all about (00:29:20) the trade and the craft and like you (00:29:21) know the French food in Japan is better (00:29:23) than the French food in France because (00:29:24) the f the Japanese chefs went there and (00:29:26) then come back and they perfected it in (00:29:27) Japan and like cuz they're so precise (00:29:29) and and there's so many different like (00:29:30) things that like Japan is so good at (00:29:32) because they're so precise and like (00:29:34) dedicated to the craft and it comes out (00:29:35) of like I don't know like samurai (00:29:37) culture or something I don't know right (00:29:38) like I don't exactly know how that (00:29:40) culture came up and so when you look at (00:29:41) like and it's like across the world (00:29:43) there's different places where things (00:29:44) like this happen right like Oh, like the (00:29:46) Netherlands makes EUV tools. Cool. I (00:29:49) guess so. And you look across the (00:29:50) semiconductor industry. There's a famous (00:29:51) economic essay called I pencil or (00:29:54) something like that. Or talking about (00:29:55) how the pencil like a simple pencil (00:29:57) comes from like oh the rubber comes from (00:29:59) like Indonesia for the eraser and the (00:30:02) graphite comes from this mine here and (00:30:03) and the wood comes from these aspen (00:30:05) trees in Canada and like you actually (00:30:06) can't make a pencil without aggregating (00:30:08) this entire supply chain. semiconductor (00:30:10) industry is like way crazier because (00:30:11) like I would say there's like 15 or 20 (00:30:13) countries that could shut down the (00:30:14) entire semiconductor industry, right? (00:30:16) Even like Austria could, right? And and (00:30:17) it's like what? And it's like well yeah, (00:30:18) there's two different companies there (00:30:19) who have like 90% share in like some (00:30:22) random niche stuff. (00:30:23) >> And it's like okay, cool. I guess (00:30:25) Austria can and oh yeah, those two (00:30:27) companies only like have less than a (00:30:28) billion of revenue, but they just happen (00:30:30) to have lynchpin critical things. And (00:30:32) there's lynch pin critical things (00:30:33) everywhere because the process is so (00:30:34) complicated. And so China's been trying (00:30:36) to replicate this. Um, (00:30:37) >> is there one thing they're missing that (00:30:38) they don't have yet? (00:30:39) >> I think there's a lot of things. I think (00:30:41) if you were to close your eyes and say (00:30:42) or if you were to cut off every country (00:30:43) and say there's no more globalism, China (00:30:45) has the most vertical stack in (00:30:47) semiconductors today and they're the (00:30:48) best at semiconductors in the world (00:30:50) because their fabs could still run (00:30:51) somewhat on a lot of things because they (00:30:53) have built some of these chemical supply (00:30:54) chains, right? Like TSMC for certain (00:30:57) kinds of chemicals 100% share from (00:30:58) Japan, right? Or Intel same thing, (00:31:00) right? or you know for certain kinds of (00:31:02) tools 100% share from Netherlands or (00:31:03) 100% share from you know this American (00:31:05) company or that you know Austrian (00:31:06) company or this or that right like (00:31:07) there's just all these like you know (00:31:09) this Swiss company like there's just all (00:31:10) these different places have 100% share (00:31:12) it might be one company might be three (00:31:14) companies but geographically or in the (00:31:16) same area and China's built that up (00:31:18) right because they've created this made (00:31:19) in China initiatives which just plowed (00:31:22) money into it and they've got this (00:31:23) culture of like the diffused like you (00:31:25) know these provinces like yeah I just (00:31:27) decided I'm going to [ __ ] focus on or (00:31:29) might not even be might not even be the (00:31:30) Right. It may be the like, you know, (00:31:32) someone brought it there and decided and (00:31:34) then people were like, "Oh, wow. You're (00:31:35) doing that?" Me, too. Like, I'm a Patel (00:31:37) and I grew up in a motel and guess what? (00:31:38) We like almost all the Patels I know (00:31:40) grew up in a motel and it's because some (00:31:43) random Patel immigrated to America and (00:31:46) like worked at a hotel motel and then (00:31:48) bought a motel and then like it just (00:31:49) started happening, right? Like you sort (00:31:50) of like these things are serendipitous (00:31:52) of sorts and like I don't know like and (00:31:53) it's like I I view it as the same kind (00:31:55) of specialization, right? Chinese cities (00:31:57) are like starting to do the these (00:31:58) things. China's missing a lot of things, (00:32:00) right? I would say like if you say minus (00:32:01) 10 years tech, China's complete and no (00:32:04) one else is complete, right? Taiwan is (00:32:06) not complete. Their the fabs would shut (00:32:07) down without foreign supply, you know, (00:32:09) and you go down or you go across the (00:32:10) stack. Uh but if you go to 10ear tech, (00:32:12) maybe maybe more like 20-year tech, you (00:32:14) could get a fully vertical supply chain (00:32:15) in China, which I do not think any (00:32:17) country could do. Like America could not (00:32:19) build a fully vertical fab without stuff (00:32:20) from elsewhere, even if it's 20-y old (00:32:22) tech. (00:32:23) >> Um probably not even 40-y old tech. And (00:32:25) so, so that's interesting. But then when (00:32:27) the flip side is like well like you kind (00:32:28) of do need specialization. That's how (00:32:31) that chemical gets the purest best, you (00:32:33) know, most engineered, you know, or that (00:32:35) that slurry of chemicals or that, you (00:32:38) know, that gas or like that tool because (00:32:40) every smart person or a lot of them in (00:32:43) that country grew up around that culture (00:32:45) and like every the supply chain is there (00:32:47) and like everyone kind of knows and like (00:32:48) it's like a a driveaway and like sort of (00:32:51) like this is what makes supply chains (00:32:53) work is that there is this (00:32:54) specialization and the best of the best (00:32:56) only comes when you have that hyper (00:32:57) specialization. So, China doesn't have (00:32:58) lithography. their lithography is like (00:33:00) 10 years behind and I think it'll be 5 (00:33:02) years behind in a couple years, right? (00:33:03) They're catching up fast. I don't think (00:33:05) they'll be as good as ASML for a long (00:33:08) time. You know, maybe I don't know, (00:33:10) maybe they will be, you know, China. You (00:33:11) shouldn't ever underestimate China, but (00:33:13) like and Chinese engineers or, you know, (00:33:14) but like for a while, right? Or like, (00:33:17) you know, I don't think they'll be able (00:33:18) to make leading edge chemicals like many (00:33:20) chi uh Japanese companies or many (00:33:21) American companies and their tools and (00:33:23) like you just go across the supply (00:33:24) chain. They're not hey forefront on (00:33:28) really anything in the manufacturing (00:33:30) supply chain on the design supply chain. (00:33:32) There's some things that they're (00:33:34) starting to be similar par but like (00:33:36) cheaper or like a year or two behind but (00:33:39) cheaper and that's like fine for a lot (00:33:40) of stuff. An example of that is Huawei, (00:33:42) right? Huawei in mobile phones was on (00:33:45) par with Apple like entirely. Yeah. And (00:33:47) they had become Apple TSMC's biggest (00:33:49) customer and they were designing the (00:33:50) best thing and they are number one in (00:33:52) telecom and their tech is just literally (00:33:53) better. And so when you think what (00:33:56) happens, you know, is is is China (00:33:59) missing anything? It's like they're they (00:34:01) don't they don't they don't have the (00:34:02) best of much, you know, today in the AI (00:34:05) supply chain. they have a complete (00:34:06) package and a couple years behind and (00:34:08) they'll figure out how to make it (00:34:09) cheaper slash do more slashcatch up and (00:34:11) and create a robust industry. But (00:34:14) there's a reason like I don't think that (00:34:16) like Jensen is scared of AMD really. (00:34:18) He's paranoid. I mentioned he's (00:34:19) paranoid. I'm sure he's a little bit (00:34:20) scared of them, right? Like I think some (00:34:22) of the things that they've done are (00:34:23) reactions and competitive dynamics with (00:34:25) AMD or Google's TPUs or whatever. Right. (00:34:27) There was a Core Weave deal today and I (00:34:29) think that's directly the result of what (00:34:30) Google's been doing. (00:34:31) >> Yeah. the two billion pipe that Nvidia (00:34:34) announced into (00:34:35) >> Nvidia invested two billion in core but (00:34:37) what's more important is that that's (00:34:38) like sort of just like the sticker (00:34:39) what's really relevant is Nvidia is (00:34:41) going to work with core reef to uh (00:34:43) acquire um and and back stop and all (00:34:46) these things the the land the power the (00:34:49) energy the transmission that help build (00:34:51) the data center all this capital side (00:34:53) stuff that because Nvidia has so much (00:34:54) money they can backs stop corore weave (00:34:56) doing it because corore reweave then can (00:34:58) be the one who generates demand anyways (00:35:00) there's like because Google was doing (00:35:01) And they did that with like a couple (00:35:02) companies such as Fluid Stack and (00:35:04) Terowolf and Cipher. These are some (00:35:06) public deals that have been announced. (00:35:07) And so Google is doing that with TPUs (00:35:08) and Nvidia reacted, right? Um, and so in (00:35:11) the same way, I think Nvidia's reacted (00:35:12) to AMD. And in the same way, I think the (00:35:15) thing is Nvidia is like deathly (00:35:16) terrified of Huawei (00:35:18) >> because Huawei has caught up to Apple (00:35:20) and actually surpassed them as TSMC's (00:35:22) biggest customer before they got banned, (00:35:24) right? They did just crush Nokia, Sony, (00:35:26) Sony, Ericson, etc., right? Like the (00:35:28) entire telecom supply chain. they just (00:35:30) like completely destroyed them. And (00:35:32) there's so many other areas like they (00:35:33) straight up made a folding phone, right? (00:35:35) You know, I have a Samsung folding (00:35:37) phone. They have a folding phone that's (00:35:38) better than Samsung's folding phone. (00:35:40) >> And it's like, bro, what? Like, you (00:35:42) know, you know, Huawei's really really (00:35:44) cracked. And so, of course, they're (00:35:46) terrified of uh and and Huawei is the (00:35:48) most vertical company in the world. No (00:35:49) company is more verticalized than (00:35:51) Huawei, which then leads to huge (00:35:53) innovations. It's something that we (00:35:55) don't fully appreciate in the US, but (00:35:56) like when you travel in Europe, you see (00:35:57) everybody who's like honors honor phones (00:36:00) and it's like the the footprint of (00:36:01) Huawei is huge in in phones in a way (00:36:03) that people (00:36:05) >> not just phones um you know security (00:36:07) cameras actually they think they have (00:36:08) like you know (00:36:09) >> a lot of training on the [laughter] (00:36:12) that a captive group of testers. (00:36:15) >> Exactly. Exactly. Um I think I think (00:36:17) Huawei is terrifying right and and so (00:36:19) like yes their chips are not as good (00:36:20) today (00:36:20) >> and is that is is that already (00:36:22) happening? I mean obviously the US and (00:36:24) China are the two biggest markets but (00:36:25) like for other markets I don't know UAE, (00:36:27) Middle East, Europe are Nvidia and (00:36:30) Huawei already uh headto-head in (00:36:32) >> well they shipped a little bit but like (00:36:35) mostly just like sticker capacity like (00:36:37) there's nothing like no no like I would (00:36:38) say like a little bit as in like a few (00:36:39) servers not like a billion dollars worth (00:36:41) of stuff right the thing is China's (00:36:44) supply chain has to ramp up right um (00:36:46) China China's express goal is to have (00:36:48) all inter internalized but then like a (00:36:50) company like Alibaba's like I I don't (00:36:52) want to use Huawei, right? Like I want (00:36:54) to make I want to use Nvidia and just (00:36:56) make the best freaking models, right? (00:36:57) Because that's my business. My business (00:36:58) is not, you know, using a Huawei thing, (00:37:00) but it's like, okay, it's being pushed (00:37:01) upon me. There's other companies too, (00:37:02) like Cameron Con and so on and so forth. (00:37:04) And so the sort of like supply chain, (00:37:06) you know, companies in China don't want (00:37:07) to use they're kind of encouraged (00:37:09) obviously and pushed, you know, you must (00:37:11) some local provincial government be (00:37:13) like, well, you're doing this much (00:37:14) business here. You got to do this, (00:37:15) right? Like there's all sorts of like (00:37:16) crazy stuff that, you know, pushing of (00:37:18) of companies to use Huawei. Um the (00:37:20) challenge is probably can't manufacture (00:37:22) enough, right? We've like done a lot of (00:37:24) work on this. Um and we've just put it (00:37:26) for free, you know, instead of like to (00:37:28) our customers because it's like (00:37:29) something that's like national security, (00:37:30) which is how was Huawei actually (00:37:32) building chips? Well, actually they were (00:37:34) uh using shell companies to get chips (00:37:36) from TSMC and using different methods of (00:37:38) like sneaking HBM, which is memory, from (00:37:41) you know, Korea through Taiwan to China, (00:37:43) right? Like all sorts of crazy stuff (00:37:44) we've reported on and and people it's (00:37:46) like a whack-a-ole, right? they shut it (00:37:48) down or like tools that get shipped to (00:37:50) China and they shouldn't be for you know (00:37:51) making leading edge chips but they (00:37:53) actually are um and all these sorts of (00:37:55) things are happening because they can't (00:37:56) make everything and if they want to make (00:37:58) the leading edge stuff they do need to (00:37:59) rely on the foreign supply chain quite a (00:38:00) bit in terms of the upstream supply (00:38:02) chain right uh memory logic chips uh (00:38:05) tools for fabs chemicals for fabs etc (00:38:07) Huawei cannot satisfy the market um (00:38:09) because there's not enough advanced (00:38:11) leading edge capacity in memory logic (00:38:14) you know and all all these other things (00:38:16) uh domestically in and they're trying to (00:38:18) build it as fast as they can, but that (00:38:19) means there's just not enough to satisfy (00:38:20) the market. So, Nvidia has a market. I (00:38:22) think they'll figure out how to sell (00:38:23) chips to China. And Jensen's in China, I (00:38:25) think, like right now or was yesterday. (00:38:26) And so, like he's clearly like wheeling (00:38:28) and dealing to try and get his chips (00:38:30) into China because, you know, I think (00:38:32) Nvidia's argument is if we sell them (00:38:34) chips, then they won't, you know, there (00:38:35) won't be enough of as much of a domestic (00:38:37) market. The feedback loop for software (00:38:38) and everything else won't be there. That (00:38:40) will sort of like really challenge it, (00:38:42) right? Like most of the open source (00:38:44) software for AI has a lot of Chinese (00:38:47) contributors, right? VLM and PyTorch, SG (00:38:51) Lang and like all of these other like (00:38:54) libraries and things that are just like (00:38:55) you know and and and it goes to (00:38:56) low-level software especially right like (00:38:58) a lot of the best open source stuff is (00:39:00) actually just from like a Chinese (00:39:01) company who decided to open source it (00:39:02) and same with models right and so like (00:39:04) it's like okay well if they can't use (00:39:05) Nvidia chips anymore then this open (00:39:07) source stuff won't be designed for (00:39:08) Nvidia chips it'll be designed for (00:39:09) Huawei chips and now does that like (00:39:10) weaken the CUDA mode and now like not (00:39:12) only is China domestic now they have (00:39:14) like a feedback loop internally and then (00:39:16) they can externalize across the rest of (00:39:17) the world right so This is the like (00:39:18) argument Nvidia makes. I'm not sure if I (00:39:21) am like I'm like you know I think I (00:39:23) think my AI timelines are so fast. I'm (00:39:26) not that fast like not in terms of like (00:39:27) AGI but like hey AI is hundred billion (00:39:29) dollars of revenue uh across the (00:39:32) industry. I think the industry could hit (00:39:34) 100 billion ARR by the end of this year (00:39:36) like 4550 for open AI like 3540 for (00:39:40) anthropic and then you know vertex deep (00:39:42) minds uh models at Google Gemini right (00:39:45) um and then vertex API for anthropic (00:39:47) models and uh bedrock APIs and Azure (00:39:51) foundry APIs like I think hundred (00:39:53) billion dollars like end of this year (00:39:54) >> that's a lot and then what's the (00:39:56) economic value of that hundred billion (00:39:57) dollars now how much of that is in China (00:39:59) right like China's number is probably (00:40:00) 10x lower right? Because they just (00:40:04) haven't been able to pervasively push (00:40:05) AI, right? Chat GPT has a billion users (00:40:08) roughly and you know, then you add on (00:40:10) Gemini and Meta claims they have 500 (00:40:12) million users. I don't know. I think (00:40:13) people just accidentally click like (00:40:14) generative sticker or something. Um, (00:40:16) [laughter] but like anyways, like (00:40:18) there's like there's like a lot of usage (00:40:19) of AI in the west already and it's going (00:40:22) to climb. It's going to keep climbing (00:40:23) and like you kind of have to get used to (00:40:24) it and so like the question is like do (00:40:26) you you know what's what's the economic (00:40:28) benefit to the world, right? And at the (00:40:30) end of the day, this is an economic war, (00:40:31) right? If the US and the West win in AI (00:40:35) and control, you know, more powerful AI (00:40:37) systems that have this feedback loop (00:40:39) that improved economic growth and (00:40:41) weapons systems and whatever else, (00:40:42) right? Engineering of grids and cyber (00:40:45) attacks and all these sorts of things. (00:40:46) They have this like advantage over (00:40:48) China, then China will not rise to be (00:40:50) the global hedgeimony. But without AI, (00:40:52) China definitely will rise to be the (00:40:54) global hedgeimony. They're just going to (00:40:56) outrun America. And so the question is (00:40:57) like you know that's I think like the (00:40:58) other view right and how fast are super (00:41:00) powerful AI systems versus you know (00:41:02) China building a domestic ecosystem for (00:41:05) chips and models and everything that is (00:41:06) a few years behind like what's what's (00:41:09) actually the value right like that's (00:41:10) sort of like (00:41:12) >> around restrictions and regulations (00:41:13) >> where where do the uh US onshoring (00:41:16) efforts fall in that category what do (00:41:18) you make of them from the chipsack to (00:41:20) like all the thing that is being built (00:41:22) everything looks like it's massively (00:41:23) delayed by the way which perhaps is not (00:41:25) surprising (00:41:26) >> I think TSM MC's manufacturing wafers (00:41:27) and they're like building real wafers (00:41:29) and there's real fabs and like you know (00:41:31) there's some other fabs that have been (00:41:32) announced and like they're doing well (00:41:33) and there's like a bunch of like (00:41:34) different kinds of plants like a Korean (00:41:35) company making a random gas plant in (00:41:37) Texas for you know their chips right (00:41:39) like uh for chips and all these like (00:41:40) sort of things are happening. Um I think (00:41:42) the chips act did really well with its (00:41:44) $50 billion. It's just I don't think (00:41:46) people understand the scale of the (00:41:48) semiconductor industry. is the most (00:41:50) complicated supply chain in the world, (00:41:52) right? It's much bigger than, you know, (00:41:55) say manufacturing airplanes. It's much (00:41:57) bigger than like, you know, really (00:41:58) anything else, right? If you look at the (00:42:00) top 10 companies like of the world, I (00:42:02) think eight of them designed (00:42:03) semiconductors, right? Now, obviously (00:42:05) like Google designed semiconductors, but (00:42:06) it's like, oh wait, no, but their cost (00:42:08) of search would be like 10x higher if (00:42:10) they didn't have TPUs and TPUs were (00:42:11) super optimized for search, right? Or (00:42:13) like, you know, you you you go down the (00:42:15) list, right? Like Meta serves (00:42:16) recommendation systems with their chips, (00:42:18) right? Like you go down the list, it's (00:42:19) everyone is making their own chips. (00:42:21) Apple devices would be materially worse (00:42:23) if they didn't have their own chips, (00:42:24) right? Um and you just go down the list, (00:42:26) it's like it's the most complicated (00:42:28) supply chain and they they're spending (00:42:29) something on the order of like $150 (00:42:32) billion roughly in subsidies a year to (00:42:34) the chip industry. (00:42:36) >> We are doing 50 over like a decade. (00:42:38) >> Yeah, (00:42:39) >> there's a difference in scale here, (00:42:40) right? The collective total amount of (00:42:42) like capex that has been spent in Taiwan (00:42:45) is like 500 billion plus, right? across (00:42:48) the industry, across all the companies (00:42:49) that are making semiconductors in Taiwan (00:42:51) and Taiwan doesn't have a domestic (00:42:52) industry. How is $50 billion of (00:42:54) subsidies going to change America's (00:42:56) needle? Right? It does move it a little (00:42:58) bit, right? I I want to be clear like (00:43:00) the chips act is awesome. I don't (00:43:02) understand why like EVs or like solar (00:43:05) was given this massive massive like (00:43:08) trillion dollar package. Semiconductors (00:43:10) were only given 50. Like semiconductors (00:43:12) need a lot bigger package to actually (00:43:13) incentivize onoring. I think what's (00:43:15) happened so far has proven that it's (00:43:17) working well. TSMC is literally making (00:43:19) chips for Nvidia and Apple and AMD and (00:43:22) others in Arizona today, right? And I (00:43:24) think that's really great. (00:43:26) >> Is is your sense that the broad American (00:43:28) government is just uh aware of of all of (00:43:31) this that it's uh I wouldn't say only (00:43:33) passed because the automotive like (00:43:35) prices went up because car manufacturers (00:43:37) are like the worst because they do just (00:43:39) in time inventory, right? Or not worse, (00:43:40) but like this is just like a thing, (00:43:41) right? Just in time inventory systems. (00:43:44) COVID happens, sales plummet fabs that (00:43:46) were making, you know, random power IC's (00:43:48) or random microcontrollers for engines (00:43:49) got repurposed to the boom from COVID, (00:43:52) which is which was data centers and PCs (00:43:54) and smartphones. So, that stuff was (00:43:55) booming. And then when people were like, (00:43:56) "Oh, wait. Actually, like, you know, I (00:43:58) have some money. I stayed at home. I (00:43:59) didn't go out. I didn't drink. I have a (00:44:00) lot of I have some cash, right? Let me (00:44:02) buy a car." They went out and bought (00:44:03) cars and cars started skyrocketing in (00:44:04) prices. Oh, let's restart and let's (00:44:06) let's Oh, yeah. Can I can you sell me (00:44:07) that microcontroller for the engine (00:44:08) again? It's like, "No, I I'm making a (00:44:11) slightly different microcontroller that (00:44:12) works for, you know, uh, let's say a (00:44:15) keyboard or a mouse, right, or (00:44:17) whatever." And it's like, and and and (00:44:18) they actually didn't just leave me (00:44:21) flatfooted and they were like a partner (00:44:22) through co, right? You know, versus you (00:44:24) just left me. Screw you Ford or whoever, (00:44:27) Toyota, um, or automotive OEM, you know, (00:44:29) you up that supply chain. And so, Chips (00:44:31) Act did not get passed, only got passed (00:44:33) because that happened. And people are (00:44:35) like, "Oh my god, the semiconductors are (00:44:36) why cars can't be made." If that didn't (00:44:38) happen, we wouldn't even have the chips (00:44:40) act. It's like it's like silly. So like (00:44:41) I don't know like I think you know (00:44:43) whereas like and and even though that's (00:44:44) what was pitched to all the senators (00:44:46) like I know people who were running (00:44:47) around Capitol Hill just pushing that (00:44:49) narrative and story and that's why it (00:44:50) finally got passed in reality it was all (00:44:52) for advanced leading edge chips, right? (00:44:54) Nothing that goes in a car, right? And (00:44:56) so it's like this like funny thing. So, (00:44:57) in other words, do you think my words my (00:44:59) words not yours, but is it is it (00:45:01) hopeless that the US is going to I'm (00:45:03) very optimistic. Okay. I mean, do you (00:45:04) think there's a world where the US just (00:45:06) decides to invest in semiconductor at (00:45:09) the scale that (00:45:10) >> you know, I thought we just needed a (00:45:11) bigger chips act, but (00:45:14) >> look, Trump's kind of gotten TSMC to (00:45:16) promise to invest a fuckload more (00:45:18) [laughter] and they're moving on it, (00:45:20) right? They're like actually like just (00:45:22) building it. It's like, I'm going to (00:45:23) tariff the [ __ ] out of you unless you (00:45:24) build a fab. But it's like we'll build a (00:45:26) fab [laughter] and they're building it (00:45:28) right now. The timelines for fabs just (00:45:30) takes forever cuz again it's the most (00:45:31) complicated thing in the world. The (00:45:33) cleanest space in the place in the world (00:45:34) is not like a hospital or a biotech lab (00:45:36) or whatever. It's a semiconductor fab. (00:45:37) And the most expensive tools in the (00:45:38) world are not you know any of these (00:45:40) medical tools or whatever. It's it's (00:45:41) semiconductor tools or it's not a (00:45:43) rocket. It's a semiconductor tool, (00:45:44) right? Like everything you know I (00:45:46) describe it as um I remember when I was (00:45:47) a kid I was like I want to be a rocket (00:45:49) scientist and then I was like oh I want (00:45:50) to be a surgeon. And I'm like wait chips (00:45:52) are like rocket surgery but even cooler (00:45:54) right? Like I think anyways like sort of (00:45:56) like there there are fabs being built in (00:45:58) America. (00:45:59) >> They won't take America to (00:46:00) self-sufficiency. I don't think that's a (00:46:01) relevant. I don't think that's a goal (00:46:03) relevant like that's relevant, right? (00:46:04) Like globalism is generally just good. (00:46:07) Hot take [laughter] (00:46:10) like in terms of economics. (00:46:11) >> We'll turn this into a short a YouTube (00:46:12) short. (00:46:13) >> Globalism. (00:46:13) >> Globalism is good. [laughter] (00:46:14) >> Dude, you're gonna get me like (00:46:15) cancelled. (00:46:18) >> [gasps] (00:46:19) >> I tweeted about ice and it was a (00:46:20) complete joke, but so many people got (00:46:22) mad at me because I can't be, you know, (00:46:23) I'm too I'm too much of a joker. You (00:46:25) know, these are serious things. (00:46:26) >> Yeah. Yeah. No, I know the I know the (00:46:28) feeling. Yes. [laughter] (00:46:29) >> Anyways, um I think I think you know I (00:46:33) think we are building fabs and I think (00:46:34) it's like going to move and now even (00:46:36) Elon's talking about building fabs now (00:46:37) because he sees the shortages in the (00:46:38) world, right? Uh there's a lot of (00:46:40) semiconductor related shortages for (00:46:41) building out AI and and so I don't think (00:46:44) it's hopeless. I think I'm like very (00:46:45) optimistic that we're going to do more (00:46:46) and more and more. And maybe this (00:46:48) administration threatens tariffs and (00:46:49) they get the deals and the next (00:46:50) administration comes back with the (00:46:51) carrot. If it is the Democrats, whatever (00:46:53) happens, I don't know. Like I was at a (00:46:55) comedy club on Sunday night and like (00:46:57) he's like, "Oh, I'm I use Chad GPT." And (00:46:59) then like there were a couple people who (00:47:00) booed and he's like, "Yeah, I'm one of (00:47:01) those guys. I know." And like it's like, (00:47:03) "Wow, people hate AI." (00:47:05) >> And that has has not even started, (00:47:06) right? Like the actual impact of AI (00:47:08) >> or like New Jersey power prices are up, (00:47:10) right? Uh is it because of a data (00:47:12) center? New Jersey, the governor's (00:47:14) election like I think literally fl like (00:47:16) there's like an election that changed (00:47:17) recently in New Jersey because power (00:47:19) prices were up and people blamed a (00:47:22) Microsoft Nebius data center in New (00:47:24) Jersey for that reason. But in reality (00:47:27) that data center has nothing to do with (00:47:29) power prices going up. It's super storm (00:47:31) standy like five years ago knocking or (00:47:34) whatever how many years ago knocking (00:47:35) down the state's electrical (00:47:36) infrastructure and then the then (00:47:37) improving all these improvements and (00:47:39) then those improvements have to be paid (00:47:41) by someone and it turns out the consumer (00:47:42) has to pay for them with higher power (00:47:44) prices, right? And so like you know like (00:47:45) there's like there's a lot like going on (00:47:47) in that regard, right? Um that kind of (00:47:50) is uh (00:47:51) >> sad. Um, and and people hate AI and (00:47:54) they're blaming AI on it and artists (00:47:55) hate AI and like you know you see all (00:47:57) this deep fake stuff and like I think I (00:47:59) think it'll be the hottest button issue (00:48:00) especially as like we're really getting (00:48:02) into like I think last year Google spent (00:48:04) $3 billion on Whimo and we're waiting (00:48:06) for their guide for this year $3 billion (00:48:08) on Whimo taxis but their t their Whimos (00:48:11) went from like 300k to like 100k or 90k (00:48:14) the new Whimo car and they're going to (00:48:16) spend more than three because they've (00:48:17) just launched in like four cities now (00:48:19) right or five cities and and they're (00:48:20) testing it a lot and the same a robo (00:48:22) taxi like people are going to hate AI (00:48:23) for that reason people are going to hate (00:48:24) AI because the slop on the internet (00:48:26) people are going to hate AI because you (00:48:27) know the perceived job replacement (00:48:29) people are going to hate AI for all (00:48:30) these reasons and so yeah it's going to (00:48:32) be a hot button political issue don't (00:48:33) you think (00:48:33) >> yeah talking about that so um capex is (00:48:37) there a capex bubble are we u investing (00:48:41) too much or actually are we investing (00:48:44) not enough given what you were saying (00:48:46) earlier about uh the the the rate of (00:48:49) revenue increase and and therefore (00:48:50) implied demand (00:48:51) that you expect for this year. (00:48:53) >> I'm obviously a maxi. I think we're (00:48:55) going to need a lot of infra and I think (00:48:57) I'm literally paid to like analyze the (00:48:59) supply chain and do consulting. Like (00:49:01) that's what my company does. So like (00:49:03) obviously I'm very [laughter] biased. (00:49:05) I think I think we're pretty good at (00:49:07) calling when when things go down though, (00:49:08) right? Before like a part of the supply (00:49:10) chain reb. Anyways, you know, again, (00:49:12) going back to the economics of it, it's (00:49:13) north of hundred billion dollars of (00:49:14) revenue exiting this year for AI from a (00:49:16) base of, you know, sub1 billion gen AI (00:49:20) from a base because ads and stuff is (00:49:22) like already a multiundred billion (00:49:23) dollar AI industry, right? You know, go (00:49:25) back to 2023 it was like less than a (00:49:26) billion, right? And 2024, I don't know (00:49:28) exactly what number, maybe let's call it (00:49:31) 10 and 25 was maybe like 30 40. It'll be (00:49:34) north of 100 easily. If you're talking (00:49:36) about hundred billion of revenue, let's (00:49:39) say at a 50% gross margin. So that's $50 (00:49:42) billion of gross profit um and $50 (00:49:45) billion of COGS. That $50 billion of (00:49:46) COGS needs to run on infra, which cost (00:49:49) roughly if a five, if you're talking (00:49:51) about fiveyear depreciation, call it (00:49:53) $250 billion, right, of infra (00:49:55) >> for hundred billion of revenue. (00:49:57) >> Mhm. (00:49:58) >> Okay. What is what is the actual spend (00:49:59) on AI infra this year? It's going to be (00:50:00) like it's I mean it depends on what (00:50:02) layer. If you're talking about energy, (00:50:04) those are longer lived assets and all (00:50:05) these other things, right? Um data (00:50:06) centers are longer lived assets. The (00:50:08) chips are not as much. People are (00:50:09) putting capex down. Um and the (00:50:11) hyperscalers capex is going to be like (00:50:13) $500 billion this year or something like (00:50:14) this. And then besides them, there's (00:50:16) also a lot more hyp uh capex elsewhere. (00:50:19) Um and so, you know, is it a bubble? I (00:50:21) mean theoretically like you know it's (00:50:23) twice as much as it should be but it's (00:50:24) also like well no there's an R&D (00:50:26) component to this and the excess spent (00:50:30) that wasn't revenue generating last year (00:50:31) is what led to models being so good this (00:50:34) year um and led to like everyone who can (00:50:37) using cloud code and like that changing (00:50:39) their life. This is like it's not a (00:50:41) bubble, right? I don't think it's a (00:50:42) bubble yet. Um I think if AI model (00:50:44) progress stops and that's the main (00:50:45) thing, right? The moment model progress (00:50:47) stops all the spending is for not. But (00:50:49) so far we've had consistent improvement. (00:50:52) As you put in more compute, you get more (00:50:54) performance and better models. (00:50:56) >> Yeah. Model performance being the (00:50:57) lagging indicator of hardware progress (00:51:00) or data center. (00:51:01) >> Yeah. of of capex, right? Yeah. (00:51:03) >> Ultimately, the capex that Microsoft (00:51:05) spent in 2024 for OpenAI is what results (00:51:08) in in 2025 for OpenAI Cory or whoever is (00:51:10) what results in their models being so (00:51:12) good this year. Same with Enthropic and (00:51:14) Amazon Google and their models now being (00:51:16) so good now is is that capex and (00:51:18) actually they still haven't paid for (00:51:19) those chips yet because those chips are (00:51:21) still have a useful life for another few (00:51:22) years right I think model progress is (00:51:24) very clear um the moment that stops (00:51:27) happening right if we hit a wall there's (00:51:29) no new research directions um then then (00:51:32) it's cooked yeah right (00:51:34) >> and that assumes that better model leads (00:51:36) to more demand which is a reasonable (00:51:39) assumption (00:51:40) >> yeah for sure (00:51:41) >> but um yeah I mean there scale the (00:51:43) adoption curve regardless of how good (00:51:45) the model is uh in the enterprise (00:51:47) >> like 2% of GitHub commits today are (00:51:48) cloud code (00:51:50) >> as in committed by cloud code you can (00:51:51) disable that where it's not (00:51:52) automatically committed but 2% of GitHub (00:51:54) commits today are cloud code $2 trillion (00:51:56) of software wages paid in the world (00:51:58) >> if it was 2% then you like you're like (00:52:01) wait a second (00:52:02) >> this is this is an insane amount um AI (00:52:05) is under earning the value that it's (00:52:07) producing in the world right by a (00:52:09) significant margin already today (00:52:11) >> Bor's journey from Cloud code who had (00:52:13) who we had on the pod was saying that (00:52:16) what he's written all of Claude what is (00:52:19) it called co-work like the new product (00:52:20) entirely with cloud code yet so we're (00:52:23) very much in that world. Yes. (00:52:24) >> Yeah. My uh one of my roommates I was (00:52:26) asking him because he's like always been (00:52:28) a really low-level good programmer and (00:52:30) he started you know I was like he's like (00:52:32) he had this um holiday obsession right I (00:52:36) mean he was using cloud code for work (00:52:37) already right like whatever. Um but he (00:52:39) had this holiday obsession. We got into (00:52:41) playing Age of Empires 2. Myself, you (00:52:44) know, my roommate, a handful of people (00:52:45) from like Open Eye, GDM, Anthropic. We (00:52:48) just would do land parties of AoE 2 over (00:52:51) the holidays a bit. Not not like (00:52:52) Christmas, but like a little bit before, (00:52:53) a little bit after, you know, cuz most (00:52:54) of us went home for Christmas. Um, but (00:52:56) like we'd do these lands. My roommate (00:52:58) got so obsessed with like the game that (00:53:01) during Christmas week, cuz he didn't go (00:53:03) home, he just stayed in San Francisco. (00:53:05) Um, he just worked on an RTS game and he (00:53:07) built an entire RTS game. And I think I (00:53:10) kid you not, I think he he used like (00:53:12) $10,000 of Claude in one week and built (00:53:15) an entire RTS from scratch uh about a (00:53:18) like but instead of like being a (00:53:19) standard RTS where it's like oh Age of (00:53:21) Empires for advance through ages or (00:53:22) Starcraft, it is it is an RTS where it's (00:53:24) China versus the US and you're in the AI (00:53:26) race and you go from the start of the (00:53:28) information age all the way through to (00:53:31) you know AGI and like robots and (00:53:33) humanoids and and and like all like (00:53:35) space fairing civil like it's crazy. He (00:53:37) built it in a week (00:53:39) >> and he didn't type a single line of (00:53:40) code, right? He can only dictate it to (00:53:42) the model. And he told me, yeah, like we (00:53:44) have an indicator internally at (00:53:45) Enthropic where you see how many people (00:53:47) actually write code now. There's only a (00:53:49) few hold outs left. (00:53:50) >> But I guess the question to the bubble (00:53:52) is is really a question of uh timing as (00:53:54) well, right? Uh it's it's whether the (00:53:57) build which is supply side and the (00:54:00) demand side are going to land sort of at (00:54:03) the same time. Is that is that fair? (00:54:04) >> Yeah. But also the economics of like say (00:54:06) you you spend let's say you spend you (00:54:08) build a gigawatt you put down roughly (00:54:10) $50 billion across you know the data (00:54:13) center the chips the networking blah (00:54:14) blah blah blah blah right let's say it (00:54:16) has a 5year useful life so it's $10 (00:54:18) billion a year is it a bubble if the (00:54:20) first year you have you didn't make any (00:54:22) money it's zero the second year it's (00:54:23) zero and then third fourth fifth year (00:54:25) you're at 50% gross margins and so you (00:54:27) make 20 2020 now you've made $60 billion (00:54:30) off of this $50 billion investment it's (00:54:33) not the best return on invested capital, (00:54:34) but it did pay for itself. (00:54:36) >> Yeah. (00:54:36) >> Um, is that is that a bubble? Well, (00:54:38) that's what's happening today is that (00:54:39) people are spending all this infra money (00:54:41) on infra and there's no return for a lot (00:54:42) of it, right? A lot of it is just doing (00:54:44) research and like trying to get adoption (00:54:45) and is free users and like what does (00:54:47) that mean? (00:54:48) >> Yeah. (00:54:48) >> Um, (00:54:49) >> depends a bit on (00:54:50) >> the timing. That's the timing though. (00:54:51) Yeah. But oh, that $50 billion capex was (00:54:53) spent in year one. (00:54:54) >> What about energy? In the in the data (00:54:56) center world, you had this fun post (00:54:58) about the gas replacement for for (00:55:00) energy. So, is uh is AI basically uh uh (00:55:04) destroying the grid? (00:55:05) >> It would if the utilities were willing (00:55:08) to let it, but I think the utilities are (00:55:10) so slow and dumb that they don't want (00:55:12) to. Not destroy, but like expanding the (00:55:14) grid. Yeah. (00:55:15) >> Um I think the US could have a way (00:55:16) better grid, but we just don't want to. (00:55:18) Like, no one's made the effort or (00:55:20) initiative. You know, there's not enough (00:55:21) power. America's not built power for 50 (00:55:23) years really, right? It's like converted (00:55:24) from coal to gas and like things like (00:55:26) this but like really just have not built (00:55:28) wholesale new power on a large scale and (00:55:31) there have been a lot of times where the (00:55:32) industry blew up right independent power (00:55:34) producers IPs have blown up multiple (00:55:36) times in the 2010s when uh Korean and (00:55:39) Japanese investors like flooded the (00:55:40) market with because they saw such a good (00:55:42) return there or before in the early (00:55:45) 2000s power was growing a little bit for (00:55:47) a little bit and so people overbuilt on (00:55:48) power so power industry has been burned (00:55:50) a couple times but no one really built (00:55:51) power and then you've got data centers (00:55:52) now all of a sudden coming online and (00:55:54) going from 2% to 10% of the US grid in (00:55:56) just a handful of years. And so you've (00:55:58) got this humongous humongous change in (00:55:59) the industry. We don't have the labor, (00:56:01) right? I think ultimately that's the (00:56:02) biggest problem is the equipment and the (00:56:03) labor and equipment is basically you (00:56:05) know again labor and time takes time to (00:56:07) build a factory so you can build the (00:56:09) things. I think the equipment side of (00:56:10) things will be solved like more (00:56:12) reasonably. And one one example was like (00:56:13) gas, right? People initially thought, (00:56:15) oh, you can only use like the two (00:56:17) vendors, right? Uh Seammens or G Vernova (00:56:20) for gas turbines, but they have the they (00:56:21) have the best ones, the most efficient (00:56:22) ones. It's like, okay, well, like, okay, (00:56:24) also Mitsubishi exists and they're (00:56:26) ramping up production fast. Oh, Ducson (00:56:28) and Korea exist and they're ramping up (00:56:29) production fast. Oh, actually, I can (00:56:31) just take Cumins engines, right? Like, (00:56:33) you know, if you've ever like ridden a (00:56:34) pickup truck or like you know, like (00:56:35) diesel trucks, like everyone loves (00:56:36) Cumins, right? You know, you see the Ram (00:56:38) on the street and has the Cumins like (00:56:39) badge. It's like it's like a that's like (00:56:40) an aura symbol for a certain kind of (00:56:42) redneck from South Georgia, which I have (00:56:44) a little bit of. Anyways, I I don't have (00:56:45) a I don't have a truck. [laughter] (00:56:47) I have though. Um but anyways, like the (00:56:49) there's like all these engines like (00:56:51) people are figuring out how to make the (00:56:52) equipment. You know, solar sucks. It's (00:56:53) too intermittent. Wind sucks. It's too (00:56:55) intermittent. Nuclear sucks. It takes (00:56:56) forever to build. Coal sucks. It's way (00:56:58) too dirty. How do you make power for (00:57:00) data centers besides gas? And like, (00:57:02) okay, the grid's not willing to put the (00:57:04) gas on your site, right? That's what (00:57:05) Elon did. Now everyone's doing it, (00:57:06) right? (00:57:07) >> This other cool post just uh last week (00:57:09) or two weeks ago that was about water (00:57:11) consumption. Uh did you want to talk to (00:57:13) that? (00:57:14) >> Yeah. Yeah. So there's this annoying (00:57:15) thing where everyone's like, "Oh, AI is (00:57:17) using all the water. Oh wow, AI and data (00:57:21) centers are going to like use up all the (00:57:23) water and now we don't have any water." (00:57:24) And it's like that's so silly. Uh water (00:57:26) is a distribution problem, not a like we (00:57:29) don't have enough problem, right? Like (00:57:30) you look at California. So California (00:57:31) has shitloads of water. But people (00:57:33) decide to make oat milk which consumes (00:57:36) like 1,000x the water of like anything (00:57:40) else like regular milk even and and cows (00:57:43) obviously eat a you know consume a lot (00:57:44) of water. Um but anyways like you know (00:57:46) data centers consume very little water (00:57:48) actually right. So the US grid will get (00:57:50) to like 10% of power by like 28 27 is (00:57:52) data centers. For water consumption it's (00:57:54) not even going to crack 1%. (00:57:56) >> Yeah. (00:57:56) >> By the end of the decade. (00:57:57) >> And what was the metric? Um and so so (00:57:59) the the comparison we made is because (00:58:01) like you know it was a bit of a [ __ ] (00:58:02) post but it was like serious research. (00:58:04) Yeah. Basically like we were doing (00:58:05) serious research because we keep getting (00:58:07) this like question and debunking it and (00:58:09) we would do it seriously but then I was (00:58:11) like no no no this is like too like (00:58:12) complicated like let's make it very (00:58:14) simple. So I was like, "Guys, why don't (00:58:15) we just compare it to like hamburgers, (00:58:17) right? Cuz cuz you know, I've heard that (00:58:19) argument from some like vegetarian (00:58:20) people before or some Hindus or like I'm (00:58:23) Hindu myself, although you know, and I I (00:58:25) I do eat beef sometimes, but you know, (00:58:28) like I I'm Hindu, but like you know, so (00:58:30) so we made this comparison to (00:58:31) hamburgers, right? Hamburgers require a (00:58:34) shitload of water cuz cows, you know, (00:58:36) when to for them they require a ton of (00:58:39) water and when a cow's taking a lot of (00:58:40) water, it's not the cow itself, it's all (00:58:41) the feed you're feeding them, right? (00:58:42) Because no one grass feeds their cows, (00:58:45) you know, and just lets the rain take (00:58:46) care of the grass. They like either rain (00:58:48) the the grass or most likely they do (00:58:51) mass industrial farming of corn, (00:58:53) soybean, alalfa, etc., which uses (00:58:56) shitloads of water, right? Like, you (00:58:59) know, or like almond milk like uses tons (00:59:01) and tons of water. Like produce is like (00:59:03) the main user of water. I think the uh (00:59:05) metric was the entirety of Elon Musk's (00:59:09) Colossus data center, right? Uses as (00:59:11) much water as two and a half in-n-outs. (00:59:14) Um because that's, you know, you do the (00:59:16) calculation on how many how many b (00:59:17) what's the average revenue per in-n-out (00:59:19) and how many hamburgers does that (00:59:20) translate to, right? If everyone's (00:59:21) ordering like a combo, right? Okay, (00:59:23) let's ignore the drink, let's ignore the (00:59:25) fries, let's just talk about the (00:59:26) hamburger, let's ignore the bread, which (00:59:28) does use have grain, let's just do the (00:59:30) meat (00:59:31) >> and the cheese. And all of a sudden all (00:59:33) this water is there's so much water, (00:59:36) right? Like a single query like all of (00:59:39) your AI usage from chat GBT of the (00:59:41) average user is like a hamburger, right? (00:59:43) Like it's like okay, this is nothing, (00:59:46) right? You know, because these things (00:59:47) are the data centers actually are like (00:59:49) they're mostly closed loops and like (00:59:50) sure they evaporate some water for like (00:59:52) cooling reasons, but like by doing (00:59:54) evaporative cooling, they're using less (00:59:55) power, right? And that's actually better (00:59:56) for the environment than uh than not (00:59:59) using evaporative cool. There's all all (01:00:00) these reasons why this myth or hoax of (01:00:03) AI of AI using all the water is just (01:00:05) nonsense, right? Like Meta's data center (01:00:07) in Louisiana is getting protested (01:00:08) because the water it's it's going to be (01:00:10) the largest data center in the world. (01:00:11) It's going to be like four or five (01:00:12) gigawatts at least announced so far. (01:00:13) We're tracking some other ones that are (01:00:15) that may be as big or bigger. Uh but (01:00:17) Meta is getting protested because the (01:00:20) local population around that area is (01:00:21) like, "Oh, the water's dirty. It's (01:00:22) because of this meta data center." And (01:00:24) like there's these trucks on these big (01:00:26) trucks on these back roads that used to (01:00:27) be empty completely. They're just like (01:00:29) mad and annoyed about that, right? But (01:00:30) at the end of the day, what actually (01:00:32) made the water dirty is that that's an (01:00:34) area where you go fracking. Like (01:00:37) >> fracking is absurdly worse and almost (01:00:39) all of that gas is being shipped to an (01:00:41) LG terminal and being shipped to Asia. (01:00:44) Like you know, you know, like Japan or (01:00:46) Taiwan or China or Korea and some Europe (01:00:48) as well, right? Like like actually all (01:00:50) of this water is dirty because of (01:00:52) regulation fracking. Like I support (01:00:54) fracking by the way, but you know that's (01:00:55) that's an insane take too maybe. Um but (01:00:58) like water usage is is is like not a (01:01:00) relevant argument. (01:01:02) >> Are you bullish on the sort of energy uh (01:01:05) companies I'm thinking constellation for (01:01:08) nuclear or Vistra I guess is an (01:01:11) independent power producer. (01:01:14) >> I think IPS will do well. I think IPS (01:01:17) can secure contracts at premiums to what (01:01:21) they've previously been able to for new (01:01:22) power plants that are either uh (01:01:24) dedicated or grid connected but come (01:01:26) with a pairing of a grid load right for (01:01:29) example utilities won't let you just do (01:01:30) data centers now but if you come with a (01:01:32) a pair right you're like hey I'm going (01:01:33) to build this massive data center but (01:01:34) we're also going to have this massive uh (01:01:36) power generating asset right say you (01:01:38) know whatever it is right some IP (01:01:40) they're going to partner with and (01:01:41) they'll build the load and the uh (01:01:43) consumption even if it's connected (01:01:44) through the grid for better stability (01:01:46) and more reliability. Um or it's not (01:01:48) it's behind the meter i.e. not connected (01:01:50) to the grid at all. Um like some part (01:01:52) some data centers like partially like (01:01:54) Colossus from Elon uh the original one (01:01:56) or part of Abene's Texas OpenAI right (01:01:59) like Cruso there's a lot of room for (01:02:01) power producers to get outsized returns. (01:02:04) I'm not necessarily bullish nuclear. Um (01:02:06) existing nuclear fine yeah it'll it'll (01:02:08) it can find a higher buyer higher priced (01:02:10) buyer but majority of it will be gas but (01:02:13) like you can do like renewables backed (01:02:14) by gas and then just turn off the gas (01:02:16) and like it's cost more but whatever (01:02:17) right or you can do wind backed by gas (01:02:19) >> and why not nuclear (01:02:20) >> takes too long (01:02:21) >> takes too long (01:02:21) >> no one can build nuclear fast (01:02:24) >> even China takes like 5 years to build (01:02:25) nuclear right like it's it's complicated (01:02:28) it's unsafe right you know I love (01:02:31) nuclear I wish it would work it's just (01:02:32) not relevant in the time scale that like (01:02:34) AI's power is going crazy. Um, but yeah, (01:02:37) there's a lot of interesting stuff like (01:02:39) have clients would like had a client buy (01:02:41) a coal plant and we were advising them (01:02:43) on the transaction based on they just (01:02:44) like showed up and they're like, "Yeah, (01:02:45) we want to buy we want to buy power (01:02:46) assets. We believe in this power story." (01:02:48) It's like, "Okay, great." So, yeah. So, (01:02:49) here's all of the like power plants that (01:02:51) we know of like you can get some of it (01:02:52) from EIA blah blah blah. um which are (01:02:54) these like and then we like worked (01:02:56) through the economics and we looked at (01:02:57) the new data centers being built in the (01:02:58) region and all this and then they (01:02:59) decided to buy a coal plant and they (01:03:01) restarted it and they're like making (01:03:02) tons of money now because now someone a (01:03:05) certain hyperscaler wants to buy the (01:03:06) entire pipeline of power and put a load (01:03:08) load near it right instead of just being (01:03:10) a grid connected asset. So it's like a (01:03:12) super awesome investment. So like you (01:03:13) know power is power is going to do (01:03:15) great. (01:03:15) >> Yeah. I was going to talk about peace (01:03:17) dividends of the whole AI boom. Uh (01:03:19) generally yes right like hyperscalers (01:03:21) are paying for uh transmission grid (01:03:23) upgrades which people will benefit from (01:03:26) right or like you know investors are (01:03:27) obviously going to benefit people who (01:03:29) work in the industry electricians wages (01:03:31) are skyrocketing you know etc right like (01:03:33) plumbers wages are skyrocketing so (01:03:35) there's like a lot of trades that are (01:03:36) doing really well too I think that's (01:03:38) definitely also um part of it yeah (01:03:40) >> I wanted to come back quickly to uh that (01:03:44) um Nvidian core wave deal that you (01:03:45) mentioned as we sort of close the (01:03:47) discussion on uh on capex and a and a (01:03:50) bubble. It seems like there is circular (01:03:52) deals but also a lot of debt kind of (01:03:54) like flushing around. So I don't know (01:03:56) the specifics of of that deal but like I (01:03:58) did hear variations of this where (01:04:01) effectively you have a large player (01:04:02) guaranteeing the debt being the last (01:04:04) recourse uh for a lot of infrastructure (01:04:08) build is sort of uh this plus the whole (01:04:11) like oracle commitment. (01:04:14) there there is a fragility into this (01:04:16) whole thing that can be a little (01:04:18) unnerving. What do you make of it? (01:04:20) >> I think it's like completely fine and I (01:04:22) think like people are like freaking out (01:04:23) and making narratives where there really (01:04:25) is shouldn't be one. It's like well okay (01:04:27) Google doesn't have enough data center (01:04:28) capacity. They need people to build data (01:04:29) centers, but no one can build a data (01:04:32) center because they don't have the (01:04:33) capital. Like don't have, you know, many (01:04:34) cases capital is not the, you know, they (01:04:36) don't have capital, right? Or like no (01:04:38) one will give them a loan because they (01:04:39) don't trust some random [ __ ] company. (01:04:40) And it's like, but then Google's like, (01:04:41) well, no, we've due diligence to them. (01:04:42) We think they can build it here. We'll (01:04:44) like even guarantee we'll buy the thing (01:04:46) or start using it once they build it. (01:04:47) You know, just having a customer alone (01:04:50) spoken for it was enough, right? Um, in (01:04:52) the case of Cororewave, they were (01:04:53) actually able to no backs stop, right? (01:04:54) Right? They were able to just say, "Hey, (01:04:55) hey look, here's our Microsoft contract (01:04:57) for this many GPUs. I want to put in (01:04:59) that data center, that data center, that (01:05:00) data center. Here's the contract for (01:05:01) renting those GPUs. I want to hire these (01:05:02) people. I want to do this." No one will (01:05:04) like they don't have any money, but then (01:05:05) they were able to like have it work out (01:05:06) because they were able to get people to (01:05:07) lend to them. I think like Cororeweave (01:05:09) did that and there was no circular (01:05:10) financing. But that was when there was (01:05:11) like the scale of investment was like (01:05:13) single digit billions or less than a (01:05:14) billion. Right? Now the scale of (01:05:16) investment is hundreds of billions. (01:05:18) >> Yeah. (01:05:18) >> Um and so the question is like, oh well, (01:05:20) if I want data center capacity, how do I (01:05:22) how do I get data center capacity? I (01:05:23) just go to everyone who's going to build (01:05:25) it looks smart is smart enough to do it (01:05:27) but can't afford to do it and tell them (01:05:29) I'll I'll take it and in fact I won't (01:05:30) just take it. I'll go to your debtor and (01:05:32) be like, I'll guarantee you. Yeah. (01:05:33) >> Because, you know, obviously you're a (01:05:34) new company. I've vetted you, but the (01:05:36) debtor hasn't. And so, you know, like, (01:05:38) you know, you know, they don't want me (01:05:39) to just be able to walk away because (01:05:41) like in the Microsoft Corwave deals, (01:05:42) Microsoft could have walked away if (01:05:43) Corwe [ __ ] it up, (01:05:45) >> right? (01:05:45) >> Yeah. (01:05:46) >> There's no I mean, yeah, there's there's (01:05:47) always like uh sort of like cancellation (01:05:48) or whatever possibilities. And so, this (01:05:50) is just a further form of guarantee um (01:05:52) as far as on like a lot of these back (01:05:54) stops as far as on like Oracle getting (01:05:56) the money and then OpenAI getting money (01:05:57) and Nvidia, you know, paying and it's a (01:05:59) whole circular. It's kind of nonsense (01:06:01) because it's like Nvidia's getting (01:06:03) equity in OpenAI. They're basically (01:06:04) saying, "Hey, every gigawatt you buy, (01:06:06) we'll also buy some equity." (01:06:07) >> Yeah. (01:06:08) >> Right. Okay. Well, cool. Now, Nvidia (01:06:09) owns an asset which they think is (01:06:11) valuable. OpenAI, right? Open AAI is (01:06:13) turning around and is like trying to (01:06:14) rent those uh use the equity they buy. (01:06:17) What do they what was their use of (01:06:18) equity? People's cash pay isn't that (01:06:20) great, right? It's mostly just 99 plus% (01:06:23) of their spend at the company is (01:06:24) probably just compute. (01:06:25) >> Yeah. (01:06:26) >> Uh so so sort of like it's like, okay, (01:06:27) well then I I raise this money. I'm (01:06:29) going to do the the whole thing I (01:06:30) explained earlier, right? Year one and (01:06:31) two I lose money. Year three, four, (01:06:32) five, I hope to make money on it, right? (01:06:34) Um, and open has been doing that, right? (01:06:35) So, I'm going to Okay, I'm going to go (01:06:36) out there. I've raised $50 billion. I've (01:06:39) raised $10 billion. I'm going to raise (01:06:40) it. I'm going to rent a cluster for five (01:06:43) years for $65 billion. And I've rented (01:06:48) that contract and now I only have enough (01:06:50) to pay for the first year to be clear. (01:06:51) But I think, you know, you trust me, (01:06:52) Oracle, you think I'm going to grow and (01:06:54) you think I'll be able to pay for it. (01:06:55) Oracle's like, "Yeah, or if you're not, (01:06:56) I think I'll be able to sell it to (01:06:57) someone else." So like, okay, cool. I'm (01:06:59) going to spend $50 billion this year. (01:07:00) >> Yep. (01:07:01) >> To build that data center. And and and (01:07:03) this these this is like for a gigawatt. (01:07:05) Um and so is it like circular that (01:07:07) OpenAI is every amount of GPUs they (01:07:08) consume and gives an investment that (01:07:11) investment is turned around to pay for (01:07:12) the first year of the rent to the (01:07:13) cluster. Um or second year then first (01:07:15) two years go, you know, it's sort of (01:07:17) like it's fine. (01:07:18) >> Yeah. Yeah. (01:07:18) >> Like it's like it's like it is a little (01:07:21) bit funky, but like I don't think it's a (01:07:23) big deal. (01:07:23) >> Yeah. Love it. Contrary intake. Maybe (01:07:25) let's finish with the models and the (01:07:28) software side of things. We talked (01:07:30) extensively about hardware and supply (01:07:31) chain and all the things. I get a sense (01:07:33) that you are super super bullish on uh (01:07:35) what's happening next in in AI. Your (01:07:38) roommate Schulto I assume was the (01:07:40) roommate that you were talking about (01:07:41) earlier on this pod effectively making (01:07:43) the point that we're just starting to (01:07:45) scratch the surface and there was so (01:07:47) much low hanging fruit around you know (01:07:49) RL and all the things you were in (01:07:51) Silicon Valley circles. Is that is that (01:07:53) your sense as well and what are you (01:07:55) tracking on the model side? (01:07:57) >> One thing is like you know simple stuff (01:07:58) like uh GitHub commits other things are (01:08:01) like what's the amount of usage how much (01:08:03) are people using like all these sorts of (01:08:05) things. I think there's so many (01:08:06) different alternative data sources for (01:08:08) tracking AI model progress area (01:08:10) tokconomics uh token economics (01:08:12) tokconomics and so that's like an entire (01:08:13) practice for us. (01:08:14) >> Are you rebranding the term from crypto? (01:08:17) >> I yeah I don't believe in crypto people (01:08:19) like I've always hated them. [laughter] (01:08:21) Um, (01:08:21) >> so now you're taking the term. (01:08:22) >> Yeah. Yeah. And Jensen's used it now. So (01:08:24) I've like I've convinced him to use the (01:08:26) word. He's used it as sovereigns and so (01:08:27) I think I think we've won. (01:08:28) >> That's awesome. Congratulations. (01:08:30) >> I've said it to him. We've written it in (01:08:31) articles. It's an entire practice of (01:08:33) consulting that I just I started in like (01:08:34) 23 2023 uh was token economics and we've (01:08:38) been trying to build out these like you (01:08:39) know but basically I think the main (01:08:40) things are like people who don't code (01:08:41) can use cloud code now right? I think (01:08:43) people don't understand that like even (01:08:44) if you don't code you've never had any (01:08:46) training in software development, you've (01:08:47) never take had a job as a software (01:08:49) developer you can code. Let's take an an (01:08:51) example of what one of the one of the (01:08:52) analysts at my company did right comes (01:08:54) from a engineering background but on (01:08:56) like semiconductor systems right uh like (01:08:59) worked on mechanical systems worked on (01:09:02) these sorts of things and they coded (01:09:03) this thing which was they wanted to do (01:09:05) an analysis of area of clean rooms right (01:09:08) clean rooms are the building that you (01:09:10) the fab has all the tools in the most (01:09:11) complicated kind of building in the (01:09:12) world has every all sorts of chemical (01:09:14) systems and all this area of that a (01:09:16) company who builds systems builds these (01:09:19) systems (01:09:20) and revenue of that company, right? And (01:09:22) so it was like, okay, uh we have this (01:09:24) fab data set. Pointed it at it was like, (01:09:26) hey, here's this fab data set. What's (01:09:27) the square footage of all of them? And (01:09:29) we have this like thing that we built (01:09:30) which uh just pulls with cloud code (01:09:32) separately which for data centers and (01:09:34) and and fabs and everything else just (01:09:36) calculates the area of something from a (01:09:37) from a satellite image, right? Very (01:09:39) simple. So we have the square footage of (01:09:40) all these things. Points at that. Here's (01:09:42) the company name. Okay, go find the (01:09:43) filings. So it dig dug through all these (01:09:45) filings. It it pulled the data, right? (01:09:47) Okay, great. now told it to um compare (01:09:50) these two. Make a chart. Great. Oh, (01:09:52) wait. There's this like weird (01:09:52) inflection. Oh, that's because they (01:09:53) bought a company five years ago. Can you (01:09:55) do a proform of this analysis without (01:09:57) those financials of that of that company (01:09:58) they acquired? Okay, great. And then (01:10:00) like we were able to like like figure (01:10:02) out an investment case for our clients (01:10:05) as well as like you know some other (01:10:07) interesting details from someone who's (01:10:08) never really coded just using clawed (01:10:11) code and it like doing this all and this (01:10:13) is like not even their and it wrote the (01:10:14) note and they just like they didn't even (01:10:16) like work on this full-time for like 3 (01:10:18) hours right they just told the model and (01:10:20) would go work on other things and told (01:10:21) the model and worked on other things (01:10:22) they just did this people don't (01:10:24) understand that like the skill sets that (01:10:25) like I think like if you go talk to an (01:10:27) analyst right a very junior analyst at (01:10:29) any right? Whether it's venture or (01:10:31) especially growth venture or public (01:10:33) markets or private equity, their their (01:10:35) job is like finding data, cleaning it, (01:10:37) making charts. It's like this is cloud (01:10:38) code now. You don't need junior (01:10:40) analysts. Just like a lot of companies (01:10:42) have stopped hiring L4 engineers because (01:10:43) it's useless. Why would I hire an L4 (01:10:46) engineer? I just tell Claude to do it. (01:10:49) You you sort of like have this has (01:10:50) happened and this is a really big like (01:10:53) shift I guess like is that like (01:10:55) low-level knowledge work just doesn't (01:10:57) matter, right? Why would I why would I (01:10:59) use Excel when I can just tell Claude to (01:11:01) manipulate CSVs? Why would I use Word (01:11:03) when Claude will just generate the (01:11:05) markdown and I can copy and paste the (01:11:06) markdown directly into our WordPress and (01:11:08) then you know and that WordPress is (01:11:09) fully formatted now and it's like oh my (01:11:10) god like what's the point of Word, (01:11:12) right? Um and what's the point of doing (01:11:13) all sorts of stuff? I think when we look (01:11:15) at model progress that's just for Opus (01:11:17) 4.5. Open's new model I think will be (01:11:19) better than Opus 4.5 and it's coming (01:11:20) like somewhat soon in Marchish um time (01:11:23) frame. I maybe February, Marchish, but (01:11:25) yeah. Um because OpenAI has a better RL (01:11:27) stack than Enthropic today. It's just (01:11:29) their pre-trained models suck compared (01:11:31) to Enthropic's pre-training, right? And (01:11:33) so like if they catch up a lot on (01:11:34) pre-training and keep their better RL (01:11:36) stack, they would actually have a model (01:11:38) that's much better, right? Flip side, (01:11:39) Google has a better pre-trained model (01:11:41) than Anthropic or OpenAI, but their RL (01:11:43) stack sucks. So if they catch up on RL, (01:11:45) like these models are going to get (01:11:47) ridiculously and then Anthropic is (01:11:48) obviously advancing as well, right? And (01:11:49) so and then and then you look across the (01:11:51) ecosystem, everyone's advancing really (01:11:53) fast progress. These moments are (01:11:55) happening, right? You know, chat GPT was (01:11:57) a moment. Gibbly was a moment. Those (01:11:58) were more consumer. Those were less like (01:11:59) I mean there chat GBT is everyone using (01:12:01) it for work too. But like I think cloud (01:12:02) code is like a new moment right 4.5 on (01:12:05) cloud code is a new moment where the way (01:12:07) you work has forever changed. And so now (01:12:09) we're trying to force everyone in my (01:12:10) company. There's 54 people here. I think (01:12:12) like half of them have coded. The other (01:12:14) half we're trying to force them to use (01:12:15) like cloud code. And it could be like oh (01:12:17) well actually you come from a consult a (01:12:19) semiconductor consulting background. Oh, (01:12:20) you come from like a semiconductor like (01:12:22) engineering of like package. Oh, you (01:12:24) worked in a fab, right? Like these kind (01:12:25) of people, they're using cloud code now, (01:12:27) right? And and their productivity is (01:12:29) being boosted. (01:12:30) >> And it's like, (01:12:31) >> you know, workspace, cloud workspace is (01:12:33) new. It sucks compared to cloud code, (01:12:34) but it'll get there, right? He he he (01:12:36) said he coded it entirely in cloud code. (01:12:37) You know that, right? Or that was on (01:12:38) your pod, right? Yeah. Yeah. So, like um (01:12:41) you I've heard that and I think maybe (01:12:42) that might have been from your pod uh (01:12:44) original uh disclosure. (01:12:46) >> My pod was before that, but yes. Oh, (01:12:48) okay. Okay. It was (01:12:49) >> I had as the guy on my pod subsequently (01:12:50) said that. (01:12:51) >> Okay. I think it's like a brand new age (01:12:53) and and like there's so much low hanging (01:12:54) fruit as Shto said on the episode when (01:12:56) he was here. There's so much low hanging (01:12:57) fruit. Yeah. I mean for for the models (01:12:59) progressing and I think model progress (01:13:00) will translate to revenue. Adoption is (01:13:03) difficult but like actually the UX of (01:13:05) cloud code sucks but like give it 6 (01:13:07) months the models will be good enough (01:13:08) that the UX can be like talking to it. (01:13:10) Yep. (01:13:10) >> And you don't even have to have like you (01:13:12) know CLI integration, right? It's (01:13:14) something even easier. or like cloud for (01:13:16) XL was released recently and it's like (01:13:18) not bad you know building models and (01:13:19) like all these sorts of things are just (01:13:20) going to be like tell someone right like (01:13:22) why tell a junior analyst right when you (01:13:24) can just do it yourself I think it's a (01:13:25) whole new world and it's a $2 trillion (01:13:27) of software work but also of wages but (01:13:29) it's also we have more north of 2% 2% is (01:13:32) claw and then you know there's codeex (01:13:33) and cursor and all these other guys so (01:13:35) probably like 5% of code committed today (01:13:36) is AI generated if not higher marked as (01:13:39) AI generated what's going to happen when (01:13:41) normal workers who do spreadsheets and (01:13:44) office processing start automating their (01:13:45) workflows. I think it's a whole new (01:13:47) world. (01:13:47) >> And speaking of Schultoe, we both agreed (01:13:49) that he was a a perfect specimen. (01:13:52) >> Dude, [laughter] I' I've been I'm (01:13:54) straight, but I've been accused of being (01:13:56) uh homosexual, which is perfectly fine (01:13:58) for for how much I like praise this man (01:14:01) because like, think about it, right? (01:14:02) He's like 6'4. He's like really (01:14:05) good-looking. He's like Australian (01:14:07) accent. Sounds amazing. Like you've (01:14:09) heard his I I have like a annoying voice (01:14:11) probably. His voice sounds amazing. He's (01:14:13) absurdly good at coding. He was an (01:14:15) Olympian level fencer. Like like he (01:14:18) picks up any sport, he's really good at (01:14:20) it, right? Because he's athletic. It's (01:14:21) like, "Holy crap, you're a specimen." (01:14:23) >> Yeah. Yeah. (01:14:24) >> This clip and sent him [laughter] for (01:14:26) sure. (01:14:28) >> Yeah. It must be uh you know I guess uh (01:14:31) may maybe some people don't follow the (01:14:33) playbyplay on on Twitter and like don't (01:14:36) haven't haven't heard of like the fact (01:14:37) that all of you guys are roommates or (01:14:39) you roommate with Scholto and then with (01:14:41) Dwarish and Darkish is like the (01:14:43) podcasters podcaster. So it must be (01:14:46) absolutely (01:14:47) >> What's a podcasters podcaster mean? (01:14:49) >> Uh the podcaster that other podcasters (01:14:52) uh aspire to to to become or learn from. (01:14:56) >> Yeah. Yeah. his his when he's preparing, (01:14:58) you know, it's like he's he's he's he's (01:15:00) so locked in and he prepares so hard for (01:15:02) interviews. It's great. (01:15:03) >> No, he's he's just uh incredible. (01:15:05) >> And then and then he might only say like (01:15:07) a hundred words on the episode, (01:15:09) >> but he's prepared so hard and then like (01:15:10) I think people just realized, oh wow, (01:15:12) he's not just like, you know, it's like, (01:15:13) oh, he just has good guests. No, no, no. (01:15:15) Like he's preparing really hard, but you (01:15:16) can't tell if you're not like realizing (01:15:18) that. And then once he started writing (01:15:20) more and he started writing more, people (01:15:21) like, oh wow, he's actually really (01:15:22) really smart. It's like, yeah, cuz he's (01:15:24) studying like crazy. Like it's like, (01:15:26) "Oh, I'm interviewing an AI researcher (01:15:27) who worked on this. I'm gonna try and (01:15:28) train a freaking model." Yeah. (01:15:29) >> Right. It's like that's the level of (01:15:31) like commitment he goes to when he (01:15:32) records this stuff. (01:15:33) >> What do you guys talk about when you (01:15:34) bump into each other? Is that is that AI (01:15:36) non-stop or you talk about everything (01:15:37) but AI (01:15:38) >> with Shoto? It's like the Age of Empires (01:15:40) game, you know, because we we got super (01:15:41) into it for a bit. We talked only about (01:15:43) that in his RTS that he made. Uh with (01:15:45) with with Dwarash, it's I mean, it's all (01:15:47) sorts. It's like normal roommate stuff. (01:15:49) It's like, [laughter] "How's your dating (01:15:50) life?" "Oh, okay. You went on a date. It (01:15:52) wasn't well. It didn't go well." "Okay, (01:15:53) well, okay." Yeah. you know, like, oh, (01:15:55) you know, like that's me. That's me. You (01:15:56) know, my days don't go [laughter] well. (01:15:58) No, I'm just kidding. Um, or like it's (01:16:00) like, oh, you want to like have dinner? (01:16:02) We can invite a few friends. Like, yeah, (01:16:03) great. Or like, you know, it's like all (01:16:04) sorts of like normal stuff, too. Um, al (01:16:06) obviously we also do talk about a lot (01:16:08) about tech, right? Like we are like this (01:16:10) is our lives. Um, and tech is the most (01:16:12) fun thing. (01:16:13) >> Awesome. Well, great. Great San (01:16:14) Francisco lore. Uh, Dylan, thank you so (01:16:17) much. Uh, that was absolutely fabulous. (01:16:18) Really enjoyed it. Learned a lot. So, (01:16:20) really appreciate uh your coming on the (01:16:22) pub. (01:16:22) >> Thank you so much. (01:16:25) Hi, it's Matt Turk again. Thanks for (01:16:26) listening to this episode of the Mad (01:16:28) Podcast. If you enjoyed it, we'd be very (01:16:30) grateful if you would consider (01:16:31) subscribing if you haven't already or (01:16:33) leaving a positive review or comment on (01:16:35) whichever platform you're watching this (01:16:37) or listening to this episode from. This (01:16:39) really helps us build a podcast and get (01:16:41) great guests. Thanks, and see you at the (01:16:44) next episode.

Dylan Patel: NVIDIA’s New Moat & Why China is “Semiconductor Pilled” (YouTube Video Transcript)

Learning Modes

YouTube Video Transcript Hide

Ask AI Result

Leave a Reply Cancel reply

Other Videos:

The AI-Generated Intimacy Crisis | Bryony Cole | TED

AI CEO: How to build a $1B Company in 2...

A brilliant student, ignored by all, decides to earn a...

Two towers guarding a pit full of monsters from the...

30 سؤال واجابته عن الصيام واحكامه للشيخ مصطفى العدوى

Two teenage girls swap bodies with their mother and grandmother...

She decides to take revenge on every bully who tormented...

Why Modern Parenting Is Creating a Mental Health Crisis (ft....

I Spy on my Kid’s Phone: Strict vs Free Range...

The 2026 Business Playbook: Leverage AI Before Your Competitors Do...

How to keep children safe online? | Cybersecurity Podcast

$215M AI CEO: How I’d Build a Profitable AI Startup...