NVIDIA’s Jensen Huang on Reasoning Models, Robotics, and Refuting the “AI Bubble” Narrative (YouTube Video Transcript)

↔

Title: NVIDIA’s Jensen Huang on Reasoning Models, Robotics, and Refuting the “AI Bubble” Narrative

Duration: 01:16:21

Total Correct Answers:

Dictation Mode:

Current Caption

Correct

Show All Captions

Learning Modes

Dictation

YouTube Video Transcript Hide

Ask AI Result

The ask AI result will appear here..

Show timestamps

Display as lines

(00:00:00) Your YouTube transcript will appear here (00:00:06) Hson, thanks so much for joining us (00:00:07) today. (00:00:08) >> So great to have you guys. What an (00:00:10) amazing year. (00:00:11) >> What a year. (00:00:12) >> Happy Hanukkah, merry Christmas, (00:00:14) >> happy new year coming up. Yep. Happy (00:00:15) holidays. (00:00:16) >> So, uh, with everything that's happened (00:00:19) in 2025, (00:00:21) um, and you know, being in the middle of (00:00:23) the vortex with it, what do you reflect (00:00:24) on and say like this surprised you most (00:00:26) or this is the biggest change? (00:00:28) >> Let's see. There there's some things (00:00:29) that didn't surprise me like for example (00:00:31) the scaling laws didn't surprise me (00:00:32) because we already knew about that. The (00:00:34) technology advancement didn't surprise (00:00:36) me. I was pleased with the improvements (00:00:39) of grounding. I was pleased with the (00:00:41) improvements of reasoning. I was pleased (00:00:43) with uh uh the connection of all of the (00:00:46) models to to to search. I'm pleased that (00:00:50) it that uh there are now routers that (00:00:53) are in front of these models so that it (00:00:55) could depending on the confidence of the (00:00:57) answers go off and do necessary research (00:01:00) and and just generally improve the (00:01:03) quality and the accuracy of answers. (00:01:05) >> I'm hugely proud of that. I think the (00:01:08) whole industry addressed one of the (00:01:10) biggest skeptical responses of AI which (00:01:13) is hallucination and um generating (00:01:16) gibberish and all of that stuff. I I (00:01:18) thought that this year the whole (00:01:20) industry everything from every every (00:01:22) field from language to vision to (00:01:25) robotics to self-driving cars the the (00:01:28) application of reasoning (00:01:31) and the grounding of the of of of the (00:01:35) answers. Um big big leaps would you guys (00:01:38) say this year? (00:01:39) >> Huge. I mean things like open evidence (00:01:41) too for medical information where (00:01:42) doctors are now really using that as a (00:01:43) trusted resource like you Harvey for (00:01:46) legal you're really starting to see AI (00:01:47) emerge as one of these things that's (00:01:49) become a trusted tool or counterparty (00:01:51) for you know experts to actually be able (00:01:53) to do what they do much better. (00:01:55) >> That's that's right. And so so in a lot (00:01:57) of ways I was expecting it but I'm still (00:01:59) pleased by it. I'm proud of it. I'm (00:02:01) proud of all of the industry's work in (00:02:03) this area. I'm really pleased and and uh (00:02:06) uh and probably a little bit surprised (00:02:09) in fact that token generation rate for (00:02:13) inference especially reasoning tokens (00:02:15) are growing so fast several exponentials (00:02:19) at the same times it seems and uh and (00:02:22) I'm so pleased that that these tokens (00:02:24) are now profitable that people are (00:02:26) generating I heard somebody hurt today (00:02:29) that that open evidence speaking of them (00:02:31) 90% gross margins I mean those are very (00:02:34) profitable tokens. (00:02:35) >> Yeah. (00:02:35) >> And so they're obviously doing very (00:02:37) profitable, very valuable work. Cursor, (00:02:39) their margins are great. Uh Claude's (00:02:42) margins are great for the enterprise use (00:02:44) of OpenAI. Their margins are great. Um (00:02:46) so anyways, it's really terrific to see (00:02:48) that that um we're now generating tokens (00:02:51) that are sufficiently good, so good in (00:02:54) value that that people are willing to (00:02:55) pay good money for. And so I I think (00:02:57) these are are really great grounding for (00:02:59) the year. I mean some of the things that (00:03:02) the narrative that that um uh of course (00:03:05) the conversation with China really (00:03:06) really you know occupied a lot of my my (00:03:09) time this year. Geopolitics (00:03:12) uh the importance of technology in each (00:03:13) one of the countries. Uh I spent more (00:03:15) time traveling around the world this (00:03:17) year than just about any time in the h (00:03:18) all of my life combined. You know my (00:03:21) average elevation this year is probably (00:03:23) about 17,000 ft. You know so so it's (00:03:25) nice to be here on the ground with you (00:03:27) guys. Um and so so I think uh (00:03:30) geopolitics the importance of AI to all (00:03:32) the nations uh all worth talking about (00:03:34) later. You know of course I spent a lot (00:03:36) of time on expert control and and making (00:03:38) sure that our strategy is nuanced and uh (00:03:41) really grounded and um uh promotes (00:03:44) national security but recognizing the (00:03:46) importance of various uh various facets (00:03:50) of national security. Um a lot of (00:03:52) conversations around that. Um, you know, (00:03:54) of course, of course, uh, lots of (00:03:56) conversation about jobs, the impact of (00:03:58) AI, uh, energy, (00:04:01) >> um, uh, labor shortage. I mean, boy, we (00:04:03) covered everything, did we? Yeah. (00:04:05) Everything was AI. (00:04:06) >> Everything was AI. Yeah, it was (00:04:08) incredible. (00:04:09) >> Yeah, AI was definitely the center of (00:04:10) the storm for like every one of those (00:04:12) themes. Maybe one we can start with (00:04:13) actually um is jobs because or there (00:04:16) jobs and employment because when I look (00:04:17) at the traditional AI community even (00:04:20) before things were scaling and even (00:04:22) before AI was really working there was a (00:04:24) strong sort of doomsday component in the (00:04:26) people working on AI oddly enough right (00:04:28) the people who were most trying to push (00:04:29) the field forward were often the people (00:04:30) who are most pessimistic which is very (00:04:32) odd why would you do both at once (00:04:33) >> and I feel like that narrative has taken (00:04:35) over some subset of media or some set of (00:04:37) other things despite all the things that (00:04:39) we think are very positive about what AI (00:04:41) has done That's going to help with (00:04:42) healthcare, with education, with (00:04:43) productivity, with all these other (00:04:45) areas. (00:04:46) >> And in in general, whenever we have a (00:04:48) technology shift, you have a shift in (00:04:50) terms of the jobs that are important, (00:04:51) but you still have more jobs. (00:04:52) >> That's right. (00:04:52) >> Could you talk about how you think about (00:04:53) employment and jobs and sort of what (00:04:55) people are saying and what you think the (00:04:56) real narrative is there? (00:04:57) >> Maybe what I'll do is I'll I'll ground (00:04:59) it on uh three points in space, three (00:05:02) points in time. now. (00:05:04) >> Mhm. (00:05:05) >> Uh maybe uh uh very near future and then (00:05:09) some some point out out in the distance (00:05:11) and and maybe maybe some (00:05:14) counternarratives. (00:05:15) >> Um something else to think about with (00:05:17) respect to jobs in the near term. (00:05:20) >> Uh one of the most important things is (00:05:21) that that AI is not just AI is software (00:05:26) >> but it's not pre-recorded software as (00:05:28) you know. For example, Excel was written (00:05:30) by several hundred engineers. They (00:05:32) compiled it. It's pre-recorded and then (00:05:34) they distribute it as is for several (00:05:36) years. In the case of AI, because it (00:05:39) takes into the context, what you asked (00:05:42) of it, what's happening in the world, (00:05:44) right? Contextual information, it (00:05:46) generates every single token for the (00:05:49) first time, every time. (00:05:50) >> Which means every time you use the (00:05:53) software and and everything that we do, (00:05:55) AI is being generated for the first time (00:05:57) ever. Just like intelligence, our (00:05:59) conversation today relies on some, you (00:06:02) know, ground truth and some knowledge (00:06:04) and but it's every single word is being (00:06:06) generated for the first time here. The (00:06:08) thing that's really really quite unique (00:06:10) about AI is that it needs these (00:06:13) computers to generate these tokens every (00:06:15) single time. I call them AI factories (00:06:17) because it's producing tokens that will (00:06:20) be, you know, used all over the world. (00:06:22) Now, some people would say it's also (00:06:23) part of infrastructure. The reason why (00:06:25) it's infrastructure is because obviously (00:06:27) it affects every single application. (00:06:29) It's used in every single company. It's (00:06:32) used in every single industry every (00:06:34) single country. Therefore, it's part (00:06:35) infrastructure like energy and and (00:06:37) internet. Now, because of that and the (00:06:40) amount of computers that's necessary to (00:06:41) generate these tokens and it's never (00:06:43) happened before and because we need (00:06:44) these factories, three new industries (00:06:47) have emerged. Number one, well, three (00:06:49) new type of plants have to be created. (00:06:50) Number one, we have to build a lot more (00:06:52) chip plants. (00:06:53) >> Mhm. TSMC is building, right? SKH Highix (00:06:56) building a lot more plants and so we (00:06:59) need more chip plants. We need more (00:07:01) computer plants. These computers are (00:07:03) very different. These are supercomputers (00:07:05) that the world's never seen before. (00:07:06) Right. Grace Blackwell looks like a very (00:07:09) different type of computer than anything (00:07:11) that's ever been made. And entire rack (00:07:13) is one GPU. (00:07:15) >> And so we need new supercomputer plants. (00:07:17) And then we need new AI factories. These (00:07:20) three plants are currently being met (00:07:22) being built in the United States at very (00:07:25) large scale quite broadly all over the (00:07:27) United States for the very first time. (00:07:29) >> The number of construction workers, (00:07:31) plumbers, electricians, technicians, (00:07:33) network engineers, you know, right? The (00:07:36) the number of the skilled labor that's (00:07:39) necessary to support this new industry (00:07:40) in the near term, (00:07:42) >> it'll be enormous. Let's just face it. (00:07:45) Uh I'm [clears throat] so excited to (00:07:46) hear that electricians are seeing their (00:07:48) paychecks double. They're being they're (00:07:50) being paid to travels like like us. We (00:07:52) go on business trips. They're going on (00:07:53) business trips. And so it's really (00:07:55) terrific to see, you know, that this (00:07:58) these three industries are now three (00:08:01) types of plants, factories are just (00:08:03) creating so much so much jobs. The next (00:08:06) part is the the near-term impact of AI (00:08:10) on jobs. And one of my favorites is um I (00:08:14) love Jeff Hinton. uh he said uh you know (00:08:18) some five six seven years ago that in (00:08:21) five years time uh AI will completely (00:08:25) revolutionize radiology that every (00:08:28) single radiology application will be (00:08:30) powered by AI and that radiologists (00:08:35) uh will no longer be needed and that he (00:08:38) would advise this the first profession (00:08:40) not to go into is radiology and he's (00:08:42) absolutely right 100% of radiology (00:08:46) applications are now AI powered. That's (00:08:49) completely true and in some eight years (00:08:51) time it is now completely pervaded uh uh (00:08:56) radiology. However, what's interesting (00:08:58) is that the number of radiologists (00:09:00) increased (00:09:01) >> and so now the question is why and this (00:09:04) is where the difference between task (00:09:07) versus purpose of a job. A job has tasks (00:09:10) and has purpose. And in the case of a (00:09:13) radiologist, the task is to study scans, (00:09:18) but the purpose is to diagnose disease (00:09:21) >> and to research (00:09:22) >> and and that exactly and they're doing (00:09:24) research. And so in the case in their (00:09:26) case, the fact that they're able to (00:09:28) study more scans more deeply, (00:09:31) um they're able to uh request more (00:09:33) scans, do a better job diagnosing (00:09:36) disease, the hospital's more productive, (00:09:39) they can have more patients, which (00:09:41) allows them to make more money, which (00:09:43) allows them to want to hire more (00:09:44) radiologists. And so the question is (00:09:47) what is the purpose of the job versus (00:09:50) what is the task that you do in your (00:09:52) job? And and as you know, I spend most (00:09:54) of my (00:09:55) >> day typing. [snorts] That's my task, but (00:09:58) my purpose is obviously not typing. And (00:10:01) so the fact that somebody could use AI (00:10:03) to automate a lot of my typing, and I (00:10:05) really appreciate that, and it helps a (00:10:06) lot. (00:10:07) >> Um, it hasn't really made me, if you (00:10:09) will, less busy. In a lot of ways, I (00:10:12) become more busy because I'm able to do (00:10:13) more work. So, I think that that's the (00:10:15) second part to consider is the task (00:10:17) versus the purpose of the job. This (00:10:19) example really strikes home because my (00:10:21) my sister-in-law Erin actually leads um (00:10:23) in nuclear medicine at Stanford, right? (00:10:25) So, she's in radiology and with all the (00:10:28) technology advancements that are coming, (00:10:30) >> these doctors really welcome it and they (00:10:32) are working 20 hours a day trying to do (00:10:34) more research and serve more patients. (00:10:36) Exactly. And and I think one thing that (00:10:38) is often missed beyond the sort of um uh (00:10:42) diversity of jobs being created by this (00:10:44) investment in infrastructure is actually (00:10:46) how much latent demand there is for (00:10:49) different goods that we we need in (00:10:52) society like better healthcare. I don't (00:10:53) think anybody feels like you know what (00:10:55) we have reached the the tiptop uh (00:10:59) mountaintop of like what American (00:11:00) healthcare or global healthcare could be (00:11:03) and um the more we can make these people (00:11:06) productive the more demand there will be (00:11:07) >> that's exactly right if I if Nvidia was (00:11:09) more productive it doesn't result in (00:11:12) layoffs it results in us doing more more (00:11:15) things (00:11:16) >> I met your new hire class today you seem (00:11:18) to be hiring every week anyway yeah (00:11:20) >> that's exactly right right the the more (00:11:22) productive we are the more uh ideas we (00:11:25) can explore uh the more growth as as a (00:11:28) result the more profitable we become (00:11:30) which allows us to pursue more ideas and (00:11:33) so I think you're you're absolutely (00:11:34) right that that if if the job if if your (00:11:37) if your life if the world the problems (00:11:40) is literally already specified and (00:11:43) there's no other problem to solve then (00:11:45) productivity would actually reduce the (00:11:48) economy but it's clearly going to (00:11:50) increase the e economy I think that the (00:11:52) Next part that I would consider is, you (00:11:55) know, people say, gosh, all of these (00:11:57) robots that we're talking about, it's (00:11:58) going to take away jobs. As as we know (00:12:00) very clearly, we don't have enough (00:12:03) factory workers. Our economy is actually (00:12:05) limited by the number of factory workers (00:12:07) we have. Most people are are having a (00:12:10) very hard time retaining their workers. (00:12:13) Um, we also know that the number of (00:12:15) truck drivers in the world is severely (00:12:18) short. And the reason for that is people (00:12:20) don't want those jobs where you have to (00:12:22) travel across the country and live in (00:12:24) different parts of the world, different (00:12:25) parts of the country, you know, every (00:12:26) single night. So people want to stay in (00:12:28) their town, stay with their families. (00:12:30) And so I think that I think the first (00:12:32) part is that having robotic systems is (00:12:36) going to allow us to cover the labor (00:12:39) shortage gap which is really really (00:12:41) severe and getting worse because of (00:12:43) aging population. This is this is not (00:12:45) only United States, it's all over the (00:12:47) world as you guys know. (00:12:48) >> And so we're going to cover the labor (00:12:51) shortage. But the second part that (00:12:53) people forget and and as a result we'll (00:12:55) go there are shortages as well in other (00:12:57) places that people talk about AI being (00:12:58) relevant. Accounting would be an example (00:13:00) where there's shortages there. Nursing (00:13:02) is another example. So you know you can (00:13:04) you can go through multiple other (00:13:05) industries and say okay there's gaps (00:13:07) right (00:13:07) >> and AI is trying to help fill those (00:13:08) gaps. (00:13:09) >> That's exactly right. And so so um (00:13:11) automation is going to help us increase (00:13:13) and solve the the the the labor gap. Now (00:13:17) people also don't don't remember that (00:13:19) when we have cars, we need mechanics to (00:13:23) take care of our cars. (00:13:25) >> And if you look at the robo taxis that (00:13:27) are that are even on the streets today, (00:13:29) it's taken 10 years for that to happen. (00:13:32) Look at all the maintenance crews and (00:13:34) all of the the the various, you know, (00:13:37) hubs that they're in where you have to (00:13:39) take care of these robo taxis and just (00:13:41) imagine we have a billion robots. (00:13:44) >> Mhm. (00:13:44) >> It's going to be the largest repair (00:13:46) industry on the planet. So I I think a (00:13:48) lot of people don't they they just have (00:13:49) to think through (00:13:50) >> and this is the part where you said um (00:13:52) when we create this type of automation, (00:13:54) we create this other job. Right now look (00:13:56) at AI is creating so many jobs. Mhm. (00:13:59) >> The AI industry is creating a boom of (00:14:01) jobs. (00:14:02) >> I think one of the core challenges here (00:14:03) is it's very easy to draw a straight (00:14:07) line of extrapolation from like oh you (00:14:10) know uh there are tools that help (00:14:12) lawyers be more productive. It's going (00:14:14) to replace the lawyers but it's actually (00:14:15) it takes like a step of incremental (00:14:18) reasoning to say there's a sucking sound (00:14:21) in the economy for everything in AI (00:14:22) infrastructure. there's actually a (00:14:24) sucking sound toward all of this demand (00:14:26) that is latent in the places where we (00:14:28) have gaps where um I think a lot of (00:14:31) policy makers have focused on you know (00:14:33) we can't replace or reduce what we have (00:14:35) when it's really there's there's far (00:14:37) more demand in what we actually are not (00:14:39) >> and in the case of lawyer what's the (00:14:41) what's the purpose of the lawyer versus (00:14:43) the task of the lawyer (00:14:45) >> reading a contract writing a contract is (00:14:48) not the purpose of the lawyer the (00:14:50) purpose of the lawyer is to help you (00:14:52) resolve conflict (00:14:54) And that's more than reading a contract. (00:14:56) It's more than writing a contract. The (00:14:59) purpose is to protect you. That's more (00:15:01) than reading a contract. It's more than (00:15:02) writing a contract. And so I think just (00:15:05) it's really really important to go back (00:15:06) to what is the purpose of the job versus (00:15:09) the task that we use, (00:15:12) >> you know, to perform that job that (00:15:13) changes over time. (00:15:14) >> Yeah. The other big theme of the year (00:15:15) that you mentioned that I think is (00:15:16) really important to touch upon is both (00:15:18) uh China is sort of in the rise of (00:15:20) Chinese open source in particular where (00:15:22) you know some of the highest scoring (00:15:24) models against benchmarks now are (00:15:26) Chinese models on the open source side (00:15:27) on the closer side it's still a lot of (00:15:28) the US models but things like Quinn (00:15:30) Deepseek etc (00:15:32) >> are doing very well you've long been a (00:15:34) proponent for open source in general (00:15:35) could you could you share views about (00:15:37) both China emerging for AI for open (00:15:40) source and what the US should be doing (00:15:41) in terms of both open source as well as (00:15:42) its own industries (00:15:44) >> when you Think about these complicated (00:15:46) interconnected dependent (00:15:49) um networks of problems. These this you (00:15:51) know big goop of a mesh of problems it's (00:15:55) always good to to go back and find a (00:15:57) framework for what it is that we're (00:15:59) talking about. In the case of AI um what (00:16:04) is AI? (00:16:06) Well, of course, the technology of AI (00:16:09) and the capability, the capabilities of (00:16:11) AI is about automation. It's about (00:16:14) automation of intelligence for the very (00:16:16) first time. And you could combine it (00:16:19) with megatronics technology to embody (00:16:23) that megatronics and and make it perform (00:16:25) tasks. (00:16:27) >> So that's what's AI automation. But what (00:16:31) what is the stack that makes AI (00:16:33) possible? What's the technology stack? (00:16:34) functional stack. And of course the e (00:16:37) the easiest way to think about that is (00:16:39) it's kind of like a fivey year five year (00:16:41) five layer cake which is at the lowest (00:16:43) level is energy. (00:16:45) >> Um it transforms energy to the output (00:16:47) that I just described. The next layer is (00:16:49) chips. The next layer is infrastructure (00:16:51) and that infrastructure is both hardware (00:16:53) software right this is where land power (00:16:56) and shell this is where construction is (00:16:58) data centers are the software stack (00:17:01) >> you know for orchestrating the so it's (00:17:03) software and hardware the layer above (00:17:05) that is where everybody thinks about (00:17:07) which is AI which is the models (00:17:10) >> we know this but it's really helpful to (00:17:12) understand that AI is a system of models (00:17:15) >> and AI is a um a techn technology that (00:17:20) understands information and there's (00:17:22) human information and so we often times (00:17:26) think about AI as a chatbot (00:17:28) >> but remember there's biological (00:17:30) information there's chemical information (00:17:32) there's physical information of all (00:17:34) kinds there's financial information (00:17:37) there's healthcare information there's f (00:17:38) there's information of all modalities (00:17:41) all kinds AI is really really broad and (00:17:44) of course human language is at the (00:17:46) foundation of of many things but it's (00:17:48) not the essence of everything because as (00:17:51) you know you know biology molecules (00:17:53) don't understand English (00:17:55) >> they understand something else right (00:17:57) proteins don't understand English they (00:17:58) understand something else I think the (00:18:00) next layer the important thing is is uh (00:18:02) that's where the AI models are but (00:18:04) there's a whole the AI is very very (00:18:06) diverse and then the the layer above (00:18:08) that is is applications and it depends (00:18:10) on the industry and you already (00:18:11) mentioned open evidence there you (00:18:13) mentioned Harvey there's cursor there's (00:18:15) all kinds of right there's all kinds of (00:18:16) applications full self-driving is really (00:18:18) an application, an AI application that (00:18:21) is embodied into a mechanical car (00:18:23) >> and figure is a AI application that has (00:18:26) been embodied into a mechanical human. (00:18:28) And so, so you got all these different (00:18:31) applications. Well, this five layer (00:18:33) stack is one way of thinking about it. (00:18:36) And then the next way of thinking about (00:18:37) I just mentioned is AI is really (00:18:39) diverse. When you now have this (00:18:40) framework of what the the technology (00:18:43) capabilities are, how to how to build (00:18:46) the technology and how diverse it is, (00:18:49) then you can come back and think about (00:18:51) okay, let's ask the question, how (00:18:53) important is open source? Well, without (00:18:56) open source, you know, today, of course, (00:19:00) the frontier models, the the the leading (00:19:02) labs have chosen to to use a closed (00:19:05) source um application approach, which is (00:19:08) just fine. you know what people decide (00:19:10) to do with their business models is is (00:19:11) really in the final analysis. It's their (00:19:13) business and they have to they have to (00:19:15) calculate what is the best way for them (00:19:17) to get the return on investment so that (00:19:18) they could scale up and and make better (00:19:20) advances. Um however they they made that (00:19:23) calculus is fantastic. On the other (00:19:26) hand, uh without open source, as you (00:19:28) know, startups would be challenged, uh (00:19:31) companies that are in in uh uh different (00:19:33) industries, whether it's manufacturing (00:19:36) or transportation or um it could be in (00:19:39) healthcare. Without open source today, (00:19:42) all of that AI work would be suffocated. (00:19:46) >> And so, they just need to have something (00:19:47) that's pre-trained. They need to have (00:19:50) some fundamental technology about (00:19:51) reasoning. from that they could all (00:19:54) adapt, fine-tune, you know, train their (00:19:57) AI models into exactly the domain and (00:20:00) application they want. And so what (00:20:02) people really really miss is just the (00:20:05) incredible pervasiveness and the (00:20:08) importance of open source to all of (00:20:10) these industries. large companies uh (00:20:13) without without open source some of some (00:20:15) of 100-year-old companies that I work (00:20:17) with (00:20:17) >> in in industrial spaces in healthcare (00:20:20) spaces they would be suffocated they (00:20:22) wouldn't be able to do that (00:20:23) >> open source at this point is driving all (00:20:24) of our data centers is driving a big (00:20:27) chunk of telefan in the world in terms (00:20:28) of Android or other devices it's driving (00:20:30) exactly (00:20:31) >> you know to your point a lot of the (00:20:32) industrial applications so it's already (00:20:33) pervasive and I think the big question (00:20:34) is (00:20:35) >> open source without open source higher (00:20:37) ed (00:20:37) >> higher ed wouldn't happen (00:20:38) >> education research (00:20:40) >> startups I mean the list goes on, you (00:20:42) know, and so so (00:20:44) >> we talk we talk all day long about the (00:20:46) tip but the most visible part of that (00:20:49) the most the part that's most newsworthy (00:20:51) maybe but underneath that is such an (00:20:56) important space of open source AI and (00:20:58) whatever we decide to do with policies (00:21:01) do not damage that innovation flywheel. (00:21:05) So I spent a lot of time uh educating (00:21:08) educating uh uh policy makers to help (00:21:11) them understand whatever you decide (00:21:12) whatever you do don't forget open (00:21:15) source. Whatever you decide whatever you (00:21:17) do don't forget biology. (00:21:20) I think the counternarrative here that (00:21:22) is worth addressing is that essentially (00:21:25) like you know there should be a (00:21:28) monolithic vertical player and (00:21:30) monolithic asset in the like one model (00:21:32) that does it all and that we can't give (00:21:34) away that crown jewel to other countries (00:21:37) or non-American companies and and your (00:21:39) your argument is like we actually need (00:21:41) this huge diversity of AI applications (00:21:44) and and the American advantage is (00:21:46) actually or any any sovereign advantage (00:21:48) is in the whole stack right? The (00:21:50) capability to deliver any piece of it. (00:21:52) >> I guess someday we will have God AI. (00:21:56) >> But when is that day? (00:21:57) >> But but that someday that someday is (00:21:59) probably on biblical scales, you know, I (00:22:02) think galactic scales. Um I I think it's (00:22:05) it's not helpful to go from where we are (00:22:08) today to God AI. (00:22:11) >> And um I don't think any company (00:22:14) practically believes they're anywhere (00:22:16) near God AI. And nor nor do I do I see (00:22:20) any researchers having any reasonable (00:22:22) ability to create god AI. The ability to (00:22:25) h understand human language and genome (00:22:29) language and molecular language and (00:22:31) protein language and amino acid language (00:22:33) and physics language all supremely well. (00:22:36) That god AI just doesn't exist. (00:22:39) >> And and yet we have a lot of industries (00:22:41) that need AI. Mhm. (00:22:43) >> AI is if if you will at the simplistic (00:22:47) level, it's just the next computer (00:22:50) industry. (00:22:51) >> And give me an example of a company, an (00:22:54) industry, a nation who doesn't need (00:22:56) computers. (00:22:57) >> Mhm. (00:22:58) >> And we all don't have to wait around for (00:23:00) God AI for us to advance, right? So God (00:23:03) AI is not showing up next week. I'm (00:23:04) fairly certain of that. Okay. And God (00:23:06) [clears throat] AI god AI is not not (00:23:08) going to show up next year, but the (00:23:10) whole world needs to move forward next (00:23:11) week, next year, next decade. I think (00:23:13) that that the idea of a monolithic (00:23:16) gigantic (00:23:18) company, (00:23:20) >> country, nation, state that has got AI (00:23:24) is just (00:23:25) >> it's unhelpful. (00:23:26) >> It's unhelpful. It's too extreme. (00:23:28) >> Then in fact, if you want to take it to (00:23:31) that level, then we ought to just all (00:23:32) stop everything. (00:23:34) What's the point of having even (00:23:36) governments? I mean, why why why are (00:23:38) they doing policies? God AI is going to (00:23:39) be smart enough to avert, you know, work (00:23:41) around any policy. And so, what's the (00:23:44) point? And so, I I think that that we (00:23:46) ought to bring things back to the ground (00:23:48) ground level and start thinking about (00:23:50) things practically and and use common (00:23:53) sense. (00:23:55) >> This seems to be like a big theme in (00:23:56) general in terms of this conversation (00:23:58) where there's been a lot that's been (00:23:59) kind of put out there that seems very (00:24:02) extreme if you actually think about it. (00:24:03) It's the jobs and employment. Nobody's (00:24:04) going to be able to work again. It's God (00:24:06) AI is going to solve every problem. It's (00:24:08) we shouldn't have open source for XYZ (00:24:10) reason despite open source powering much (00:24:11) of our industries already. (00:24:12) >> That's right. (00:24:13) >> And so it seems like in general maybe (00:24:14) one of the themes of 2025 was there's a (00:24:16) lot of extremes that were sort of (00:24:18) painted in the public with AI that if (00:24:21) you look at them very closely don't (00:24:22) really follow a logical change in terms (00:24:23) of happening anytime soon. (00:24:25) >> Yeah. And so it's it's it sounds like (00:24:26) it's really important to have this (00:24:27) conversation. (00:24:28) >> Extremely hurtful frankly. And I I think (00:24:30) we've done a lot of damage uh with very (00:24:33) wellrespected people um who have who (00:24:36) have painted a doom doomer narrative um (00:24:40) end of the world narrative science (00:24:42) fiction narrative and um you know and I (00:24:45) and I appreciate that that many of us (00:24:47) grew up in and enjoyed science fiction. (00:24:50) >> Um but I but it's not helpful. It's not (00:24:52) helpful to people. It's not helpful to (00:24:54) the industry. It's not helpful to (00:24:55) society. It's not helpful to the (00:24:57) governments. Mhm. (00:24:58) >> There are a lot of many people in the (00:25:00) government who obviously aren't as (00:25:03) familiar with as as comfortable with the (00:25:06) technology (00:25:07) >> and when PhDs of this and CEOs of that (00:25:12) >> goes to governments and explain and (00:25:14) describe these end of the world (00:25:16) scenarios and extremely extremely (00:25:19) dystopian future the future. Um, you (00:25:22) have to ask yourself, you know, what is (00:25:24) the purpose of that narrative and what (00:25:26) is their what are their intentions and (00:25:28) what do they hope? Why are they why are (00:25:31) they talking to governments about these (00:25:32) things to create regulations to (00:25:35) suffocate startups? [clears throat] (00:25:36) >> For what reason would they be doing (00:25:38) that, you know, and so (00:25:39) >> and do you think that's just regulatory (00:25:41) capture where they're trying to prevent (00:25:43) uh new startups from showing up and (00:25:44) being able to compete effectively or (00:25:46) what do you think is the goal of some of (00:25:47) these conversations? you know, I I can't (00:25:50) I can't um uh guess what they what they (00:25:53) have in mind. I know that the concern is (00:25:55) regulatory capture. As a policy, as a (00:25:58) practice, I don't think companies had to (00:26:02) go to (00:26:04) um governments to advocate (00:26:08) for the regulation on other companies (00:26:12) and other industries. just in practice (00:26:15) their their intentions are clearly (00:26:18) deeply conflicted and and uh their (00:26:22) intentions are clearly you know not (00:26:24) completely in the best interest of (00:26:26) society. I mean they're obviously CEOs (00:26:28) are obviously companies and obviously (00:26:30) they're advocating for themselves (00:26:32) >> and so so I think if we can all (00:26:36) >> come back to where are we today (00:26:39) >> and think about where the technology is (00:26:41) going to be. I mean look lit literally (00:26:43) in one year's time as we were talking (00:26:45) about in the beginning uh some of the (00:26:48) most proud moments is when the industry (00:26:52) was able to invest very aggressively in (00:26:55) advancing AI technology instead of being (00:26:57) slowed down. (00:26:58) >> Remember just two years ago people were (00:27:00) talking about slowing the industry down (00:27:02) >> but as we advanced quickly what did we (00:27:05) solve? We solved grounding, we solved (00:27:08) reasoning. We solved research. All of (00:27:11) that technology was applied for good (00:27:14) improving the functionality of the AI (00:27:17) not you know (00:27:19) >> yet the end has not come (00:27:20) >> yet the end has not come it's become (00:27:22) more useful it's become more functional (00:27:25) it's become able to do what we ask it to (00:27:27) do you know and so the first the first (00:27:30) part of the safety of a product is that (00:27:32) it perform as advertised (00:27:36) >> the first part of safety is performance (00:27:40) that it's is supposed like the first (00:27:42) part of safety of a car isn't that some (00:27:44) person is going to jump into the car and (00:27:47) use it as a missile. The first part of (00:27:49) the car is it works as advertised. (00:27:52) >> Mhm. (00:27:53) >> 99.999% (00:27:55) of the time working as advertised. And (00:27:58) so it takes a lot of technology to make (00:28:00) that car or make that AI work as (00:28:03) advertised. And I'm really glad that in (00:28:05) the last couple two three years the (00:28:07) industry has invested so much in (00:28:09) enhancing the functionality of the AI as (00:28:12) advertised. And I think if if we were to (00:28:14) to look at the next 10 years, we have so (00:28:17) much work to do to make it work as (00:28:21) advertised. Meanwhile, as as you know, (00:28:23) you both of you invest so much in in the (00:28:25) in the ecosystem, you see so many (00:28:28) companies being built for um synthetic (00:28:31) data generation so that the AIs could be (00:28:34) more grounded uh more diverse uh less (00:28:38) biased more safe uh you're investing in (00:28:40) a whole bunch of companies in cyber (00:28:42) security using AI for cyber security you (00:28:44) right people think that there's this AI (00:28:47) um the marginal cost of AI is going to (00:28:49) go go down significantly and it is (00:28:52) >> and therefore the AI is going to be (00:28:54) dangerous. It's exactly the opposite. If (00:28:57) the marginal cost of AI is going to go (00:28:59) down significantly, that one AI is going (00:29:01) to be monitored by millions of AIS. (00:29:04) >> Mhm. (00:29:04) >> And more and more AI is going to be (00:29:06) monitoring monitoring each other. People (00:29:08) don't can't forget that an AI is not (00:29:10) going to be an agent by itself. It's (00:29:12) likely the AI is going to be surrounded (00:29:14) by agents monitoring it. And so it's no (00:29:17) different than if the if the marginal (00:29:19) cost of of keeping society safe was (00:29:22) lower. We have police in every corner. (00:29:25) >> So one thing that that we were talking (00:29:27) about a little bit earlier was just the (00:29:28) cost of AI and how it's been coming (00:29:30) down. And so (00:29:31) >> I I think um in 2024 the the cost of (00:29:35) GPT4 equivalent models if you look at a (00:29:37) million tokens it came down over 100x. (00:29:40) Um you know somebody in my team did this (00:29:42) analysis to show that. Uh so the costs (00:29:44) are dropping pretty dramatically and (00:29:45) very rapidly and part of it is all the (00:29:47) advancements you all have been driving (00:29:48) on and the Nvidia level but also just (00:29:49) across the stack getting big efficiency (00:29:52) gains. (00:29:52) >> Yeah. (00:29:53) >> Um at the same time model companies are (00:29:54) talking about how the costs are rising (00:29:56) how there's enormous sort of capital (00:29:58) modes to building these things out. How (00:30:00) do you think about cost of training and (00:30:01) cost of inference over time and what (00:30:03) that means for the average end user or (00:30:05) the average startup company trying to (00:30:07) compete or people trying to do more in (00:30:08) this industry? I forget the statistic (00:30:10) that but but you know Andre Andre (00:30:12) Cararpathy um estimated the cost of (00:30:15) building the first chatbt I think (00:30:18) >> versus now I think you could do that on (00:30:20) the PC now. (00:30:20) >> Yeah. Yeah. It's probably tens of (00:30:22) thousands of dollars at this point or (00:30:23) maybe even less. (00:30:24) >> Right. And so it costs nothing. (00:30:26) >> Mhm. (00:30:26) >> And and (00:30:27) >> he has an open source project that you (00:30:28) can do in a weekend. (00:30:29) >> Oh, is that right? Okay. That's (00:30:31) incredible. Right. We're talking about (00:30:32) three years. Mhm. (00:30:33) >> Mhm. (00:30:34) >> What people people said cost billions of (00:30:37) dollars (00:30:39) um supercomputers built raising billions (00:30:42) of dollars in order to do all that now (00:30:45) >> cost you know something that you can do (00:30:47) on a weekend on a PC. And so that tells (00:30:50) you something about how quickly we're (00:30:52) making making AI more cost effective (00:30:54) >> or Spark sorry probably not quite a PC. (00:30:56) >> Okay. Not quite a PC. Yeah. We're (00:30:59) improving our architecture and (00:31:03) performance (00:31:04) um every single year. The first GBTU I (00:31:07) think was trained on Voltus. (00:31:08) >> Mhm. (00:31:09) >> And then uh Ampear (00:31:11) um you know and and it wasn't I think (00:31:14) the first breakthroughs none of it (00:31:16) included Hopper. (00:31:17) >> Mhm. (00:31:18) >> And um of course Hopper the last couple (00:31:20) two three years and um uh we're off in (00:31:22) Blackwell for the last year and a half (00:31:24) or so. And um every single one of these (00:31:27) generations the architecture improves (00:31:30) and of course the number of transistors (00:31:32) go up and uh the capacity goes up every (00:31:35) single generation very easily every (00:31:37) every single year from a computing (00:31:39) perspective. The combination of all that (00:31:41) getting 5 to 10x every single year (00:31:44) >> is not unusual. And here comes Reuben (00:31:46) just around the corner. And so we're (00:31:48) seeing 5 to 10x every single year. Well (00:31:51) compounded it's incredible. Moore's law (00:31:54) was two times every year and a half (00:31:56) >> and over the course of five years is 10x (00:31:58) over the course of 10 years is 100x (00:32:01) >> in the in the in the case of AI over the (00:32:03) course of 10 years is probably 100,000 (00:32:05) to a millionx okay and that's just the (00:32:08) hardware (00:32:09) >> then the next layer is the algorithm (00:32:11) layer and the model layer the (00:32:13) combination of all that the fact that if (00:32:16) you were to tell me that in the cost in (00:32:17) the in the in in the span of you know 10 (00:32:20) years we're going to reduce the cost of (00:32:21) token generation about a billion times. (00:32:23) I would not be surprised. (00:32:24) >> Mhm. (00:32:25) >> Okay. And so that's the tokconomics (00:32:28) of of of AI. On the training side, it's (00:32:31) not quite as aggressive in in cost (00:32:33) reduction, but it's close. If you were (00:32:35) to say that that every single year we're (00:32:38) increasing by two or 3x over the course (00:32:40) of 10 years, incredible. But the (00:32:43) important idea is when somebody says it (00:32:45) cost $und00 million to train something (00:32:48) or half a billion dollars to train (00:32:50) something. (00:32:51) Well, next year it's 10 times less. Next (00:32:54) year it's 10 times. (00:32:54) >> For people to scale these things up, (00:32:56) though, right? So the counter argument (00:32:57) is, well, we'll just get bigger every (00:32:58) year by 10x or 100x or, you know, we'll (00:33:01) try to offset that decrease in cost by (00:33:03) scale (00:33:05) >> and others can't keep up. (00:33:06) >> Yeah. But really what's happening is is (00:33:09) you're and and this is where come in as (00:33:12) you know the scale went up by a factor (00:33:14) of 10 but the computational burden did (00:33:16) not go up by a factor of 10 because (00:33:18) you're getting the compounded benefits (00:33:20) of all three things. The hardware is (00:33:22) going up the the algorithms of the (00:33:25) training models are going up and of (00:33:26) course the model architecture is going (00:33:28) up and we're getting the benefit of (00:33:29) learning from each other. This is, you (00:33:31) know, let's face it, Deep Seek was (00:33:33) probably the single most important paper (00:33:37) that most Silicon Valley researchers (00:33:39) read from in the last couple years. (00:33:41) >> It was the only thing that felt frontier (00:33:43) that was open. (00:33:44) >> That's right. (00:33:45) >> In years, (00:33:47) the value of open source again putting (00:33:49) out these papers. (00:33:49) >> Literally, Deep Seek (00:33:51) >> benefited American startups and American (00:33:54) AI labs all over (00:33:55) >> and infrastructure companies (00:33:56) >> and infrastructure company all over. (00:33:59) probably the single greatest (00:34:00) contribution to American AI last year. (00:34:03) >> And so if you said this out loud, of (00:34:05) course, you know, people (00:34:08) >> kind of shudder um that we're uh (00:34:11) American AI is actually getting learning (00:34:13) from and benefiting from uh AI from (00:34:16) other nation. But why would that be (00:34:18) surprising? You know, AI researchers in (00:34:20) all over America, all over America are (00:34:22) uh Chinese natives and come from (00:34:24) different countries. We benefit from (00:34:26) every country. become benefit from every (00:34:28) researcher and no all of the world's (00:34:31) ideas don't have to come from the United (00:34:33) States and so I I think um back to your (00:34:36) your original question it is the case (00:34:39) that (00:34:40) you know some of the narratives around (00:34:42) around the cost of AI is about scaring (00:34:45) everybody out of the market you know (00:34:47) nobody ought to do pre-training but us (00:34:49) nobody should do you know training these (00:34:51) frontier models but us because the (00:34:53) because of innovation of models (00:34:56) algorithms (00:34:57) and the computing stack, the cost of AI (00:35:00) is actually decreasing well more than (00:35:03) 10x every single year. And so if you're (00:35:04) just one year behind or even six months (00:35:06) behind, you could you could really stay (00:35:08) close. (00:35:09) >> And I think one thing that felt very (00:35:11) different to me about 2025 is um Ilia uh (00:35:15) said recently that uh you know we're in (00:35:17) the age of research again versus an age (00:35:20) of scaling. I think both things are (00:35:21) happening by the way. Everybody is also (00:35:23) trying to scale on multiple dimensions. (00:35:24) >> Yeah, exactly. both are happening. (00:35:26) >> You know, being 6 months behind or being (00:35:28) at 100 versus a 200k cluster, I think (00:35:31) matters if you are competing (00:35:33) symmetrically, but now you have people (00:35:35) from Frontier Labs or um at the very top (00:35:38) of the game who have very different (00:35:40) ideas about how to progress from here or (00:35:42) who are working on diversity of (00:35:43) problems, right? Uh and and I I think (00:35:45) that felt different from 24 maybe where (00:35:48) there was a lot of energy focused on (00:35:50) just pre-training scale and LLM. (00:35:52) >> Yeah. And several several other (00:35:54) dynamics. Um, as the market grows, each (00:35:57) one of these models could choose to have (00:36:00) verticals (00:36:02) >> or segments where they want to (00:36:03) differentiate. (00:36:04) >> Somebody could decide to be a better (00:36:06) coder. Somebody could decide to be just (00:36:09) better at being easier to be accessible (00:36:11) so that it could be a greater consumer (00:36:13) product. (00:36:14) >> You know, the diversity of these models. (00:36:15) As a result, you could you could (00:36:18) probably make a niche leap without (00:36:22) having to be great at everything else (00:36:23) and still be super valuable to the (00:36:25) market. (00:36:26) >> It's no longer necessary to boil the (00:36:28) entire ocean. The f two years ago, (00:36:31) because it was called pre-training pre, (00:36:34) you know, people people said, well, you (00:36:36) know, pre-training is over. First of (00:36:38) all, pre-training is not over. But the (00:36:41) point of pre-training is to train (00:36:43) yourself for training. That's why it's (00:36:45) called pre-training to prepare yourself (00:36:47) to do the real training. And now we call (00:36:49) it post-training. It's kind of weird. I (00:36:52) I think it's just training, but (00:36:54) pre-training is pre-training and (00:36:55) therefore it's training. Training as you (00:36:58) as as we all know is is where uh compute (00:37:02) scaling directly translates to (00:37:04) intelligence. You you've you've largely (00:37:07) now now this the the data the the data (00:37:10) necessary to train a model is actually (00:37:12) pretty small. Maybe it's just the (00:37:14) verifiable results. Now it's really (00:37:16) algorithmic, very compute intensive and (00:37:19) so and you don't have to be good at (00:37:20) everything in life as you know just like (00:37:23) all of us we don't we could decide (00:37:24) because we don't have time to learn (00:37:26) everything equally well. We decide to (00:37:28) choose a specialty and focus all of our (00:37:31) energy on it and we become superhuman or (00:37:33) incredibly good at something that other (00:37:35) people are not. And so I think AI labs (00:37:37) are going to start doing the same. (00:37:39) They're going to start bifurcating into (00:37:41) various segments and over time you're (00:37:44) gonna and startups will do the same. (00:37:46) >> They'll find a micro niche and they'll (00:37:47) take something open and then be (00:37:49) incredibly good at it. (00:37:50) >> Well, I think one of the most optimistic (00:37:51) views here is uh actually that these (00:37:53) microniches are quite valuable, right? I (00:37:56) was talking to Andre um because I've (00:37:57) been talking to a lot of people about (00:37:58) their predictions for next year. We'll (00:38:00) ask you yours as well of course. Um um (00:38:03) but he asked you know what is a what's (00:38:05) an example of a prediction that would (00:38:06) have been preient last year uh and my (00:38:10) answer everything's easy in retrospect (00:38:12) is that coding would be the first (00:38:14) application level business that gets to (00:38:16) a billion of AR as an AI native app (00:38:18) right and I I think if you taken an old (00:38:21) world view of this (00:38:23) >> um you would have believed like one of (00:38:24) two narratives right one is uh single (00:38:28) model does everything and it'll all just (00:38:29) be subsumed into something monolithic (00:38:31) Mhm. (00:38:31) >> And two is that developer tools never (00:38:33) get very big, right? Well, kind of (00:38:35) depends on how valuable the developer (00:38:36) tool is. Now, I think many more people (00:38:38) understand software engineering is in a (00:38:40) niche and there's more demand than ever (00:38:41) for it, (00:38:42) >> but I think we'll see more like that. (00:38:44) >> Also interesting, uh we are using we we (00:38:47) use cursor here and we use cursor (00:38:50) pervasively here. Every engineer uses it (00:38:52) and the number of engineers, you just (00:38:53) mentioned it, the number of people we're (00:38:55) hiring today is just incredible. (00:38:56) >> Yep. (00:38:57) >> Right. Monday is come to work at Nvidia (00:38:59) day and and um uh why is that? Uh this (00:39:03) is now the purpose and the task. (00:39:05) >> The purpose of a software engineer is to (00:39:08) solve known problems (00:39:10) and to find new problems to solve. (00:39:15) Coding is one of the tasks. (00:39:18) >> And so if the purpose is not coding, if (00:39:21) your purpose literally is coding, (00:39:23) somebody tells you what to do, you code (00:39:24) it. All right? Maybe you're going to get (00:39:26) replaced by the AI. But most of our (00:39:27) software engineers, all of our software, (00:39:29) their goal is to solve problems. And it (00:39:32) turns out we have so many problems in (00:39:33) the company and we have so many (00:39:34) undiscovered problems. And so the more (00:39:37) time they have to go explore (00:39:38) undiscovered problems, the better off we (00:39:40) are as a company. Nothing would give me (00:39:42) more joy than if none of them are coding (00:39:44) at all. They're just solving problems. (00:39:46) >> You see what I'm saying? And so I I (00:39:48) think that this framework of purpose (00:39:50) versus task is really good for everybody (00:39:52) to apply. For example, somebody who's a (00:39:54) waiter, their job is to not to take the (00:39:59) order. That's not their job. As it turns (00:40:00) out, their job is so that we have a (00:40:02) great experience. And if somebody if (00:40:04) some AI is taking the order, their job (00:40:07) or even delivering the food, their job (00:40:09) is still helping us have a great (00:40:11) experience. They they would reshape (00:40:13) their jobs accordingly. And so so I (00:40:16) think the um the question about about (00:40:19) cost of compute um uh is really (00:40:23) important. Let's let let me come back to (00:40:25) one the the reason why we are so (00:40:28) dedicated to a programmable architecture (00:40:32) versus a fixed architect. Remember a (00:40:34) long time ago (00:40:35) >> uh a CNN chip came along and they said (00:40:37) Nvidia is done. (00:40:39) >> And then and then a transformer chip (00:40:41) came and Nvidia was done. (00:40:42) >> People are still trying that. Yes. (00:40:43) >> Yeah. NP and and the benefit of these (00:40:46) dedicated AS6 of course it could perform (00:40:49) a job really really well and (00:40:50) transformers is a much more universal AI (00:40:55) network but the transformer as you know (00:40:58) the species of it is growing incredibly (00:41:00) >> the attention mechanism (00:41:01) >> the attention mechanism how it thinks (00:41:03) about context (00:41:05) diffusion versus auto reggressive (00:41:08) >> these hybrid SSM transformation (00:41:10) >> hybrid SSM for example Neotron we just (00:41:12) announced a new hybrid SM SM and and so (00:41:15) the architecture of transformer is in (00:41:18) fact changing very rapidly and over the (00:41:20) next several years it's likely to change (00:41:21) tremendously and so we we dedicate (00:41:24) ourselves to an architecture that's (00:41:25) flexible for this reason so that we can (00:41:27) on the one hand adapt with remember (00:41:30) because MOS law is largely over (00:41:32) transistor benefit is only tens 10% (00:41:37) maybe a couple of years (00:41:39) >> and yet we would like to have hundreds (00:41:40) of X every year and so the benefit is (00:41:44) actually all in algorithms and an (00:41:46) architecture that enables any algorithm (00:41:48) is likely going to be the best one right (00:41:50) because the transistor didn't it didn't (00:41:52) advance that much and so I I think the (00:41:54) the our dedication to programmability is (00:41:57) number one for that reason we have so (00:41:58) much optimism for innovation and (00:42:01) algorithms and iteration software that (00:42:03) we protect our programmability for that (00:42:06) reason the second thing is is by (00:42:09) protecting this architecture (00:42:11) our installed base is really large. When (00:42:13) a software engineer wants to optimize (00:42:16) their algorithm, they want to make sure (00:42:18) that it doesn't run on just one this one (00:42:20) little cloud or this one little stack. (00:42:21) They want it to run on as many mo on as (00:42:24) many computers as possible. So the the (00:42:25) fact that we protect our architecture (00:42:27) compatibility then flash attention runs (00:42:30) everywhere. So SSM run everywhere, (00:42:32) diffusion runs everywhere, auto (00:42:34) reggression runs everywhere. Just (00:42:36) depending it doesn't matter what you (00:42:37) want to do. CNN still run everywhere. (00:42:39) LSTM still runs everywhere. And so that (00:42:41) this this architecture that is (00:42:43) architecturally compatible so that we (00:42:45) have a large installed base programmable (00:42:47) for the future is really important in (00:42:50) the way that we help to advance and as a (00:42:52) result all of this drives the cost down (00:42:55) [clears throat] and and I'm super proud (00:42:56) that that um uh our latest innovation (00:42:59) MVLink72 (00:43:01) we're the lowest cost token generation (00:43:04) machine in the world by enormous amounts (00:43:07) and the reason for that is because are (00:43:10) really really hard (00:43:11) >> and so you know people didn't expect (00:43:13) that um that forees it's probably easier (00:43:17) to train but for inference is incredibly (00:43:19) hard to generate tokens on but as as (00:43:22) cost drop usually you open up new (00:43:23) applications or new verticals that (00:43:25) become more and more accessible (00:43:27) >> and we talked a little bit about coding (00:43:28) like cursor and cognition and other (00:43:30) companies that are really benefiting (00:43:31) from that in this last year do you have (00:43:32) any thoughts or predictions in terms of (00:43:34) what the next breakthrough industries (00:43:35) will be or new applications or areas (00:43:37) that you're most excited about coming in (00:43:38) 26 in particular like Are there one or (00:43:40) two things that you think will (00:43:41) >> because of three things I because of (00:43:44) because of a couple two three things I I (00:43:47) think I think several industries are (00:43:49) going to are going to experience their (00:43:50) chat moment. Um I believe that (00:43:54) multi-modality (00:43:57) and um very long context is going to (00:44:01) enable of course really really cool chat (00:44:04) bots. Um but the basic architecture that (00:44:08) in combination with breakthroughs in (00:44:10) synthetic data generation is going to (00:44:12) help create the chat GPT moment for (00:44:16) digital biology. (00:44:18) >> That moment is coming. (00:44:19) >> And by digital biology, do you (00:44:20) specifically mean other aspects of like (00:44:23) protein folding or protein binding or (00:44:24) protein diagnosis? I see proteins. (00:44:26) >> I think we're good at protein (00:44:28) understanding. Mhm. Now multi-proin (00:44:30) understanding is coming online and we (00:44:33) recently created a model called LA (00:44:35) prina. It's open. Um it's for (00:44:37) multi-proin (00:44:39) >> um understanding and and represent (00:44:41) representation learning and generation. (00:44:43) Uh so so I think that the protein (00:44:46) understanding is is advancing very (00:44:47) quickly. Now protein generation is going (00:44:49) to advance very quickly. Chat GPD moment (00:44:52) proteins. (00:44:52) >> Yeah. There are a lot of interesting (00:44:53) companies working on molecule design in (00:44:55) endtoend way like chai. (00:44:57) >> Exactly. And then and then of course (00:45:00) chemical understanding and chemical (00:45:01) generation and then protein chemical (00:45:05) >> confirmation understanding and (00:45:07) generation. Is that right? And so that (00:45:09) combination the chat GBT moment the (00:45:11) generative AI moment all of that stuff (00:45:13) is coming together for for um digital (00:45:14) biology (00:45:15) >> and to your to your point about like new (00:45:17) industries or you know the way I think (00:45:19) about it is like investing in the inputs (00:45:21) for this AI as well. All of these things (00:45:23) around biology and chemistry and (00:45:25) material science, they require real (00:45:27) world data generation and (00:45:28) experimentation, right? And that's new (00:45:30) infrastructure too. (00:45:30) >> New infrastructure, uh, synthetic data (00:45:33) is going to be really important because (00:45:34) they just have such sparse, right? Spar (00:45:36) sparity of data and they just don't have (00:45:38) as much as human language. And there the (00:45:40) the real breakthrough is going to be (00:45:42) when we can train a a world foundation (00:45:45) model, a foundation model for proteins, (00:45:48) a foundation model for cells. I'm I'm (00:45:50) very excited about both of those things. (00:45:52) Once we have a a foundation model, our (00:45:55) understanding capability, our generative (00:45:57) capability, that data flywheel is really (00:45:59) going to take off. (00:46:00) >> The this this the second area that I'm (00:46:02) excited about, um, of course, reasoning (00:46:04) made huge breakthroughs in language, but (00:46:06) because of reasoning, cars are going to (00:46:09) be able to perform better. So, instead (00:46:10) of just perception cars and planning (00:46:13) cars, they're going to be reasoning (00:46:14) cars. So, these cars are going to be (00:46:16) thinking all the time. And when they (00:46:17) come up they come up to a circumstance (00:46:19) they they've never en encountered before (00:46:21) they can break it down into (00:46:22) circumstances they have encountered it (00:46:24) before and construct a reason reasoning (00:46:28) system for how to navigate through it. (00:46:30) And so the out of domain out of you know (00:46:33) out of distribution (00:46:35) >> part of AI is going to very much be be (00:46:38) addressed by reasoning systems or and as (00:46:41) a result we could do more things than we (00:46:43) were taught to do between uh generative (00:46:44) AI uh and um multimodal uh you know (00:46:49) vision language action models and (00:46:51) reasoning systems. I think we're going (00:46:53) to see big breakthroughs in human robots (00:46:55) or multi-mbodiment robots. you know does (00:46:58) >> what do you think what do you think is a (00:46:59) time frame for that because if you look (00:47:00) at the self-driving analog and obviously (00:47:02) self-driving technologies were based on (00:47:05) very different types of neural networks (00:47:06) than what we're using today in terms of (00:47:08) you know there's been a big swap over (00:47:09) the last two three years (00:47:11) >> in terms of how we do a lot there (00:47:12) >> we started too soon (00:47:14) >> self-driving cars really had four eras (00:47:17) era was smart sensors (00:47:20) >> connected into a car (00:47:22) >> the mobile [clears throat] eye era (00:47:23) >> the mobile eye era and even even the (00:47:25) very earliest days of of (00:47:27) Yeah. (00:47:28) >> Yeah. Even the earliest days of Whimo, (00:47:30) >> the the um you're talk you're using (00:47:32) smart sensors um a lot of human (00:47:35) engineered algorithms (00:47:36) >> and education severe mapping as far (00:47:40) >> extreme mapping (00:47:41) >> mapping and then different systems for (00:47:42) planning and perception. (00:47:44) >> Exactly. And so so you're essentially (00:47:46) creating a car that is driving on (00:47:49) digital rails, right? It's no different (00:47:50) than than the rails at Disneyland. There (00:47:53) are digital rails. And so that's the (00:47:54) first generation. the second generation. (00:47:57) Um and during that generation you have (00:47:59) perception, world model and planning. (00:48:01) >> Mh. (00:48:02) >> And and the these modules um and each (00:48:05) one of these modules have the limits of (00:48:07) their technology and and perception was (00:48:09) first imple was was first affected by (00:48:11) deep learning uh first and then and then (00:48:14) uh and then it propagated through the (00:48:15) pipeline. (00:48:16) >> And so that but that system was too (00:48:18) brittle (00:48:19) >> and it only knows how to perform what (00:48:20) you taught it. And now where we are are (00:48:23) endtoend models and then and then where (00:48:27) we're going to go next are end to end (00:48:28) models. There you go. So that those are (00:48:31) kind of the four eras in a lot of ways. (00:48:34) If we would have started self-driving (00:48:35) cars probably three years ago, (00:48:38) >> we would probably be exactly the same (00:48:40) place. (00:48:41) >> All our poor friends who were working in (00:48:42) self-driving. Yeah. (00:48:43) >> And and I don't I don't mind it. I've (00:48:44) been working on on it for 10 years. (00:48:46) Nvidia's self-driving car stack, by the (00:48:48) way, number one rated safety in the (00:48:51) world today. (00:48:53) >> Number one, we just got we just got that (00:48:55) rating today uh last week. And number (00:48:57) two is Tesla. So, I'm very proud that (00:48:59) two American companies are up on the (00:49:00) >> Are you um So, from a robotics (00:49:02) perspective, you think because we've (00:49:03) already built all these sorts of (00:49:04) technologies in the modern era, robotics (00:49:06) won't have the same 10, 15 years. That's (00:49:10) right. (00:49:11) >> I'm much more optimistic with robotics (00:49:13) because we we've kind of (00:49:15) >> advanced foundational technology. (00:49:18) >> Now, you know, people are thinking about (00:49:20) human robotics. Human robotics has a lot (00:49:22) of challenges. I mean, there's all the (00:49:24) megatronics challenges there. You know, (00:49:26) like for example, (00:49:27) >> it's not helpful if the robot weighs 300 (00:49:29) lb (00:49:30) >> and what happens if it falls over and (00:49:32) interacting with kids and so on so (00:49:33) forth. And so, so you got all kinds of (00:49:35) challenges to deal with. I'm certain (00:49:37) that we're going to we're going to solve (00:49:38) those. But remember the fundamental (00:49:40) technology that goes into a human robot (00:49:42) robot can go into a pick and place (00:49:44) robot. (00:49:45) >> Um it could be it could be um how do you (00:49:48) think about one thing I've been curious (00:49:49) about for robotics in particular is if I (00:49:51) look at who won or who who who's (00:49:53) perceived as winning in self-driving. (00:49:55) >> It's largely incumbents, right? It's (00:49:57) Whimo, it's Tesla. You mentioned uh the (00:49:59) safety rating Nvidia's gotten. And so (00:50:01) it's people who've been working on this (00:50:02) for a long time. It took a lot of (00:50:04) capital. It was really intensive to get (00:50:05) there. You have supply chain, you have (00:50:06) hardware, you have all this extra (00:50:07) complexity. Do you think the same thing (00:50:09) will be true in robotics? Are the (00:50:10) winners basically going to be Tesla with (00:50:12) Optimus and other people who have both (00:50:14) been in the industry for a while but (00:50:16) also have all those sort of incumbent (00:50:17) effects? Do you think there's room for (00:50:18) startups? (00:50:19) >> They will be one of the leader one of (00:50:20) the one of them and and and (00:50:24) surely a major one. Um but everything (00:50:28) that moves will be robotic. (00:50:31) >> Everything that moves will be robotic. (00:50:33) And everything that moves is a very (00:50:35) large space. It's not all human or (00:50:37) robot. And yet every AI will be (00:50:40) multi-mbodiment meaning you know just (00:50:43) like just like a human with our m our (00:50:46) multi-mbodiment (00:50:48) AI ourselves (00:50:49) >> we could sit in a car (00:50:51) >> and embody that (00:50:53) >> we could pick up a tennis racket embody (00:50:54) that we could pick up a chopstick embody (00:50:56) that (00:50:57) >> and so we could embody the (00:50:58) >> people are general purpose right they (00:51:00) can do all these things (00:51:00) >> exactly and so AIS are going to become (00:51:02) general purpose so you have one arm pick (00:51:04) and place maybe it's two arms pick and (00:51:06) place could be six arms pick and place, (00:51:08) you know. So, so I think you're going to (00:51:09) have all kinds of different sizes and (00:51:11) shapes. It could be a caterpillar. It (00:51:12) could be, you know, it could be an (00:51:13) excavator. It could be all kinds of (00:51:15) stuff. And so AI will embody those just (00:51:17) as a just as a a construction worker (00:51:20) embodies an excavator embodies a (00:51:22) tractor. You know, they you know, (00:51:25) >> could there be a small number of (00:51:26) companies then that do the embodiment (00:51:28) for everything or are you saying more (00:51:29) there's going to be niche applications? (00:51:30) You should definitely see a lot of (00:51:31) software companies and then those that (00:51:33) software (00:51:34) company could serve a lot of a lot of (00:51:36) different (00:51:37) >> verticals but each one of the verticals (00:51:39) will still have solution providers that (00:51:41) then grounds it all turns it into (00:51:43) something that works perfectly. Does it (00:51:45) make sense? Because in the case of AI (00:51:47) for consumers if it works 90% of the (00:51:49) time you're delighted you you're you (00:51:51) know you're mind blown. If it works 80% (00:51:53) of the time you're satisfied. In the (00:51:55) case of most industrial and physical (00:51:57) AIs, if it works 90% of the time, nobody (00:52:00) cares about that. They only care about (00:52:01) the 10% that it fails. Basically, you (00:52:03) know, 100% dissatisfaction. And so, you (00:52:06) got to take it to 99.99999. (00:52:09) So, the core technology might be able to (00:52:11) get get you to 99%. (00:52:13) >> And then a vertical solution provider (00:52:15) like a Caterpillar or somebody, they (00:52:17) could take that core technology and make (00:52:19) it 99.999% (00:52:21) great. Do you think that's what happens (00:52:23) like earliest on because in in markets (00:52:25) that are this immature it seems one of (00:52:27) the fastest paths to market could be (00:52:28) full verticalization right because you (00:52:30) just have control of iteration speed (00:52:33) >> the different the the difficulty (00:52:34) difficulty of of verticalization for (00:52:37) technology that that is general purpose (00:52:39) is that you don't have the R&D scale to (00:52:42) build a general purpose technology. Now, (00:52:43) of course, open source helps that (00:52:45) tremendously, (00:52:47) >> which is the reason why you're going to (00:52:48) see a, you know, a a big surge of (00:52:51) vertical (00:52:52) opportunities in AI in the next several (00:52:54) years. (00:52:55) >> My my prediction would be over the (00:52:57) course of the next five years, the (00:52:59) excitement is going to be (00:53:00) verticalization. (00:53:02) >> Notice we we're excited about Open (00:53:05) Evidence, we're excited about Harvey, (00:53:06) we're excited about Cursor. cursor is is (00:53:09) a horizontal but it's kind of a (00:53:11) horizontal vertical (00:53:12) >> you know and so um I'm I'm super excited (00:53:14) about all the verticals (00:53:16) >> you know a lot of people said yeah AI is (00:53:18) gonna get so god AI is going to get so (00:53:19) good that all these rapper companies are (00:53:22) going to be obsolete it's just it misses (00:53:24) the big point (00:53:25) >> you know the reason why you could talk (00:53:27) about the reason why somebody can talk (00:53:29) talk about somebody is creating (00:53:32) technology could talk about the life of (00:53:33) a surgeon is because they've never been (00:53:34) a surgeon the reason why somebody who (00:53:36) builds at AI and talk talks about the (00:53:38) life of a accountant and a tax, you (00:53:40) know, a tax expert because they've never (00:53:42) been a tax expert, you know, and so so I (00:53:45) I think they just the reason why (00:53:47) somebody could talk about being a bus (00:53:48) boy without being a bus boy is they (00:53:50) never been a bus boy. And so so I I (00:53:51) think you you you've got to be a little (00:53:54) bit more empathetic about the depth of (00:53:55) the complexity of the work (00:53:57) >> and and tr try to truly understand the (00:53:59) purpose of the work. Often times the the (00:54:01) technology addresses the task, it (00:54:04) doesn't address the purpose. So I guess (00:54:07) one of the other narratives from we're (00:54:09) looking at narratives that are true (00:54:10) versus not true, you know, for 25. One (00:54:13) other narrative that's come up has been (00:54:14) more about energy and energy utilization (00:54:16) and will we have enough energy to (00:54:18) support AI. How do how do you think (00:54:20) about that? On the first week of (00:54:22) President Trump's administration, he (00:54:23) said drill, baby drill. He got so much (00:54:25) flack for that. (00:54:27) If not for this entire change in in (00:54:31) sentiment about energy growth in our (00:54:33) country, (00:54:34) >> we can all concede now we would have (00:54:38) handed this industrial revolution to (00:54:40) somebody else. (00:54:42) >> And we're still power constrained. (00:54:43) >> We're still power constrained. Yeah. (00:54:45) >> Without energy, there can be no new (00:54:48) industry. (00:54:49) >> Mhm. And of course, we've been energy (00:54:52) starved now for what, a decade. If not (00:54:54) for the fact that President Trump (00:54:56) reversed that narrative, we would be (00:54:58) completely screwed. (00:55:00) >> Mhm. (00:55:00) >> Without energy, you can't have (00:55:02) industrial growth. Without industrial (00:55:04) growth, the the nation can't be more (00:55:06) prosperous. Without being more (00:55:08) prosperous, we can't take care of (00:55:09) domestic issues. We can't take care of (00:55:11) social issues. You know, on and on and (00:55:13) on. And so, the fact of the matter is, (00:55:15) we need energy to grow. We need every (00:55:17) form of energy. We need, you know, (00:55:19) natural gas. We need to be, of course, (00:55:21) we need more energy on the grid. We need (00:55:23) more energy behind the meter. Uh we're (00:55:25) going to need nuclear. Uh wind is not (00:55:28) going to be enough. Solar is not going (00:55:29) to be enough. Let's just all acknowledge (00:55:31) that we'll take it. We'll take (00:55:32) everything we can. Um but the fact that (00:55:34) matters, I think, for the for the next (00:55:36) decade, (00:55:37) >> natural gas, you know, is probably the (00:55:40) the only way to go forward. What's (00:55:42) really interesting is I I agree the (00:55:43) timeline is too far out to address (00:55:46) people's um you know power generation (00:55:48) issues in 27 and 28 where uh you know (00:55:51) large players building clusters are very (00:55:53) concerned but the the biggest drivers of (00:55:57) like climate innovation in the US have (00:56:00) actually been as a result of this AI (00:56:02) infrastructure problem right because (00:56:04) people look at the demand (00:56:05) >> finally that's right demand (00:56:07) >> they look at the demand and the demand (00:56:09) is driving people to create massive of (00:56:11) new battery companies, solar (00:56:13) concentrators. It's put new energy be (00:56:15) new energy like you know willpower (00:56:18) behind (00:56:18) >> SM the AI industry is driving all of (00:56:22) that sustainable energy industry. (00:56:24) >> Yeah. (00:56:24) >> Um because people see that there is (00:56:26) going to be demand for it right so even (00:56:28) if and I think there is no practical (00:56:30) answer in the small number of years time (00:56:32) frame versus uh large gas right um uh it (00:56:37) still drives climate innovation. Yeah, (00:56:38) no question about it. No question about (00:56:40) it. And I I think that's exactly right (00:56:42) that that you know doomer messages um (00:56:46) causes policy and that policy may may (00:56:50) affect the industry in some way. But (00:56:52) there's nothing more powerful than (00:56:53) demand. Look at all the jobs that's (00:56:55) being created. Look at all the the (00:56:56) industries that's being formed around (00:56:57) it. um sustainable energy likely and (00:57:00) when history rewrites it as Sarah, I (00:57:02) think you you're going to be absolutely (00:57:03) right that that if not for AI, well AI (00:57:08) was is probably the biggest driver for (00:57:10) sustainable energy ever. (00:57:11) >> Yeah. A friend of mine has a saying that (00:57:13) uh doomers are the people who sound (00:57:14) smart at dinner parties and optimists (00:57:16) are the people who drive humanity (00:57:17) forward. And I think that's very true (00:57:18) for for all these things we've talked (00:57:20) about. Yeah. So (00:57:21) >> yeah, it's really true. (00:57:22) >> Yeah. Well, that that's one of the big (00:57:24) big um takeaways for for uh this last (00:57:27) year, the battle of narratives. (00:57:30) >> And it's too simplistic (00:57:33) um to say that everything that the (00:57:35) doomers are saying are irrelevant. (00:57:38) That's not true. A lot of very sensible (00:57:40) things are being said. Um it is too (00:57:42) simplistic to say that when somebody is (00:57:44) optimistic that they're just naive. (00:57:46) >> It needs to be grounded in reality. (00:57:48) Yeah, that optimistic people are just (00:57:50) naive, you know, (00:57:51) >> and that that's obviously not true. (00:57:54) >> Um, but I think we just have to be (00:57:55) mindful of the balance of it. (00:57:59) >> When 90% of the messaging is all around (00:58:02) the end of the world and doom and the (00:58:04) pessimism and you know, I think we we're (00:58:07) scaring people (00:58:08) >> from making the investments in AI that (00:58:11) makes it safer, more functional, more (00:58:13) productive (00:58:14) >> and more useful to society. And so we (00:58:16) just, you know, more secure. We, you (00:58:18) know, all of that takes technology. (00:58:20) Security takes technology. Safety takes (00:58:22) technology. I appreciate that my car is (00:58:24) safer today because it has better (00:58:26) technology than a car 50 years ago. (00:58:28) >> And so so I I think it takes technology (00:58:30) to be safe, technology to be secure. And (00:58:33) so I I'm I'm I'm delighted to see that (00:58:36) the the advancement of technology is (00:58:38) still accelerating and ongoing. And so (00:58:41) we just have to make sure that the the (00:58:42) policy makers around the world, the (00:58:44) governments um are able to are are (00:58:47) thinking about balancing these two (00:58:49) ideas. (00:58:50) >> How do you So I guess we've talked a lot (00:58:52) about 25 (00:58:53) >> and the narratives of 25. How do you (00:58:54) think about 26? What are you excited (00:58:56) about? What do you see coming? What do (00:58:58) you think are big changes that we should (00:58:59) be aware of? (00:59:00) >> I am optimistic that that um our (00:59:04) relationship with China will improve. (00:59:05) Mhm. [clears throat] (00:59:06) >> that President Trump and the (00:59:08) administration um has a really really (00:59:11) grounded and common sense um attitude (00:59:14) about um and philosophy around around (00:59:17) how to think about China that that (00:59:20) they're an adversary (00:59:22) >> um but they're also also a partner in (00:59:24) many ways and that the idea of (00:59:27) decoupling is naive and the idea of (00:59:30) decoupling um for whatever reason (00:59:33) philosophical reasons or national (00:59:34) security reasons It's just not not it's (00:59:37) not based on any common sense and the (00:59:39) more you the more deeply you look into (00:59:41) it the more the two countries are (00:59:44) actually highly coupled. (00:59:47) >> Um both countries ought to ought to (00:59:48) invest in their own independence. Um I (00:59:52) you know when you depend too much on (00:59:53) someone the relationship becomes too (00:59:55) emotional uh as you know [laughter] and (00:59:58) so it's good to have some independence (01:00:00) or as much independence as either either (01:00:02) would like but to recognize that there's (01:00:04) a lot of coupling a lot of dependence (01:00:06) between the two countries and and I (01:00:08) think there's a there needs to be a (01:00:10) nuanced strategy a nuanced attitude (01:00:13) about how to how to how to manage this (01:00:15) relationship in a productive way for all (01:00:17) of the people of two countries and for (01:00:20) all of the people around the world, (01:00:21) everybody depends on a productive, (01:00:24) constructive relationship of the two (01:00:26) most important nations and the single (01:00:29) most important relationship for the next (01:00:30) century. And so we have to find that (01:00:32) answer. And I'm I'm I I'm just really (01:00:35) delighted uh that President Trump is (01:00:37) looking for a constructive answer. And (01:00:40) so I I think that next year uh will be a (01:00:42) much better better better year than the (01:00:44) last several. I'm happy with the (01:00:46) administration was able to to to suggest (01:00:49) a a an export control um policy that is (01:00:54) grounded on national security (01:00:56) recognizing that they already make so (01:00:59) many chips themselves and they they can (01:01:01) depend on Huawei themselves for their (01:01:03) military for their national security. (01:01:05) they got ample technology to do that. (01:01:08) And so that American technology, (01:01:10) although general purpose um is unlikely (01:01:13) to be used by their military because (01:01:14) their military is too smart, just as our (01:01:16) military is too smart to to use their (01:01:18) technology. And so it's grounded on (01:01:20) national security. It's grounded on on (01:01:23) uh technology leadership. It's grounded (01:01:25) on national prosperity. You know, one of (01:01:28) the things that that we just always have (01:01:29) to remember is that the world's (01:01:31) mightiest military uh is supported by (01:01:34) the world's mightiest mil economy. And (01:01:37) so the wealth that we generate um brings (01:01:40) jobs home, creates prosperity in the (01:01:42) United States, um provides for tax (01:01:44) revenues, and ultimately funds the (01:01:47) mightiest military on the planet. And so (01:01:49) that circular system, that (01:01:51) interconnected system requires a nuanced (01:01:54) strategy. and and um uh and and and and (01:01:57) I'm I'm I'm pleased to to to to see some (01:02:00) of the progress in that area that allows (01:02:03) American technology companies to keep (01:02:05) America first and keep America ahead (01:02:08) >> and to to support American technology (01:02:10) leadership on the one hand um to win (01:02:13) globally (01:02:15) >> and and then and then China of course is (01:02:17) sorting itself out you know I mean not (01:02:19) sorting but they're sorting out the (01:02:21) attitude about how to think about (01:02:22) American technology and there (01:02:24) >> because historical argument there has (01:02:25) been that if if you look for example at (01:02:27) the internet um there was what was known (01:02:29) as a great firewall right China (01:02:31) basically (01:02:31) >> prevented US competition into China (01:02:33) while the opposite wasn't as true (01:02:36) >> um there's been mass expatriation of US (01:02:38) jobs and industry to China as sort of (01:02:40) part of the development of the 90s and (01:02:42) 2000s and so I think a lot of the things (01:02:43) that people have brought up from a China (01:02:45) US policy perspective besides just the (01:02:47) military adversarial relationship um or (01:02:50) spheres of influence or you know all the (01:02:51) various things like that is also just (01:02:53) the economic imbalances that have (01:02:54) perceived to exist between the two (01:02:56) countries. The way that I would think (01:02:58) through that is go back to the first (01:03:00) principles of technologies again (01:03:03) >> and and let's say the internet you have (01:03:05) the chip industry you have the systems (01:03:07) industry the software industry you have (01:03:09) the services industry on top remember (01:03:11) China's internet growth has been a boon (01:03:15) for Intel and AMD selling CPUs (01:03:18) >> Micron selling DRAM skinex and Samsung (01:03:21) selling DRAM (01:03:23) >> it is the second largest internet (01:03:26) market for American technology industry (01:03:29) >> and so so maybe maybe it wasn't helpful (01:03:32) to some layer of the stack (01:03:33) >> the Googles of the world (01:03:35) >> but don't exclude every layer of the (01:03:38) stack always come back every single one (01:03:40) of these things take a step back and (01:03:41) look at the whole stack (01:03:43) >> maybe that's a theme for today as well (01:03:44) and it makes sense that you would you (01:03:46) would send this message but you know (01:03:47) technology is actually not just the the (01:03:50) sort of internet software application (01:03:52) layer that's been very dominant for two (01:03:54) decades (01:03:54) >> it's the whole stack and Remember as as (01:03:57) as Intel and AMD prospered (01:04:01) >> uh with the internet industry uh in (01:04:03) China growth the China industry growth (01:04:05) don't forget China also contributed (01:04:07) tremendously to open source. No country (01:04:10) in the world contributes more to open (01:04:11) source than China. And look at all the (01:04:13) startups here in America that were able (01:04:15) to benefit from that open source to (01:04:17) create the the new startups that are (01:04:18) here. And so you can't look at one area (01:04:23) in isolation. You have to look at the (01:04:25) whole life cycle of the technology and (01:04:28) look at every layer of the stack. Does (01:04:29) it make sense? When you take a look at (01:04:31) that from that lens, (01:04:33) >> China's internet industry (01:04:36) generated enormous prosperity for (01:04:39) America. (01:04:40) >> Mhm. (01:04:41) >> Just not at the internet company per se. (01:04:44) >> Jensen, my other investor friends will (01:04:45) not forgive me if I don't ask you about (01:04:47) 2026 um uh on the business side. Uh are (01:04:52) we in an AI bubble? AI bubble. Yeah, (01:04:54) there's a lot of ways to reason through (01:04:55) that. (01:04:56) >> And so, so again, um, you know, when (01:05:00) when asked that question, my mind goes (01:05:02) to what is AI and where are we in that? (01:05:07) There's AI, (01:05:09) then there's computing. You know, as you (01:05:11) know, Nvidia invented accelerated (01:05:14) computing. Accelerated computing does (01:05:16) computer graphics and rendering. AI (01:05:17) doesn't. Um, accelerated computing does (01:05:20) data processing, SQL data processing. AI (01:05:22) doesn't. (01:05:23) >> Um, accelerated computing does molecular (01:05:25) dynamics and quantum chemistry. AI (01:05:27) doesn't. You know, all these are all (01:05:29) things that people could say someday AI (01:05:31) will, but it doesn't today. Accelerated (01:05:33) computing is really essential for uh (01:05:35) classical machine learning, XG boost, (01:05:38) recommener systems, the whole process of (01:05:40) uh feature engineering, extract, load, (01:05:42) and transform. That entire data science, (01:05:46) machine learning life cycle, accelerated (01:05:48) computing is used for all of that. The (01:05:50) first thing to go to is in the context (01:05:52) of Nvidia. (01:05:54) What we see is the the the dynamic is (01:05:57) the shift from general purpose computing (01:05:59) to accelerated computing because MOS (01:06:01) laws largely ended. You can't use CPUs (01:06:03) for everything anymore like you used to. (01:06:06) And so it's just no longer productive (01:06:09) enough. It's not deflationary enough. (01:06:12) >> And so so we have to move towards a new (01:06:14) computing model. And that's where (01:06:15) accelerator comes in. If you if (01:06:17) generative AI well excuse me if chatbots (01:06:20) let's just go you know open AI and (01:06:22) Anthropic and Gemini if none of that (01:06:24) existed today Nvidia would be a (01:06:27) multiundred billion dollar company and (01:06:29) the reason for that is because as you (01:06:31) know the foundation of computing is (01:06:33) shifting to accelerated computing (01:06:36) >> that's the first thing to to realize is (01:06:38) is to take a step back and ask yourself (01:06:40) what is actually happening now the next (01:06:43) layer up the question about AI now (01:06:45) becomes What is AI? Now, we ask that we (01:06:48) ask the AI bubble question and we always (01:06:51) go back to OpenAI's revenues 100%. Don't (01:06:54) we? (01:06:54) >> Mhm. (01:06:54) >> You ask somebody, hey, is there an AI (01:06:56) bubble? Everybody goes directly to (01:06:59) OpenAI's revenues. First of all, if (01:07:01) OpenAI currently has twice the capacity, (01:07:04) their revenues would double. You guys (01:07:05) know that if they have 10 times the (01:07:07) capacity, their I really believe their (01:07:09) revenues would 10 times. And so, they (01:07:11) need capacity. This is no different than (01:07:13) Nvidia needs wafers from TSMC. Just (01:07:16) because you know Nvidia exists and and (01:07:18) we're doing great doesn't mean we don't (01:07:19) need capacity. We need capacity. We need (01:07:21) capacity of DRAM. We need and so in our (01:07:23) world it's sensible to everybody. We (01:07:25) need capacity. Well, in their world they (01:07:26) need factories (01:07:27) >> and if they don't have factory capacity (01:07:29) how they generate tokens, which is where (01:07:30) we started our conversation today and so (01:07:33) they need factory capacity in order to (01:07:35) increase their revenue growth. But (01:07:37) nonetheless, we also said that AI is (01:07:41) more than chatbots. It includes all (01:07:43) these different fields of science. Um, (01:07:45) Nvidia's AV business is coming up on 10 (01:07:48) billion dollars. Nobody ever talks about (01:07:50) that. And you have to train world (01:07:52) models. You have to train these AI AVs (01:07:54) and it's happening robo taxis happening (01:07:56) all over the world. Our AI work with uh (01:07:59) digital biology, our AI work in (01:08:01) financial services. The whole industry (01:08:04) of quants, quantitative trading is (01:08:07) moving towards Yeah, exactly. They used (01:08:10) to be classical machine learning. A (01:08:12) whole bunch of human featured they call (01:08:14) quants, right? These these specialized (01:08:17) mathematicians were trying to figure out (01:08:19) what the predictive features are. Now we (01:08:21) use AI to figure it out. And so in order (01:08:23) to have instead of having quants, you (01:08:25) need a lot of supercomputers. Financial (01:08:27) services is one of our fastest growing (01:08:28) segments. billions of dollars in in (01:08:31) quants, you know, in financial services, (01:08:33) billions of dollars in AV, billions of (01:08:36) dollars in robotics coming up, billions (01:08:38) of dollars in digital biology. And so (01:08:41) how big can that all that be? Well, (01:08:43) simple logic is this simple math. (01:08:45) Whether you you think that AI is going (01:08:47) to replace shortage, labor shortage or (01:08:50) workforce shortage in any kind, um, (01:08:53) let's ignore that for a second. The (01:08:55) world is at hundred trillion dollars in (01:08:57) GDP. out of that let's just say 2% 2% (01:09:02) annually is R&D and let's just go back (01:09:05) in time five years ago if you were to (01:09:07) take the largest drug discovery company (01:09:09) in the world drug company in the world (01:09:10) and where's all of their R&D wet labs (01:09:14) >> today what are they do doing building (01:09:17) supercomputers (01:09:19) >> and so there's a fundamental shift in (01:09:22) how they think about that $2 trillion (01:09:25) >> it used to be $2 trillion for the old (01:09:27) way of doing things. It's now going to (01:09:28) be $2 trillion in the AI way of doing (01:09:31) things. Well, $2 trillion is going to (01:09:33) need $2 trillion of R&D is going to be (01:09:36) powered by a whole bunch of (01:09:37) infrastructure. And that's the reason (01:09:39) why we're building supercomputers (01:09:41) everywhere around the world. And so so I (01:09:43) think if if you reason about it from the (01:09:45) outside in, you know, either from the (01:09:48) foundation up, from the outside in, you (01:09:50) come to the conclusion that what we're (01:09:52) experiencing, what all three of us are (01:09:54) experiencing, which is the amount of (01:09:56) computing demand is insane. (01:09:59) >> Give me an example of a startup company (01:10:01) that goes, "No, we're good." (01:10:03) >> They are all dying for computing (01:10:04) capacity. Give me an example for a (01:10:07) researcher in any university, a (01:10:09) scientist in any company who says got (01:10:11) plenty of capacity. Everybody is dying (01:10:14) for capacity. And so we have a global (01:10:17) multi- company, multi-industry shortage. (01:10:21) It's not just about open AI even though (01:10:22) open AI could use a lot more capacity as (01:10:24) well. So I think I think how we think (01:10:26) about this what with the narrative the (01:10:29) narrative is not helpful and it's a (01:10:31) little bit too superficial to say how do (01:10:34) you prove there's an AI bubble$12 (01:10:36) billion of revenues (01:10:39) hundreds of billions of dollar (01:10:40) infrastructure being built is a little (01:10:42) bit too simplistic. (01:10:44) >> Yeah. The other thing people um tend to (01:10:45) point out is the MIT study. There (01:10:47) there's some study that I think came out (01:10:48) of MIT that claimed that most enterprise (01:10:50) deployments of AI weren't that useful. (01:10:52) And you're like, well, did you do the (01:10:53) change management? Did you do a reorg? (01:10:55) Did you integrate into tooling? Did you (01:10:57) like how long did it even take to (01:10:58) implement it? If a planning cycle in an (01:10:59) enterprise is a year and is something in (01:11:01) six months and so it feels like there's (01:11:03) a lot of these kind of again overstated (01:11:05) things that get a lot of attention, but (01:11:07) then you map it against what's actually (01:11:08) happening. (01:11:09) >> Yeah. (01:11:10) >> And the growth of these companies using (01:11:11) AI and it's just a completely different (01:11:12) world. And and and if you want to find (01:11:14) out where the world's innovation's (01:11:17) happening, I would not go find out at an (01:11:19) enterprise. (01:11:20) >> Would you guys agree? (01:11:22) >> Yeah. (01:11:22) >> Enterprise is like the slowest adopters (01:11:25) of new technologies. I would go talk to (01:11:27) all of the startups, the 30, 40,000 (01:11:30) startups that are currently doing this (01:11:32) stuff. I would go talk to Open Evans. (01:11:34) How how's it working? I would go go talk (01:11:36) to cursor. How's coding working by the (01:11:37) way? You know, I would just go talk to (01:11:39) these people. (01:11:40) >> I think it's really interesting that you (01:11:41) see that. Um, of course you do have (01:11:43) companies making, you know, hundred (01:11:45) million plus, multiund million plus (01:11:47) progress of AR in enterprise sales, (01:11:50) Harvey, Sierra, etc. But some of the (01:11:52) fastest growing companies have been (01:11:54) enduser adopted even in conservative (01:11:57) industries, right? Like healthcare, you (01:11:59) know, skeptical industries like (01:12:00) engineering, (01:12:01) >> healthare, the most right, the most (01:12:03) conservative of all. But guess what? (01:12:05) They are so concerned about getting the (01:12:08) right answer (01:12:10) >> that the ability to have something like (01:12:12) open evidence. (01:12:12) >> Yeah. (01:12:13) >> To do grounded research, high quality (01:12:15) research and get that get that research (01:12:18) as information to you. Nobody wants to (01:12:20) do research. They want answers. Nobody (01:12:22) wants to do search. They want answers. (01:12:23) Is that right? (01:12:24) >> A bridge is a great example of that too (01:12:25) where they're basically making it really (01:12:26) easy to do the physician knows instead (01:12:28) of the physician sitting there and doing (01:12:29) it. Back to your point on task versus (01:12:31) >> task versus purpose. Exactly. And I (01:12:32) think a different way to think about the (01:12:33) demand is like there are so many jobs (01:12:35) where you're asking the the work is (01:12:38) actually like an impossible ask right of (01:12:40) a doctor or a radiologist keep up with (01:12:43) the world's biomedical knowledge in R&D (01:12:46) which is accelerating you know computing (01:12:47) and otherwise um and then (01:12:49) >> like archive papers (01:12:51) >> there was a time you s you and I read (01:12:54) >> you and I both both used to do I don't (01:12:56) do that anymore but here now now I just (01:12:59) load it all into chat (01:13:00) >> GBTh you Now I just load it all in with (01:13:03) all of the the ones that are interesting (01:13:05) and and I make it learn it (01:13:07) >> and then I you know make it summarize (01:13:09) and another summary and I I interact (01:13:11) with it. But but the point is uh we used (01:13:14) to do search. We don't do it that (01:13:15) anymore. I don't do search. We used to (01:13:17) do research. You know the goal is to get (01:13:20) answers. The goal is to get smarter. And (01:13:22) these AIs allow us to help us do all (01:13:24) that. And I think all of it all of it (01:13:28) comes back with it. It's all more (01:13:31) helpful if you come back to the (01:13:32) framework that says AI is a multi-layer (01:13:36) cake (01:13:37) >> and that AI is not just a chatbot. AI is (01:13:41) very very diverse in all of the (01:13:43) industries and modalities and (01:13:45) information and applications that it (01:13:47) addresses. When you think about wanting (01:13:50) to win (01:13:51) >> that America should win AI, it should (01:13:54) not just be America should have this (01:13:57) company win AI, but it we should try to (01:13:59) win across the board (01:14:01) >> and across domains. (01:14:02) >> Across domains. Exactly. And when we (01:14:04) think about open source, all of a sudden (01:14:06) this this is a helpful framework. When (01:14:08) we think about winning, it's a helpful (01:14:09) framework. When we think about uh energy (01:14:11) is a helpful framework that because we (01:14:13) need factories. Factories need energy. (01:14:16) And without energy, we have no factory. (01:14:17) without factories we have no AI that's a (01:14:19) helpful framework and so I think if if (01:14:22) um if we if we have a better (01:14:25) understanding a system a framework for (01:14:28) understanding what AI is I think the (01:14:30) narratives will become more common sense (01:14:32) the narratives will become more (01:14:33) pragmatic (01:14:35) >> become more balanced we want to keep (01:14:37) people safe (01:14:38) >> but one of the best ways to keep people (01:14:40) safe is advancing advancing of (01:14:42) technology quickly (01:14:43) >> and and I think the industry is doing (01:14:45) that and I'm very proud of the industry (01:14:47) for doing that. (01:14:48) >> No one wants to drive a car from, you (01:14:50) know, the first decade of cars. And so I (01:14:53) I think uh (01:14:54) >> ABS is a really good thing. (01:14:56) >> Yes, (01:14:56) >> ABS is a really good thing. Lane keeping (01:14:58) is a really good thing. There's no (01:15:00) question FSD is a really good thing. (01:15:02) >> And I think people will be excited about (01:15:04) the, you know, third or fourth year of (01:15:06) AI. (01:15:07) >> Yeah. No, no doubt. And and I I say with (01:15:10) great pride that the industry made (01:15:14) tremendous strides this last year. (01:15:17) all the technologies we've mentioned. (01:15:19) Um, and that the scaling laws are so (01:15:21) intact that we we now know that more (01:15:26) compute, more intelligence (01:15:28) >> and and um uh gosh, the the the the (01:15:34) innovations in one in in one sector (01:15:37) diffuses and spreads across all of the (01:15:39) other sectors so fast. I'm so happy to (01:15:42) see all that. And so I think the next (01:15:44) five years it's going to be (01:15:45) extraordinary. No, no doubt about it. (01:15:46) And I think next year is going to be (01:15:48) incredible. (01:15:48) >> Amazing. Well, we're excited to talk to (01:15:50) you at the end of next year, too. (01:15:51) >> Yeah. Looking forward to it. Thank you (01:15:53) guys for all the work that you guys do. (01:15:54) Congratulations. What a great year. (01:15:55) >> Wow. Amazing year. (01:15:57) >> Yeah. A lot. Thank you. (01:15:58) >> Yeah. Thank you. Happy New Year. Happy (01:16:00) New Year. (01:16:03) >> Find us on Twitter at No Prior Pod. (01:16:05) Subscribe to our YouTube channel if you (01:16:07) want to see our faces. Follow the show (01:16:09) on Apple Podcasts, Spotify, or wherever (01:16:11) [music] you listen. That way you get a (01:16:13) new episode every week. And sign up for (01:16:15) emails or find transcripts for every (01:16:17) episode at no-briers.com.

NVIDIA’s Jensen Huang on Reasoning Models, Robotics, and Refuting the “AI Bubble” Narrative (YouTube Video Transcript)

Learning Modes

YouTube Video Transcript Hide

Ask AI Result

Leave a Reply Cancel reply

Other Videos:

Google DeepMind CEO Demis Hassabis: AI’s Next Breakthroughs, AGI Timeline,...

L’era Glaciale (Non/Disney) – “Oh no, un pirata. Aiuto! Aiuto!”

The Future Badge | Full Episode | Hey Duggee

Timon and Pumbaa Interrupt Tom and Jerry Meet Anastasia (2021)...

Best Neovim Plugins in 2025 (AI, Productivity & More)

The Nottoway Effect: It’s Time to Burn Down America’s Historical...

Think the Outcome of the Civil War Was Wrong? The...

Britain’s STRICTEST Headmistress Debates The Power Of Her Teaching Methods!

How childhood wiring impacts adult life, in 90 minutes |...

How We Are Still Complicit in Perpetuating American Slavery

Jonathan Haidt on Gen Z Fragility, Social Media, and the...

Parenting Strategies for Raising Resilient Kids — Dr. Becky Kennedy,...