Home Videos

NVIDIA Just Changed Robotics Forever With GR00T N1 – See It in Action! (YouTube Video Transcript)

Need transcripts for other videos? Try our YouTube Transcript Generator →
Title: NVIDIA Just Changed Robotics Forever With GR00T N1 – See It in Action!
Duration: 00:13:44
Total Correct Answers:
Current Caption
Correct

Learning Modes

YouTube Video Transcript Hide

Ask AI Result

The ask AI result will appear here..
(00:00:00) Your YouTube transcript will appear here (00:00:00) Let's go talk about robotics, shall (00:00:02) [Music] (00:00:05) we? Let's talk about (00:00:08) robots. Well, the time has come. The (00:00:11) time have has come for robots. Uh robots (00:00:15) have the benefit the benefit of being (00:00:17) able to interact with the physical world (00:00:18) and do things that otherwise digital (00:00:20) information cannot. Uh we know very (00:00:23) clearly that the world is has severe (00:00:26) shortage of of human labors, human (00:00:28) workers by the end of this decade. The (00:00:31) world is going to be at least 50 million (00:00:34) workers short. We'd be more than (00:00:36) delighted to pay them each $50,000 to (00:00:38) come to work. We're probably going to (00:00:40) have to pay robots $50,000 a year to (00:00:42) come to work. And so this is going to be (00:00:44) a very very large industry. There are (00:00:46) all kinds of robotic systems. Your (00:00:48) infrastructure will be robotic. billions (00:00:50) of cameras and warehouses and factories, (00:00:53) 10, 20 million factories around the (00:00:54) world. Every car is already a robot as I (00:00:57) mentioned earlier. And then now we're (00:00:58) building general robots. Let me show you (00:01:00) how we're doing that. (00:01:04) [Music] (00:01:11) [Music] (00:01:27) Hey (00:01:28) [Music] (00:01:36) hey (00:01:38) a hey a (00:01:44) [Music] (00:01:46) I'm (00:01:55) [Music] (00:01:59) a boy. (00:02:03) [Music] (00:02:24) [Music] (00:02:28) Why? Heat. (00:02:32) [Music] (00:02:40) [Music] (00:02:47) Heat. Hey, (00:02:50) [Music] (00:02:55) Heat. (00:02:56) [Music] (00:03:06) [Music] (00:03:13) [Music] (00:03:18) Heat. Heat. (00:03:33) Heat. (00:03:34) [Music] (00:03:42) Heat. (00:03:43) [Music] (00:03:59) [Music] (00:04:03) Hey, hey, (00:04:07) hey. Heat. Heat. (00:04:14) [Music] (00:04:28) N. Heat. Heat. (00:04:30) [Music] (00:04:54) [Music] (00:05:02) [Music] (00:05:10) [Music] (00:05:18) I'm physical AI and robotics. (00:05:22) are moving so fast. Everybody pay (00:05:25) attention to this space. This could very (00:05:27) well likely be the largest industry of (00:05:29) all. At its core, we have the same (00:05:33) challenges. As I mentioned before, there (00:05:36) are three that we focus on. They are (00:05:39) rather systematic. (00:05:41) One, how do you solve the data (00:05:44) problem? How where do you create the (00:05:48) data necessary to train the AI? Two, (00:05:51) what's the model architecture? And then (00:05:53) three, what's the scaling loss? How can (00:05:57) we scale either the data, the compute or (00:06:01) both so that we can make AIs smarter and (00:06:05) smarter and smarter? How do we scale? (00:06:07) And those two those fundamental problems (00:06:10) exist in robotics as well. In (00:06:12) robotics, we created a system called (00:06:14) Omniverse. It's our operating system for (00:06:17) physical AI. You've heard me talk about (00:06:18) Omniverse for a long time. We added two (00:06:22) technologies to it. Today I'm going to (00:06:24) show you two things. One of them is so (00:06:28) that we could scale AI with generative (00:06:31) capabilities. A generative model that (00:06:34) understand the physical world. We call (00:06:36) it cosmos. Using (00:06:39) Omniverse to condition Cosmos and using (00:06:43) Cosmos to generate an infinite number of (00:06:47) environments allows us to create data (00:06:51) that is grounded grounded controlled by (00:06:55) us and yet be systematically infinite at (00:06:59) the same time. Okay. So you see (00:07:02) Omniverse, we use Candy Colors to give (00:07:04) you an (00:07:05) example of us controlling the robot in (00:07:09) the scenario perfectly and yet O Cosmos (00:07:13) can create all these virtual (00:07:14) environments. The second thing just as (00:07:17) we were talking about earlier, one of (00:07:20) the incredible scaling capabilities of (00:07:22) language models today is reinforcement (00:07:24) learning verifiable rewards. The (00:07:28) question is what's the verifiable (00:07:29) rewards in (00:07:31) robotics? And as we know very well, it's (00:07:34) the laws of physics, verifiable physics (00:07:38) rewards. And so we need an incredible (00:07:41) physics engine. Well, most physics (00:07:44) engines have been designed for a variety (00:07:45) of reasons. It could be designed because (00:07:47) if we want to use it for large (00:07:49) machineries or uh maybe we design it for (00:07:51) uh virtual worlds, video games and such. (00:07:54) But we need a physics engine that is (00:07:57) designed for very fine (00:08:00) grain rigid and soft bodies designed for (00:08:05) being able to train tactile feedback and (00:08:09) fine motor skills and actuator controls. (00:08:13) We needed to be GPU accelerated so that (00:08:16) we these virtual worlds could live in (00:08:18) super linear time, super real time and (00:08:22) train these AI models incredibly fast. (00:08:25) And we needed to be integrated (00:08:27) harmoniously into a framework that is (00:08:31) used by roboticists all over the world, (00:08:33) Mujoko. And so today we're announcing (00:08:36) something really, really special. It is (00:08:39) a partnership of three (00:08:41) companies, Deep (00:08:43) Mind, Disney Research, and Nvidia. And (00:08:47) we call it (00:08:52) Newton. Let's Let's take a look at (00:08:55) Newton. (00:08:59) [Music] (00:09:04) [Music] (00:09:16) [Applause] (00:09:18) [Music] (00:09:21) [Applause] (00:09:24) Thank you. All right, let's start that (00:09:26) over, shall (00:09:28) we? Let's not ruin it for them. Hang on (00:09:31) a second. Somebody talk to me. I need (00:09:33) feedback. What (00:09:36) happened? Who? I just need a human to (00:09:38) talk (00:09:42) to. Come on. That's a good (00:09:45) joke. Give me a human to talk to. (00:09:48) Janine, I know it's not your fault, but (00:09:50) talk to me. We got We just got a two (00:09:52) minutes left. I'm right here. They're (00:09:53) re-rackcking it. They're re-rackcking (00:09:55) it. I don't even know what that means. (00:09:57) Okay. (00:09:59) [Music] (00:10:17) [Applause] (00:10:23) [Music] (00:10:29) What did you do? (00:10:32) [Music] (00:10:42) Tell me that wasn't (00:10:45) amazing. Hey, (00:10:47) Blue. How are you (00:10:49) doing? How do you like How do you like (00:10:52) your new physics engine? You like it, (00:10:54) huh? Yeah, I bet. I know. Tactical (00:10:58) feedback. (00:11:00) Rigid body, soft body (00:11:03) simulation, super real (00:11:05) time. Can you imagine just now what you (00:11:07) were looking at is comp complete real (00:11:09) time (00:11:10) simulation? This is how we're going to (00:11:12) train robots in the (00:11:13) future. Uh just so you know, Blue has uh (00:11:17) two computers, two Nvidia computers (00:11:22) inside. Look how smart you are. (00:11:26) Yes, you're smart. Okay. (00:11:30) All right. Hey, Blue. Listen. How about (00:11:33) let's take them home. Let's finish this (00:11:35) keynote. It's (00:11:37) lunchtime. Are you ready? Let's finish (00:11:40) it up. We have another announcement to (00:11:43) You're good. You're good. Just stand (00:11:44) right here. Stand right here. Stand (00:11:47) right (00:11:49) here. All right. Good. Right (00:11:52) there. That's good. All right. Stand. (00:11:57) [Applause] (00:12:06) Okay, we have another amazing (00:12:09) news. I told you the progress of our (00:12:12) robotics has been making enormous (00:12:14) progress and today we're announcing that (00:12:18) Groot (00:12:19) N1 is open sourced. (00:12:22) [Applause] (00:12:31) I want to thank all of you to come (00:12:35) to let's wrap up. I want to thank all of (00:12:38) you for coming to GTC. We talked about (00:12:39) several things. One, Blackwell is in (00:12:42) full production and the ramp is (00:12:46) incredible. Customer demand is (00:12:48) incredible and for good reason because (00:12:50) there's an inflection point in AI. The (00:12:52) amount of computation we have to do in (00:12:54) AI is so much greater as a result of (00:12:57) reasoning AI and the training of (00:12:59) reasoning AI systems and agent agentic (00:13:02) systems. Second, Blackwell MVLink 72 (00:13:05) with Dynamo is 40 times the performance (00:13:10) AI factory performance of Hopper and (00:13:12) inference is going to be one of the most (00:13:14) important workloads in the next decade (00:13:16) as we scale out AI. Third, we have an (00:13:20) annual annual rhythm of road maps that (00:13:23) has been laid out for you so that you (00:13:26) could plan your AI infrastructure. And (00:13:28) then we have two we have three AI (00:13:30) infrastructures we're building. AI (00:13:31) infrastructure for the cloud, AI (00:13:33) infrastructure for enterprise, and AI (00:13:36) infrastructure for robots. (00:13:43) [Music]

Leave a Reply

Your email address will not be published. Required fields are marked *