↔
Title: Unlocking AI Innovation: A Conversation with Eric Boyd and Asha Sharma | Studio06
Duration: 00:16:17
Total Correct Answers:
Current Caption
Correct
Learning Modes
YouTube Video Transcript Hide
Ask AI:
Export as:
Ask AI Result
The ask AI result will appear here..
(00:00:00) Your YouTube transcript will appear here
(00:00:10)
SETH JUAREZ: Hello, my friends.
(00:00:12)
Welcome back.
(00:00:12)
This is the Build Live stage.
(00:00:15)
My name is Seth, and no pressure for
(00:00:17)
me because this is my boss.
(00:00:20)
Then my boss' boss. How's it going, my friends?
(00:00:26)
ERIC BOYD: Very well. How are you?
(00:00:27)
SETH JUAREZ: Eric and Asha,
(00:00:28)
why don't you introduce yourself?
(00:00:29)
We'll start with you and then Asha.
(00:00:30)
ERIC BOYD: I'm Eric Boyd.
(00:00:31)
I lead the AI platform team.
(00:00:33)
ASHA SHARMA: Asha, I lead product
(00:00:34)
for the AI platform team.
(00:00:36)
SETH JUAREZ: Let's start with
(00:00:37)
what is the now
(00:00:38)
of AI because there's a lot of stuff going on,
(00:00:40)
and maybe you can help us rationalize it.
(00:00:42)
Want to get your thoughts and
(00:00:43)
then Asha on product, how that looks.
(00:00:45)
ERIC BOYD: There's so much
(00:00:46)
going on in AI.
(00:00:47)
I think some of the most exciting things to
(00:00:49)
think about are what are our customers doing?
(00:00:52)
If I start with Mercedes Benz,
(00:00:55)
everyone's been in a car and has
(00:00:57)
tried to talk to that car voice interface,
(00:00:59)
and it doesn't understand
(00:01:00)
you and can't say anything back.
(00:01:01)
Mercedes Benz integrated with GPT-4 for
(00:01:04)
their MBUX and so now you can
(00:01:06)
have a much more natural conversation in the car,
(00:01:08)
being able to get directions and get things done.
(00:01:11)
They even integrated GPT-4 vision into the dash cam,
(00:01:14)
so now they can see
(00:01:15)
things going on around the car and near you.
(00:01:18)
It's so exciting, things like that.
(00:01:20)
You can go from car manufacturers
(00:01:23)
all the way to developers with the unity platform,
(00:01:26)
which has now made a bot that can help
(00:01:28)
developers figure out all the things
(00:01:30)
that you're doing on Unity.
(00:01:31)
Unity is a platform for making 3D games.
(00:01:33)
Helping you figure out all the APIs
(00:01:35)
and how everything works,
(00:01:37)
all within the unity environment and not having to leave.
(00:01:39)
When you look at those two extremes of just
(00:01:42)
how much these applications are just all over the place,
(00:01:45)
it just shows you how quickly companies are
(00:01:47)
adopting AI across the spectrum.
(00:01:50)
We're just building tools as
(00:01:52)
quickly as we can to try and serve those needs.
(00:01:55)
Companies need things like search to be able to
(00:01:58)
get their data in and store it.
(00:02:00)
Azure AI Search, which is
(00:02:02)
now 12 times as much storage for the same price,
(00:02:05)
really all integrated in.
(00:02:07)
The Azure AI Studio, making it easy.
(00:02:09)
There's just so much going on in AI
(00:02:11)
just really bringing all those applications together.
(00:02:13)
It's really fun and exciting to see.
(00:02:15)
ASHA SHARMA: It almost feels
(00:02:16)
like applications today are
(00:02:17)
feeling increasingly broken when
(00:02:19)
they don't have AI as part of it.
(00:02:20)
We're starting to see
(00:02:22)
organizations go from experimentation
(00:02:24)
now to scale and
(00:02:26)
even think about customizing the experiences.
(00:02:28)
A lot of the tools that we're working
(00:02:30)
on are going towards, how do they fine tune?
(00:02:33)
How do they leverage data to
(00:02:35)
ground their application and things like that.
(00:02:37)
SETH JUAREZ: What are
(00:02:37)
some cool things that people are
(00:02:39)
doing now that you're getting a sense for?
(00:02:42)
Because we talk about AI all the time,
(00:02:44)
and everyone's like, AI, AI.
(00:02:46)
But what are some things that
(00:02:47)
people are doing to sort of soften
(00:02:49)
the edge around software and hardware and people?
(00:02:53)
ERIC BOYD: You just see it in
(00:02:54)
all the different use cases.
(00:02:55)
I talked about a couple where we
(00:02:57)
see what Mercedes Benz and Unity is doing.
(00:03:00)
But there are so many other examples where they're
(00:03:04)
trying to build solutions to
(00:03:05)
make it easier to interact with people.
(00:03:07)
I love the example Satya gave in
(00:03:09)
his keynote this morning at the Khan Academy.
(00:03:11)
The ability to have an individualized education tutor
(00:03:15)
really connecting directly with each student is
(00:03:19)
really powerful and the types of
(00:03:21)
things that they are the applications you
(00:03:23)
couldn't even have really dreamed of
(00:03:25)
a few years ago until AI really got to this point.
(00:03:28)
SETH JUAREZ: What are your thoughts?
(00:03:28)
ASHA SHARMA: Yeah, I think multimodal
(00:03:30)
is becoming a big aspect of that.
(00:03:32)
Now with the power of GPT-4o,
(00:03:34)
which is GA today on Azure OpenAI and Azure AI Studio,
(00:03:38)
I think there's going to be a new world
(00:03:40)
of possibilities to actually
(00:03:41)
interact with your AI
(00:03:43)
much more fluently than ever before.
(00:03:46)
SETH JUAREZ: I love that because they
(00:03:48)
let me play with these models sometimes,
(00:03:50)
and they're actually really fun
(00:03:51)
to imagine the things that you can do.
(00:03:54)
We started talking about GPT-4 Omni,
(00:03:57)
but there's a lot of other models
(00:03:58)
that we're looking at right now.
(00:04:00)
Eric, why don't we start with some
(00:04:01)
of the cool things that are out today?
(00:04:03)
Then Asha maybe some ways
(00:04:04)
that we're helping people use those things.
(00:04:06)
ERIC BOYD: Some of the other things
(00:04:08)
that we're excited to see today,
(00:04:10)
we announced the Phi series of models, Phi-3.
(00:04:13)
We've got Phi-3-mini,
(00:04:14)
small, and medium,
(00:04:15)
which are 3, 7,
(00:04:17)
and 14 billion parameter models.
(00:04:19)
The thing about these Phi series models
(00:04:21)
is we took a different approach to training them,
(00:04:24)
where we really tried to think of
(00:04:26)
the data that they train on as a curriculum,
(00:04:29)
like you would give to a student where you're trying to
(00:04:31)
teach the model step by step,
(00:04:33)
how do we make it learn everything about the world.
(00:04:37)
In doing that, we got a model that
(00:04:39)
is much smaller and more powerful.
(00:04:41)
All the Phi-3 models,
(00:04:43)
they basically punch a full weight class
(00:04:45)
ahead for their size,
(00:04:46)
which is really interesting as
(00:04:48)
customers are looking for what's
(00:04:50)
the most power that I can get from
(00:04:51)
a model at the right price point.
(00:04:54)
Looking to select all sort
(00:04:55)
of along that price performance curve,
(00:04:57)
what are the places
(00:04:58)
where the model is going to work really well?
(00:05:00)
It's exciting to see what we've done with that.
(00:05:02)
We also added, Asha is talking about multimodal.
(00:05:06)
Phi-3 for Vision is also available today.
(00:05:08)
Now you can in these smaller model still get
(00:05:11)
those same rich vision capabilities that
(00:05:13)
you're used to seeing with GPT-4V and GPTo.
(00:05:16)
Really excited to see that.
(00:05:18)
ASHA SHARMA: We also expanded
(00:05:19)
our partnership with Hugging Face,
(00:05:20)
so there's a lot of open source models.
(00:05:23)
We are also doing a lot around the tool chain.
(00:05:25)
I think that's what's most special.
(00:05:27)
Eventually, we have to teach people to get
(00:05:29)
the right model for the right job at the right cost.
(00:05:32)
If they can do that, then it's about
(00:05:33)
the tool chain to build those applications.
(00:05:35)
We announced the general availability of Azure AI Studio,
(00:05:39)
but we're also announcing a lot around responsible AI to
(00:05:42)
keep those applications safe and secure end to end,
(00:05:45)
so content filters, prompt shields,
(00:05:48)
all of those things are going to become
(00:05:49)
even more important as we start to scale.
(00:05:51)
As we look at this from a perspective of developers,
(00:05:57)
I might be like, oh, I got to use AI.
(00:05:59)
I don't know what to do.
(00:06:00)
Because I've been to conferences around the world,
(00:06:02)
and people are literally being told by their management,
(00:06:06)
you need to use AI,
(00:06:07)
and they're like, what?
(00:06:09)
What is some of the stuff that we provide to
(00:06:12)
help developers get started?
(00:06:14)
We'll start with you then Asha.
(00:06:15)
ERIC BOYD: There are
(00:06:15)
all these different tools that
(00:06:17)
we make to make things easier.
(00:06:20)
You can go to Azure AI Studio,
(00:06:22)
which is the integrated place to go
(00:06:24)
and build your generative AI applications.
(00:06:26)
The most common pattern that we're
(00:06:29)
seeing is this retrieval augmented generation,
(00:06:32)
the rag pattern of how do I take
(00:06:34)
some data and put it into a search engine,
(00:06:37)
usually one that has vector search.
(00:06:38)
That's why Azure AI Search is
(00:06:41)
super helpful for building those applications.
(00:06:43)
It's a really rich search engine that
(00:06:45)
brings all the power of vector search with
(00:06:48)
all the things that we've learned over decades of doing
(00:06:50)
keywords based search into a single platform,
(00:06:53)
and then has a semantic ranker on top of
(00:06:55)
them to really retrieve the most relevant results.
(00:06:58)
Then you feed that to your model that then
(00:07:01)
the model now has all of
(00:07:02)
the information for your application,
(00:07:04)
just all that built into it.
(00:07:06)
That's the pattern that
(00:07:07)
developers are really going after.
(00:07:09)
How do we make it easy for them to build that,
(00:07:12)
to put all those pieces together,
(00:07:14)
and then to start experimenting
(00:07:15)
and iterating and learning,
(00:07:17)
how do I tweak this to get the best quality out of it.
(00:07:20)
ASHA SHARMA: It seems like some of
(00:07:21)
the more sophisticated developers,
(00:07:22)
once they do that,
(00:07:23)
they're starting to turn to tools
(00:07:25)
to actually help them solve complex tasks,
(00:07:27)
so Assistants, Assistants API,
(00:07:30)
which we have for Azure OpenAI.
(00:07:32)
That's becoming increasingly more popular.
(00:07:33)
I saw Coca Cola is doing
(00:07:35)
something really cool where they have
(00:07:36)
Assistant API templates for every single department,
(00:07:40)
so it makes it much easier than
(00:07:41)
for a business decision maker to actually
(00:07:43)
get started with the tools in a really non scary way.
(00:07:48)
SETH JUAREZ: The cool end.
(00:07:49)
Did you want to say something?
(00:07:49)
ERIC BOYD: You can say,
(00:07:50)
you even mentioned prompt shields,
(00:07:51)
which we also didn't talk about is like.
(00:07:53)
The whole capability of building
(00:07:55)
these tools responsibly and safely.
(00:07:57)
We provide the Azure Content Safety system,
(00:08:01)
which is built into the Azure OpenAI service,
(00:08:03)
and you can then add on to any
(00:08:04)
other model that you want.
(00:08:06)
But it gives you the full set
(00:08:07)
of safety tools that you might want,
(00:08:09)
things about how to give different levels of control
(00:08:12)
around violence or sexual content
(00:08:15)
or how do you want to manage that,
(00:08:16)
all the way to reducing hallucinations,
(00:08:19)
to managing prompt shields,
(00:08:21)
which is really controlling
(00:08:22)
jail break and things like that.
(00:08:23)
All those are just making the developer's life
(00:08:25)
easier as they go and build these applications.
(00:08:27)
SETH JUAREZ: I want to lean in here just
(00:08:29)
a little bit before we get to some of
(00:08:30)
the other questions about safety and responsibility.
(00:08:33)
Can you tell me how Microsoft is taking
(00:08:35)
this particular task seriously
(00:08:38)
in helping developers do it as well, Asha.
(00:08:40)
ASHA SHARMA: Well, we've been
(00:08:41)
working on it
(00:08:41)
for the last eight years.
(00:08:43)
We think about this as
(00:08:45)
cutting edge technology needs cutting
(00:08:47)
edge responsibility and we're still pretty early in it.
(00:08:50)
But after eight years,
(00:08:51)
we have 90 tools, hundreds of features.
(00:08:54)
We do a lot of the work ourselves.
(00:08:57)
Before we release a model,
(00:08:58)
there's a lot of work that we do to
(00:09:00)
provide the Azure promise,
(00:09:01)
but then we also provide tools so
(00:09:03)
developers can actually customize their need.
(00:09:05)
Custom Filters is a big feature that we released today,
(00:09:09)
and that's allowing you to then set
(00:09:11)
up different tools for moderation,
(00:09:13)
for things that maybe we haven't even thought about yet.
(00:09:15)
ERIC BOYD: That's right.
(00:09:16)
SETH JUAREZ: That's really cool.
(00:09:17)
The thing that I
(00:09:18)
think now I want to dive into,
(00:09:20)
if it's possible, is the notion of,
(00:09:22)
what are customers doing with?
(00:09:24)
I know we spent a lot of time talking
(00:09:26)
about here's all this good stuff,
(00:09:27)
goodness that we have, but what are customers doing?
(00:09:29)
We'll start with you, Eric, and then
(00:09:31)
maybe Asha tell us about that.
(00:09:32)
ERIC BOYD: We just keep telling all
(00:09:33)
of the customer stories of
(00:09:34)
all the amazing things that they're able to
(00:09:36)
go and deliver on top of these tools,
(00:09:38)
and it's been in every industry
(00:09:41)
that we've really worked with.
(00:09:43)
You see obvious cases in customer support
(00:09:45)
where people are trying to build customer support bots
(00:09:48)
to help answer users questions or
(00:09:50)
even for their own agents to
(00:09:52)
go and help them develop that.
(00:09:54)
But I think the interesting things
(00:09:56)
are the new applications,
(00:09:57)
the things that we just couldn't imagine
(00:09:59)
being lit up previously and now seeing
(00:10:02)
how those are coming to light
(00:10:03)
with the way the customers are
(00:10:04)
pulling all these different tools
(00:10:06)
together in incredible applications.
(00:10:08)
Some of the most interesting things are this new space,
(00:10:12)
of having multiple agents talk to
(00:10:14)
each other and sort of string
(00:10:16)
different conversations together.
(00:10:17)
I think that's a really interesting space
(00:10:19)
where we're starting to see
(00:10:21)
the standard computer science abstractions
(00:10:24)
and separation of control,
(00:10:25)
that's starting to materialize into
(00:10:27)
these much richer applications.
(00:10:30)
ASHA SHARMA: We now have more than
(00:10:32)
53,000 customers that are on Azure AI.
(00:10:35)
One of the things that we're thinking deeply about is,
(00:10:37)
how do we start to do safety and security by default.
(00:10:40)
Again, removing as much work
(00:10:42)
as possible from our customers.
(00:10:43)
The other interesting thing is that
(00:10:45)
Microsoft runs on Azure AI.
(00:10:47)
All of the copilots,
(00:10:49)
all the millions of users,
(00:10:50)
we're getting a first class ride at learning
(00:10:53)
and using the technology for our customers internally,
(00:10:56)
and then we put it in the platform and we
(00:10:57)
extend it to our third party customers.
(00:10:59)
SETH JUAREZ: Microsoft has always been
(00:11:01)
customer number zero?
(00:11:02)
ERIC BOYD: Literally from the start
(00:11:04)
when we first started
(00:11:06)
building out our capabilities
(00:11:08)
with the Azure OpenAI service.
(00:11:10)
We first launched Bing Chat,
(00:11:12)
and then we launched M365 Copilot and GitHub Copilot,
(00:11:15)
even 18 months before that.
(00:11:17)
We've built all of that as Asha said,
(00:11:19)
on the same platform
(00:11:20)
that we make available to our customers.
(00:11:22)
They know we've literally
(00:11:24)
bet all of Microsoft business on it,
(00:11:25)
so they can trust it with their business as well.
(00:11:27)
SETH JUAREZ: That's awesome, because, like
(00:11:29)
I said, I I use this stuff,
(00:11:31)
and I know what it does.
(00:11:33)
Every once in a while, it's
(00:11:35)
pretty surprising what you can do.
(00:11:38)
As people are getting immersed in the rag pattern
(00:11:41)
and doing that development
(00:11:43)
and they're getting out to work.
(00:11:44)
What are some things that people should look
(00:11:45)
forward to or how should they
(00:11:47)
orient their thinking when it comes to
(00:11:49)
experiences that they may be able to develop?
(00:11:52)
ASHA SHARMA: One of the big things is,
(00:11:54)
I feel like we have to just be thoughtful to
(00:11:56)
not do AI for AI's sake.
(00:11:58)
It has to create real values.
(00:11:59)
It has to improve the way you live,
(00:12:00)
improve the way you work.
(00:12:02)
There was a study that showed that skilled workers that
(00:12:04)
use GenAI are seeing
(00:12:05)
a 40 percent improvement in the way that they work.
(00:12:08)
We are constantly thinking about
(00:12:10)
that from a product perspective, which is,
(00:12:11)
how do we build tools to help
(00:12:13)
developers solve real problems.
(00:12:15)
I would say, don't lose sight of that.
(00:12:17)
The second thing to look forward to is,
(00:12:19)
and we showed this in one of our sessions.
(00:12:21)
We're moving much more code first,
(00:12:23)
so we want to meet developers where they are.
(00:12:25)
More and more, we're seeing
(00:12:26)
all applications becoming AI applications,
(00:12:28)
and we have the best IDs in the world.
(00:12:30)
We want to make that really
(00:12:32)
simple to wire up your applications,
(00:12:34)
change out your models,
(00:12:35)
and do so without any thrash.
(00:12:37)
ERIC BOYD: I would add, too.
(00:12:39)
Asha touched on multimodality earlier.
(00:12:41)
This is something that we've been
(00:12:42)
seeing coming for a while.
(00:12:44)
But really, with GPT-4.0,
(00:12:46)
it really brought home just how
(00:12:48)
powerful this could be to have
(00:12:50)
that much fluency with voice
(00:12:52)
and talking to a model and having it talk back.
(00:12:56)
There's these inflection points on
(00:12:58)
all technologies where suddenly you get to
(00:13:00)
a point where it's good enough that it
(00:13:03)
opens up scenarios that didn't use to previously work.
(00:13:05)
In the early 2000s,
(00:13:08)
speech recognition had to be word at a time,
(00:13:11)
and now it's much more natural for me to talk to
(00:13:13)
my phone than to swipe text messages.
(00:13:16)
I think this level of voice fluency
(00:13:19)
is going to change the way that we interact with
(00:13:22)
machines where we'll just
(00:13:24)
talk much more naturally in our voice as opposed to
(00:13:26)
doctoring things to machine
(00:13:28)
speak and making it work that way.
(00:13:30)
I think that combined with vision,
(00:13:32)
is going to be really
(00:13:33)
interesting the types of things that we can do.
(00:13:34)
ASHA SHARMA: Now I was just going
(00:13:35)
to say, I agree.
(00:13:36)
In many ways, we've taught ourselves to
(00:13:37)
prompt engineer queries for search.
(00:13:40)
Your demo this morning was pretty
(00:13:42)
amazing because it was just you
(00:13:44)
expressing yourself to an application that now becomes
(00:13:48)
much smarter over time and can help you in
(00:13:50)
any mode that you want in any language that you want.
(00:13:52)
SETH JUAREZ: For those wondering,
(00:13:53)
that actually I coded that up.
(00:13:54)
That's a real thing.
(00:13:55)
ASHA SHARMA: I noticed
(00:13:56)
SETH JUAREZ: It's pretty amazing because,
(00:13:58)
as I was interacting with it,
(00:14:00)
I forgot the interface.
(00:14:04)
It wasn't a keyboard thing anymore.
(00:14:07)
Where should people go to get
(00:14:08)
started with this stuff, Asha?
(00:14:11)
ASHA SHARMA: Azure Studio is
(00:14:12)
the best place.
(00:14:13)
We have a playground, so you can just start to
(00:14:14)
dabble with the models and
(00:14:16)
then you can consume the models.
(00:14:17)
That's a great place to get started.
(00:14:19)
SETH JUAREZ: Any final thoughts from you,
(00:14:20)
Eric and then Asha.
(00:14:22)
ERIC BOYD: There's so much happening
(00:14:23)
in AI these days,
(00:14:24)
that the main thing for developers
(00:14:26)
is they just have to get their hands wet,
(00:14:28)
like just dig in
(00:14:29)
and start building something and just start
(00:14:31)
to see what you can do with it
(00:14:33)
because until you have that experience,
(00:14:35)
until you're really seeing how things
(00:14:36)
compose and how you can use it.
(00:14:39)
You're just not going to have that full understanding.
(00:14:41)
That's the main thing I would encourage people to do is,
(00:14:43)
go try these tools,
(00:14:45)
go build applications, and you'll
(00:14:47)
be surprised at the power
(00:14:48)
that you can really create with things.
(00:14:50)
SETH JUAREZ: Asha for your last word.
(00:14:51)
ASHA SHARMA: He took it.
(00:14:52)
I feel like AI just
(00:14:53)
say get second. That's okay.
(00:14:55)
No. I think that we just need
(00:14:58)
to imagine the world as
(00:15:00)
every application becoming an AI application.
(00:15:03)
I would say, think at a much bigger scale
(00:15:05)
than what we have before.
(00:15:06)
SETH JUAREZ: Well, this has been amazing,
(00:15:07)
and thank you so much for letting me work
(00:15:09)
with you on these cool models.
(00:15:11)
It's really exciting. Thank you so much,
(00:15:12)
and we'll see you after this.
