Microsoft's Cloud & AI Head on the AI Buildout's Risks and ROI — With Scott Guthrie
Channel: Alex Kantrowitz
Published at: 2025-10-01
YouTube video id: g6vFyuCZrVs
Source: https://www.youtube.com/watch?v=g6vFyuCZrVs
Microsoft's head of cloud and AI joins us as we ask, is this AI buildout going too far? That's coming up right after this. Welcome to Big Technology Podcast, a show for coolheaded and nuanced conversation of the tech world and beyond. We're joined today by Scott Guthrie. He's the head of cloud and AI at Microsoft. And he is the perfect guest to give us some context on the massive and some would say insane buildout of AI data centers taking place today. What does it mean? Is it going too far? What will it lead to? Scott, I'm so thrilled to have you on the show. Welcome. >> It's great to be here, Alex. Thanks for having me. >> All right, let me take you through the headlines over the past couple weeks. It's crazy that this has just been over the past few weeks, but uh here we go. Nvidia agreed or announced that it would invest up to hundred billion in OpenAI, uh starting out with 10 billion. Oracle announced it would invest 30 billion uh in OpenAI or a $30 billion uh build out with with the company. Anthropic raised 13 billion. So, you know, we're just talking about a cool 143 billion. No big deal. Uh, is this crazy? Is this overinvestment? >> Well, I think there's uh it's a great question. I'm I'm sure that's top of mind for everyone. I think, you know, stepping back for a moment, I would say if you look at AI and the impact I think it's going to have in the economy, uh, it's going to be, I think, the most profound technology shift in in our lifetimes. And so I I think if you look at um the long-term trend uh I I don't see I I don't worry about overinvesting. I think that you know there will be a question on the horizon of different companies are making different strategies in terms of their investment and how they get their return in the one two three year horizon. So you know am I going to say that every company is perfectly timed? I don't I'm not going to make that assertion. But at the same time, I do think the long-term secular trend of AI is going to be that we're going to need more infrastructure. There's going to be more ROI from it and it's going to be more widely used. And so, you know, I think directionally from an industry perspective, um the investments do make sense and and will ultimately yield pretty profound um results. >> You think so? You think that this level of buildout is healthy? I I think we definitely are not nearly at the point at which there is too much AI infrastructure given I think the number of AI workloads that are coming for the world. Um and um you know I think we're seeing on a over the last couple years as people use AI they get value they use it more the models get better and people then use it even more for new use cases and and I think at this point across the industry with AI we're still more supply constrained than we are demand constrained and I think um I you know I expect that to continue over the next couple years as the technology continues to evolve and as people start to integrate AI into more and more workflows. >> Okay. So, let me put it bluntly then. Um, Microsoft has a partnership with OpenAI, has invested 13 billion thereabouts, has the capacity to build big data centers. Um, why did Microsoft make the decision not to do the hundred billion level buildout with OpenAI uh or even the $30 billion that the company is doing with Oracle um and leave it to other partners to do that? Well, we we have a great partnership with OpenAI and it's gone back many many years and and continues going forward and we are building out and doing uh uh a lot of projects with OpenAI and um uh across the Microsoft cloud we're building out AI data centers all over the world. uh and you know at the same time you know we are balancing um you know our investment uh to make sure that it maximizes um the AI infrastructure for both our first party Microsoft offerings uh that we're investing in our customers uh AI offerings that we're investing in and obviously OpenAI's uh offerings um that we're deeply enabling. So, you know, we are very invested. I don't think it's a binary. Are we are we uh building out for OpenAI or not? We definitely are building out for OpenAI. Uh and at the same time, the way our partnership works is we're supportive of of others u participating in that as well. >> Okay. Okay. But I I just want to put a fine point on it because u again like if if you believe that uh this technology is going to be massively transformative which you stated uh and that we're not at the uh at the sort of optimal point of the AI buildout yet that there's room to continue to do more. Um and again it's the partnership with what is the consensus leader in the space. They needed more infrastructure. There must have been some calculation within your group or your company to say uh is it worth it for us to be the one that goes out and builds this massive massive footprint you know in partnership with them or somebody else. So I definitely understand there's there's multiple stakeholders but what made Microsoft pause on that front? Well, we we have a balanced view and so it's we and we take both a a long-term view in terms of making sure that we're building out in all the locations that we want to build out that we're being um uh thoughtful in terms of kind of the investment spend and the infrastructure that we're building and also recognize that we don't have to do it all. And so I think we're always trying to kind of take a a continually balanced view of that and and as you've seen from our capex and as you've seen from our earnings calls, we are investing a lot in infrastructure and building out like crazy. Um but again, at the same time, you know, we're we're always constantly re-evaluating and watching closely, you know, which data centers in which markets to what specifications and making sure that that we keep, you know, good discipline as we're doing it. um that optimizes for both the long-term, near-term, and midterm uh horizons. >> Okay. I I'll just ask one more follow-up and then we can move on. Um good discipline. What about uh this would have been undisiplined to have gone uh to this level. >> Well, I don't think it's so much the volume level. I think it's it's uh we one of the things that we do when we add new data data center capacity or AI infrastructure is you know making sure that that we can use this infrastructure for a variety of different AI use cases. I think one of the things that's really going to differentiate um AI infrastructure companies in the future uh is that ability to kind of maximize yield on the infrastructure like how are you driving down the cost of you know tokens per watt per dollar and um you know part of what makes the Microsoft portfolio so unique is the fact that we have a lot of our own AI uh products Microsoft 365 copilot GitHub copilot um the work that we're doing with nuance and and dragon and healthcare. Uh we've got the world's largest consumer application with chat GPT uh that runs on top of Azure and we have thousands uh hundreds of thousands and millions of businesses that are also building their own AI applications on top of us. And so as we think about like what market are we going to build a new data center? Is it for training? Is it for inferencing? Um and uh you know how do we make sure that that infrastructure is going to be maximally used? You know we we feed in kind of each of these different customer scenarios into our calculus and there are certain tranches of capacity that we're happy to build out because we can see very clear line of sight in terms of how we're going to maximize the usage and the revenue from it. And there's others that were maybe less uh uh likely to see the immediate or the ROI that we'd like. And so we try to be disciplined about it. Um as we've kind of shared in our blog post, you know, we do kind of look at every request first and we do have an opportunity on that. And and as you've seen from our capex, uh we are swinging in a lot of opportunities. Um but that doesn't, you know, we're not going to be undisiplined and say blanket we're going to do everything. you know, we we know that um you know, some opportunities will have uh more certain returns than others, and we're trying to make sure that we maximize our focus around those. >> You know, as as we're talking, I'm kind of laughing at myself because, you know, my my question is basically boiling down to you've like talked about your capex. Isn't Microsoft expected to spend like 80 billion on infrastructure this year or in the neighborhood? I think that's what we shared in our last >> and I'm like well why aren't you doing another 100red billion um but but the fact that you're not as actually very interesting and um and it goes to a point that you that you just made um and and I'm trying to read behind the line between the lines and you tell me if I'm if I'm getting this right. uh you think you're you're talking about where you invest and where you're pretty sure you're going to get an ROI and um to me if I'm sitting in your shoes the question I would be asking is uh is it worth spending you know all that money on training where there where there's been a a lot of um noise about diminishing returns of training larger models with these unbelievably massive data centers now you have the startups uh like OpenAI and uh Anthropic, you know, their their belief in the scaling laws seems unabated. Uh and so the numbers get bigger and bigger and uh they seem to believe that they'll continue to get an exponential return from training these bigger models. But is your decision in terms of being disciplined based on a belief that you're not you're not sure if if scaling up will continue to work and therefore it's too big of a risk to make such a large bet on uh training an even bigger model in a bigger data center. >> Well, I think there's there's a couple different elements of that. I think one is um recognizing that you want to have the best models. So, you know, training is super important. uh because if you don't have the best models then you know your actual ability to monetize AI goes down you know at um uh you know then and you know part of what makes our partnership with OpenAI unique is the fact that we do have access to the best models frankly whether they're trained on our infrastructure or anywhere else um you know that's part of our um partnership that's really important um and I also think when you think about training training is evolving uh from maybe where uh simplistically we'd think of training a couple years ago of you you do training in one place and then you do inferencing where you are uh executing the models and building applications. You know there's now multiple types of training. There's pre-training, there's post-raining, uh there's reinforcement learning, there's fine-tuning. Um there's a lot of new techniques that uh both sometimes require lots of contiguous infrastructure and sometimes requires lots of infrastructure but sometimes it's smaller sizes uh that can be used for very specific tasks. Uh and so when we think about the investments of our infrastructure we're trying to think about all of this and compose it all end to end. Um, you know, for us that means for example, we want to make sure that we have lots of inferencing capacity because that ultimately is how customers pay us. Um, and how ultimately you make money from any product AI product that you build. Um and I think increasingly on the uh inferencing side, you know, one important element is the geopolitics of the world have gotten complicated over the last many years. And uh you know, customers in Europe want to make sure that their AI is in Europe and the customers in Asia are going to care about their AI in Asia. Obviously, the customers in North America and the United States are going to care about their AI being delivered uh uh in in North America. And so, you know, even as we build out our infrastructure, we want to think about it not just narrowly as we want to have one giant pool all in the US. We need to kind of be distributed around the world to kind of meet those uh geopolitical needs and to make sure that our AI is as close to the customers that are going to be using the AI as possible and can can meet all of the data residency and data sovereignty needs. And so even if you look at our infrastructure builds around the world, we have regions in more countries in more locations than any other right infrastructure provider. And um again, as we balance out the investments we're making on AI info, you know, we're trying to keep that in mind versus narrowly put it all in one location. >> I totally understand that, but I have to go back to the diminishing returns of training uh question. What where do you stand on that? Well, I think if you look at training broadly, um, I think you're going to continue to see more value from the models by doing more training. But kind of going back to my answer earlier, I don't know if that's always going to be pre-training. I think increasingly lots of post-training activities are going to significantly change the value of the model. And so by post training I mean take the base model and how do you add financial data or healthcare data or something that's very specific to an application or a use case. What's nice about post- training is that you don't have to do it in one large data center in one location. And so part of the the technique that we've been focused on is how do we take this inferencing capacity around the world and a lot of it is idle at night as people go to sleep. you know, how are we doing increasingly post- trainining in a distributed fashion across many many different sites? Uh, and then when employees come to work in the morning, we serve the applications. And so having that kind of flexibility and being able to dynamically schedule your AI infrastructure so that you're maximizing revenue generation and training ideally in a very swappable dynamic way. I think is one of the things we're investing in heavily and I think is one of the differentiators for Microsoft. >> Okay. Okay, but you'll forgive me for going back to this uh scaling pre-training question. Um >> I'm just trying to see what you believe here and uh you haven't said it outright, but from your answers, it does seem to me like you believe that uh spending wildly on scaling pre-training is a bad bet. >> I wouldn't necessarily say that. I think we've definitely seen as the the scale infrastructure for um pre-training has gotten bigger, we're we are seeing the models continually improve and and we're investing in those types of pre-training um sites and infrastructure. You know, we recently, for example, announced our fairwater uh data, you know, data center regions around the US. We have multiple fair waters um and you know we we did a blog post recently of one of our new sites in Wisconsin and these are you know hundreds of megawws um uh hundreds of thousands of the latest GB200s and GB300 GPUs and are um yeah we think the largest contiguous block of GPUs anywhere in the world um in one giant training infrastructure that can be used for pre-training um and So we're investing heavily in that uh as you could see kind of from the the the photos uh from the sky uh in terms of massive infrastructure. Um and you know we do continue to see the scaling laws improve. Now will the scaling laws improve linearly? Will they improve at the rate that they have? I think that is a question that everyone right now in the AI space is still trying to calculate. Um but do I think they'll improve? Yes. Um, and the question really around what's the rate of improvement on pre-training? And I do think with post- training, we're going to continue to see dramatic improvements. Uh, and that's again why we're trying to make sure we have a balanced investment both on pre-training and post-training infrastructure. >> And and yeah, and just to parse your words here, it's uh you can see improvement by making by doubling the data center, but that's why I use the word bet because are you going to get the same return if it doesn't improve exponentially? and just improves on the margins and that I think is the big question right now right >> it's a big question and and you know the thing that also makes it um uh the big question is uh it it's not like a law of nature that's immovable and so there could be one breakthrough that actually changes the scaling laws for better and and there could be a lack of breakthroughs that means again things will still improve but do they improve at the same rate that they historically did uh from a raw size and scale perspective and that is the trillion dollar questions. >> Okay, great. I do want to get to the ROI of of uh AI spend uh in in a moment. Um you know, it's always great to have a chance to speak with someone who's in a position like you are within Microsoft because we get a chance to like take some headlines and uh which might paint a portion of a story and then ask you what the truth is. Uh there were some stories over the past year talking about Microsoft had like cancelled options to build data centers in certain locations and people took those headlines and they read into it that there was no demand for AI or that um uh it wasn't going as well as Microsoft's telling us. Uh but but what is the what is the reason for um why those those data centers that there were the options and they were canceled? What what happened there? Well, we're we're constantly um well, I think in general the headlines were focused on things that we canceled as opposed to all the things we signed. Um and so if you look at a given >> It's amazing how news works that way, right? If it bleeds, it leads. So >> if you um if you look at kind of the overall investments and certainly if you look at the overall capex spend, um it has been going up uh and up and up and so um as has again the revenue that comes from it. And so I think um I would kind of focus on the overall picture as opposed to individual tanches or individual projects that we potentially made decisions on. Now you know the thing that we did do and we continually do is look hard at every single investment decision we make and we don't take this level of investment and this level of project and infrastructure lightly. It's critical that we uh invest wisely. Is it critical if we invest that we make it successful and that we bring it to market on time with the right quality and the right security? Um, and it's critical that we have the right go to market to monetize it. And so, you know, part of our calculus that we do as a leadership team is constantly looking at the variables for all of those. And there are places and times when we slow down or pause projects and there are times when we accelerate projects somewhere else. Um and kind of going back to my comment around the world, you know, the also the um you know regulation geopolitics of of how AI is going to be used going forward has changed quite a bit and what Europe thinks about where GPUs can be based has evolved quite a bit I'd say in the last 12 to 18 months and I think it's going to continue to evolve around the world. And so even as we think about the investments we're making, we're also being very very thoughtful in terms of where geography based are we investing so that we can again maximize um the uh AI tokens we can serve in in real production applications. Uh and then ultimately use that maximization to uh ensure that we're delivering a good return on investment for every capital dollar we spend. >> Okay. And I I have some technology questions for you, but um just to keep on speaking about the financing of this stuff because it's so important. Um so there has been some interesting reporting about how uh the AI infrastructure buildout has begun to be funded by debt, not just profits. Great story in the Wall Street Journal this week. It says, "Debt is fueling the next wave of the AI boom." I'll read the beginning. In the initial years of the AI boom, companies uh comparisons to the dotcom bubble didn't make sense. Three years in, growing level of debt are making them ring truer. Early on, wealthy tech companies were opening their wallets to out joust each other for leadership in AI. They were spending cash generated largely from advertising and cloud computing businesses. Uh there was no debtfueled splurge on computing and networking infrastructure like the one that inflated the bubble two and a half decades ago. Uh however uh that is starting to happen. Now uh OpenAI's deal with Oracle um is uh has been uh pushed Oracle to start taking on debt. Um they say this is according to the story. Analysts at Key Bank Capital Markets estimated in a recent note that Oracle would have to borrow 25 billion a year over the next four years. Um, obviously you guys are not Oracle, but um, you know, you're you're watching this happen as it plays out and see the parallels to the dot boom, I'm sure, is not not fun. You've been at Microsoft for I think 27 years, >> 28 years, >> 28, sorry, I don't want to miss that that last year there. Um, so you've seen it. Uh, Scott, uh, is this this seems to be an issue at least from the outside. What do you think about it being on the inside? Well, I think I um I think uh obviously there's a tremendous amount of spend from lots of different companies and I I would say um yeah, the thing I can speak most to is what we're doing. Um and kind of uh you know per my comments earlier, I think we're trying to make sure that we have a smart um investment play uh and a long-term u strategic play that allows us to ride the AI revolution that we think is going to transform the world and do it in a way that leverages some of the strengths that we have at Microsoft which is we have a very good cash flow. We have a very diverse portfolio of businesses in particular in the commercial enterprise space. Um whether it's cloud infrastructure, productivity applications, business applications, security etc. All of them are going to be transformed by AI. uh and you know if you look at say to your comment earlier on the Wall Street Journal post I think if you even read further in the post you know it does show the ratios for different companies and there are some companies that are 400% debt to equity ratios and >> that would be Oracle >> uh and then there are other companies that are are much smaller and that would be Microsoft and I think you know we want to make sure that we're not um and I think again based on our capex spend and uh uh and the rate at which our capex spend is going up. You know, we're not going to sit on the sidelines and not be bold as we invest. And at the same time, you know, I think the thing that our investors expect and ultimately I think every investor of every company will expect is to see that revenue growing um in terms of AI services and products that are being delivered in terms of net revenue recognized in a quarter and making sure that the proportionality of that to the spend and in particular to the obligations that maybe are being undertaken with debt are balanced. And um yeah, I can that's the thing that we've been focused on. I think um you know if you look at our last quarterly earnings I think people were pretty pleased with the getting the balance right there and you know every every quarter going forward people are obviously going to be looking at making sure that that balance is right so that they see us investing for the long term and going to win and at the same time doing it in a way that um is sustainable and allows us to kind of ride through um uh you know the ups and downs that inevitably will happen over the next many years as this technology you know transforms the world. >> What are the consequences if this goes wrong with the debt? >> Uh well >> you're obviously not taking on the same amount of debt. So there's a rationale behind it. What happens if Yeah. it breaks? >> Uh well I mean we have the ability we're not constrained. I mean our borrowing costs ironically right now are low. >> Yeah. industrywide, big picture, industrywide, not Microsoft specifically. >> Well, I think the thing that um uh that we as an industry, I think, you know, again, you need to have that thesis of how you're going to use the infrastructure and is it do you have I I would focus less on the megawws um that sometimes get reported in the press and more at where are those megawatts and what are you going to do with those megawws? Is it going to be ultimately capacity that you can use to serve customers? Is it to build better models that help you serve customers? And you know what is the line of sight in terms of the the product services and revenue that comes from it? And I think that's a place where again between chatbt which is the number one AI app in the world, between Microsoft 365 which is the number one enterprise AI app in the world and between GitHub which is the number one developer AI app in the world. I feel good that we have applications using our infrastructure and maximizing it. And I feel good about the investments we're making in terms of capital spend and buildout in the right locations to kind of continue to do that. And I think not every company probably has that um level of um game plan and I don't think that maybe not every company is probably doing the same level of thoughtfulness of that and you know at some point you know different companies will probably be hit by it but you know we're very focused on what we do and how do we make sure that we stay um aggressive yet disciplined and make sure that we get that balance right. All right, I want to take a quick break and then I'm going to ask you a couple uh technology questions about the state of the buildout, GPUs, uh custom silicon, and uh and then maybe we can get a little bit into this ROI question. In fact, we will we have to talk about the ROI of AI. We'll do that right after this. And we're back here on Big Technology podcast with Scott Guthrie, the head of cloud and AI at Microsoft. Uh Scott, we have a Discord here at Big Technology and I asked some of our members, you know, what they would ask you and we got a flood of excellent questions. Uh and I think I think the they were great because they focused on the technology. Some questions that I don't think you hear too often um in the common conversation about this technology. So, uh you're the perfect person to ask. I'm going to ask them to you. One of our members asked, "What is the working life of a GPU and how long until they burn out?" they are there use cases for GPUs once they are no longer top of the market right we hear often about okay well unlike the the um the laying of the fiber the GPU depreciates after a couple years so I think this is a a pretty important question can you can you hand that tackle that for us >> yeah I think kind of going back to the comments we had earlier on balance I think as you think about your GPU buildout one of the things that we think about is the lifetime of the GPU you and how we use it. I think you know what you use it for in year one or two might be very different than how you use it in year 3, four and five or six. Um and so you know I think that is something where um you know so far we've always been able to use our GPUs even ones that we deployed multiple years ago for different use cases and and get positive ROI from it and that's why our depreciation cycle for GPUs is what it is. Um but I do think that's you know as we build out um our infrastructure we are definitely consciously thinking about that because uh you don't want to have your entire fleet in two years suddenly have to be replaced because um you know that that that would be expensive and so you know we are very thoughtful on that um and again that's I think I talked earlier about um different training I also think even as you think about training we we often in the past used to monolithically call training training. There's lots and lots of different training use cases now. There's pre-training, there's synthetic data generation that goes into training, there's post-raining with RL and fine-tuning and and other different techniques and um you know having infrastructure that's very fungeable and that you can use for a variety of different training infra scenarios and at the same time be used for inferencing where you ultimately enable an application to perform a query or perform an AI uh invocation. um is key and I think that's uh uh does go goes beyond just the the GPUs even though I think people often narrowly focus on that. It also is around the data center architecture. It's around the storage and the compute that's near the GPUs. And it also really comes into play with the network because if you are for example building one large data center that only does training and it's not connected to a wide area network around the world that's close to the users, it's hard to use that same infrastructure for inferencing um because you can't go faster than the speed of light. And so someone elsewhere around the world that wants to call that GPU if you don't have the network to support it um uh you can't use it for those inferencing needs. And so again going back to kind of some my comments earlier about how we're trying to be very thoughtful about where we place infrastructure and how we um maximize the utilization. We're definitely thinking of that not just for this year or this quarter, but thinking about it on that four or five or six year horizon for how we want to basically leverage and and use it. >> Okay. Uh here's another question. Are there any cool technological breakthroughs that would change the economies of data centers as we know them now? GPUs started as uh graphics processing units for video games. Uh are there resources you found that might do as well but with fewer constraints? Well, I think one of the biggest changes that's happening right now from a data center perspective, and you're seeing this with the latest Nvidia GPUs, and I think you're going to see this in a more profound way over the next um two years, is the shift from air cooled data centers where you use, you know, effectively giant air conditioning units or chillers um to a uh liquid to liquid cooled facility water where you're actually pumping in water in order to to cool the equipment uh in a in a closed loop circulating system. So in other words, you feed in cold water, you run it over the GPUs effectively, extract the water, cool it down again, and then do it again throughout the building. That's a massive technology change. Uh and it does mean that older data centers um that are air cooled, you know, they can't just drop in liquid cooling uh and be effective. And so that is something that I think everyone that's in the AI space is designing for and needs to be thoughtful of again with their infrastructure projects to make sure that they're ready for that technology shift. It also is going to have a big difference and big impact in terms of the staffing. Um when you have a aircooled data center you'd have very few employees often per server. when you all start to involve water and liquid, you know, it's not massively more, but at the same time, it does change staffing because there are more things that break when you have pipes that are actually uh continuously flowing liquid uh into a data center. So, it's there's a lot of technology shifts that are happening right now behind the scenes beyond the GPUs. And then obviously GPUs are the things that dominate the press in terms of innovations both in terms of the silicon but also in terms of the network because at the end of the day if you have a chip that can process a lot more information but you don't have the ability to get that information to the chip and extract it or have it communicate with other chips you know then you don't get the the yield out of it. And so um I think it's fascinating right now in technology the pace at which so many things are evolving so fast both with the GPUs to the question but then also the data centers even the power and cooling infrastructure for the data centers and the network and um you know as a technologist it's it's exciting times >> right you mentioned staffing so I want to ask a follow-up on that front u I think for those who don't live those who are not in this uh deep into it. There is a perception that data centers um they're placed uh near communities in some cases. They use up a lot of water. They don't provide a lot of jobs. Is that a misconception? >> I think it's a misconception. Um uh I mean >> like some numbers to sort of flesh out what they actually bring to a community that they appear next to. >> Yeah. Yeah, I mean, we we've talked about with our Wisconsin uh fairwater site that we did some press on recently and um uh talked about uh including with the governor of Wisconsin and and others that were attending. You know, it's it's thousands of jobs that we've created on the construction of the site. Um uh I think we've we've shared over 3,000 um jobs and these are these are very skilled jobs. These are um you're talking about electricians, you're talking about plumbers, you're talking about welders, you're talking about skilled tradescraft uh and um uh and you know highquality jobs and and I think you know if if you look I mean we have phenomenal work site, phenomenal workers there and a phenomenal safety culture which has allowed us to attract some of the the best workers to work on that project. I think what people are missing sometimes when they say, "Okay, but when the project's done, how many people are going to be in the data center um and there will be, you know, hundreds of people that will be in the data center." What people are missing is the fact that right next to that data center, we're building another data center. And so those thousands of people that have been working on the first Fairwater data center, we just announced are now going to be starting work on the second one. Uh and then after the second one, we will do a third one. And and if you look at the land and you look at the power we've accumulated in that area, it's multi- gigawatts of land um or multi gigawatts of power and it's it's an awful lot of land. And so you you're going to see us continue to employ thousands of very skilled trades um uh workers in that community. And as each one of those data centers comes online, we're going to add net new employees that will actually operate it and uh and manage it. So, you know, that's would be an example, I think, of a community and we, you know, we have over 400 data centers around the world. So, they're not all that size obviously, but um replicating that and I think as more infrastructure gets built out, you're going to continue to see um not just jobs created, but well-paying jobs that really require real uh real uh trade craft. Do you feel what do do you feel the um I don't know the right word to put it the pressure of competing with China um because from my understanding China has a much looser regulatory approval process and they're just stacking you know data centers they have abundant electricity uh in the United States in particular um I imagine Europe is the same way uh that that is not the case so what's it like oh >> certainly I think the world has a very different regulatory approval process. I mean I think one thing that when I talk to people and they say how can you build data centers faster um you know there's obviously things that we can do from a technology and are doing from a technology and from a manufacturing perspective but you know candidly here in the US the longest part of building a data center is getting permitting. It's not actually the construction. It's it's making sure that you you know uh get permitting approval for all the steps that you want to take. And you know, different states and different parts of the country have different regulatory environments. And I think even if you look at a sort of a heat map, if you will of where data centers are being built in the US, you definitely see pockets. And I would say some of that approximates to where there is land and where there's power and some of it really, you know, closely correlates with where it is easier or faster to kind of complete the permitting process. Um, you know, in Wisconsin, we had a, you know, a phenomenal partnership with the governor and the local county. Um, we were able to to purchase some land and power that, um, a manufacturer was previously going to use and they they pulled out of a project. And so, you know, I think that the the the local communities recognized if they weren't able to work with us, you know, they were gonna lose jobs and and have, you know, impact on the community and they leaned in with us and, uh, you know, can't say enough positives in terms of the speed with which, you know, we went through all the process. We got all the approvals. It was a very thorough process, but it was streamlined so that we could move fast and that we could actually help ensure that jobs weren't lost and that instead they were created in the community. And I think there's more opportunities for public private partnership like that that we'd certainly welcome as part of it. >> Okay, another Discord question. Uh what time frames are you looking at to get ROI on these investments? So when would you stop investing if prior investments weren't showing returns? And how do you know when it's time to stop building? Well, I think you know at the end of the day I think you know every every quarter we we share our revenue growth and we share our capex spend and you know to some extent I think the you know markets keep um uh companies honest in terms of um uh that balance and u you know sometimes markets can be slightly irrational at times but in the limit the markets keep you honest and >> um yeah that's a big part of why we focus so much on making sure we get that balance right. Make sure we're we're again investing for the long run. I don't think anyone if you look at our capex spend and our our commitments and our investments would say that we are um uh not being bold. But at the same time, you know, we have a report card every quarter where we need to kind of demonstrate and uh prove not just with press releases, but you know, here's how much revenue growth we had. You know, last quarter we grew Azure at 39% year-over-year on a very large number and a lot of that was driven by AI and then also driven by the other systems that come with AI. um because there are databases and there's compute and there's storage sold with that AI and you know I think investors were happy with the um both the spend and the aggressiveness that we were building out but also the return and you know I think that's going to be true um you know forever and you know making sure you get that balance right and and again as part of that balance in markets want to know you're investing to win the long run uh and at the same time they want to make sure you have some level of discipline And and I think our portfolio the the balance that we have both across the the products we build but then also the fact that we have the largest AI product in the world called Jad GBT running on top of our cloud. Um gives us a unique opportunity to get that balance and that growth and that investment right. >> Uh is is chat GPT by the way going to stay on Azure even though OpenAI is making these partnerships with uh Nvidia and Oracle? >> Uh yes. >> Okay. All right. It's good to get something definitive on that. Um, you know, you mentioned your 39% Azure growth and you know uh I'm uh I'm looking at your quarterly numbers every every uh quarter and uh often talking about them on CNBC and the numbers are are massive. And the other side of it though is so that's spend coming from clients, right? And there have been multiple studies that have come out recently that have talked about how enterprises aren't getting the ROI that they've anticipated on their AI projects yet. Uh when you see those studies, do they ring true to you? How do you react to them? >> Well, I think um uh I think when you say AI in general, it's a very broad statement. Um, and >> this is gener it's this is obviously I mean not obviously it's in large part this is generative AI where companies everywhere have tried to adopt LLMs and try to put some version of that into play in there and it's not recommener engines basically. >> Yeah. But I think what you need to do is double click even further from Genai to GitHub copilot or healthcare or uh Microsoft 365 copilot or security products built with geni. I do think ultimately, you know, the closer you can kind of doubleclick on is this really delivering ROI, then then you have much more precise data because I do think a lot of companies have dabbled or done internal kind of I'll call it proof of concepts and some of them have paid off and some of them haven't but um um yeah and and but I think ultimately a lot of the solutions that are paying off that we we continually hear from our clients and our and our customers is um uh you know a bunch of the applications for example that we've built um I think similarly you know a bunch of the applications that our partners have built on top of us and you know ultimately the Azure business is um you know we get paid based on consumption um it is it's a consumption based business meaning if people aren't actually running something we don't get paid it's not like they're they're pre- buying a ton of stuff um you know we recognize our revenue based on when it's used and so you know the Good news is when you look at our revenue growth, it is um you know it's not a bookings number. It's actually a consumption number and uh you can tell that people are consuming more and um you know you know the last two quarters our our revenue growth's accelerated on a big number and that that is um you know a statement of the fact that I think people are getting a lot of ROI at least with the projects that they're running on top of our cloud. >> Yeah, I think that's an important point to to bring home. It is consumption based. Um, so you you talked a little bit about water cooling versus air cooling. I love the term for the air cooling. It's called chillers and that's what my friends in high school called ourselves, you know, back in the day. Um, and um, I want to end on the GPU side of things or the the silicon side of things. Um, what do you think the potential is for for custom silicon in the AI world? I mean, like we talked about previously, GPUs were designed for gaming. they happen to do parallel processing actually ended up being really good for uh you know large language models the training and the and the inference. Um what's your perspective on on whether this industry is going to continue to run on that type of chip uh and what the potential is for custom silicon? I think um a couple things I think one is it um I think that increasing the um the the number of tokens you can get per watt per dollar is going to be the game over the next couple years and and maximizing u the ability of our cloud to deliver the best uh volume of tokens for every watt of power for every dollar that's spent where the dollar is spent on energy, it's spent on the GPUs, it's spent on the data center infrastructure, it's spent on the network and it's spent on everything else is is the thing that again we're laser focused on and uh it is you know there's a bunch of steps as part of that GPUs being a critical component of it. Um, and you know, one of the things that our scale gives us the ability to do is to invest for uh kind of nonlinear improvements in that type of productivity and that type of yield. You know, if you've got, you know, a million dollars of revenue on couple hundred GPUs, you're not going to be investing in custom silicon. Um, when you're at our scale, you will be. Um, and you're not just investing in custom skeleilican for GPUs for pre-training or for inferencing. You're you're looking at what could we be doing for synthetic data generation with silicon. What can we be doing from a network compression perspective with custom silicon? What can we be doing from a security perspective? And we have bets across all of those, many of which are now in production and are actually powering a lot of these um AI experiences. In fact, I think every GPU server that we're running in the fleet right now um is using custom silicon at the networking compression storage layer that we've built. Um now the GPUs themselves are also going to be a prize that people are going to try to optimize like the actual instructions um for doing um the GPUs. Nvidia is a fantastic partner of ours. Um we're probably one of if not the biggest customer in the world of theirs. Um and um uh we partner super deeply with Jensen and and his team. You know, at the same time and and partly why they're so successful is they're executing incredibly well. Um you know at the same time if you look at the history of silicon um not every silicon company or it's rare to have a silicon company that every single year is is doing the absolute u perfect work that's differentiated and and kudos to Jensen for what he's done and I know he's going to keep trying to do it going forward but you know there will be other opportunities from other companies where people are going to look for a niche that's going to be big enough in this AI space to be truly differentiated versus what Nvidia is delivering and then we're doing our own silicon investment in house. So, because we're going to be going after those same opportunities and and ultimately the way we've tried to build our infrastructure, uh, none of our customers know when they're using Microsoft 365 or GitHub or or any open models what silicon they're running on. And we're going to be constantly tuning the use cases based on the applications. >> And if we find ways that are breakthroughs, we're absolutely going to be taking advantage of them for those use cases. And again, at our balance of scale and our balance of use cases, I'm very confident that we're going to find use cases where custom silicon will make a difference. And I'm also very confident we're going to continue to be a great partner to Nvidia and others in the world that are going to be selling us great solutions. >> All right, Scott, I want to end on this because I'm always I've always been curious about the human aspect of this. like you're going out and working on designing your own chips that are trying to be you know better than GPUs for certain parts of this AI uh uh application layer uh and training and then you're you said one of Nvidia's biggest customers if not its biggest customers. So is this like a situation where like you go to Jensen and you're like we're gonna just both give a shot at building the stuff and may the best chip win and it's friendly like friendly competition or is there any awkwardness in there cuz you're like kind of building the thing that is making them the most valuable company in the world? >> Well, I think probably different companies handle that differently. I think the nice thing about Microsoft is a we've been around a while and I think also we're we're you know we compete almost in every market in some way shape or form. So like there's none of my partners that I'm not also a competitor with. I think is probably a true um form and the important thing is I think you have that enterprise maturity to be able to recognize you know I want Jensen to do the best possible work because it's going to benefit me. Um and we've leaned in. We were the very first cloud to deliver uh live GB200s, you know, which is a massive architectural shift for Nvidia. That's the first of their liquid >> gracewell. >> Yeah. And we were the first >> the first one running, first rack running, the first cluster running, the first data center running of any cloud or neo cloud provider in the world. And so, you know, that's an example where we really leaned in and moved at the speed of light together. And we're going to continue doing those types of projects. and at the same time, you know, he recognizes and understands we're going to be doing lots of things. And I also recognize he's going to work with other providers as well. So, I think the ability to kind of keep a complete thought and recognize it's not zero sum on every single decision. And that at the end of the day, um, you know, it's a market, we're all going to compete and we're also going to partner and, um, you know, I think we have the maturity at Microsoft to do that. Again, the balance I keep I think I've said balance multiple times. I do think balance in life but especially in business um and especially in technology that is the devil's in the detail but if you can get that right and do it consistently those are the companies that win and those are the companies that um really you know have the ability to set the agenda and and that's what we're focused on. >> Well Scott, uh I just want to say thank you for taking the time. I know you don't do this often, so I appreciate why why did you uh say, "Okay, hey, I want to come out and speak about this today." >> Uh well, a bunch of people internally said, "Hey, are you going to talk to Alex?" And so >> that is always a good advice to follow. >> And so I Okay. And um so it's uh it's fun to fun to get a chance to do and uh really I really enjoyed the conversation. >> As did I. Yeah. Thank you again for taking the time again. I know it's rare for you to come out and speak about these things. uh you're running a massive massive and fast growing business and so it was great to be able to speak with you and get into peak get a peak into it today and a look as to what the rest of the industry is doing and your perspective on that. So thanks for coming on the show Scott. Appreciate it. >> Thanks for having me Alex. >> All right everybody, thank you so much for listening and watching. We'll be back on Friday to break down the week's news with Max Ze of Techrunch. It's going to be a great episode and we hope to see you there. Thanks again and we'll see you next time on Big Technology