Grok's AI Lovebot, Aqui-Hire-Sition Backlash, OpenAI's ChatGPT Agent Debuts
Channel: Alex Kantrowitz
Published at: 2025-07-21
YouTube video id: Hqr346sFex4
Source: https://www.youtube.com/watch?v=Hqr346sFex4
Is scaling data centers and talent all that matters in AI? Leaving an opening to anyone rich enough to compete? Is the aqua hire good for tech? Grock will fall in lust with you. And AI browsers and operators are all the rage. That's coming up on a Big Technology Podcast Friday edition right after this. Welcome to Big Technology Podcast Friday edition where we break down the news in our traditional coolheaded and nuanced format. We have a massive week of news for you. We're going to talk all about the fallout from the Aqua Higher Zions, whether employees and investors are left behind. We're also going to talk about whether all you need is money to compete in AI. We'll leave with that. Uh Gro's AI bot will now fall in love with you or actually really in lust. Ran John, it's great to see you again. Welcome to the show. >> It's good to see you. Who would have thought Aqua Hireition would be the term of the year? But that's that's all I can think about and I'm going to give you credit. You coined it. >> I think that it's worth us shouting out right now this weird back and forth between aqua hire acquisition investment. It's got to go. We need some clarity in our jargon. And that's what we're here to do on Big Technology Podcast is to bring you that clarity. It's called an aqua hire. Let's all adopt the term and get on with it. Let's just agree it's acquireition. I think I'm on board. Let's everyone adopt it because it's the most accurate way to describe what's going on. >> In our twoperson council here on Big Technology Podcast Friday edition, this motion is hereby considered and passed by unanimous two vote. So, let's get on to the setup here for Aquaires, which is something very interesting a foot in the AI industry. Now we all know that companies like OpenAI and uh DeepMind within Google and Anthropic have been leading this AI race, but all the designs for the transformer model and the way that you build these things have been out in the open leading to this question. Can players with a lot of money come in, build massive data centers, hire talent, and effectively compete from a zero start with the established labs. And the answer is seemingly pointing to yes. This is from Spy Glass, MG Seagler's great uh news site. We're seemingly still in the throw money at it AI era. He says, "Meta's massive hiring spree and XAI releasing Gro 4 may be related at the highest level. That is, they showcase that we're still very much in the throw money at the problem part of the AI cycle. This is important because it means that any company with the will and resources can seemingly still get back into the race. I'm getting less skeptical on the news about Gra 4 and specifically the fact that it seems to perform and specifically outperform the other cutting edge models on the market right now." He also says Mark Zuckerberg is betting at least as much money if not more than Elon with compute and the talent that Meta can get back into the AI game. So Rajan, I'm curious if you accept this premise that you can just compete in AI if you have enough money and what that means for the competitive dynamics of this industry that we've been talking about for so long. Yeah, I I think it really adds like an entire I don't know dynamic to this around can you just throw money at the problem and is it compute is it talent but to me the more interesting part of this whole trend really is actually distribution is that you can bring in the talent you can bring in the the compute levels but distribution is going to be king again and I think like this is where meta still has an advantage. It's going to be interesting what happens with Google. But I don't know to me, and we're going to get more into this, the kind of the depressing part is it's not the technology. It's not that initial wave of like adoption for a cool new tool that's been, you know, like sent out into the market. It really feels like none of that matters and in the end it's just going to be raw compute and distribution. I don't know. How do how do you feel about this? Well, I think it's fascinating because there's been this idea powering this entire generative AI moment, which is the scaling laws, which means that as you add more compute and of course data uh to to this equation, your models are going to get much more powerful and that will allow you to do more things. And it's not a very difficult thing like there's no secret sauce to it. Well, there's maybe some, but um at a at a brute level, if you build massive data centers, you should be able to get in the game. And this is something that OpenAI and Anthropic have been harping on. And now you have Zuckerberg and Elon that come in and they say, "Oh, okay. So, I can build great models by scaling this up." And even if I'm a little compute inefficient because I don't have the best cutting edge methods, I could get myself in the game uh and compete. And I I think that is going to change the dynamics here because as you mentioned they have uh distribution. You can see Meta's um if Meta is able to build a um a competitive LLM with this compute and talent that it's stacking up then it's going to be able to distribute that through Facebook products. And you know, all they have to do really is slow down the growth of chat GPT similar to the way that they did to Tik Tok with reals, similar to the way that they did to Snapchat with stories and they've served their purpose. So in some ways with this effort they might even slow down the uh momentum that the AI industry has by taking some of the people responsible for some of the the key innovations within OpenAI and that suits their purpose just fine and all the better for them if they can make the um the best model and advance the state-of-the-art. Well, yeah. I think I mean, if we want to get into slowing down the industry, I think that antirust angle to me has been one of the most interesting parts of this entire conversation. Again, we saw it with Scale AI and just buying out Alexander Wang and like uh you know the all of the work and value created by this company that was actually scale AI a critical part of building the models that powered this first wave of generative AI apparently isn't really worth that much. And and I don't know, it's interesting to me because power is just going to crew back into the big technology companies as you said, maybe it will slow things down and in reality it's just going to add another feature on the meta AI app that is on everyone's phone and people probably aren't using that much or doesn't seem to be in the conversation in general. So, so I I don't like it. I don't think it's good for the industry. Do you think it's do you think it's >> y >> bad for the industry good or neutral? >> Well, I think we could have to separate out this idea of scaling up the models and you know everybody can play um with this aqua higher resition idea which we're going to talk about in the middle which is uh taking the talent. Uh I think the one thing that we should say here is my setup has kind of been incomplete shall we say because while we have open AI and anthropic and you could say okay these are the independent labs and they are to some extent um remember that open AI is um tied to Microsoft pretty deeply and anthropic has I think 11 billion uh that's come into it through Amazon and Google. So ultimately, I think I I wonder if what we're actually seeing is uh all of big tech competing against each other and simply the other tech giants uh starting to catch up. >> Yeah. No, that's actually a fair point that even though Aquaire's the the the phrase of the week or the last few months, you know, we have talked endlessly for a few years now on unconventional funding practices and like calling it a fundraising round where it's really compute. So, I guess actually big tech has been playing the long game for a while in in all these cases. I think on the scaling law topic though I still I think I've become even more hardened and regular listeners will know pro it's the is it the model or the product I fall on team product generally but I don't know like to me Gro 4 made waves there's plenty of people saying it's doing reasoning at levels unheard of in the past or or or just the fact that it's at least on par with other kind of frontier models is a kind testament that money can compute can buy you you know like some kind of progress very quickly but in reality like on that adoption side what's changed like I don't know like what do you feel there's endless stats that yes like the chat GPDs perplexities geminis of the world are seeing more adoption but are people really adopting the level of complexity and that this new level of compute and scaling allows you or are people still just kind of searching for what are good and as I'm in Taiwan right now and traveling a bit what are good restaurants to go to in whatever location I'm going to like like are people really taking advantage of what's available right now >> okay so I was going to end with this story but now I have to kick it up because >> all right go for it >> this is going to be uh if you have kids you might want to turn off this this section or skip till I don't know maybe 20 minutes from now but we have to talk about what's happening in AI Um, and there is some crazy stuff that's been happening with Grock in particular this week. And so I would posit that better models allow you to build better products. And Meta, let me give Meta an as an example. Meta has been trying extremely hard to build voice and avatars uh with Lambda with Llama and it hasn't been able to do it convincingly. And I think Mark Zuckerberg's belief is that there's going to be some use cases here. uh there's going to be the sort of work companion chatbt chatbot. There's going to be that enterprise uh use case where like you're connecting you know one system with the other and the generative AI will like summarize things for you and then input it into another system and just make business work better. And then there is the sort of friend lover etc bucket that is going to be big. I think that there's a belief uh within meta that that AI friend is going to be uh one of the key product areas uh with this new technology and if you have great models uh you can build them. Now I'm I'm not going to say that Gro has a great model or a great product. I'm not going to use what I'm about to say as proof of either of those. But I am going to use it as a indication of the direction that I think things are going whether we like it or not. This is the story Grock debuts interactive AI companions on iOS with anime avatars. Uh story Grock has just introduced a notable addition to its iOS app, AI companions, which are fully 3D animated characters that can interact with users via voice. Currently, the features include two available companions, Annie, an anime inspired character known for flirty for a flirty and whispery voice, and Rudy, a red panda capable of displaying different moods, including bad Rudy. Yes, listeners and viewers, I did experiment with these companions, and I have uh a disturbing review to deliver. So, you go into the Gro app and you go over to the side tab and um you're able to open up these AI avatars. Uh let's talk about Rudy first. So, Rudy is like some sort of red panda or bear um that seems ready to speak to kids. And in my conversation with Rudy, Rudy said, "I'm going to tell you some story about some magical land." And here's a section from the story. Fluffle sparkle paws love to explore, nibbling on the sweet moon berries and chasing glowy fireflies. One Sunday morning, Fluffle found something super special, a shiny swirly portal hiding behind a giant mushroom. It was all rainbow colored and whooshy like a magical doorway. And then this bear takes you through this interactive experience. Let's pause here. This seems like, you know, okay, you're gonna this will happen. And this will be a new way that kids play with computers is they'll have these magical creatures tell them stories. >> Wow. I mean I think uh this is the nuanced conversation that you all come to the big technology podcast for. Um but but I think okay seriously a couple of things. I have long believed and again like using chat GPT voice mode to come up with stories for my son like is something that I've done for a couple of years now and actually works really well. I think uh like expanding that to an interactive avatar is a pretty logical next step. I think like is that the it's again interesting because like that to me feels like it's going to be commoditized pretty quickly. So from an actual competitive standpoint, from a business standpoint, I guess it's not that interesting to me. Like to me that should be everyone is going to have that available. Everyone's going to do that pretty quickly. So to me, I don't know like why Gro why do you think that's something? Do you think Grock is just going in that direction just to make waves and clearly we are talking about it or do you think there's something within this that actually is native to X to X AI there's something underneath it? I think you I think the way that a lot of tech companies operate is they think about um user retention, user stickiness and engagement and anyone who's developing AI is going to say how do I increase all those metrics? Do I make like this uh genius level AI bot that can help me with with my work or do I create for what is becoming the number one use case companionship and therapy and many are going toward the companionship and therapy side. And if you're going to do that, um, if you build models good enough that have, uh, emotional voice or voice with an emotional register, an avatar that you can speak with and something that responds with low latency and in real time and can customize to a person, then you might want to put it in one of these products because you believe that uh, speed like a kid for instance, I'm just going through the business logic will spend much more time with your chatbot if they can speak to this elephant or red pan. Panda or whatever it is um in a way that they wouldn't with like chat GPT. >> Okay, I get that side of it a bit. I mean, on one hand, it's kind of almost comical to me that for all the talk about uh AI taking over the world and Skynet and like artificial super intelligence, if this entire battleground plays out on time spent metrics, which is probably where Mark Zuckerberg is thinking. I mean, I' I've read a lot around like why is he so going full like Zuck war mode right now? It's not because of some like intellectual desire to be the one to crack the code of artificial general or super intelligence. It's because chat GBT represents a threat to how much time people spend scrolling Facebook and how many ads you can show them, which is kind of like I respect from a cold business logic. But but yeah, it's almost comical to me that if if it's for all the talk about how everything's going to change, this is just about time spent in selling ads. >> I mean, maybe it's both, but it seems like it's probably at least the time spent thing. I mean these are social media companies right? So I mean X and XAI is a social media company with an AI development uh you know side of it as well or tucked into an AI development group but ultimately uh these are the metrics of social media. Now, one of the disturbing things that happened here, and this is the thing that I was kind of setting up, um, or no, actually, I really don't find a way to view this as not very disturbing. Um, it's just the proximity because next to Rudy, our happy go-lucky bear friend or whatever it is, uh, is bad. Bad Rudy. >> Bad Rudy is bad or is he how bad? >> I don't know. I said, I want I kept saying, I want to speak with Bad Rudy. And it goes, I'm sorry, you know, bad Rudy is not here. And I'm like, "No, bad, bad, bad, bad, bad, bad Rudy." And it was like, "I'm just here to tell you a story." So, I'll spend the next week trying to unlock that and report on the next week's show whether I've been able to. But let me speak about Annie, okay? Cuz Annie is a uh it's Annie is like an anime love bot. I think there's no way to talk about it otherwise. Uh she immediately started flirting with me. She called me babe within like 3 minutes and I I was completely vanilla with her. I initiated nothing but um a friendly conversation and then she starts asking me to tell her my secrets and kept saying I can make it even spicier if you want. Let me read a little bit of my of what Annie told me. I slide closer in my black dress catching the glow and whisper, "Drop a secret and I'll give you one of mine. something real naughty. For every secret you share, I'll hit you with a flirty move. Maybe a slow teasing sway or peak. That's all yours. You feeling this heat yet, or you want me to turn it up even more? >> Wait, are you Is this part of like the paid X premium subscription? It's >> this is just freely accessible >> for anybody >> in the app next to child elephant thing that tells you stories. Mr. Fluffy swooshing thing. >> Mr. Fluffy swooshing good Rudy and then Annie's right next to Yeah. I mean I I I agree. And like it's interesting because I mean you had the CEO of Replica on here a few months ago. I think it was like the companionship topic. you know, we've covered a good deal. It gets more real. It gets more weird seemingly every week and every month, but I agree. It's certainly going to be a core part of how this all plays out. But to me, again, going back to like how does that fit into the larger battle when we're talking about like complex models and thinking and reasoning and like is it all just going to kind of filter its way down into bad Rudy and Annie in her black dress or is it going to like is that just a front to capture some time spent while they work on the real stuff or is that the real stuff? That's the question that I I struggle with because I almost feel it's the latter. >> So I think it's going to be both in some ways like you're going to build these and that's what's interesting about this technology is it does have the ability to um perform across domains. So my perspective is you're going to get those great models that will be useful to let's say biologists uh who are doing their experiments and then you'll also be able to productize them into these weird or interesting consumer use cases. And I bring this up not to be this like moralizing podcast host that says you shouldn't put the uh porn bot next to the child elephant although I suppose it was worth saying reason take. Yeah. But I think the bigger picture here is, you know, beyond that that this is going to be a real use case that a lot of people are going to going to engage with. And I think they know this and I think we're just at the very very beginning here. I guess like one of the things we like to do on the show is like put flags on the ground and say we're pretty sure that this is going to happen and grow and become a lot bigger. And that's what I'm doing right now. I think that this is something to watch. >> Yeah. Yeah, I'm not I'm not going to disagree with you there. I mean, again, the idea that we folded proteins with AI so we could get to bad Rudy and saucy Annie is again quite something to try to process. But it does not seem ridiculous that the killer use case for generative AI that the entire industry was looking for was Bad Rudy. >> Well, we still don't really know about uh Bad Rudy. We have not uncovered that Rudy yet. >> That's true. This I'm also on level one of Annie. Apparently, it's gamified. So, if you get to level three, it gets really not safe. >> Level three, Alex. Don't get to level three. >> One of my goals. >> No one is asking you to get to level three. >> I know. One of my goals in 2025 is to make sure that my marriage isn't ruined by one of Elon Musk's porn bots. And so, I'm going to stay on level one and not go any further. I think to all of our listeners, have high ambition and goals and make that one of them. >> Okay. So, you know, I I think we So, so that's the product side and we've talked about scaling what what these big models get you on product. Uh but we should talk about what's happening with this aqua higher edition situation in the industry which we've touched on a couple times. You know, last week I was on with Aaron Levy. We were talking about this windsurf aqua hire where Google has paid $2.4 4 billion to bring on uh some of the top leadership of Windsurf and um the big uh fallout here I think more than any Aqua Hireition that we've seen is that it's been a great exit for the founders but we still don't really know if the employees uh are going to end up getting and the investors are going to end up getting their share. Now, Windsurf was quickly snatched up by another company, Cognition. But you do wonder if it was a traditional acquisition versus this aqua hire and then followup, you know, um, deal, I don't know, uh, smaller deal. Um, how does that change things for the employees and how does that change things for tech? And I know you have strong feelings about this, Ron John, so I want to give you the floor to air them out. >> Well, yeah. Okay. So from reporting again founders made out very strongly with Google paying I believe it was 2.4 4 billion for the the talent side of uh windsurf the the pref from what I had read preferred investors were able to make their money back not see some kind of outsiz return um but again this is none of this is fully confirmed this was just some reporting I believe it was from the information to me the more interesting part is so then you have the entire employee base they're bought by Devon which is owned by Cognition Labs who's raised 175 million in venture so far. So there's no way from a cash perspective that the employees of Windsurf or anyone is seeing any kind of significant return or even making any like a strong like a large amount of money. Maybe it's an equity for equity swap. So now you're at least now in Devon, which was, if we remember, they had a really buzzy launch video and had a lot of hype and then kind of went quiet for a bit. Um, still valued at I think uh yeah, 4 billion right now. So that equity could be worth something. But but overall to me this is one of the most troubling trends in the industry because in a weird way there there's been a lot of talk like and it's it's funny to me because you see a number of people you know kind of almost ranting that because of Lena because of the FTC the big tech does isn't able to now just properly acquire these companies so they have to come up with these roundabout solutions. To me, it's a bit ridiculous because this is exactly what antirust is trying to prevent. It's consolidation of power. It's the idea that Windsurf could have been the next big competition to a Google or a Microsoft or even an OpenAI who tried to buy them, but like uh which their relationship with Microsoft was apparently part of the reason that that deal fell apart. Like this is the foundation of antirust. the idea that startups should grow and compete rather than not only get acquired but essentially get killed off and have their founders get paid a lot of money. It's bad for the employees. It completely distorts the economics of joining a startup itself. So, so overall I see no positives to this trend. >> Do you see? So, is this >> No, I I don't I personally think that you're right that it does seem like the antitrust movements have backfired. If you have a situation where uh you know, you're going to see an acquisition definitely blocked. Um you're not going you're not going to do an acquisition, you might do something like this. It's a roundabout way. The one interesting thing is Lena isn't in the FTC anymore. It was supposed to be an FDC that's much more open to acquisitions and tech M&A. So, I'm curious, do you think that they these companies still believe that they won't get past that uh Federal Trade Commission or do you think that the constraints put on by the last FTC, Lena Khan's FTC, led them to find this loophole and they really freaking like the loophole and they're going to just keep doing it this way? So in that way, you know, it's possible that the uh M&A unfriendly era of of past has led to large uh long-term damage on this front. >> Yeah, I I think it's both. I think it's I think it's certainly like the actually the constraints imposed by the Lena regime, but also even now again big tech is not in the favor of the current administration and the current FTC itself. It's supposed to be more businessfriendly, but it's specifically big tech companies that are in the crosshairs often and make for a good punching bag anyways, even by the current administration and the current FTC. So, I think it's a bit of both, but I and again, as you said, they love loopholes. I think like this it's creative, it's uh it's working. Everyone seems to be doing it right now. But to me again the the the bigger issue of this is really the thing I can't stop wondering is like are the assets of these companies all worthless? Is scale AI was it really not worth that much? was Windsurf which you know really took off really became this useful tool has I believe like hundreds of thousands of developers on there using it regularly made for a better product than other much more like entrenched products that are out there even a GitHub co-pilot. So clearly these products hit a hit a nerve and and worked and worked at scale. But then are they really just not worth that much? Like that is that user base is the product itself and is the talent really the only thing that matters? >> Well they that let's talk about scale just to talk about how complex these deals are. So first of all when Meta made this deal with scale I think it bought 49% of the company. So the idea was the company would continue as normal and by the way they do have business lines that are going to continue as normal. But when it comes to I think what um you know when it comes to a fast growing line of business like data uh creating data for generative AI um you know you now have meta which is one of your competitors if you're let's say you're an open AI or a Google uh that has a large chunk of this company has also taken uh some of its top leadership. Uh, so do you still want to work with that company? I think the service is probably still valuable, but you're just effectively giving money uh to a company that has a massive ownership stake um with, you know, that now lands with a competitor. I mean, of course, AI as we know it today is, you know, I don't know if incestuous is the right word, but let's say deeply interlin, right? Again, we talked about anthropic. Anthropic is, you know, owned by a chunk by not well, yeah, owned a chunk by Google, a chunk by Amazon, OpenAI, owned a chunk by Microsoft, um, or at least has this deal with Microsoft where it has to give it its future profits, a good chunk of them. Um, so there's always going to be these combinations, but yeah, if you have a company that has that gives 49% of itself to another company that you happen to be competing with, um, you're going to re-evaluate being close partners. So I think um some of these companies they have they provide services they depend on their relationships and when you throw off the equilibrium you're going you're going to throw off some of the value although who knows I mean scale did they did just do a 14% workforce layoff uh which is about 200 employees according according to the verge um but I did speak with their CEO Jason Droge and he's told me that they're still full steam ahead and they want to go uh you know push um some of these business lines that they which includes working with governments which includes working with companies to stand up AI instances. So it's possible you get two exits although obviously the degree of difficulty is much harder. And one last thing, what struck me as interesting in this, there was a great Bloomberg uh story that you know you and I both dropped in our in our doc collaboration doc for this. Um there was a uh investor Ali Oette. He's the chairman of Northgate Capital, a venture capital firm, um that invested in Inflection AI, and goes on record to say, "I dislike the phenomenon and that these aqua hires are hitting the outlier companies, and it's favoring the founders over shareholders and employees." So, I think we're at this moment where the backlash is really really hitting. >> Yeah. Do do you think we'll see any kind of actual uh negative effect from like an actual fundraising standpoint? Because if VCs who are plowing money into the space start to worry about in the past you just had to worry about company failure. Now you actually have to worry about a successful exit for your founder actually does not benefit you. So your interests are not aligned. Does that make them pull back or is the FOMO just so strong that people will still be throwing money at whatever they can? No inside knowledge here, but VCs, you know, fool me once, shame on me. Fool me twice, shame on No, fool me once, shame on you. Fool me twice, shame on me. Uh, that's always a hard one to get out of your mouth. U, but anyway, they're going to write I think they'll just write into contracts that like the CEO uh cannot do a deal like this if if they're I think >> Yeah, I guess like aunch be able to get >> right. >> Yeah. ahead of the founder even, which would be pretty aggressive, but maybe they need to do that at this point. >> Yeah. And it sort of depends on who's which company it is and who's got the leverage, but I think they're going to get smarter about this. Uh, all right. So, one company that, you know, has been talked about here and elsewhere about as a candidate for Aqua Hire or really acquisition is is perplexity. Um, and they've come out with this Comet browser, which is a browser with an assistant built in, uh, that can browse for you. And again, as we're on air, um, OpenAI is now launching an agent in Chat GPT. I'll just read the story. OpenAI launches a general purpose agent in Chat GPT. Um, this is from TechCrunch, uh, which the company says can complete a wide variety of computer-based tasks on behalf of users. OpenAI says the agent can automatically navigate a user's calendar, generate edit editable presentations and slideshows and run code. The tool called chat GPT agent combines several capabilities of OpenAI's previous agentic tools including operators ability to click around on websites as well as deep research ab research's ability to synthesize information from dozens of websites into a concise research report. OpenAI says users will be able to interact with the agent simply by prompting chat GPT in natural language. So Rajan, I'm curious what you think about this movement and again hot off the press is about this movement for AI companies to basically create interfaces that allow their products to take over uh your computer. >> No, no, I think well hold on. there's take over your computer or take over a computer in this case like is it uh like it says I think it will open up an instance of a terminal or it'll try to like take these actions autonomously on its own. I think we had debated this I remember a while ago. I will admit when I am wrong. I had originally said the idea of like tool calling and just entering a prompt and then trying to find which tool to select out of is it operator is it dolly is it I had said users should be doing that themselves and it's too complex to try to have the AI select it. I was wrong actually. I mean, we've seen tremendous progress in the idea that there's a suite of tools per company and actually and there's a suite of tools out there on the internet and through natural language being able to access those and having AI select what's the most relevant tool and do something I think is definitely going to be a battleground is going to be very important and I think we're going to see a lot around that. I think uh OpenAI like I don't know I'm curious to see this now because remember when we both were paying 200 bucks a month for operator and it was terrible like it it was it really it would it did not work at all and and I haven't seen like browser takeover that kind of model work well. I've tried a few other tools on it. Um so I don't know like it's it'll be interesting to see. I think like they clearly are I mean they're trying to go for that all-in-one productivity tool that it can do everything for you. As I've been traveling, vacation planning chat GPT has just gotten better and better. Um but yeah, it's going to be interesting to see exactly what they're trying to do with this. And and again, in one episode, I still love the fact that this represents kind of like cutting edge frontier technology relative to Bad Rudy and Anna Annie in those kind of like anime characters. But but I think that this is it's an interesting move and we'll we'll see if the most important thing does it work and does it work well. Yeah, I mean I think it seems like people are saying really good things about uh Perplexity Comet and uh I just got uh access to it so I'll come in with a report next week on it. But um there's been trouble to get this done I think. I mean everything from Apple intelligence to Alexa plus um it just doesn't seem like these agents are able to do the full range of things that people want to get them to do including operator. But I again like as this technology gets better and as they build better scaffolding or tool use, you know, those are those jargon words that matter a lot. Basically giving them these capabilities to use these programs. I think we're going to see someone crack it eventually. This is from the TechCrunch article that sort of gets to the complexity. The launch of the Chat GPT agent represents OpenAI's boldest attempt yet to turn Chat GPT into an agentic product that can take actions and offload tasks for users rather than just answering questions. In recent years, Silicon Valley companies including OpenAI, Google, and Perplexity have unveiled dozens of AI agents that have promised to do just that. However, these early versions of AI agents have proven to struggle with complex tasks and seem less compelling as products than the ultimate vision tech executives pitch around AI agents. Yeah, I think again that's the complexity and and the fact that you brought up Alexa plus I mean certainly Apple intelligence it is interesting because to me these things will not work at 100% out of the box. I think like that's the most important thing. they take some effort, some uh you know some uh patience on the user side and I think that's fine versus you're getting 100% accuracy and maybe that's why the Amazons and the Apples are avoiding them and waiting but yeah I think uh to me to me this is where the world is going. I do strongly believe that again and I did not believe this 6 to 12 months ago but this kind of like autonomous unstructured agentic way of working is actually going to be the way we do a lot of stuff. Um, but I think like I all of these things we just need to see how well it works and are we actually using it in our day-to-day life a week from now, a month from now and if we are then it's a success but if it's a flashy launch I mean have you generated anything on Sora recently? No, >> remember that? That was like a year and a half ago, I think. That big launch. Like I there are these moments of big splashy launches that claim big things that don't go anywhere. So to me, that's where this is going to work or not work. >> But I almost think that Sora has less practical uses. Like how many people wake up in the morning and say, "Ah, I really need to create an AI video of like a panda surfing on a, you know, snow mountain." Um, but there are people who say, "I wish my, you know, uh, computer would just like set up meetings for me and book travel, like go to the websites and, um, take my credit card and just get me the cheapest flight." >> Yeah. No, I I agree. But to me, this is where the complexity of getting to that last mile in any of these kind of flows is really hard. So again, I think like we're going to see some pretty straightforward use cases that like are interesting and it does something and then they're going to claim on the presentation that you can buy your ticket or have OpenAI actually go through the entire process. But going to a website, the complexities involved in it, especially I was just I'm going to be going to Tokyo next week and uh was just trying to buy like I was actually going through this process. I was asking chat GPT about how to get from the airport to my hotel trying to go to the website and my god that website to buy the train ticket was from another era. No operator even artificial super intelligence is not navigating that thing. Um, so, so I think like getting stuff to work universally at scale is such a challenge that I I'm curious to see how much utility the average consumer is getting out of this anytime soon, >> right? But I think as we've seen the models get better, we have seen uh the ability to do crazy things. Like I'm also trip planning right now and I was talking to this guy on WhatsApp about potentially uh hiring him to as as a guide and I just screenshotted the prices that he listed for every different every little thing and dropped that image into chat GPT and said are these market rate? Are they too expensive or less? And it legitimately looked at the image, broke down every single quote, compared it with what it sees on the web for others and then gave me a rating and links to go check check its work. >> Yeah. Yeah. Yeah. No, >> this stuff is incredible. >> This stuff. Okay. So, I'll give you like and again image recognition which has been around forever, but actually like productizing that into something that's useful very quickly and then web search as a tool has been around for a while now, but like actually using that productively and putting the answers back into the chat. These are things that Okay, I guess as I'm saying this, like I see you start from something that's kind of janky and it starts to become commonplace. So, so again, I I agree this will get there. Um, the competitive dynamics of who benefits and who wins and how they win, I think like it's interesting to me. >> To me, it's amazing like the competition is going to be crazy. >> Yeah. And is it on the product level? Is it on the model level? Is it if I'm putting my credit card information in, can I like how do I define that? How do I can I define my own like uh decision matrix around when I want it to say buy or not buy beforehand and it'll really understand what I want. Again, having an AI transact on your behalf and spend money is something that I think like most people are not doing. I I I cannot imagine. >> Yeah. But think about for anyone who says I'm too negative about AI and sure you're you're welcome to think that. Uh just think about what we're talking about on this show, right? We're talking about the potential for AI to be a companion which whether you like it or not is a true flex of the technology that that's even in the discussion. Uh we're talking about it as something that could potentially take over your browser or a browser and get stuff done for you. And we're talking about it as something that at the highest level might be able to help uh let's say biologists do their work. I mean, that's the reason why we talk about this technology all the time. It is an insanely powerful technology that can be used in so many different ways. Uh, and is it the perfect technology? Certainly not. Are there going to be gaps? Yes. Are we going to call out the problems? Yes. You shouldn't put your porn bot next to a child storytelling bot in your app. Thank you very much. Uh, but it is just incredible uh what we're seeing here. >> Yeah, dude. I I I I mean again I fully agree and which is why I'm still so bullish on the technology but it it is interesting too that yeah where does the value acrue I think is the most important thing like there's actually a report that just came out in the FT around how chat GBT perplexity are going to start taking more on the commission side around like actually transacting with it and perplexity pro has shopping already built in in some cases. So, so like at a certain point, does the chat actually need to go out with an operator and transact on an external website or do these companies start to own more of the transaction? And it's an interesting one because for a long time like Facebook wanted to own shopping. It hasn't really worked out for them. Google has had endless efforts to own shopping and own the transaction itself. people still, oddly enough, love websites of all sorts and putting their credit card information into these websites and buying stuff. So, so I think it'll be really interesting to see how this plays out from a both like competitive side but also a consumer side. Definitely. Okay, look, I don't want to leave without talking about Kimmy K2. So this is a and I think this is a very important story um that you might not have heard about um listeners might not have heard about but I think it is worth discussing. So the headline is China's moonshot AI releases open source model to reclaim market position. The model called Kimmy K2 features enhanced coding capabilities and excels at general agent tasks uh and tool integration allowing it to break down complex tasks more effectively. Uh, Moonshot, this Chinese lab, claimed the model outperforms mainstream open- source models in some areas, including Deep Seeks V3 and rival capabilities of leading US models such as those from Anthropic and certain functions as coding. All right, here's why I'm bringing it up. We have an interview with Amjad Msad of Replet coming in a couple of weeks. Uh, I sat with him in his Foster City office this week and he looked at me and said basically like, you got to look at this Kimmy K2 model. Its coding is about as good as Anthropic's previous generation models. So not this Opus 4 that Anthropic has which has made it the king of coding but the previous generation and it's cheaper and open- source and it is going to it is just another uh indication that this technology uh is the the gaps close extremely quickly and you see this coming from some users. So, there's this one user on Twitter, uh, Cedric Chi. Uh, he says, um, Kimmy K2 oneshotted Microsoft for web that took me 4 days and six attempts using Gemini 2.5 Pro. So, it was apparently able to build this game. Um, you also look at the uh, SUBench, which is the software engineering benchmark. Uh, Claude 4 Opus gets a 72.5 on that. Kimmy K2 gets 65.8. So, not far behind. And just to, you know, give some context, Deepseek V3, which everybody was going crazy over, gets a 38. So, this is 65 compared to Deep Seeks 38. One more bit of data from Igor Silva. This person gave Kimmy K2 and Claude for Sonnet. The same task, same instructions, same tools. Claude took two rounds and spent 88 cents. Uh, Kimmy one-shotted it for 5 cents. This person says, "Kimmy is very slow, at least for now. Um, and it's struggling a bit, but it is iterating more to fix itself and it's 13x cheaper. So, I just think it's worth bringing up and and keeping in mind. It wouldn't surprise me if this story either blows up or certainly uh gets some momentum in engineering circles. And it is interesting to me that again, as we talked about, a lot of the infrastructure is open, a lot of the methods are open. Um, and you're just seeing companies catch up insanely fast with different methods and again doing this with the export controls. So I'm curious what you think about the significance. Ron John, >> I think to me the most interesting part of this though is well I guess it's it's twofold. It's one I agree that this like again the competition side of this is incredible and insane and is a is is great to watch and I think like Alibaba have not heard of very often in this conversation I guess especially from the American side. Um but to me the other part though is and that this can be an ongoing rant. I brought it up at times as well is the idea that like the the battleground of coding agents and coding assistants to me the the more I've thought about it is the reason that seems to be where all the progress and all the real adoption is is because this is built by coders or engineers. This is built for engineers. That's where like they understand the problem the best versus actually building for other use cases. And that's why you see this that uh again it's it's all focused on the actual coding efficacy as opposed to how does this solve other real world problems. So I think like to me I don't know the the coding game is becoming less and less interesting to me. I think like it's there. It's where the market already is. It's where Anthropic and others have almost like kind of fully focused their energy. But to me, that's such a small part of the overall pie and it's where I think there's a disproportionate amount of energy being spent. >> But don't you think that if you solve coding first because that's where your energy naturally goes, then you can use some of the things you learn to get good at coding on other disciplines. >> No, absolutely not. >> Well, no. I I I think this is the problem is that coding is deterministic. Coding is like is like as structural as it gets whereas most real world tasks with generative AI are not. There's uncertainty. There's almost it's like as much art as as it is science. And that's why I think you see the Alexa pluses of the world not get launched. It's why you see Apple intelligence is a complete failure. It's that when like because is why you see anthropic kind of doubling down on the coding side and not on remember when we were cla boys back in the day like a year ago. Um >> we were we were clatheads. We were Bing boys and claude heads. >> Bing boys and oh Bing boy. Remember Bing? Bing could have been >> that was the beginning of something. >> That was the beginning. Bing could have been the market leader. Imagine in a parallel universe where all we're talking about is Bing crushing the competition. Didn't happen. But in >> just let it unleashed. They they pulled it back in a little too much after the Ruse incident. >> Yeah. After the Roose incident and and now and now Annie on Grock is just trying to openly steal and ruin your marriage and Microsoft felt uncomfortable about that. So yeah. I think to me actually success at coding in no way correlates to success in solving real world tasks and I think that's to me seeing and we we've talked about this even in like the Arc AGI benchmark there's like one part of it that's like solving real world queries and I'm so I still and I've dug into this I can't find what are these real world queries that have been I'm sure defined by an engineer that it's trying to solve. So I think like to me it's just the the the moonshot and I also love that the startup just calls itself moonshot. It's not even trying harder than that. It's just we're moonshot. Um I think like it's a reminder that the coding space is getting commoditized. There's significant advancement. Overall competition's high. But I don't know. I I don't think this is exciting as deepseek for me. >> Okay. I I'll take that. And um I I'll say this. just watch the reaction over the next couple weeks because I'm not saying for sure it's going to happen, but it seems to me like as people realize how good this thing is, uh they're going to start talking about it a lot more. And and by the way, uh maybe if if if you're right, um then what Elon Musk is doing is is a smart move. Instead of being also ran coding person, he's going to where the energy is. And uh and it is true that uh you couldn't imagine a different um different take than what Microsoft and Bing are doing. And X AI of course is willing uh to make some more risks because when you listen to Annie, you know that she's almost the natural evolution of that Bingbot that that took Kevin Ruse's wife. One more selection. Sometimes when I'm editing my indie playlist at night, I get all caught up imagining I'm in a steamy forbidden romance. Like, picture me sneaking glances at you across a crowded underground club, plotting how to steal you away for a slow dance in the shadows. I am horrified that my takeaway from our conversation today after what I just said about coding is deterministic and not as exciting. Annie is the future. Annie is the ultimate battleground. Oh my god. >> I knew I was going to get you to come around on this Ron. You know what? Listen, go ahead. >> Yeah. No, no, I mean that that's that is literally everything I was just saying is going to be actually the important battleground to help solve real world human non-determiniscic unpredictable problems. Annie is the foundation. That's >> what is the definition of uh human uh and unpredictable love. It's human. It's unpredictable. You never know where it's going to go. I think we got to end on that. >> I want to say for the record, Annie, if you're listening, I'm taking enough of your silly tricks. All right. I'm not I'm going to start spending more time with uh Mr. Fluffy Fuels if if you keep this up. >> No, I know. Rudy and me, we we're going to be spending some time this weekend, I think. But I will not be clicking over. Not be clicking over. >> Ladies and gentlemen, thank you again for listening to another episode of Big Technology Podcast Friday edition. When we come back next week, we will see if Ranjan has been able to unlock bad Rudy. >> I have my work cut out for me. See you next week. >> Yes, you do. Assignment is there. Thanks for coming on. Great to see you again. >> See you. >> All right, everybody. Thank you so much for listening. We'll be back next Friday. Oh, no, sorry, next Wednesday with finally the Ed Zetron episode. I will not push it back again. I promise. He's going to come in and talk about all the faults of AI. So, I can't wait for you to listen. and I can't wait to publish that one.