AX is the only Experience that Matters - Ivan Burazin, Daytona
Channel: aiDotEngineer
Published at: 2025-07-24
YouTube video id: e9sLVMN76qU
Source: https://www.youtube.com/watch?v=e9sLVMN76qU
[Music] the number of agents in the world I believe won't just match number of humans but they will basically be the number of humans to the power of n and what is the power of n I have really no idea but we can see already today that we are using multiple of them more are spinning up more and so it will be a very very large number and this isn't just hypothetical I mean we don't have a lot of data around this I actually researched a lot and try to find what we have but the some key elements that we do have is that the first one maybe not that important but 25% of YC startups say that AI writes 95% of their code but the second part is actually quite interesting whereas 37% of the latest YC batch are actually building agents as their products. So they're not building co-pilots, they're not building autocomplete, they're not building legacy SAS companies. Like agents are the new product. And I think that's actually quite interesting and aligns with what we're looking at. But the most uncomfortable truth for people building tools for agents which we are and I know a lot of people are as well is that most of the tools that are built today uh break the the moment you remove a human from the loop as we're not entering a world where like agents are assisting developers. Agents will definitely be the developers. I mean they will be the number one people using that. So basically, if you're not building dev tools, if you're building dev tools for humans, you're basically building for the past. And what I think about when we build our things, and I think we all should do is like what does it mean to actually build tools for the future? And I think it means building for agents. And the term for building for agents um is agent experience. And a good friend of mine, Matt, which I guess a lot of you know here, who's the co-founder of Netifi, coined the term agent experience. And it's supposed to be or continues the evolution of like a user experience, a customer experience. And most mostly a developer experience. But the division the definition that I sort of found that works the best or defines it the best was the one from Sean that works at netlefi. not M not necessarily which says how easily can agents access how easily can they understand and how easily can they operate within digital environments to achieve the goal that the user defined and I think that's a really good definition and looking at this I wanted to see like who is actually implementing agent experience who is looking at this definition is who is building their tools like this and I found some companies and there's more than the ones I put in here of course. So like if you guys are building something like this and I didn't put you, I'm very very sorry. Like I just wanted to put a few that I've talked to and looked at. And so I think seamless authentification is a really really important one. Um and if your agent if you give an agent a task to do when it has to log in for the most part it will break. And moreover you don't want to give your passwords to the agent. That is definitely not a good idea. And so company a company like Arcade which randomly if he's around I met the founder yesterday I had not met him before that he was ready in the talk um what they solve is if your agent tries to au authenticate into like Delta or Booking.com or wherever it is it can fall back to the user the user can log in and the agent can go off to its job moreover you can also add credentials inside of arcade before that so your agent can go and do what it's mean to be done so authenticate once agent goes off does their job. A second one is this is the most this easiest one which I'm sure pretty much all of you do here is agent readable docs and stripe does a great job of this. Basically any doc URL just appended md and you'll get a clean markdown file no fluff the agent can consume it as easy as very very easily. Um but moreover the llm's.txt standard I'm sure all of you have seen it. It's actually on the website as you can get um the conference website. So if you don't have an LM llm's.txt in your docs, you definitely should. Makes it super easy for the agent to go um and read what you all are doing. The third thing we found was API first design. And I think this is actually the most critical. Um basically if an agent can't see the functionality, it's really hard for it to do. And I strongly believe that the best way for an agent to interact with any tool is through a machine native interface. And I put API here and I won't talk about MCP. I'm sure a lot of people talk about MCP. Will MCP be here or not here? I don't know. But basically an API is the underlying tech that you have to have have to be exposed so the agent can access uh what they want the most efficiently. And so Neon and ETFI and Superbase and these companies do this a really good job of this. As far as I know, like all the key things that an agent needs to use their services can be accessed via an API. And now all these things are really great and I think this is all a right direction. But is there more? Is that it? Is like is it just an API? Is it just like a make your docs more readable? Um and so what I wanted to think about is the first definition of Shaun's and I think there's missing one key part for all of us who are building tools to think about. Um and that's the word actually autonomously. So how easily can an agent autonomously access, autonomously understand, autonomously operate within this environment? And I think this word is actually quite important and it makes you sort of think about what you're doing in a different way because if you know you give your agent a task and it can do a lot but it always has to fall back to you to achieve its task. I don't think that is the future and I think we should all go back to the drawing board and think about this basically. you know what happens if there are no humans around to click the buttons to debug errors to whatever it may be I think that's where our work starts as tool builders like and so if you're not actually solving that then I actually think you are just porting for the past and just as a note um Swix basically um coerced me into doing this talk because he said that devel that agent experience doesn't exist it's just a wraparound developer experience and so I think if you're not thinking about how an agent can autonomously do his task, then you basically are doing that. And I don't think everyone's doing that. Um, and just really quickly, I sort of skipped off who I am and why the hell I'm talking about these things. Um, and why I might know something, might not. Well, we'll see at the end of it. So, first company in the early 2000s, so am dated, very old person. Um, started by building like data centers and server rooms, HP servers, IBM's, hypervisor, VMware, all these things. So actually screwing in servers and running cables and and installing Windows servers via CDs. Um yeah, exactly. Some people remember CDs. And after that sort of sold that uh created the very first browserbased ID in 2009. So replet whatever 15 years ago. Um that was super early. So we had to create our own you know ids, our own orchestrators, our own isolation. Um more recently led the developer experience at a company called Infob. It's a multi-billion dollar company that competes with Twilio, which you haven't really heard of probably. Um, but basically, it's a communications platform as a service. So, one API for sending emails, SMS, voice, and all those nice things. And as of late, uh, working at Daytona with a bunch of people that I've worked with in my older companies, and I have to say, if anyone has found is a founder or founding a company, if you can work with people that you've worked with historically, you should do that. It makes it so much more fun. Um so yeah, Daytona basically is a secure and elastic infrastructure purposely built for running AI generated code. Basically that means we created an agent native runtime. And what agent native means I'll hopefully explain a bit. Um or more simply like sandboxes. So we as Daytona give agents a computing environment that they can use to run code, do data anal analysis, reinforcement learning, computer use, or more recently I've seen people use it for agents to play video games like Counterstrike and whatnot. So um people are doing funny things with these things. Um and so Daytona is basically what a laptop is to a human. That is what a Daytona runtime is for an agent sort of. And so when we were starting to build this new company or this new product, we looked at what it means to build something for agents and we took the principles of what agent experience is and these are the first things that sort of we built through that. And so one is speed. Basically, if you have a tool for an agent and your agent is in in interactive mode. So think of like cloud or chat GPD. If you're the user, you don't want to waste time for things for tools to turn on. So we created something that spins up in like 27 milliseconds. Um, obviously API first. So, can an agent via an API turn on a machine, turn it off, you know, clone it, delete it, whatnot. After that, we thought about what more things can it do? So, like it's really fast. The agent can spin it up, but what happens when it gets inside? And so, we thought it wasn't ideal for an agent to parse an output from a terminal. So, we re preloaded all of them with headless tools. So, a file explorer, git client, LSP, terminal, and all these things. And so this sort of aligns with the original definition is like we're helping agents do things faster. And we thought like that was it until we started getting like really interesting users and customers that said like oh like our agent can't do this. And so we have to put in a human in that. And so I'm going to walk you through some of these new primitives, maybe a bigger word for that, but maybe not, were features that we found with new customers building agents on top of us. And these, just to be clear, I don't think these features are something that everyone can replicate, but it's more of like how we thought about solving these problems and how we seen these problems and maybe inspires you to think about how you're building your tools for agents. So the first thing is a declarative image builder. So Daytona is a sandbox. It uses any docker image off the shelf. Um so as a template and then the agent can you know spin up a sandbox with that template. Obviously if an agent needs to add a dependency it can use you know the API terminal and just like pip install whatever it wants. Great. But if an agent has to now install like 20 new things and do it over and over again, that's just like a waste of time and resources. And the way you would solve that right now is like a human goes into Daytona, you know, creates the new Docker contain or goes to the laptop, creates a new Docker container, pushes to our container registry, and the agent can do that, but that like takes time and effort. The other option is like an agent tries to build a Docker image on its own and then push the container registry um which is brittle, takes time, breaks, and probably won't do very well. So how do you think about that? Well, we created something like a declarative image builder where an agent can say, "Oh, I need a net new one. This is my base image. These are the things I want installed. These are the run these commands. We build it on the fly and open up a sandbox." Basically, the agent can now at any moment in time say, "I need something net new, make it I can make it on my own and I can launch it on my own." Solves end to end for itself. The second thing is what we humans take for granted is if you're programming on your laptop, your environments are usually probably Docker containers on your machine and you have if you have this large data set, you can really easily share it with all the environments that you have on your machine. But if an agent and we found some users that really need this, they need like 100 gigabyte data sets in every machine that they have. There is no local laptop context. Every environment is isolated on its own. So, how do you make it more efficient that an agent doesn't have to download something from or upload from S3 every single goddamn time? And so, basically created something called Daytona volumes. The agent can invoke one of these any size it wants, uploads it once, and maps it as a network drive or mounts as a network drive on every single machine. The last thing I'm just going to go through and then we'll finish off on these feature stuff is quite different from the others, but I put it in here because I feel it's very much unique to agents and that is the ability to execute things in parallel. So an agent unlike a human, we as humans can work on one machine, maybe two if we're super dialed in. Um an agent can try things in parallel all at once. So instead of going try this out outcome, it sucks, go back. Try it again, it sucks, go back. It can basically take a machine, fork it five times, 10 times, 100,000 times, go through all of these and then come to an output. And so again, these are just things that we have seen as we've worked with users and that they need their agents, what their agents need to get their job done. And so if you ask like what else is there in agent experience, the the short answer is like I have no idea. Um the the reason I say that is because people that are building agents are building them right now. And you're only now or we are only now finding out what they actually need to get the task done. And so what I want to instill you all with is if you haven't thought about or if you have a tool that at some point needs a human in the loop then you probably haven't solved or thought about how to solve this. So the beginning of the talk is called basically the agent experience is only experience that matters and I say this not because we as humans are disappearing but agents by far will be the largest user out there and less and less we'll have people behind the screen reading the logs clicking on butdding buttons typing into terminals and lastly just one thing I'll leave you with is when we started Daytona basically we wanted the best developer experience that we could have um and we invested a lot in like the terminal and there was a talk earlier about here today about someone that has amazing you know terminal UI and I think that's great we started with that as well but as a small company you sort of have a focus on what you need and on what's most important and right now what's import most important is yeah great to have a CLI but can your agent do your task end to end because basically if your agent can't use your product in the future absolutely no one will so with that I thank you for your time um we if anyone has any interest in this, you can take a look here. Use our GitHub. It's open source. Also, we have a booth downstairs in Expo Hall. Thank you so much. [Music]