Small AI Teams with Huge Impact — Vik Paruchuri, Datalab
Channel: aiDotEngineer
Published at: 2025-07-15
YouTube video id: K-iYKDMFKhE
Source: https://www.youtube.com/watch?v=K-iYKDMFKhE
[Music] Okay. Uh, my name is Vicas. I'm the CEO of Datal Lab. And today I'm going to talk about how we got to 40K GitHub stars, seven figure ARR, and train state-of-the-art models with a team of three. So I spent the last year training these models like Britney mentioned marker and sura I also built repositories around them. I left my AI research job and I started a company and raised a seed round. Uh I did not get enough sleep. It's important and this is data lab. So we made our first hire in January. We're now a team of four. Faraz is new enough that he's not pictured. Um we've grown in revenue 5x since January. We're at 7ig and our customers include tier one AI labs, universities, Fortune 500 and AI startups, including Gamma, who I used to make this presentation. Um, so today's focus, I'm going to talk about how we've grown with a small team. I'm going to talk about my philosophy on building teams and why I think we're at kind of an inflection point in how we think about building teams. And I'm really going to talk about this idea that headcount does not equal productivity. There's like this really persistent notion in Silicon Valley that you raise money, you hire a bunch of people, and you build more, but it almost never in my in my opinion works out perfectly that way. All right, so my last company was called Data Quest. I'm very fond of the data prefix apparently. Uh, and we scaled to 30 people and 4 million AR bootstrap during COVID. It was an online education startup. Um, and then unfortunately we had to do two rounds of layoffs postco when online education kind of tanked. We went from 30 to 15 and then again from 15 to 7. And it was obviously awful for the people we had to lay off. But I noticed something really interesting. Productivity and happiness increased a couple of months after both layoffs to the point where we were actually much more productive after both cycles than we were at the beginning. And I started to wonder why that was. Like how could reducing the team so much actually improve productivity? And I came up with these four hypotheses. One, we'd hired a lot of specialists. So as you scale like Grant mentioned in the earlier talk, you end up building these very specialized functions and teams and those specialists often can't flex across the company to solve the key issues of the company. Two, we were a remote team which required a lot of intentional process and heavy syncing which just eats into your time and just just makes it really hard to get on the same page. Um because of that we had a lot of meeting overload and especially once we got middle management in place people whose job is kind of professionally to manage we ended up with just a lot of meetings on people's calendars and not enough time to actually work and then senior people we hired kind of a mix of experience like most companies do. We hired junior, mid-level, senior, and then senior people ended get up getting kind of tied down in doing a lot of work uh to manage the more junior people. I we actually had a case where we had a three-person team and we cut it down to one and the team actually got much more productive because it freed up the senior person's time. Um and kind of every company I feel like goes through this journey. There's this initial golden period when everyone is aligned. You're on the same page. you're building this amazing stuff and that's really when you build the core thing of your company um like Google uh with search or Microsoft with Windows. It's kind of when you figure out your business model and then you hire a bunch to fill out the edges around it. Like you hire a bunch of enterprise sales, you hire a bunch of marketing, you hire a bunch of engineers who are kind of in very small boxes to build very small features. I had a friend at Amazon who worked there for two years and built a shopping cart button. Um it's it's fine, right? But I mean at at that scale of org, that's kind of the tiny box you get fit in. Uh and you end up with a lot of bureaucracy, a lot of syncs, a lot of unclear priorities. Um and this pattern is unfortunately very common. But I started to think, what if that golden period just lasted forever? Why why do you actually need to end it? And as I started working with Jeremy Howard at Answer AI, I got to understand his philosophy for building a company a little bit better. And his idea is basically hire less than 15 generalists. So people who can really do everything across the stack and really understand all aspects of the company, fill in the edges with AI and internal tooling. So Jeremy's invested a lot recently in fast HTML and things like Monster UI because he sees them as kind of building block libraries to really build out the other tools that the company's working on. Uh and then use simple boring tech, right? Like you don't need to get too fancy. You don't need a Kubernetes cluster when you're a threeperson company. Um but this unfortunately requires uh kind of a high cultural bar for folks. Um you need people who really want to and can understand everything you're doing. So you need engineers who talk to customers. You need go to market people who actually build. Uh and that's that's not necessarily easy to find. You need high trust. So um basically you need people who are in it because they're building something together uh and not in it for other reasons like politics or personal advancement etc. And everyone needs to really care about the customers and focus on them. Um I think these are the prerequisites for this kind of team working this less than 15 person team of generalists. I I'll give you a quick example. So we recently trained a model uh Syria OCR 3. Uh it we recently shipped it but have not announced it yet. So it's 500 million parameters that supports 90 languages and 99% accuracy on our challenging internal benchmarks that include math. Um, and it also does some features that no other model does like character level bounding boxes. It uses PDF text as grounding at a line level. Um, so it was a very challenging model to train and in order to do it, Darun, who's a research engineer at data lab and I had to handle the entire process from end to end. So that included talking to customers, figuring out what they wanted. Uh it included reading a bunch of papers and figuring out the right architecture, prototyping, doing the model training itself, which you always hope is 90% architecture, but is always 90% data cleaning. So building a data pipeline library, building out the data sets, then we had to write the inference code. So we had to connect it to our repos, get the inference written for all our customers, and then integrate it into our products. So this is a scope that in a big company you'd probably have four, 10, you'd have a lot of teams doing this. And every time you hand off between teams in a traditional company, you lose context, right? The people who talk to the customers lossily communicate it to the people who build, who lossily communicate it to the people who train the model. Um, it just gets it becomes very inefficient. You end up eating a lot of time in just syncing context. It never gets fully synced. You're not able to build a great endto-end experience as a result. And you have very slow feedback loops, right? Like you talk to a customer today and it might impact your model training in months. Um whereas if you have generalists who can work across the stack, you get seamless context, right? You never need to share context and do inefficient syncing. You get a really tight integration between all aspects of the company and very very fast feedback cycles. Um and the reason we were able to do this is we used AI to to take kind of the easy lowleverage pieces of this um like building a data pipeline library or helping us really figure out how to integrate it into the API whereas we did the higher level work in each of these silos. So I if you get one thing from this talk this is the thing more people does not equal more productivity. Um all right and like how do you make this work like how do you operationalize this? So the first thing you have to do is hire senior generalists. And senior to me does not mean years of experience. It really means maturity. You need people who can look at a problem and say I'm going to figure out how to solve this. I'm going to do what it takes and I really care enough to iterate with the customer to solve it. um you need to avoid over complication right like I'm an engineer a lot of us are engineers we love over complicating things like hey let me deploy this Kubernetes cluster and multi-stage pipeline to solve like a data extraction problem um but in reality you need people who can go back and like kind of set aside the fixation on shiny tech and just do the simplest possible thing which usually is I'm just going to write a shell script to run this on one machine there's that famous like Hadoop versus shell script blog post uh from a few years ago when you like you could replace a whole Hadoop cluster with just like a 64 core machine. Uh you need people who who appreciate that ethos. Um and you need to work in person. I personally think um remote is great for a lot of reasons, but it's not great for a small team that needs to move fast. Um because you need to set up a lot of process and process to me is kind of the death of this really fast collaboration and tight feedback loop. And then how do you do it architecturally? So um I I alluded to this a little bit, but you have to reuse components aggressively. So we reuse a lot of components between our on-prem and our API deployments. We keep our technology super simple like we don't use React. We don't use any fancy front-end frameworks. It's all server rendered HTML with like light HTMX and Alpine and then super clean modular code that AI can really add to very well. Like we rearchitected our marker repo to be extremely modular and easy to to work with and well documented and that makes it much easier to use AI to actually add to it. All right. So basically keep everything simple. Code is clean, readable, maintainable. Architecture as few moving pieces as possible. Um minimize your surface area and then process. Minimize bureaucracy, high trust, continuous discussions. Um if if you feel like someone's going to need a lot of management, like don't hire them. Like you need people who can who can move fast without being managed. Um all right. And then how do you fill in the edges with models? So, a challenge we're going to face as we scale is this idea that we're we're a document processing document intelligence company. And every customer has a slightly different way that they want to parse their docs. And if you go back kind of to the last generation of OCR companies, the way they solve this is they hired a bunch of Ford deployed engineers. You sat at a client site and you just kind of iterated with them until it was good enough. But in the future, you can really train a model to handle this complexity, right? Like we can train a model to essentially loop over customer outputs until it gets to the the right state. So you can kind of replace that entire forward deployed engineering side of the org. Um, and then when does this model fail? Like we're early, right? I don't know exactly when this model falls apart. Um, but Gamma, as as we just saw, is a great example of a small team with with very very meaningful growth in ARR. I think the key is being able to say no, right? A lot of these edges are choices, right? You can choose to go hire a bunch of Ford deployed engineers and put them at your client sites or you can choose to solve it a different way. And maybe that different way is slightly less efficient in terms of revenue. Um, but it might be more efficient in terms of your long-term company trajectory and health. Um, so it's really unknown if this will work forever. But in my opinion, like it's your choice, right? Like you can choose to make this model work or you can choose to to do the less efficient let's scale to hundreds of people model. Um, all right. So LLMs are surprisingly bad at generating ven diagrams. So that explains why this slide is is is not so well done. Um but basically we have three core roles and the the responsibilities overlap a lot. So everybody talks to customers. Um everybody builds product in some way and research engineer and full stack engineer overlap quite a bit. Um and then go to market is really like your traditional kind of sales, marketing, support functions all collapsed into kind of like a more generalist role. Um, and really like I feel like politics are the death of small teams, right? Like we want people who only care about the work, the people around them and customers, right? Like minimal ego. You need some ego to to kind of advance your own ideas, but not so much that you're willing to fight for them at the detriment of of kind of the health of the company. Um, we pay top of market salary, right? Like it's always weird to me that startups pay 150 or 200k when they've raised 20 million, right? like you should be able to hire fewer people with higher salaries and get more done in my at least that that's what I've seen um meaningful work so big challenges in scope right like if you come in you get to work across the stack you get to ship things end to end uh and that's very exciting for some people it's it's not exciting to other people and they kind of self- select and then you really need a good way to screen for low ego and GSD right like you need people who will ship not talk about shipping um and that's another downside of remote culture in my opinion it's very it gets very hard to tell the two apart. Um, and then patience, right? Like the worst hires I've personally made have all been when I thought I had to fill a role very quickly. All of my best hires have been when I said, "Okay, let me find the best person and and hire them." Even though I may not necessarily have a role today, they're a great generalist. Um, this is actually a big debate in NBA and NFL drafting, too. Like best player available versus drafting for fit. Um, all right. So really I think the thing to think about as you scale is like how do we scale productivity not headcount and you can do that in a few ways right like you can raise salary bands as the company grows so you hire more and more experienced people into the same role um you can invest more in compute right like a one researcher with access to eight GPUs is less productive than one with access to 64 GPUs you can invest in AI tools that multiply productivity right there's so many tools out there now um that are worth paying for that can abstract away a lot of these edges for And finally, uh I'd be remiss if I didn't say if this culture sounds interesting to you, drop me a line. Those are all my socials. Um we'd love to chat. All right. Yes. Uh I think we do the microphone for questions, right? So, um when you went from 30 to 15 to seven, I mean, my take away from this whole talk is like the human touch points are really what slowed things down, right? Um was there any uh additional po um focus on reducing the domains that you were focusing on or like your capability sets or it was like basically your same product offering just with less folks focused on it? Yeah, that's a really good question. So at at a very high level we offered the same product but we cut some features that were less relevant. Like we'd we'd built up a lot of those those edges that you kind of like end up building over over the years. Uh and we ended up slicing a lot of those edges. So I think I think what happens when you hire a lot of people is you don't have enough work and you start making work for people, right? And they end up building all of these edges that actually aren't that useful to the customer. But when you have a tiny team, there's so much work that you actually have to ruthlessly prioritize. And I think you always want to be in that zone. And that's kind of where we ended up back. Oh, sorry. No worries. So uh it's a hypothetical question for you. So, we take you and drop you in the middle of a giant company that's been around for a hundred years, hundreds of thousands of employees, lots of bureaucracy, lots of ego, got super comfortable with a revenue stream. Um, and they're clearly folding over on themselves with too many people. How do you change that culture? Yeah, I'm not the right person for that. I've never done that before. Um I I would say I would say you the people who want to change the culture go start a small company and build the same thing just build it better. That that's a common pattern, right? Like that's a common disruption growth cycle. Um I think that's the best way to do it. Like it's it's just once a culture gets oified like I've worked at the State Department, Pepsi, UPS, like once a culture gets oified enough like you're not going to change it. Like it's just it just is what it is. Generally with that pattern, what happens is these companies recognize that they're a target and they start to buy up those small startups and crush them. Yeah, sometimes that happens, but like I mean Google is a great example of where that didn't happen, right? So, you haven't talked about how you source these these uh really good generalists, I think. Yeah, that's that's a great question. Well, one way is is this. Okay. Uh another way is uh is just open source and Twitter are great ways uh to hire. Like uh a lot of a lot of best candidates have actually come from Twitter, which is weird. I refuse to call it X. It's still Twitter. Um, but yeah, uh, I I don't I don't have a great answer to that, but I think if you do good work and you put it out in public and you talk about how you're building, like that seems to attract people who really care about this mission and want to build in the same way. At least that's been my experience. Thank you. Yeah. Uh, well, uh, actually it's related. Uh, so how do you structure your interview process in recruitment? Like how does it look like you maybe do a trial period or Yeah, that's a great question. So, uh, three steps. So, step one is people come in, we do a short chat. It's really like talking to a peer. Like, uh, here's a challenge I'm having. Let me talk it through with you and see if we can solve it together. If that goes well, step two is let's think of a project we can build together. So, we do a paid project. It's usually around 10 hours. We pay $1,000. It's like, it sounds like a lot, but it's actually a tiny amount of money to to figure out if someone's a fit or not. Um, and then we review the project, and if it's good, we come in and just do a culture fit. Like how does it feel if we're all just interacting as humans and people and and does if it feels like a good fit like it's a hire? Yeah. And what is your like success rate there? Like maybe 10% of the people that goes through the pro through that process uh get Oh, that's that's an interesting question. Like usually we don't once we kind of get someone to the beginning of the process, we have high confidence that we could like we don't want to waste anyone's time. Um, but we probably of the people we've interviewed, I think 40% have we've ended up hiring. Yeah. Nice. Thank you. All right. I'm out of time. Thank you folks. This is great. [Music]