Dust’s Gabriel Hubert and Stanislas Polu: Getting the Most From AI With Multiple Custom Agents

Founded in early 2023 after spending years at Stripe and OpenAI, Gabriel Hubert and Stanislas Polu started Dust with the view that one model will not rule them all, and that multi-model integration will be key to getting the most value out of AI assistants. In this episode we’ll hear why they believe the proprietary data you have in silos will be key to unlocking the full power of AI, get their perspective on the evolving model landscape, and how AI can augment rather than replace human capabilities. Hosted by: Konstantine Buhler and Pat Grady, Sequoia Capital 00:00 - Introduction 02:16 - One model will not rule them all 07:15 - Reasoning breakthroughs 11:15 - Trends in AI models 13:32 - The future of the open source ecosystem 16:16 - Model quality and performance 21:44 - “No GPUs before PMF” 27:24 - Dust in action 37:40 - How do you find “the makers” 42:36 - The beliefs Dust lives by 50:03 - Keeping the human in the loop 52:33 - Second time founders 56:15 - Lightning round

Published: Published Nov 26, 2024
Uploaded: Uploaded Jun 11, 2026
File type: POD
Queried: 00

Full transcript

Showing the full transcript for this episode.

AI-generated transcript with timestamped sections.

0:00-1:51

[00:00] We've asked the entire world to move from calculator technology, punch the same keys, you'll get the same results, to stochastic technology. Ask the same question, you'll get a slightly different result. This has not happened. This is the biggest shift in, you know, the use of the tools that we have since the advent of the computer. We're asking an entire cohort of the workforce to move to a stochastic mindset. And the only way you get that is by having a risk reward ratio that you're comfortable enough. It's like, you know what, I'm not asking it to be right 100% of the time. I'm asking it, [00:30] give me a draft that saves me time many, many, many times over. And that distribution of ROI is something that I'm comfortable exploring with and iterating on. And I think that that is really one of the predictors that we see in people who've tried ChatGPT or in people who are just curious with new technology is they expect that some of it's going to be a bit broken, but the upside scenario to them is so clear and so 10x that they're willing to make that trade-off or that local risk to get things started. [01:00] Bye. [01:11] Welcome to Training Data. This week we welcome Gabriel Hubert and Stanislas Polu, the co-founders of Dust, a unified product to build, share, and deploy personalized AI assistants at work. [01:24] Founded in early 2023, [01:25] after spending years at stripe and open ai [01:29] Second-time founders Gabe and Stan started Dust with the view that one model will not rule them all, and that multi-model integration will be key to getting the most value out of AI assistance. They were early to be convinced that access to the proprietary data you have in data silos will be key to unlocking the full power of AI, and they know that you want to keep that data private.

1:52-3:27

[01:52] We've worked together for 18 months, and their predictions have been consistently prescient. [01:57] So today we decided to ask them about those predictions. [02:00] We'll get into their perspective on how they see the model landscape evolving. [02:04] on the importance of product focus over building proprietary models, and on how AI can augment rather than replace human capabilities. [02:13] - Stan, Gabriel, welcome to Trading Data. - Thank you, glad to be here. - Yeah, thanks, Constantine, super happy to be here. - Guys, first thing that I wanna ask is, you started this company in early 2023. [02:26] At the time, it seemed like one model might rule them all. And that model at the time was... [02:33] I think it was probably GBT 3.5. I don't know if 4 had yet come out, but that was way ahead of the curve, and people were super blown away. [02:41] You guys came out with a pretty contrarian view that there actually would be many models. [02:46] and that the ability to stitch those together and do advanced workflows on top of that would be important. So far, you've been completely right. How did you get the confidence to make that decision a year and a half ago? [02:56] Yeah, I think on the middle part, it was clear that many labs were already emerging. It was not clear from the kind of general audience, but for the people that knew the dynamics of the market, it was clear many labs were emerging. And I think... [03:14] It was kind of natural to us that there would be competition in that space. And as a result, there would be value in enabling people to quickly switch from one model to another to get the best value depending on their use cases.

3:28-4:59

[03:28] Yeah, and I think from a user standpoint, the point on being able to quickly evaluate and compare is obviously important. Looking ahead or already at some of the conversations we're having, it seems that the levels of scrutiny, security, sensitivity of the data that's being processed may also influence some different use cases. And so we're excitingly seeing people thinking about running smaller models on device for some use cases. [03:58] something that's [03:59] Less sensitive, absolutely crucial to get like cutting edge reasoning capabilities for, and some smaller classification or summarization efforts that could be done locally, while the interface that you use for your agent or your assistant remains the same. And that switching sort of requires the ability to have a layer on top of the models. [04:19] You guys have been right about this every time as you've called this out. And so many of your predictions over the past couple of years of partnership have been [04:28] uh, [04:28] Non-obvious and then correct. [04:31] I think this is still not obvious, as in there will be many models. You'll have some local models. You'll have some API call. [04:39] And then you actually, as a customer, want to choose between them or want to have some control. [04:44] First of all, [04:46] Why do you think that will be? As in, why will there be multiple models? [04:49] Secondly, why doesn't it get abstracted away by some sort of router mechanism, some hypervisory layer? [04:55] And does that happen? And would you be that hypervisory layer?

4:59-6:34

[04:59] Uh... [05:00] Yeah, help me understand that. [05:02] I think there's really two modes of operation in FreeSync about the future. So it's a bimodal distribution, basically, of the future. There's one where the technology as it stands today keeps progressing rapidly, in which case there's still going to be competition from rather big lads because there's going to be an incredible need for GPUs to build those larger and larger models because the only way we know to get those models better today is mostly by scale. [05:30] In that world, this kind of dynamic of being able to switch to the best model at time t will remain true for a long time, I guess, until we reach... [05:38] Whatever it is that is at the end of that dynamic. [05:42] And then there's the hypothesis, and we can talk about that later in more details, but the hypothesis of maybe the technology plateauing, in which case, [05:51] It's not going to be one model. It's going to be a Gaussian model, and eventually everybody will get their model. And eventually on your MacBook M6, you'll be able to train a GPT-6 in a few hours, in a couple of years. And then the kind of router, the router need kind of disappears because the technology is really commoditized in terms of just producing the tokens. And every company will have their own model. We would have our own model in that world. [06:17] Well, we got to push you on that. So there's you're building a business where you sort of win, regardless of which one of those worlds we go into. Which one of those worlds do you think we're going into? [06:26] This one is definitely tricky. I mean, it's interesting because, so in terms of,

6:34-8:04

[06:34] capabilities of the models, we've seen or had the perception that the ecosystem was moving very quickly over the past two years. We've seen larger context, support for audio, support for image and stuff. But at the same time, the one core thing that matters for changing the world is the resonating capabilities of those models, right? And the current resonating capabilities of those models has been actually pretty flat over the past two years. They are at the level of GT4 as the end of its training, [07:04] years ago if I remember correctly for the end of the internal training at OpenAI. [07:08] And so that means that over the past two years, in terms of reasoning capabilities, it's been somewhat flat. Well, so hang on. So there's the Kevin Scott point of view, which is there's actually exponential progress, but you only get to sample that progress every so often. And so in the absence of a recent sample, people interpret it as having been flat when, in fact, there's just sample bias. You can't see it. So do you... [07:29] Do you think he's right? Do you think there's actually exponential progress? We just haven't gotten to see it yet. Or do you think it's actually like asymptoting and reasoning breakthroughs have not progressed at the rate one might hope? [07:41] As far as I'm concerned, I have a strong feeling that it hasn't been moving as fast as I would have expected in my most... [07:49] optimistic views of the technology. And so that's why I'm allowing myself to ask the question or simply consider the different scenarios. [07:59] What I think one of your predictions for 2024 also is that we would have a major reasoning breakthrough. Do you think it's coming?

8:04-9:35

[08:04] Yeah, it's going to be a tough one because this one hasn't come for sure. And even GPT-5 or GPT-N plus one or Cloud N plus one, doesn't matter who cracks that first, hasn't come yet. And there's many reasons to believe it might not be a core technological limitation. You can make many hypotheses as to why it might be the case that it takes time. The scale of the clusters required to train the next generation of model is humongous. [08:34] of complexity from an infrastructure and really programming standpoints because GPU fails. When you scale to that many GPUs per cluster, they fail pretty much all the time. All the training is very synchronous across the cluster. And so it might just be the case that scaling up to the next order of magnitude of GPUs needed is just very, very, very hard. And that wouldn't be kind of an inherent limitation. It's just a phase where we learn how to go from Red 1 to Red 5, but for GPUs basically. [09:05] Stan, you were at OpenAI at a pretty critical point in time. So... [09:11] People know you from the Dust experience, but one thing that... [09:14] What has to be remembered about, Stan, is you were a critical researcher at OpenAI from 2019 through late 2022. [09:20] You've got a bunch of wonderful publications. Some of them relate to mathematics and AI. You worked on these with Ilya Sitskiver and the crew at OpenAI.com. [09:31] Do you think that mathematics will be essential to...

9:35-11:12

[09:35] this type of reasoning breakthrough, or is it orthogonal? It's something that we're actually going to learn on [09:40] textual or language data. [09:42] I remain quite convinced that it's a great environment to study. And it was a thesis that we had at the time with Guillaume Lamp, who then founded Mistral. He was working at FAIR on exactly the same subjects. And our motivation was really shared at the time. We really were frenemies competing in the workspace, but really friends by the ideas. [10:04] I think the idea there was that mathematics, in particular in its form of formal mathematics that gives you perfect verification, is a… [10:16] very unique environment to study reasoning capabilities and to push reasoning capabilities because [10:21] you have a verifier. So you're not constrained by being able to verify the prediction of the model that in an informal setup would require humans checking them to some extent. And so that's... [10:34] very bits is probably something that has to unlock something at some point. It hasn't yet for many reasons, but at some point it should unlock something. So I remain extremely bullish on the kind of mass and formal mass and LM studies. [10:48] Yeah, I remember one of the ways you were presenting as me when I was still very much ramping up was, you know, maths is the door to software, software is the door to the rest, and started with some of the critical systems that were the very only ones to have been hand-proven and hand-verified as an example of how much more costly it was to do it by hand than do it by machine. And an indication of the future gains we could expect from being able to extend that and democratize that.

11:12-12:53

[11:12] up. [11:13] You guys see a lot of action through the Dust API calls. When you build a Dust Assistant, you're able to choose what type of underlying model to use. [11:22] You're able to call many different models. Me as a user, I often call [11:27] not just cloud three, but I call GPT-4, and I call the dust assistant, and I call... [11:33] In my custom assistance, I select one of many options. What have you guys seen in terms of trends? [11:39] what's performing really well. I've personally been super impressed by the Anthropic models as of late. [11:45] But you guys have [11:46] A much closer view of that. [11:49] I think the... [11:51] I mean, so a word of caveat on... [11:54] On trends, you're going to have the usual cognitive biases. The grass is always greener. People are going to want to switch just to see what it looks like on the other side. And so when you're observing those switches, you're not necessarily observing a conviction that the bottle on the other side is better. You're observing the conviction people want to try. But it is true we've gotten great feedback on Claude's latest sonnet release. [12:15] and [12:17] So empirically, we're seeing some stickiness on that model in our user base. I think that word on the street is for some coding application, CodeStral is actually performing very, very well. We haven't yet made it available through Dust. But it's yesterday. [12:36] Ah, there we go. Sorry. See, this is the thing you get for reading in San Francisco and waking up to do a recording at seven o'clock in the morning. So yeah, CodeStrial apparently is really interesting on some coding capabilities. And then you have to mix it in with the actual experience that people are getting.

13:06-14:44

[13:06] You could see latency literally in the API as people were waking up on the West Coast. So people have use cases that may be more or less tolerant of those. So we cover the Gemini models, Anthropics models, OpenAI and Mistrada right now. And we have seen some interest in moving away from the default, which when we first launched were OpenAI's models. Not to say that 4.0 isn't performing very, very well. Over the past year, there's been a lot of enthusiasm about open source models. [13:36] And it's actually one of your predictions. Stan, you have these great predictions every year about AI. I always really enjoy reading them. One of them was that at some point this year, an open source model takes the [13:46] Brief lead. [13:47] for LLM quality. [13:50] That doesn't seem to have happened yet. [13:52] And it also seems like the enthusiasm around, not the enthusiasm around, but rather the lead slash acceleration of... [14:01] The open source models in comparison to the closed source models has maybe slowed down a little bit, maybe back to that Kevin Scott point about we're sampling at discrete times as opposed to continuous times. We just haven't seen it yet. [14:11] But where do you think the open source ecosystem is going to go? [14:15] Will it actually at some point [14:17] surpass the closed source ecosystem. [14:19] I mean, that echoes with what we said earlier. It's really in that by model distribution, there's one distribution where OpenSource goes nowhere, and there's one distribution where OpenSource wins the whole thing, right? Because if the technology platters, OpenSource obviously catches up, and eventually everybody can train their high-quality model themselves. And at that point, there is no value in going for a proprietary model.

14:49-16:25

[14:49] at the end, which would be a fun turn of event, obviously. And then in the current dynamic, it's true that OpenSource has been lagging behind so far. Obviously, I think the one that has to be called out is really Facebook or Meta efforts because they have what it takes to train an excellent model. And so far, they've been releasing every model. [15:13] very openly. And so that's exciting to see what will come out of them in those next four months to maybe make the prediction true. The caveat to that is that assuming the best model are the [15:31] assumption, yet it can be discussed. [15:34] It means that that model will be humongous to some extent. [15:38] And so that means that even if it's open source, nobody will be able to [15:42] make it run, right? It'll just cost too much money. You'll need eight GPUs just to do in France. And so that will really trump the kind of usage of those models, even if they're better in the current state of affairs in terms of costs of running them. [15:58] It's a point for consumption that's interesting because that means that you might still have a world where there's a lot of API-based inference, demand for API-based inference, regardless of whether the model on the other end is controlled, hosted, open, wait, whatever, just because of the technical abilities to perform it. One of your founding assumptions kind of related to model quality and model performance, and this goes back almost two years now, was that

16:25-17:59

[16:25] Even as of two years ago, the models were powerful enough and potentially economically viable enough that you could unlock a huge range of unique and compelling applications on top. And that the bottleneck, even at that point, was not necessarily model quality so much as. [16:43] product and engineering that can happen on top of the model. I don't know if that's a consensus point of view today. You know, we still hear a lot of people who are sort of waiting for the models to get better. For what it's worth, we happen to agree with you. But the question is, [16:57] What did you see in 2022? [17:00] that gave you that point of view, [17:02] And if we fast forward to today, what has your lived experience been deploying this stuff into the enterprise in terms of where are the product and engineering unlocks that need to happen to bring this stuff to fruition? [17:15] My triggering point for living up on AI was seeing and playing with GP4, and it was coming from two very contradictory motivations. The first was... [17:27] I said before, it is crazy useful. [17:30] Nobody knows about it. Nobody can use it yet. And still it exists. And literally, it's almost already in the API. I mean, at the time, it was 2P3.5 in the API, which was kind of a slightly smaller version of GP4, but on the same training data. It was a crazy good model, which was-- and it was basically Codex, the base model. [17:52] It was much better than chat.tpd. It was available in the API. And yet the ARR of OpenAI

17:59-19:35

[17:59] was ridiculously small at the time. [18:02] Like in existence. [18:05] by all standards of what we see today. [18:08] And so that was kind of the motivation. And that was mixed with the fact that I was starting to feel the [18:15] I had the intuition that it would be hard to [18:20] in events [18:22] an artificial mathematician with the current technology. [18:25] And so I was kind of seeing not a dead end, but a very long path, slow path forward on what I was working on. And at the same time, I was seeing the utility of those models already. [18:37] when you use them for your day-to-day tasks. And so that was first motivation. [18:41] And the very contradictory motivation that I shared with Gabriel at the time was, [18:46] If that technology goes all the way to AGI, it's the last train to build a company. [18:51] So we better do it right now because otherwise next time it's going to be with sheets. [18:57] And I absolutely didn't answer your question, but I'll let Gabriel answer your question. [19:03] I think what got me excited and when we did start brainstorming on the ways to deploy this just raw capability in the world, what – [19:12] Where it made sense to dig was... [19:15] I have one insight on some of the limitations of the hype around fine-tuning at the time. People were talking a lot about fine-tuning. A lot of consultancy firms were selling a lot of slides that were essentially telling big companies to spend a lot of money fine-tuning. The two things that cut it for me was Dan saying, one –

19:35-21:00

[19:35] It's expensive and you do it regularly and nobody knows that they'll have to do it regularly. And two, it's really not the right idea for most of the things people are excited to fine-tune on. And in particular, fine-tuning on your company's data is a bad idea, as opposed to maybe sometimes fine-tuning on some specific tasks where you can see gains. But the idea that bringing the context of a company, which is obviously every real company's obsession, like how does this work for me? How do I get it to work the way I like it to work? [20:03] was going to happen with technologies that weren't just changing the model itself, but rather controlling the data it has access to, controlling the data any of its users have access to. And those are somewhat hybrid models between new world and old world. The very old world version of it is the key holders are still the same. The CISO is the one deciding how new technology is exposed to members of a company, the guardrails that are in place, the observability that's available to the teams to measure its impact and any data leaks. [20:33] software problems, but they still need to be rolled out on very new interfaces because the interfaces now are these assistants, these agents. And then some of the new problems are around access controls. Does access controls look and feel the same in a world where you have half of the actions done by non-humans? Now, I might want to have access to a file. That's like 2020. Like, do I have access to the file? Yes or no.

21:03-22:36

[21:03] access to the file and can give me a summary of it that leaves out some of the critical information I should not have access to, but still gives me access to some of the decision points that are important for me to move on with my job. And that set of primitives, that set of nuances just doesn't really exist in how documents are stored today. So if you think about deploying the capability in a real world environment where people are still going to have to face those controls and those guardrails, [21:29] The product layer is actually very thick. The application layer to build the logic and the usability to ensure performance but also adoption is quite thick. And I think that was the go to say, all right, there's a lot to do here. I might get started. [21:44] Maybe you can dig into that because when we intersected in Q2 2023, we were going to [21:50] Q1, Q2, 2023. A lot of people were still starting these foundation model companies. [21:54] And you guys had a very specific opinion, which is the future is application layer. [21:59] and there's going to be a lot going on under the hood, and we're just going to be an abstraction layer on top of that and let things happen as it happens. We're going to succeed in any case by building something that people actually use and love. [22:11] First, how do you have the conviction for that? Secondly, [22:15] How has that been playing out? What has been the hard part about it? You mentioned the CISOs and the enterprise and enterprise deployments. [22:21] uh, [22:22] You guys have been way ahead of the curve on RAG. [22:25] I mean, Edward was talking about fine tuning, but you guys have done so much in terms of [22:29] retrieving, it was just before it was even called that, really, retrieving and actually making smart decisions around information,

22:36-24:10

[22:36] Walk us through the step-by-step from the idea of application layer to where you are today. [22:41] You can imagine the application layer conviction existing in a world where you still decide to build a frontier model. The reason we split those two is, one, it seemed like a lot of money for a lot of risk. And I mean a lot of money for a lot of risk to try and develop a frontier model or an equivalent to a frontier model and also make a bet on the way it was going to be distributed. And it's so our internal slogan was no GPUs before PMF. [23:11] training our own model until we actually know which use cases it's going to get deployed on and there are much cheaper ways to explore and confirm which use cases are actually going to make most of value and generate most of the engagement the second reason was was really about this this [23:29] This data contradiction, like the fact that the cutoff dates for training on Internet data are hard to set continuously, the fact that you can't actually get an internal understanding of what happened last week in a frontier model means that fine tuning is a hard problem, that it is not a solved problem at scale. And so if you walk from that conviction backwards, that means like it's there are many cases where it's not solved. So another technology has to be the one to deliver most of most of the gains. [23:59] And extracting a small piece of context from documents where it lives, feeding it into the scenario, the workflow that you need help for.

24:10-25:41

[24:10] The one trend that seemed interesting was that actually many decisions require limited amounts of context and information to be greatly improved. So the context windows at the time that were small were already compatible with some scenarios of saying, let's just bring the information in. [24:40] the frontier model. And what we've experienced is, first of all, it takes time for people to understand those distinctions. It's hard and that you have to get yourself out of your own bubble regularly to realize that it's true. The world, the future isn't quite evenly distributed yet. And people have varying assumptions on what it means to roll out AI internally or roll out the capabilities of these frontier models on their workflows. And you have to walk them back on [25:10] things. I want to work faster. I want to know the stuff that I'm missing out on. I want to be more productive or more efficient in some tasks that I find repetitive. And then only bring the explanation of what technology is going to solve that when it's absolutely necessary, because people will worry about their experience and how they feel about it more than how it's working under the hood [25:31] 99% the time. [25:33] The big insight that's happened and that I think we're leaning into, we have been for a while, and it's great to see some of the market also doing that is,

25:42-27:30

[25:42] People are actually really good at recognizing which tool they need in the toolbox. I think we've not respected users enough in saying, you need a single user that does absolutely everything. And the routing problem should be completely abstracted from you. You should ask this question to the one oracle, and the oracle will reply. People are pretty comfortable telling a screwdriver from a hammer. And when they want to get to work and they need a screwdriver, they're very, very disappointed. When they get a hammer, and it sounds like a hammer response. [26:12] design, deploy, monitor, iterate on, improve, all those verbs that require product service, it was quickly apparent to us that people were very comfortable with that. [26:23] And so the number one question that made us feel like we had an insight to hang on to and lean in on was... [26:30] everybody asking us about DAS was obsessed with the top use case. It's like, what are people using it most for? What is the top use case across companies? And I could almost see the Amazon eyes trying to decide which diapers.com they're going to verticalize and integrate. Like, which verticalized use case should we now just build as a specialized version of this? But I think the full story is fragmentation. I think the story is like giving the tools to a team or to a company to see opportunities for workflows to be improved on, augmented, [27:00] and understanding the Lego bricks that are going to help them do that. So rather than [27:05] encapsulate the technological breaks that are useful and abstract them away from users, exposing them at the right level gives people a ton more autonomy and really the ability to design things that we had never thought of. Some of the scenarios that have come up, we literally could not have imagined ourselves. That idea makes sense, like the fragmentation and providing people with Lego blocks to see what sort of use cases emerge. Just to make it a little bit real, though, can you

27:35-29:29

[27:35] surprising or particularly valuable? Just something to make it a little more tangible? There's obviously a ton that people are thinking about. The category of obvious use cases that have been interestingly and quickly deployed are enablement of sales teams, support teams, marketing teams. And that is essentially context retrieval and content generation. I need to answer a ticket. I need to understand what the answer to the ticket is and generate a [28:05] need to understand which vertical they're in and how our product solves their problems and draft an email to follow up on their objections. I need to prepare a blog post to show how we're differentiated from the market. Again, I'm going to go and plow into what makes us special and generate with our tone of voice. Those were pretty obvious and quite expected. What I've been excited by is to see two types of things. One, very individual assistants, personal coaches, people, [28:35] asking for advice on a weekly, on a daily basis? Like, how did I do today versus my goals? Where do you think I should focus my attention in the coming days? Can you actually break down my interactions on Slack and in Notion over the past couple of days and say where I could have been more concise? [28:50] I'm getting the feedback that I'm sometimes talking to theoretically. Can you point out the ways in which I can improve on that in these two notes that I'm about to send? And so that's exciting because... [29:00] Our bet was, you know, we want to make everybody a builder. We want to make everybody able to see that it's not that hard to get started. And by reducing the activation energy there to see small gains immediately rather than wait for the next model or the next version that's going to really solve everything for them. Personal use cases have been great for that. The second family of use cases that I'm excited by are essentially cross-functional. So where the data silos exist because the functions don't speak the same.

29:30-31:01

[29:30] but they don't speak the same language. And so understanding what's happened in the codebase when you don't know how to code [29:37] is powerful. Having an assistant translate into plain English what the last pull request that's been merged does is powerful. It's powerful to people that were blocked in their work, didn't know who they should bug to actually get an update. So, you know, marketing to engineering, sales to engineering. The other scenarios are extracting technical information from a long sales call is powerful because it means that the engineer doesn't need the abstraction of a PMM [30:07] with a key account. They can just actually focus the attention of an assistant on that type of content on their own project and get those updates. So I'd say that's the family of assistants that we're excited by because they really represent [30:21] I think the future of how we'd love fast-moving, well-performing companies to work where the data that is useful to you and the decisions you should make is always accessible. You don't need to worry about which function decided on it or created it. You can access it, and that fluidity of information flowing through the company helps you make better and faster decisions day in, day out. [30:43] Yeah, any other examples I'm missing, Stan, that you think you're excited by? No, I think what I wanted to add is the fact that, as you said, the usage is extremely fragmented. We see over and over the same scenario, and so we have data to back that kind of proposition is fragmented.

31:01-32:46

[31:01] We built Dust as a sandbox. [31:04] which makes it extremely powerful and extremely flexible, but also has the complexity of making activation of our users not trivial. [31:14] Because when you have an horizontal sandbox-like product, you're like, yes, but for what? [31:18] And so generally the pilot phase that goes with our users starts by clearly identified use cases. So they really kind of try to answer the question, what are the use cases I should care about for my company and try to identify a couple of them. [31:32] And we always see the same pattern. We see first use case get deployed, use stage starts. We try to move laterally to another use case. Second use case gets deployed, usage picks up a little bit more. And then we generally go through a phase where the usage is kind of flat, increasing slowly. And eventually it reaches a critical mass of usage, and all of a sudden it skyrockets to something like 70% of the company. And that's kind of the pattern of kind of oxidation of our users. [32:01] and the skyrocketing to 70%, the usage peaks up to Tom, the original use case that were identified by the stakeholders become just anecdotal compared to the rest of the usage. And that's where we feel like this provides odd value. And it's very hard to know for us what are all those use case because [32:18] We have examples of company with a few hundreds people and a few hundred assistants. And so it's just it's just hard to answer the question. What are the best use cases like? [32:27] Those are great examples. And that calls to mind an analogy that I would like to try out on you guys. And you may puke on this analogy, but this is what just showed up in my brain, which was a lot of those use cases you described, you could imagine some sort of vertical application being built around those use cases, but

32:47-34:30

[32:47] And the analogy that comes to mind is, [32:49] There are a gazillion vertical applications, and yet, where does a lot of work happen? [32:54] spreadsheets. [32:55] Why does it happen in spreadsheets? [32:57] Everybody knows how to use a spreadsheet. [32:59] They're there. They're flexible. You can you can customize them to your heart's content. And so the analogy that I'm wondering about is this almost like the spreadsheet of the future. You know, some of these applications may get peeled off at a vertical specific applications. But even then, people are still going to come back to the to the personal agent because it's just it's there. It's available. It has access to your data. It's familiar. You know how to use it. You can build what you want quickly and simply and effectively. [33:29] what this... [33:30] I think it's an amazing analogy for another thing that I'm thinking about, which is it took me the longest time to get Stan to use spreadsheets when we started to work together. And this is way back when. This is like, I don't know if it was 20 years ago, 15 years ago. And then at one point Stan uses it for something and is like, oh, wow, this is kind of like a cool REPL interface where you can just get the results of your functions in real time. And I was like, yeah, that's now the work. He's like, it's a cool REPL interface for non-engineers. I get it now. [34:00] Experimentation cost is very very low if you think about [34:04] The way in which some of our customers try and describe the gains that they're experiencing or that they're seeing and their excitement for the future is... [34:14] Some functions, we've had 80% productivity gains. Some functions, we're seeing 5% productivity gains, and we're not even sure that we're measuring them right. But we're seeing gains when the specialization of the assistant is close enough to the actual workflow that is able to augment.

34:44-36:19

[34:44] is going to be a fit is sometimes complicated, when sometimes that's where the performance gains are the most obvious. [34:51] One of our users has seen like 8,000 hours a year shaved off two workflows for an expansion into a country where they decided not to have a full-time team. And so basically sparing you some of the boring details, but like the ability to review websites, compare them to incorporation documents in a foreign language, have a policy checker that was making a certain number of checkpoints very clear to the agents that were reviewing the accounts, [35:21] because they were really exploring the country, and immediate gains, like very, very easy iteration on the first version of the assistant, two weeks to launch it into production, roll it out to three human agents that were then assisted by these assistants, and their CTO sharing, like, you know, we're seeing north of 600 hours a month. I'm thinking our pricing is terrible, but what I'm excited by is that [35:51] verticalized sales motion. Because I just don't know how you get to that fairly junior person in a specific team and actually are able to pitch them and deploy that quickly. Whereas if you have that common infrastructure that people understand the brakes of, [36:08] Not everybody knows how to do some products. Not everybody knows how to do a pivot table, but everybody understands that they can just play around with the basic things and probably get help from somebody close to them. That's the other thing we've seen, you know,

36:19-37:49

[36:19] The map of builders within companies, this heat map of people, [36:25] What's amazing about it is that it's people who are just excited about iterating, exploring, and testing new stuff, which I think correlates well to high performance or high potential in the future. I think Dust is heat-seeking for potential and talent across your teams because the people using it the most are people who are the most comfortable saying, [36:44] I don't feel threatened by something that's going to take the boring and repetitive side of my job away from me. I'm excited to have that go away and focus on the high value tasks. [36:51] I think for the first six months, I was one of the loudest voices saying, what is that main use case? I think you guys heard many, many times. And then eventually I realized this is a primitive. We're talking about spreadsheets. You could talk about, frankly, a Word document. You could talk about Office Suite. [37:07] When I interface with Dust, I think about it like Slack, except I'm not slacking my colleagues. I'm slacking assistants, and they actually do this kind of work for me, and I can show them the kind of work. So it feels, Pat, to your point, something like a spreadsheet meets the ergonomics of a Slack. [37:24] As it's brought to me as opposed to I have to go to it. [37:29] And that is... [37:31] It took me a while to get there, and now I see how the fragmentation is the power. [37:37] of what you're going after. Gabriel, I have a quick question on sort of the psychographic of your user, because you're [37:43] your comment that it's like heat seeking for the people who are sort of ambitious and innovative and stuff like that. [37:49] Um,

37:50-39:04

[37:50] I don't know if you have a name for them, but let's call them the makers, you know, the people who are not afraid to try to try new things and try to build stuff. [37:58] Have you come up with a systematic way to find those people or do they tend to find you through word of mouth or some other thing? Because that's not, you know, LinkedIn profiles don't say, you know, Gabriel, migger, right? I think it's a super interesting question at a couple of levels, but our motion is dual, right? So the things that predict a great outcome with dust, I'm coming out of a call and trying to think about what was most powerful about this call I had yesterday with the chief people and systems officer of the company. [38:28] that could not stop interrupting me five minutes into my pictures. Yes, I did the talk on this. Yes. I've already read about this. I've got a blog post on this. Okay. When can I demo? Where do I put my credit card? Let's call you next week. And it's, [38:39] The top-down motion is enthusiasm and optimism about this technology changing most things for most people who spend most of their days in front of a computer. You need that. That's a necessary condition because I think it unlocks three things. One, it unlocks the belief in a horizontal platform for exploration, the ability for security to be in the supportive business rather than a blocker, and genuinely sometimes example setting.

39:09-40:50

[39:09] last week and leadership meetings are being asked, they're doing off-site about like, how are you going to get better at answering some of your team's queries faster with us? So once you have that, then you have the right sandbox, I'd say that the right Petri dish. [39:25] I don't think we've fully cracked the builder identification. So right now it's more like bait. It's like the product is incredibly easy to use. Anybody can create an assistant, even if they have not been labeled a builder by their organization. And it's just the sharing capabilities of their assistant that are somewhat throttled. But we can see from the way in which people explore the product, create assistance for themselves, share them with their teammates in a limited way, a great predictor of that type of personality. [39:55] who are going to be in that family. I'd say the number one discriminator is... [40:01] It's somewhat to a degree, it's a bit ageist, but like people who are maybe earlier on in their careers who have a mix of tasks that they obviously know they can get an assistant to help with. And so they have use case one just laid out for them. People who have repetitive tasks and people who have scripted their way out of a lot of repetitive things before. Just to be explicit, like we had the conversation. I think it's okay to say. [40:25] Like it is people under 25. Like we were saying yesterday, the power users, the people that are using this all the time, [40:31] at the companies are the people under 25 because they aren't set in their ways. Just to be explicit, and that doesn't mean everyone. You can be [redacted address], but in general, they don't have the pattern that they've been set to. And by the way, that's true of a lot of the next generation of productivity notion, which Pat works really closely with.

40:51-42:34

[40:51] That is a under 25 power law type business. And, you know, the teammates here under 25 keep pushing me to transfer over to Notion. And it's just a different type of thinking. It feels like a very similar motion. [41:04] Add dust. [41:05] Yeah, I think that the – [41:08] But the one thing we had that we have, which is useful, is that the immense B2C success of ChatGPT as a now obviously world famous product has made it really easy to set up pilots by just telling teams, do you know what? Send a survey out. Ask people how often they've used ChatGPT for personal use in the last seven days. Like rank by descending order. And that's your pilot team. [41:38] You know, we've asked the entire world to move from calculator technology, punch the same keys, you'll get the same results, to stochastic technology. Ask the same question, you'll get a slightly different result. This has not happened. This is the biggest shift in, you know, the use of the tools that we have since the advent of the computer. We're asking an entire cohort of the workforce to move to a stochastic mindset. And the only way you get that is by having a risk-reward ratio that you're comfortable in. It's like, you know what, I'm not asking it to be right 100% of the time. [42:08] asking it give me a draft that saved me time many many many times over and that distribution of roi is something that i'm comfortable exploring with and it's rating on and i think that that is really one of the predictors that we see in people who've tried chat gpt or in people who are just curious with new technology is they expect that some of it's going to be a bit broken but the upside scenario to them is so clear and so 10x that they're willing to make that trade-off or that local risk to uh to get things started

42:34-44:07

[42:34] So you guys have a lot of very strongly held beliefs. [42:38] internally. [42:40] and externally. And the good news is you've consistently been right. [42:43] without the strongly held beliefs. [42:45] You've named a few of them. I mean, you've talked about this shift from deterministic to stochastic way before it was mainstreamed. [42:52] You talked about rasterization and vectorization. I think about that. That can be unpacked if you'd like. Certainly would need big unpacked on the show if we go down that rabbit hole. You talked about no GPUs versus PMF. [43:05] Right. Can you just walk through [43:08] some of the [43:09] beliefs that dust lives by. [43:13] It can either be philosophical or [43:16] as a couple of these are, or tactical, like the Node DPUs before PMF. [43:21] The first one is really the continued belief that focusing on products is the right thing to do, because it really feels to me like we are only scratching the surface of what we can do with those models. [43:34] Right now, we are starting from the conversational interface, so that's why you use the Slack analogy. And I really, truly believe that that analogy, the Slack analogy, will not sustain time because the way we interact with that technology will change. It started with the conversation interface, but it will happen in a very different place in my opinion. [43:52] Basically, those models are kind of the CPUs of the computer. The APIs and the tokens are really the bash interface. What we're doing right now is merely inventing bash scripts.

44:08-45:38

[44:08] And we have yet to invent the UI. We have yet to invent multiprocessing. We have yet to invent so many things. We are really at the very beginning of what we can do from a product standpoint with that technology, whether it evolves or whether it stays like it is. [44:24] One word that I think is going to be important, and I feel recent news has actually changed [44:31] uh, uh, helped confirm or is an interesting new drop in the bucket for his, um, [44:36] The notion, so one of our product mottos is augmenting humans, not replacing them. And it's not just the naive version of saying like, we're not here to get people fired. It's really that we think there is tremendous upside in giving people who will still have a job in five to 10 years time, the best possible exoskeleton. And that it's a very different kind of company and kind of product conversation to be like, all right, how many dollars are we going to take away from your OPEX line next year? [45:06] is the number of latent opportunities that you are not able to explore as a business because your people are dragged down and pushing like stale slide wear around or not even knowing what dependencies they have on the rest of the company like this is how much friction you've imposed on the smart people you've spent so much money hiring because half of their day or part of their week is spent doing things that we should literally not be talking about in 2024 so that's one and the thing that comes back to to to the drop for a second you've been saying that from the beginning gabriel [45:36] And in the beginning, you didn't use the word productivity.

45:38-47:17

[45:38] Like you didn't want to use the word productivity. I wonder if that shifted. [45:42] And if so, the nuance around [45:45] why you chose not to. [45:46] I think productivity, there's two terms that I was hesitant on. Productivity to me sometimes feels like an optimization when really... [45:53] um there's two ways to be productive there's doing the same things faster and there's doing just better things and i think you know the mixed effect of productivity is is is enshrined in an effort versus impact at the end of the day your boss is never going to be mad if you spent no time doing the things you were assigned to do but brought in the biggest deal for the company nobody's actually going to make any comments on on that being the bad decision because i think the more you grow in your career and the more you're close to the leadership of the company the more [46:23] comes in sometimes unplanned, hyperplenary, completely like left field ways where it's like [46:30] Of course, we needed to focus on this and it's clear in hindsight, but you need to free up time, space, energy and mental cognitive space for that. The other one was enterprise search. I just feel like enterprise search is one that we didn't want to put on the website because retrieval of information is obviously a use case that people are very excited about very quickly. [46:48] But we're just very convinced that looking for the document is a step that people are not particularly passionate about. Nobody wakes up in the morning and is like, I'm so happy that I'm going to just get the right document the first time around when I do the search. People just want to get their job done. And it just so happens that using context from three different documents across seven data silos help them get it done faster or better. And so I think the search bit is just it's never the job to be done. Nobody really wants to search. They want to they want to complete. They want to they want to prove they want to test.

47:18-49:02

[47:18] The search bit is a step that we think will get abstracted. And going back to Stan's point, I think that the interfaces and the experiences we have with this technology will sort of really try to forget about what the original data source was quite fast, potentially, once we've gone over the trust hurdles that exist today. [47:36] um the the thing that that this all comes back to is collaboration collaboration between human and non-human agents and and i think projects uh by by anthropic are an amazing um an amazing example here um we thought about co-edition last summer we have an amazing intern uh from from mit with us last summer and uh who spent their time working on a co-edition interface how do you chat to an assistant to make something that you're thinking about better whether it's an app or a [48:06] or a document or a script. And this is something that obviously the recent release by Anthropic has made very palpable to many more people. That is to me the interface and the interaction that we need to get right. And that will be in the future. So we say augmentation and we'll stick to it because I think it really helps us focus on the interfaces that help humans and non-humans make progress faster. It's going to be about proposals. [48:36] loop with a proposal that's written just in the right way to decide if we swipe left or swipe right on it. It's going to be co-edition. How do I have the language of the human in front of the assistant be as easy to interpret and as foolproof as possible for the final project to move into its final form as quickly as possible? And so you need that interface, that interaction between the agent and the human.

49:02-50:41

[49:02] And you forget that when you replace too quickly. When you focus on just replacing and removing, you've built something that is fire and forget, essentially. And you'll see the gains. You'll see the dollar gains. But if you've automated 100% of your customer support tickets, [49:19] You still need the insights from what people are pissed off about. You still need to understand and have your finger on the pulse of why people are stuck. Otherwise, you're slowing down your product development efforts. And product development efforts today live and die by some of the comments that are coming in from support tickets. And so how you've made that problem go away and become like actually maybe cheaper, sure, but also virtual and harder to connect to is amazing. [49:44] is not, I think, a super long-term view of how your product and business is going to serve your customers best. [49:51] because [49:52] You still need to think about the ultimate interfaces that are going to enable the decision making to make it better and strategic and the best option for your customers in the future. [50:02] So keeping the human in the loop always. I mean, it is human one way to stay it, but it is drip this human driven, like the whole point of all of this technology that we are building is to serve humans better. [50:13] And as soon as you remove that, you've made a terrible mistake. [50:17] Because someone else is not going to do that. And they're going to actually have a better experience with customers. [50:22] and employees. [50:23] and stakeholders, and then they're going to win. [50:26] You know, obviously there's scenarios in which you're going to catch me and you'll be like, you know, this one, this, you know, we know that humans get it wrong way more. And so we should obviously replace it. And this is a complex and nuanced problem. So I'm sure there's certain areas of it where pure replacement has...

50:42-52:13

[50:42] Fully. [50:43] understood, non-external, with no negative externality value. [50:48] But I'd venture that we're pretty poor at modeling where value is created and how it's funneled through the parts of our company today. And, you know, economists have been great at showing that when you don't price negative externalities well, we end up in pretty messy situations. [51:01] And so this is this is the question that I that I post to leaders who are asking you know, what should I automate first? I'm like, well, I [51:08] I don't know which parts of the company do you worry about the most. And often I just find that CEOs are panicked about what their customers say on support tickets. And so making that problem go away, making that problem less visible might be great for some OPEX conversations and your stock price. [51:22] could have unforeseen consequences if you haven't funneled it through in the right places. But also, I think... [51:30] There's so much more to do than to shave 3% off your balance sheet. The, the, the, the, the, the, [51:35] The spectrum of opportunity that you're giving your team [51:39] if there's technologies in their hands and if they're able to come up with ideas. [51:42] is broader than just [51:45] Firing... [51:46] people out of their jobs. And I'm not saying you shouldn't do that. I don't want dust to be perceived as naive in this ecosystem where the disruptive nature of this technology is going to take some people's jobs away because those jobs... [51:59] were currently being done by humans for lack of a better alternative. I think in certain situations you could see those jobs as having been created because we were waiting for the robots, having been framed in a way that was because we were waiting for the robots.

52:13-53:43

[52:13] But I don't know that that's what leaders of companies are excited by. I think that the upside, the future, the way in which we need to be resilient, anti-fragile for what's to come and what our competition is going to come up in, those are the ways in which... [52:28] energy and support I feel should be fueled to support teams. [52:32] You guys, second time founders. [52:35] You started your first company over 10 years ago. You were an early acquisition of Stripe. You guys were there super early on. [52:43] What have you learned and done differently? [52:45] This time. [52:46] as second-time founders? I think... [52:49] Really understanding that a few explosive bets are more likely to get you anywhere meaningful than over-optimizing too early on on something that is still meaningless in the market. [53:02] That's one thing that I think we think about differently. [53:05] So like exploring versus exploiting, uh, [53:08] and all those frameworks. That's one. I think the transparency, the trust and empowerment that you give to your team [53:16] is we i don't think we were against it it's more that we were clueless about how much more empowering you could be so uh the the idea one of the best words from my stripe years was paper trail and it was you know you you had two people in a corridor have a conversation and then one of them would take the time to just write a paper trail in slack or in a document say you know what we just had this exchange and we've moved the needle in this in this direction and it saved

53:46-55:24

[53:46] to go in a meeting room or figure out that this decision has been made. [53:50] And it feeds a graph network of trust and respect for your co-workers that is, I think, second to none in how you can then just achieve more as a team. [54:01] So culturally, you need to sort of push that to begin with because... [54:06] especially people who are earlier in their career, will not always feel comfortable with how information should be shared. So I think that's one where example is important. [54:15] Um, [54:16] big markets that you really believe in for a long time. We loved technology when we started our first company like 12, 13 years ago. It was like, this is great. This is amazing. These are QR codes. Everybody's going to use them. [54:28] And it's like, no, we have to wait for a pandemic to sell QR codes. Okay, I'll do that next time. And so falling in love with the technology and not really fundamentally understanding how big the business could be if it's successful. [54:39] And asking that question early and unabashedly is one thing that I feel is different. So what we kept is our experience together. I think it's an entire advantage to having built a company with a person because you've explored everything. You've explored the beauty, the terrible, the joy, the pain, and you know pretty much the entire API in and out. And so that enables a much more efficient solution. [55:06] co-founding, I mean, co-founder [55:09] interaction and collaboration. I think it's a really big unfair advantage. [55:14] Um, [55:15] I think the biggest one that I think is completely different for me and that Gabriel mentioned is about empowering people.

55:24-56:54

[55:24] Really, as a founder, it's not to you. I mean, it's not to you early and it's to you to build and to build that initial spark. But then for the sake of the company, you are not the one that has to build. You're the one that has to build. [55:39] create an environment for people to be empowered to build those things and explore and create new stuff. And the best value you can give is [55:48] I don't like to use that word. Leadership was coming to mind. It's not necessarily leadership. It's really guidance and trying to create an environment where everybody has the chance to do what they want, but yes, in a guided environment so that everything works as a whole. But that would be the biggest difference and something that we learned about a time at Stripe at least, personally. [56:10] So guys, let's move to a lightning round. We've got a couple of questions for you. All right. Lightning round. Question number one. [56:17] Stan, you share these predictions for where the world of AI is going on Twitter from time to time. [56:23] At this moment. [56:24] What is your top contrarian prediction for where the world of AI is going? [56:31] And don't give me this bimodal little bit of this, little bit of that. Let's hear a point of view. What's your top contrarian prediction for where the world of AI is going? I see it. [56:42] Thank you. [56:43] Thank you. [56:44] It's a lightning, so I have to answer something. It's going to be tough. I think we're on a very drive entering a pretty tough period. [56:53] How so?

56:54-58:27

[56:54] The excitement will go down. Maybe it'll take time to get to the next stage of the technology. There's tremendous value to create, but people will not see it yet. And it'll take a long time for it to diffuse through society. So there is massive amount of value to create, but it's going to be a... [57:10] We might have tough times in front of us. [57:13] All right. Short-term pessimist, long-term optimist. I'll tell you that. All right. Lightning round question number two. [57:17] And this is for both of you. [57:20] Who do you admire most in the world of AI? [57:23] Ilya is just incredible. I've had the chance to work with him. He's my favorite people in AI. [57:31] He's extremely smart, but he's not a genius builder. He's a genius leader. He's just a visionary. And I think that's been incredible. [57:39] Carl Partier, I actually don't know him, but I admire him a lot. And in terms of pure genius in AI, I think it's Shimon and Jakub at Hob&AI. [57:50] They have crazy last names, so I'll let people look it up. But Shimon and Iakou are... [57:54] Thank you. [57:55] I'm... [57:56] impressed by those who've been around for a while and are good, and they're acting as good resistance and condensator elements in the system. They're just, you know, like providing the friction to, uh, remain, uh, [58:10] optimistic but cautiously so uh [58:14] And to me, in one of the first... [58:17] I think it was, I can't remember if it was a tweet or a podcast or an article, but hearing Yandakarond would be like, you know. [58:23] We can make pretty good decisions with a glass of water and a sandwich.

58:28-59:59

[58:28] And these things require... [58:30] Power Station sized data sources and are not making great decisions on some things. So we've [58:36] We feel something is missing. [58:39] And just like elegantly putting that back into perspective has been interesting to me. [58:45] Because it's hard to not cave in. [58:47] to the hype, I think. And so in some ways, pushing for a simple ideal like being open, which I think Yannakang is doing quite aggressively, despite that not always probably being the easiest decision. [59:02] And also saying, you know, we probably haven't solved everything all the time. [59:07] is nice. And from my personal experience, the researchers that have worked for or with him [59:15] have learned and taken from that quite a bit. And so that [59:20] And it's not French, but, you know, some touch of modesty, touch of temperance. I've appreciated in my discovery of the generative side of artificial intelligence. Like after 10 years of just doing your prediction and classification from fraud and risk and onboarding at Stripe and healthcare claims management and things like that, it's nice to feel like there's... [59:46] some people who've seen a lot, done a lot, and are just questioning rather than questioning. [59:53] Affirming. [59:54] All right. So that brings me to the third and final lightning round question. You chose a Frenchman.

1:00:00-1:01:54

[1:00:00] for your most admired Gabriel. [1:00:03] And dust is proudly made in France. Paris has been in an epicenter. [1:00:08] certainly an epicenter for all things ai [1:00:11] your take on the Parisian ecosystem and what do you want to say for the French founders listening to this podcast? Other than I'm sorry it was in English. [1:00:20] It's their fault, not ours. [1:00:23] Yeah, I think the friendship system is awesome because we compared to where it was 12 years or 15 years ago when we started our first company, now we have tenants because there's been a generation of scale-ups that went through the market and trained all that tenants and [1:00:41] Most recently, that kind of explosion of AI talent as well, which is super exciting. So I think it creates a pool of talents and with the right conditions to create incredible companies. Obviously, tackling the U.S. market from France is a challenge. And so that's something to be taken into account, of course. [1:01:03] Yeah, I think if you have ambition, there's a lot more to do. And then as long as you're not naive, where there are still some realities, you can fight some aspects of narratives, you can't fight gravity, or at least you shouldn't, you should probably work with gravity way more than you should fight it. [1:01:22] Uh, [1:01:23] But there's a ton more we can do. And I think we have to behave a little more like... [1:01:29] Tech countries like Israel, I think, in mixing ruthless ambition, a recognition for where talent is and how it's already connected and has high trust connective tissue, which I think is a great catalyst and accelerant in making great companies happen. But a recognition for where the markets are, where people are buying, where people are paying and how quickly people are making decisions on shifting to new technologies, especially in that space.

1:01:59-1:03:04

[1:01:59] if you've always been friends, you have kind of that feeling that something magical must be happening in the US, something special, there must be something special about those people. Well, I'll tell you, I've been at Strive, I've been at OpenAI, I'm working with Sequoia, these all are... [1:02:15] Normal humans. They don't have any magical capabilities. They're just like us. And so it's really important to really be ambitious and believe strongly that you can... [1:02:27] You can make it, you can do it, whatever it is, from France versus the U.S. [1:02:32] Wonderful. [1:02:33] That's a good place to end it. Thank you, gentlemen. Thank you, guys. [1:03:03] .

Want to learn more?

Ask about this episode