Neal Richardson - MCP, or not MCP | Pydata London 26

Transcript#

This transcript was generated automatically and may contain errors.

Thanks a lot. I'm really excited to be here. You all are just coming from the keynote, so you're all ready to talk about MCP and Fast MCP and stuff. So I'm going to talk about MCP, and when it's great, maybe when it's not the best tool for the job. Who here has used MCP before? Who has made your own MCP server? Cool. Well, I hope through this talk that you'll develop an understanding of when you might want to do that and how you might do that. And just to tease that a little bit, it's really not that hard.

I want to start by talking about how I first finally came to understand what MCP was about and when I would want to use it. Because when it first came out, I was like, oh, this sounds cool. I should use this. But I didn't quite grasp, like, how. Before I do that, to give a little context, Katrina just introduced me, but that's me. I've been working in open source and data and the intersection of that for most of my career. I'm a member of the Apache Arrow PMC. For a number of years, I developed Apache Arrow. I was the chair of the project management committee last year, and I'm currently VP engineering at Posit.

One of the teams that I work with at Posit is the Posit Connect team. So for those of you who don't know, Connect is a platform for deploying and sharing custom apps, dashboards, reports, APIs, notebooks, things that you write in the language or framework of your choice, whether it's Streamlit or Dash or Gradio or Shiny , FastAPI, now have Node.js support, and so on. And you can also host MCP servers there.

The problem MCP solves

Like many of you, we've really been leaning into what we can do with agentic software development on our engineering teams. And particularly over the last year, we found ourselves just finding more and more things that Claude Code is just really good at doing, as the models have gotten better and the harnesses have gotten better. And this is a very strange time to be making software, I think we can all agree. But one of the things that's really positive about this and really exciting is that there's so many problems that maybe seemed to be hard or out of reach for us in the past that now they're now accessible to us.

You know, we can be much more responsive to suggestions of good ideas. We have a lot of customers that give us lots of feedback about how we can make Connect better. And now it's so much easier for us to say, I don't know, let's see what that would look like. Claude, let's go fire off a get work tree and take a look at that. Without having to do all of the heavy duty context switching that you would have to do to stop what you're doing and really understand what the idea was.

So as we were getting more, finding more and more uses for agentic workflows, you know, it led me to think like, all right, well, what else can I throw Claude at? What other problems are there that we think are hard or painful that maybe aren't anymore? And one of them was responding to customer support escalation. So Posit uses Zendesk for customer support. Customers will file tickets on Zendesk that get answered by our support team. And if the support engineers can't address it right away or it's going to require some more complex debugging, it gets passed off to the engineering team. And so an engineer is going to have to stop what they're doing and try to reason about the customer's environment and the peculiarities of it, because Connect is on prem software that can be run in all sorts of configurations. And so it's just, it's a heavy duty context switch that is hard and not enjoyable.

And so I thought, well, Claude's really good at reading through the code base and understanding how, you know, things that are very far apart work together. What if I could just like, can I just pipe the Zendesk information into my Claude Code session? Because there's great information about, you know, error messages, logs, sometimes, you know, Cloud server diagnostics. And I just want to say, like, take that and tell me what's going on. I want it to do this. I want to paste the link to the ticket and say, help me with this.

But this doesn't work. Because, well, at first it requires authentication. And so Claude will try and say, oh, I can't. You need to, can you just paste me the contents here? Just defeats the purpose. And so I can't, there's not a way for me to just give my credentials to Claude to log in for me. And I wouldn't want to do that anyway. Because I don't want my API key or my username and password, like, in the context here. So that's one problem. But even if I could do that, you know, Zendesk has a REST API. But the API is not designed to work with AI agents. It's not designed for them to be efficient with tokens. And it's not designed for security.

So that's what MCP is for. So MCP provides a way in the specification for you to authenticate securely with APIs without having to give your credentials to your agent. And by writing custom MCP servers, you can control exactly what tools are made available and what capabilities they expose. So this is going to improve the performance of the model, but also protects you against some security vulnerabilities. And then by hosting your own MCP server, you can easily share it with others on your team.

So that's what MCP is for. So MCP provides a way in the specification for you to authenticate securely with APIs without having to give your credentials to your agent. And by writing custom MCP servers, you can control exactly what tools are made available and what capabilities they expose.

So I'm going to talk about MCP today and what it's good for, what it's less suited for, and some best practices. Because when it's the right tool for the job, it can really unlock some great superpowers. A key point that I want to make sure comes across is that you can write and host your own MCP servers. You can do exactly what you need. There's a lot of public MCP servers out there. A lot of companies provide MCP with their products now. You can use those. But I'm really going to be focused on the ones that you write yourself. Maybe you're composing, you're kind of proxying and taking some tools from MCP servers that are out there. Or maybe you're just purely writing your own. And writing your own is a lot easier these days with tools like FastMCP, and agents can write them for you.

As a human, I can read the JSON dictionary into memory and I can just choose not to look at the two other copies of the comment body. But the agent doesn't have that choice. It's all just tokens, right? So less context means lower cost and more likely to get the answer that you're looking for.

And finally, I didn't actually do this in this particular MCP server, but it's also good to just be mindful of your data hygiene. And so if there's any personally identifying information that you're pulling in potentially and the agent doesn't need it, you can strip it out. So here's just a function that strips out the username but keeps the domain for an email. So I could still see that somebody from Posit left this comment, but the individual is not identified.

So again, when you're building the MCP server, you might not know right away what agents it's going to get pulled into. I have restricted to only read-only tools, so I'm not exposing within this particular MCP server to the lethal trifecta. But if someone else is in their agent, they take this MCP and they take another one that's got the ability, maybe it doesn't expose private data, but it has the ability to write, then we're kind of back in there. But this is something that we can and must, as owners of these projects, take care of.