Gordon Shotwell | Socure | Creating Secure Systems for Growth

Transcript#

This transcript was generated automatically and may contain errors.

So thank you so much for having me. I'm located in Halifax, Nova Scotia, where it's quite beautiful here today. And what I'm going to sort of talk about is about security and how to kind of like approach building, like getting your data scientist to work on data in a secure environment interfacing with those other parts of the company. And one of the sort of things I was thinking about while I was sort of trying to figure out what to say about this is that there's not really a kind of right answer in terms of what level of security you need for any of these systems. So these are kind of like general strategies, very general strategy, I think the particular things you need to do, like, we need this kind of authentication, or the data needs to be stored in this way or tracked in some fashion, are going to be individual to the place that you work. So this is kind of more about like, how do you work with the people and systems at your company? And how do you set those things up to be successful?

About Socure

So a little bit about Socure. So Socure is an identity company. So we build basically models and data products that help people identify fraudsters. So whenever you apply for like a bank or a credit card, you, your bank sends your information to either a company like ours, or another model, or another or an internal model to sort of check like, are you really who you say you are? This is for regulatory reasons to make sure that people aren't like wandering money. It's also for just loss prevention, like fraud costs companies a lot of money. So they try to do things that produce that that reduce fraud.

And we're the best at that. So we have some of the best products in these markets. And a lot of that has to do with the fact that we've kind of from the start, set up systems where we're able to confidently work with very, very sensitive data of various different kinds and segregate data that's more sensitive from data that's less sensitive. We're also going through a period of very, very fast growth. So revenue is basically growing, but depending on how you count revenue, one and 200% year over year, we started the year 120 people are probably going to be growing to about 500 people at the end of this year. So we're hiring.

And so this puts a lot of pressure on security. And because the kind of ways you have to think about security, when you're growing this fast, are slightly different from the ways that you can think about security at a larger company, just because, you know, if you sort of imagine, like, so I started at Socure two years ago, and I'm one of the more senior people there, right? So all the people who joined after me have like less context about, you know, why do we have things set up this way? What are the risks? What's the threat model, all those different things. But I found it really helpful environment to sort of learn this stuff. Because I think if you can kind of build secure systems at these sort of like, fast growing company places, it really helps to think about how to make them work at all different types of companies.

The security bad place

So I want to sort of talk about what I sort of think of as the bad place, the security bad place that a lot of places find themselves in. And this is kind of the basic conflict between data scientists, and really everybody else. But you can see a similar conflict occurs between like, you know, salespeople and security people, some degree like engineers and security people, but definitely data scientists and security people, it's the worst. And the basic conflict is that the safest way to deal with sensitive data is to just delete it, right? To not use it, if you can't delete it, you want to make sure that you don't give anybody access to it. And if you can't give, you can't prevent people from accessing it all, you want to have like a small number of people. And it has to be really, really annoying to access, right?

So other kind of like the from a security perspective, that's a hierarchy of data safety, right? And as soon as you have somebody who's actually like, I'm going to go open up a data set and look at, you know, the emails or look at the name. That's a kind of like security nightmare for people. But from a data science perspective, like there's a lot of insight that you're getting from that data that you actually do need to look at the data. So for example, if we have like fraud, fraudster, a fraud ring coming in, they might have like a particular email pattern, which we don't know about until you actually kind of look at that, those, those, those scores, you look at them and say, like, okay, like, what are the emails that are, you know, causing us to decide this person is fraudulent? At some point, you maybe can build an automated system to detect that pattern. But at some point, data scientists do need to actually like be able to look at the data in some environment, right?

So what I've noticed that happens with security is security groups, I'm just going to refer to that as security, like, so this is maybe like compliance people, security engineers, you know, the people who are there to try to make sure that your company doesn't get hacked. You know, so security, like, they'll come into a situation, and they'll try to fix something that's broken. That's an old system that was set up some way that, you know, maybe it was good at the time, or maybe it was just never thought about, but it's some battle system. And they'll say, like, look, this is a big, you know, problem, right? Like, this could be an attack here, we got to fix this, right? But in order to fix, they have to direct money from new projects to fix the old ones. And usually, like, in most companies, like, the groups that are responsible for security don't actually have the power to direct resources like that.

But the power that security does always have is the power to veto new things, right? So when a new project comes in, it usually goes through a security review, and if it fails, it doesn't get put in place, right? So you have this situation where, like, the company can't fix the old security problems, but then can also put in new systems in place because they get vetoed, right? Everybody thinks that the other side is unreasonable, like, so the data scientists feel like security is just putting roadblocks, like needless roadblocks in their place. Security thinks that the rest of the business doesn't care about security. And then everybody just stops talking.

So I think this is really common. And it's worth knowing that even if this isn't like a real problem at your business, most of the people who are coming into that business have had experiences at other companies that are like this. So like most, pretty much all the security people I know, have had situations where they were like, I was at this other job. And there was this like, giant, you know, giant fire of a bad system that never got fixed. And it was so frustrating. And I left, right. So they're kind of carrying that from job to job. And data scientists often feel like we can do, we've had all these experiences where it's like, I could do so much, produce so much value, but I just can't get access to the data that I need to actually produce that value.

So that's the bad place. So this is a really big problem, because it's sort of all of the friction of a high security environment with none of the security. Because all of these kinds of hacks or security issues, they're adversarial, usually, so somebody is looking for the weak link in your company's armor. And it really doesn't matter if you have a 40% of your systems are secure, or 90% of your systems are secure, because they will tend to find the holes, right? So you kind of want to always focus your energy on like lifting the floor of your business rather than having like some small number of things that are like super duper secure. The analogy that I kind of use for this is like, if all of your windows in your house are open, you know, it doesn't really matter if you walk the door, right? Like somebody a burglar is going to like just go through the window.

So that's the bad place. So this is a really big problem, because it's sort of all of the friction of a high security environment with none of the security.

And so if you assume that they're going to do the easy thing, right, and you can make the right thing easy, then you can have some confidence that they're going to tend to do the right thing, right?

So the ways that I've used to build good developer experiences for, for secure projects, the first one, I think is really, really crucial is to control the client libraries. So we have an internal a set of internal R packages, a set of internal Python packages, and, and they're great to use, they've gotten good adoption. And so what that means is that we can, we basically have a layer of abstraction between the user and however, that's interfacing with some kind of back end system. So we might have like some function that we have a function, for example, that connects to a database. This function has been around in that package for years, right? But over the over my history there, I think we probably changed some mechanism of how that database connection works, maybe seven or eight times, right? And the users never knew, right? Like, it's because it's just basically like, we've thought about their experience. And like, part of their experience is like, they should never have to learn a new database connection algorithm. Like, they shouldn't need to know how AWS, like, access tokens work. That's not something that's relevant to their job. Like, they just want to connect to the database.