When SafeGraph pulled abortion clinic data, research protecting abortion rights hit a roadblock

The move also exposed the complex ethical considerations of using data many believe is fuel for surveillance capitalism.

Map with location service

SafeGraph pulled down data for privacy reasons, but it impacted researchers too.

Illustration: Christopher T. Fong/Protocol

Martin Andersen was frustrated.

When location data company SafeGraph last week suddenly turned off access to abortion clinic data that Andersen had been using in his research, the associate professor in the department of economics at the University of North Carolina at Greensboro lost a valuable source of information. Andersen had been using the data to measure the real-world impact of laws limiting abortion rights.

“I think it’s incredibly valuable to help us understand how people are responding to laws like SB 8 or overturning of Roe v. Wade,” said Andersen. He was referring to SafeGraph’s data and its use in research gauging the effects of Texas’ Senate Bill 8, which bans most abortions after six weeks, and a potential decision by the Supreme Court to overturn Roe v. Wade.

“I am kind of frustrated they did it, but I understand why they did this as well. I think the decision to take down family clinic data is a very defensible decision,” Andersen said regarding SafeGraph’s data removal.

SafeGraph said on May 3 it would cut access to data associated with travel to and from family-planning center locations from its online self-serve data platform and from the API through which it distributes data to customers and researchers. The company sells data to business and government customers that shows where anonymized mobile devices are spotted to indicate which locations people traveled from, how long they stayed and where they traveled afterward. The company pointed to “potential federal changes in family-planning access” as the reason for its decision to remove access to the data.

However, while the move was designed to protect people’s privacy, it had broader effects on researchers studying the impact of laws on access to reproductive health services.

“Data limitation is a severe issue for research generally, but especially so for research in the area of reproductive health care,” said Jason Lindo, a professor of economics at Texas A&M University who studies the impact of abortion restrictions on reproductive health care access and the amount of travel necessary to obtain it.

“There are a lot of big questions in terms of who is served by abortion clinics, and how far do women come from in seeking care, and what factors affect the amount of care that they can provide. The SafeGraph data really opens new opportunities to get at this question,” said Lindo, who has not personally used SafeGraph data.

“If Roe falls this summer, as certainly seems a pretty distinct possibility, and policymakers and the public are trying to understand the impacts of state abortion bans, these data would allow us to have a near-immediate window into what begins to happen as women travel across state lines,” said Caitlin Knowles Myers, an economics professor at Middlebury College. She also has not used SafeGraph data in her research, but she said a student of hers has.

“There’s certainly a potential role for this type of location data to give really quick, high-frequency evidence on the effects of restrictions on people seeking abortions,” Knowles Myers said. She said location data can fill in data gaps, as some states do not keep track of the number of abortions performed in their locales — when they do, it can take years for this vital statistical data to be released.

Ethical gray areas

While some academic researchers believe that mobile location data showing travel to and from abortion clinics — such as what SafeGraph provided — is useful for their work, recent state laws limiting abortion services and the potential for the Supreme Court to overturn Roe v. Wade have exposed the privacy risks associated with the easy availability of location data.

The situation shines a spotlight on a complex debate that has plagued researchers, particularly those in need of sensitive data related to health care. Researchers want data, and the increasingly digitized world enabled through connected mobile communication creates new forms of information that can offer valuable new insights to researchers. However, researchers care about the privacy of research subjects, and must comply with academic review board data use and privacy rules.

“We see this issue constantly — the trade-off between access and privacy,” Andersen said.

For instance, in the past, privacy and intellectual property protections have limited data details and access to Facebook data for researchers; some have decided not to use it at all as a result.

Do I have an ethical obligation to think through those issues? Yes, and I am, and I don’t have an answer for you yet.

But the use of location data poses dilemmas beyond balancing privacy and access.

Most mobile location data providers rely on harvesting and selling people’s personal data obtained through covert data partnerships with unnamed ad tech companies and mobile app providers, which fuels what many perceive as surveillance capitalism. Tech and data providers are sometimes eager to open access to their information for research, not only because it helps show their products’ value and promote their companies, but because it gives them a “data-for-good” cover in an otherwise highly criticized industry.

“As I’m watching this conversation about SafeGraph play out, I have every obligation to really take a pause and think through these issues, knowing that more broadly there’s this question of whether people understand their data are being collected, whether they truly have provided consent and whether they are potentially being manipulated without even realizing it,” Knowles Myers said.

“Do I have an ethical obligation to think through those issues? Yes, and I am, and I don’t have an answer for you yet,” she said.

SafeGraph declined to comment for this story.

Location data-based research used in court

Andersen told Protocol he has two research papers in progress that rely on data from SafeGraph, both related to travel associated with abortion clinics: one that analyzes the effects of the COVID-19 pandemic and another that focuses on the effect SB 8 has on travel to reproductive health facilities.

Andersen said the SafeGraph data he had already downloaded showed the number of unique devices that had traveled from a specific census block to a reproductive health clinic on a weekly and monthly basis. SafeGraph’s decision means he will not be able to gather updated information showing future implications of legal abortion restrictions to compare against that historical data.

Andersen also co-authored a 2020 paper for the National Bureau of Economic Research that used SafeGraph foot traffic data to estimate the impact of early COVID-19-related state lockdowns on the number of abortions that may have been performed.

Research quantifying the travel burdens created by abortion restrictions has been considered in abortion-related legal cases since the 2016 Supreme Court case Whole Woman's Health v. Hellerstedt, in which the court ruled that Texas cannot place restrictions on abortion services that create an undue burden on people seeking an abortion.

“In the case of reproductive health care and abortion access, the research has mattered. Showing how travel distance affects abortion rates has been used by courts to determine that laws have constituted an undue burden,” Lindo said.

While Lindo has not used SafeGraph data in his work, his research quantifying the degree to which abortion rates decrease when people are forced to travel farther to reach an abortion provider than they would have without restrictions in place was used to inform a court decision blocking enforcement of an Arkansas law.

Research by Lindo and Knowles Myers predicting the effects on travel distances to access abortion services if Roe v. Wade was overturned or substantially weakened is cited in a 2021 amicus brief submitted to the Supreme Court.

The nuances of location data privacy

Mobile location data has been the subject of great scrutiny by privacy, data security and human rights advocates for many years. Amid the threat of more abortion bans, there is more concern than ever that location data could be used by law enforcement officials, government agencies or everyday people to detect clinics providing interstate abortions, or people obtaining abortion services outside their home states.

When Motherboard reported last week that another location data company, Placer.ai, offered easy access to data showing the approximate home locations of visitors to specific Planned Parenthood facilities, Sen. Ron Wyden linked to the article on Twitter, noting, “Researching birth control online, updating a period-tracking app or bringing a phone to the doctor's office could be used to track and prosecute women across the U.S.” Wyden pointed to his proposed Fourth Amendment is Not For Sale Act, which he said “would make it harder for Republican states to persecute anyone seeking an abortion by weaponizing their personal information.”

But some researchers say they appreciate SafeGraph’s data because, unlike other location data providers they could purchase from, SafeGraph reports foot traffic data according to census block groups rather than providing device-level information or precise GPS coordinates.

“I could go out and buy a device-level data set, but there’s no way I’m going to touch that,” said Andersen — he is more comfortable using SafeGraph’s data for that reason.

While Andersen and others argue SafeGraph’s approach provides a greater degree of anonymity than what other location data suppliers provide, in cases where a small number of devices appear in a given location, even census block-level information could be used to identify someone.

A chance for middle ground?

Already, academic researchers must satisfy the criteria of review boards to get approval for their research plans, including data sources. “Researchers are super, super accustomed to dealing with these circumstances in using these sorts of sensitive data all the time,” Lindo said. “The balance is in finding out how to make sure that people are protected.”

Lindo and others suggest that rather than shutting off the spigot to abortion clinic data to everyone, SafeGraph could consider ways to provide it safely to vetted researchers. “It is very reasonable for them to require [Independent Review Board] approval to buy data,” Lindo said. “It doesn’t have to be an all or nothing thing.”

Knowles Myers agreed. “I would like to hope that there’s that middle ground,” she said. “All these data vendors should be thinking about protecting individuals anyway.”

In a recent interview with Protocol, SafeGraph CEO Auren Hoffman said the company might consider altering its approach to data access. “We could say, ‘Only vetted researchers can get access to this data, whereas the broader public can get less access to the data,’ and that's something we might do. So we are evaluating those types of things,” he said.

“In general, we would support carve-outs for research, especially if they’re focusing on institutions that are well-vetted,” said Caitlin Seeley George, campaign director at Fight for the Future, a digital and privacy rights group. Ultimately, she said, the federal government should step in to define national rules related to data use.

“We need data privacy legislation, because in the end, we can’t rely on companies to make the decision to just share with researchers,” Seeley George said. “They’re going to do what makes the most money, which is likely selling that data to anyone who wants to buy it.”


Judge Zia Faruqui is trying to teach you crypto, one ‘SNL’ reference at a time

His decisions on major cryptocurrency cases have quoted "The Big Lebowski," "SNL," and "Dr. Strangelove." That’s because he wants you — yes, you — to read them.

The ways Zia Faruqui (right) has weighed on cases that have come before him can give lawyers clues as to what legal frameworks will pass muster.

Photo: Carolyn Van Houten/The Washington Post via Getty Images

“Cryptocurrency and related software analytics tools are ‘The wave of the future, Dude. One hundred percent electronic.’”

That’s not a quote from "The Big Lebowski" — at least, not directly. It’s a quote from a Washington, D.C., district court memorandum opinion on the role cryptocurrency analytics tools can play in government investigations. The author is Magistrate Judge Zia Faruqui.

Keep ReadingShow less
Veronica Irwin

Veronica Irwin (@vronirwin) is a San Francisco-based reporter at Protocol covering fintech. Previously she was at the San Francisco Examiner, covering tech from a hyper-local angle. Before that, her byline was featured in SF Weekly, The Nation, Techworker, Ms. Magazine and The Frisc.

The financial technology transformation is driving competition, creating consumer choice, and shaping the future of finance. Hear from seven fintech leaders who are reshaping the future of finance, and join the inaugural Financial Technology Association Fintech Summit to learn more.

Keep ReadingShow less
The Financial Technology Association (FTA) represents industry leaders shaping the future of finance. We champion the power of technology-centered financial services and advocate for the modernization of financial regulation to support inclusion and responsible innovation.

AWS CEO: The cloud isn’t just about technology

As AWS preps for its annual re:Invent conference, Adam Selipsky talks product strategy, support for hybrid environments, and the value of the cloud in uncertain economic times.

Photo: Noah Berger/Getty Images for Amazon Web Services

AWS is gearing up for re:Invent, its annual cloud computing conference where announcements this year are expected to focus on its end-to-end data strategy and delivering new industry-specific services.

It will be the second re:Invent with CEO Adam Selipsky as leader of the industry’s largest cloud provider after his return last year to AWS from data visualization company Tableau Software.

Keep ReadingShow less
Donna Goodison

Donna Goodison (@dgoodison) is Protocol's senior reporter focusing on enterprise infrastructure technology, from the 'Big 3' cloud computing providers to data centers. She previously covered the public cloud at CRN after 15 years as a business reporter for the Boston Herald. Based in Massachusetts, she also has worked as a Boston Globe freelancer, business reporter at the Boston Business Journal and real estate reporter at Banker & Tradesman after toiling at weekly newspapers.

Image: Protocol

We launched Protocol in February 2020 to cover the evolving power center of tech. It is with deep sadness that just under three years later, we are winding down the publication.

As of today, we will not publish any more stories. All of our newsletters, apart from our flagship, Source Code, will no longer be sent. Source Code will be published and sent for the next few weeks, but it will also close down in December.

Keep ReadingShow less
Bennett Richardson

Bennett Richardson ( @bennettrich) is the president of Protocol. Prior to joining Protocol in 2019, Bennett was executive director of global strategic partnerships at POLITICO, where he led strategic growth efforts including POLITICO's European expansion in Brussels and POLITICO's creative agency POLITICO Focus during his six years with the company. Prior to POLITICO, Bennett was co-founder and CMO of Hinge, the mobile dating company recently acquired by Match Group. Bennett began his career in digital and social brand marketing working with major brands across tech, energy, and health care at leading marketing and communications agencies including Edelman and GMMB. Bennett is originally from Portland, Maine, and received his bachelor's degree from Colgate University.


Why large enterprises struggle to find suitable platforms for MLops

As companies expand their use of AI beyond running just a few machine learning models, and as larger enterprises go from deploying hundreds of models to thousands and even millions of models, ML practitioners say that they have yet to find what they need from prepackaged MLops systems.

As companies expand their use of AI beyond running just a few machine learning models, ML practitioners say that they have yet to find what they need from prepackaged MLops systems.

Photo: artpartner-images via Getty Images

On any given day, Lily AI runs hundreds of machine learning models using computer vision and natural language processing that are customized for its retail and ecommerce clients to make website product recommendations, forecast demand, and plan merchandising. But this spring when the company was in the market for a machine learning operations platform to manage its expanding model roster, it wasn’t easy to find a suitable off-the-shelf system that could handle such a large number of models in deployment while also meeting other criteria.

Some MLops platforms are not well-suited for maintaining even more than 10 machine learning models when it comes to keeping track of data, navigating their user interfaces, or reporting capabilities, Matthew Nokleby, machine learning manager for Lily AI’s product intelligence team, told Protocol earlier this year. “The duct tape starts to show,” he said.

Keep ReadingShow less
Kate Kaye

Kate Kaye is an award-winning multimedia reporter digging deep and telling print, digital and audio stories. She covers AI and data for Protocol. Her reporting on AI and tech ethics issues has been published in OneZero, Fast Company, MIT Technology Review, CityLab, Ad Age and Digiday and heard on NPR. Kate is the creator of RedTailMedia.org and is the author of "Campaign '08: A Turning Point for Digital Media," a book about how the 2008 presidential campaigns used digital media and data.

Latest Stories