Power

Facebook lends more data to the coronavirus fight

The company is releasing new, anonymized maps for researchers and partnering with Carnegie Mellon on a COVID-19 symptom survey.

An illustration of people connected by lines

Facebook announced a series of new mapping tools that will allow researchers to analyze mobility patterns and connections between people in different regions.

Photo: Orbon Alija via Getty Images

As governments turn to tech companies to track the spread of COVID-19, Facebook will share even more anonymized data about its users with researchers in order to show how people move around, how they're connected, and where the virus is likely to spread.

In a blog post published Monday, Facebook announced a series of new mapping tools that will allow approved researchers to analyze mobility patterns and connections among people in different regions. The company also announced that starting Monday, it will begin showing select users in the U.S. a coronavirus survey from Carnegie Mellon University at the top of their News Feeds. Users can voluntarily complete the survey off of Facebook in order to contribute to Carnegie Mellon's research. That survey asks people to report their symptoms and their ZIP codes to help researchers understand where people who might have the virus are located, even if they're not being seen by a doctor.

"The hope is with Facebook's reach and also the fact that people are on Facebook a lot, we're going to get lots of respondents and some really fine-grain geographic information about symptomatic infections," said Ryan Tibshirani, an associate professor of statistics and machine learning at Carnegie Mellon, who is leading the project. Tibshirani stressed that the partnership doesn't give him any access to Facebook users' personal data, and Facebook doesn't get access to their survey results.

Facebook has been working on disease prevention research since 2019 when it launched a series of mapping tools designed to help researchers study things like population density in outbreak areas and mobility patterns in a given region. Since the COVID-19 outbreak began, researchers have been using Facebook's anonymized, aggregated mobility data to study whether people are actually practicing social distancing. This data comes from Facebook users who have location services enabled on their phones. Last week, Google also began releasing state-level and country-level data on social distancing patterns.

But this new batch of mapping tools from Facebook will give researchers insight not just into where people are moving, but how they're connected, as well. In addition to mapping out movement trends in a region or county, Facebook is also offering researchers two other tools based on its user data: colocation maps and a social connectedness index.

Colocation maps reveal the probability that people from one place might come into contact with people from another place. That way, researchers can tell if people from an area with a large outbreak are likely to cross paths with people from another area with fewer cases. As Protocol previously reported, Facebook began piloting these maps in Hong Kong in direct response to the coronavirus outbreak there. Meanwhile, the social connectedness index shows friendship patterns across different states and countries, so researchers can better understand where the virus might spread next.

These tools are already being put to use by a global coalition of researchers called the COVID-19 Mobility Data Network. This network pairs researchers with city, state and even national governments to help them analyze local mobility patterns using location data. This work has already yielded important insights on the successes and failures of social distancing, said Andrew Schroeder, vice president of research and analysis at the nonprofit Direct Relief.

Facebook connectedness map.An example of a social connectedness map.Image: Courtesy of Facebook

In California, for instance, the data shows that San Francisco has seen the greatest rate in reduction in mobility and the highest rate of people staying home in the whole state. Meanwhile, areas like Riverside and San Bernardino have seen smaller decreases in mobility. Schroeder chalked that up to the simple fact that people have different types of jobs in those places. While San Francisco's tech workforce can easily work from home, Riverside and San Bernardino are logistics hubs, where people can't.

Schroeder said that although it's too early to tell for sure, San Francisco's early adherence to social distancing may be one reason why it's emerged as a "success story" in terms of disease transmission.

"You saw early case detections happen there, but nevertheless, the rate of increase in San Francisco has been, on average, lower than other cities of its size," he said. "Does that have to do with social distancing? Yeah, probably."

In addition to these new mapping tools that Schroeder and others are using, Facebook is also helping Carnegie Mellon's researchers with their survey design. Though no data is changing hands between Facebook and Carnegie Mellon, Facebook is helping the researchers ensure they're sampling a representative group by assigning each survey taker a random ID. Once someone has completed the survey, Carnegie Mellon will send Facebook that person's specific ID. Facebook will then tell the researchers how they should weigh that response in order to correct for sampling bias.

This structure aims to address the growing tension between preserving people's privacy and ensuring governments and scientists have access to the data they need to track the virus. Countries like South Korea and China have deployed extensive, tech-enabled surveillance to monitor the movements of people who have developed COVID-19. In the United States, lawmakers have cautioned against embracing such tactics.

"I don't want a situation where we have data on individuals, and people are showing up and knocking on their doors with thermometers or testing or figuring out where they're traveling to as individuals," Rep. Ro Khanna recently told Protocol. "That's a surveillance state like China we should completely resist."

Meanwhile, last week, a group of lawmakers led by New Jersey Sen. Bob Menendez wrote to Verily, a health care company owned by Alphabet, asking for information on what will happen to all the data it's collecting on people with COVID-19 symptoms.

Facebook, of course, has not always been so careful with users' personal information. If this crisis had struck before 2014, researchers could have built Facebook apps to conduct their surveys and scraped all of the respondents' data, as well as the data of their friends. That's how a University of Cambridge researcher ended up selling data on millions of unwitting Americans to the political consulting firm Cambridge Analytica before the 2016 U.S. election.

But the resounding blowback to that scandal has forced Facebook to rethink its partnerships, even with scientists motivated to stop a global pandemic. Now, Facebook maintains that all of the data it's releasing is fully anonymized and aggregated, so that no single individual's anonymous location data can be reattached to their identity.

"Facebook and the wider technology industry can — and must — continue to find innovative ways to help health experts and authorities respond to this crisis, without trading off privacy," KX Jin, Facebook's head of health, and Laura McGorman, policy lead for Facebook's Data for Good program, wrote in the company blog post.

Carnegie Mellon is running a similar survey program with Google, but in that case, Google is surveying users itself through its Opinion Rewards app, which offers app store credit in exchange for survey answers. But Tibshirani said the Facebook survey gives him more flexibility in asking health-related questions, since he's running the survey himself.


Get in touch with us: Share information securely with Protocol via encrypted Signal or WhatsApp message, at 415-214-4715 or through our anonymous SecureDrop.


Carnegie Mellon plans to share the survey results with the University of Maryland and several other schools that Tibshirani said are awaiting Facebook's approval. Once he receives enough survey results, he said he hopes to release aggregate data on symptoms at a county-level. He said that may help researchers and lawmakers who are currently blind to where people might be infected before they reach a hospital or testing center.

"I am very, very excited about the potential of this data," Tibshirani said. "It has the potential for enormous impact."

Schroeder agreed the data Facebook and other companies have provided to this effort has been critical to understanding the virus' spread. But as more companies offer up their location data to the cause, he said researchers and local governments will need to be careful about what signals they pay attention to so as not to be overwhelmed.

"Before people get flooded to the point where they can't easily make decisions on it, we need to make sure we're clear on what these signals mean," he said.

Fintech

Judge Zia Faruqui is trying to teach you crypto, one ‘SNL’ reference at a time

His decisions on major cryptocurrency cases have quoted "The Big Lebowski," "SNL," and "Dr. Strangelove." That’s because he wants you — yes, you — to read them.

The ways Zia Faruqui (right) has weighed on cases that have come before him can give lawyers clues as to what legal frameworks will pass muster.

Photo: Carolyn Van Houten/The Washington Post via Getty Images

“Cryptocurrency and related software analytics tools are ‘The wave of the future, Dude. One hundred percent electronic.’”

That’s not a quote from "The Big Lebowski" — at least, not directly. It’s a quote from a Washington, D.C., district court memorandum opinion on the role cryptocurrency analytics tools can play in government investigations. The author is Magistrate Judge Zia Faruqui.

Keep ReadingShow less
Veronica Irwin

Veronica Irwin (@vronirwin) is a San Francisco-based reporter at Protocol covering fintech. Previously she was at the San Francisco Examiner, covering tech from a hyper-local angle. Before that, her byline was featured in SF Weekly, The Nation, Techworker, Ms. Magazine and The Frisc.

The financial technology transformation is driving competition, creating consumer choice, and shaping the future of finance. Hear from seven fintech leaders who are reshaping the future of finance, and join the inaugural Financial Technology Association Fintech Summit to learn more.

Keep ReadingShow less
FTA
The Financial Technology Association (FTA) represents industry leaders shaping the future of finance. We champion the power of technology-centered financial services and advocate for the modernization of financial regulation to support inclusion and responsible innovation.
Enterprise

AWS CEO: The cloud isn’t just about technology

As AWS preps for its annual re:Invent conference, Adam Selipsky talks product strategy, support for hybrid environments, and the value of the cloud in uncertain economic times.

Photo: Noah Berger/Getty Images for Amazon Web Services

AWS is gearing up for re:Invent, its annual cloud computing conference where announcements this year are expected to focus on its end-to-end data strategy and delivering new industry-specific services.

It will be the second re:Invent with CEO Adam Selipsky as leader of the industry’s largest cloud provider after his return last year to AWS from data visualization company Tableau Software.

Keep ReadingShow less
Donna Goodison

Donna Goodison (@dgoodison) is Protocol's senior reporter focusing on enterprise infrastructure technology, from the 'Big 3' cloud computing providers to data centers. She previously covered the public cloud at CRN after 15 years as a business reporter for the Boston Herald. Based in Massachusetts, she also has worked as a Boston Globe freelancer, business reporter at the Boston Business Journal and real estate reporter at Banker & Tradesman after toiling at weekly newspapers.

Image: Protocol

We launched Protocol in February 2020 to cover the evolving power center of tech. It is with deep sadness that just under three years later, we are winding down the publication.

As of today, we will not publish any more stories. All of our newsletters, apart from our flagship, Source Code, will no longer be sent. Source Code will be published and sent for the next few weeks, but it will also close down in December.

Keep ReadingShow less
Bennett Richardson

Bennett Richardson ( @bennettrich) is the president of Protocol. Prior to joining Protocol in 2019, Bennett was executive director of global strategic partnerships at POLITICO, where he led strategic growth efforts including POLITICO's European expansion in Brussels and POLITICO's creative agency POLITICO Focus during his six years with the company. Prior to POLITICO, Bennett was co-founder and CMO of Hinge, the mobile dating company recently acquired by Match Group. Bennett began his career in digital and social brand marketing working with major brands across tech, energy, and health care at leading marketing and communications agencies including Edelman and GMMB. Bennett is originally from Portland, Maine, and received his bachelor's degree from Colgate University.

Enterprise

Why large enterprises struggle to find suitable platforms for MLops

As companies expand their use of AI beyond running just a few machine learning models, and as larger enterprises go from deploying hundreds of models to thousands and even millions of models, ML practitioners say that they have yet to find what they need from prepackaged MLops systems.

As companies expand their use of AI beyond running just a few machine learning models, ML practitioners say that they have yet to find what they need from prepackaged MLops systems.

Photo: artpartner-images via Getty Images

On any given day, Lily AI runs hundreds of machine learning models using computer vision and natural language processing that are customized for its retail and ecommerce clients to make website product recommendations, forecast demand, and plan merchandising. But this spring when the company was in the market for a machine learning operations platform to manage its expanding model roster, it wasn’t easy to find a suitable off-the-shelf system that could handle such a large number of models in deployment while also meeting other criteria.

Some MLops platforms are not well-suited for maintaining even more than 10 machine learning models when it comes to keeping track of data, navigating their user interfaces, or reporting capabilities, Matthew Nokleby, machine learning manager for Lily AI’s product intelligence team, told Protocol earlier this year. “The duct tape starts to show,” he said.

Keep ReadingShow less
Kate Kaye

Kate Kaye is an award-winning multimedia reporter digging deep and telling print, digital and audio stories. She covers AI and data for Protocol. Her reporting on AI and tech ethics issues has been published in OneZero, Fast Company, MIT Technology Review, CityLab, Ad Age and Digiday and heard on NPR. Kate is the creator of RedTailMedia.org and is the author of "Campaign '08: A Turning Point for Digital Media," a book about how the 2008 presidential campaigns used digital media and data.

Latest Stories
Bulletins