Diversity Tracker

The tech industry needs to standardize diversity reports

No tech firm reports diversity the same way. That makes tracking progress harder.

Magnifying glass

Tech lacks a standard, comprehensive method of reporting and presenting employee demographic data.

Illustration: Christopher T. Fong/Protocol

Diversity reports don't always tell the full story of what representation looks like at tech companies.

The earliest versions of tech industry diversity reports created a binary narrative about diversity and inclusion, according to Bo Young Lee, Uber's chief diversity officer. The reports, which tech companies began consistently releasing in 2014, communicated that "diversity is about gender representation, and it's about race and ethnicity, and it's about nothing else," Lee told Protocol.

"And these diversity reports kind of boiled everything down to these two sets of identities," she said. "When in fact, just knowing the representation of women or just knowing the representation of underrepresented people of color, you don't really actually get a sense of how subgroups within there are thriving."

Reports in recent years have begun to detail additional data points, including, for example, information about intersectionality, which considers overlapping identities that may contribute to discrimination or some other form of disadvantage. But the industry still lacks a standard, comprehensive method of reporting and presenting employee demographic data.

Data pet peeves

A good diversity report needs rich and clean data, according to Bernard Coleman, the chief diversity and engagement officer at human resources startup Gusto. That means companies need to be equipped with a reporting system that makes it easy to collect that data and a strategy to encourage employees to self-identify across a variety of categories.

"Maybe there's a lot of data maintenance to make sure that you truly understand your folks," Coleman said. "We've done a self-ID campaign to really understand who are you and what do you need to be successful. I think that's really important. A lot of times, I've seen folks just don't have clean data."

Lee agreed with Coleman's sentiment. She said it's "shocking" that many companies "don't actually have very good validated workforce data."

Sometimes, Lee said, the ambition to report as much data as possible can undermine the integrity of the report. Many companies, for example, have begun to report sexual orientation and gender identity information.

"But I would guarantee you almost no company probably has a statistically significant amount of information to say that the sexual orientation and gender identity [information] is an accurate reflection of their workforce," Lee said.

In addition to the standard race and gender data, Uber collects self-reported employee data around sexual orientation, gender identity, veteran status, caregiver status, disability and socioeconomic status during childhood. But it has yet to report that additional data because Lee wants at least 80% of Uber's workforce to respond to those questions.

"Otherwise, we don't really know if it's representative," she said.

Companies also vary in their methods of presenting data. At Twitter, the company previously reported 100% of workforce data but in 2017 began reporting undisclosed responses. That change resulted in Twitter appearing significantly more diverse than it once did. In 2016, for example, Twitter was 57% white. In 2017, when Twitter reported the demographics of only 80% of its workforce, Twitter was 44.3% white.

In 2018, Square began grouping Black, Latinx, Native American, Pacific Islander and employees of two or more races under an "underrepresented minority" umbrella when detailing the demographics of its tech, business and leadership teams. At Intel, the company grouped white and Asian-American employees together in a "majority population" bucket from 2017 through 2019.

"As an Asian American, I think it is very insulting," Lee said.

The limitations of the EEO-1

A diversity reporting standard does exist, but both Lee and Coleman say it's subpar. The Equal Employment Opportunity Commission requires companies with 100 or more employees to annually report demographic data via an EEO-1 filing. But this standard has its flaws.

One limitation with EEO-1 reports, Lee said, is that it's only U.S.-based data. Additionally, the EEOC asks companies to input data in a way that is "fundamentally different from the way corporations actually do it," she said.

EEO-1 reports require companies to report the raw numbers across 10 different types of roles, as well as male, female and six races. Lee argues that it doesn't allow for a deeper analysis of a company's workforce.

"So you're seeing pure gender-based data, you're seeing pure race- [and] ethnicity-based data [in the EEO-1]," she said. "So again, there's a limitation there from a reporting compliance perspective."

Coleman said the EEO-1 does serve a purpose but recognizes that there is room for a better standard. He also thinks a diversity report standard would make it easier for more companies to participate.

"A lot of folks I think sit on the sidelines because they don't want to get it wrong," Coleman said. "And I think most organizations don't want to misrepresent what their data is. [...] That would take a lot of that guesswork out, if we all had a standard we could go by that was deeper and richer than the EEO. Because the EEO is kind of limiting and I would argue decades behind."


Judge Zia Faruqui is trying to teach you crypto, one ‘SNL’ reference at a time

His decisions on major cryptocurrency cases have quoted "The Big Lebowski," "SNL," and "Dr. Strangelove." That’s because he wants you — yes, you — to read them.

The ways Zia Faruqui (right) has weighed on cases that have come before him can give lawyers clues as to what legal frameworks will pass muster.

Photo: Carolyn Van Houten/The Washington Post via Getty Images

“Cryptocurrency and related software analytics tools are ‘The wave of the future, Dude. One hundred percent electronic.’”

That’s not a quote from "The Big Lebowski" — at least, not directly. It’s a quote from a Washington, D.C., district court memorandum opinion on the role cryptocurrency analytics tools can play in government investigations. The author is Magistrate Judge Zia Faruqui.

Keep ReadingShow less
Veronica Irwin

Veronica Irwin (@vronirwin) is a San Francisco-based reporter at Protocol covering fintech. Previously she was at the San Francisco Examiner, covering tech from a hyper-local angle. Before that, her byline was featured in SF Weekly, The Nation, Techworker, Ms. Magazine and The Frisc.

The financial technology transformation is driving competition, creating consumer choice, and shaping the future of finance. Hear from seven fintech leaders who are reshaping the future of finance, and join the inaugural Financial Technology Association Fintech Summit to learn more.

Keep ReadingShow less
The Financial Technology Association (FTA) represents industry leaders shaping the future of finance. We champion the power of technology-centered financial services and advocate for the modernization of financial regulation to support inclusion and responsible innovation.

AWS CEO: The cloud isn’t just about technology

As AWS preps for its annual re:Invent conference, Adam Selipsky talks product strategy, support for hybrid environments, and the value of the cloud in uncertain economic times.

Photo: Noah Berger/Getty Images for Amazon Web Services

AWS is gearing up for re:Invent, its annual cloud computing conference where announcements this year are expected to focus on its end-to-end data strategy and delivering new industry-specific services.

It will be the second re:Invent with CEO Adam Selipsky as leader of the industry’s largest cloud provider after his return last year to AWS from data visualization company Tableau Software.

Keep ReadingShow less
Donna Goodison

Donna Goodison (@dgoodison) is Protocol's senior reporter focusing on enterprise infrastructure technology, from the 'Big 3' cloud computing providers to data centers. She previously covered the public cloud at CRN after 15 years as a business reporter for the Boston Herald. Based in Massachusetts, she also has worked as a Boston Globe freelancer, business reporter at the Boston Business Journal and real estate reporter at Banker & Tradesman after toiling at weekly newspapers.

Image: Protocol

We launched Protocol in February 2020 to cover the evolving power center of tech. It is with deep sadness that just under three years later, we are winding down the publication.

As of today, we will not publish any more stories. All of our newsletters, apart from our flagship, Source Code, will no longer be sent. Source Code will be published and sent for the next few weeks, but it will also close down in December.

Keep ReadingShow less
Bennett Richardson

Bennett Richardson ( @bennettrich) is the president of Protocol. Prior to joining Protocol in 2019, Bennett was executive director of global strategic partnerships at POLITICO, where he led strategic growth efforts including POLITICO's European expansion in Brussels and POLITICO's creative agency POLITICO Focus during his six years with the company. Prior to POLITICO, Bennett was co-founder and CMO of Hinge, the mobile dating company recently acquired by Match Group. Bennett began his career in digital and social brand marketing working with major brands across tech, energy, and health care at leading marketing and communications agencies including Edelman and GMMB. Bennett is originally from Portland, Maine, and received his bachelor's degree from Colgate University.


Why large enterprises struggle to find suitable platforms for MLops

As companies expand their use of AI beyond running just a few machine learning models, and as larger enterprises go from deploying hundreds of models to thousands and even millions of models, ML practitioners say that they have yet to find what they need from prepackaged MLops systems.

As companies expand their use of AI beyond running just a few machine learning models, ML practitioners say that they have yet to find what they need from prepackaged MLops systems.

Photo: artpartner-images via Getty Images

On any given day, Lily AI runs hundreds of machine learning models using computer vision and natural language processing that are customized for its retail and ecommerce clients to make website product recommendations, forecast demand, and plan merchandising. But this spring when the company was in the market for a machine learning operations platform to manage its expanding model roster, it wasn’t easy to find a suitable off-the-shelf system that could handle such a large number of models in deployment while also meeting other criteria.

Some MLops platforms are not well-suited for maintaining even more than 10 machine learning models when it comes to keeping track of data, navigating their user interfaces, or reporting capabilities, Matthew Nokleby, machine learning manager for Lily AI’s product intelligence team, told Protocol earlier this year. “The duct tape starts to show,” he said.

Keep ReadingShow less
Kate Kaye

Kate Kaye is an award-winning multimedia reporter digging deep and telling print, digital and audio stories. She covers AI and data for Protocol. Her reporting on AI and tech ethics issues has been published in OneZero, Fast Company, MIT Technology Review, CityLab, Ad Age and Digiday and heard on NPR. Kate is the creator of RedTailMedia.org and is the author of "Campaign '08: A Turning Point for Digital Media," a book about how the 2008 presidential campaigns used digital media and data.

Latest Stories