Payroll data is fintech’s $10 billion ‘holy grail’

A glimpse at consumers' paychecks gives crucial clues that help financial firms market mortgages, retirement accounts and more. No wonder it's such a tricky field to master.

An illustration of a magnifying glass examining a paycheck for payroll data.

Payroll data has become a crucial fintech battlefield.

Image: Christopher T. Fong/Protocol

Pinwheel CEO Kurt Lin launched his startup three years ago to chase what he called the "the holy grail" of fintech, connecting payroll data to pretty much any app.

Payroll data is already widely used to verify employees' work status and make it easier for consumers to set up or switch direct deposit accounts. But so much more can be done with all that information recorded on paystubs, Lin said.

"There's an incredible wealth of information that has never really been programmatically unlocked before: things like who you are, how much you make, what you pay in taxes," he told Protocol.

Unlocking that data is expected to unleash a new wave of fintech innovation. And startups like Pinwheel are jockeying for position in the small but growing market as they also grapple with questions around regulations, controversial business practices and the presence of bigger competitors, including Plaid and Finicity.

Lindsay Davis, head of markets at Atomic, estimated the total addressable market for "payroll connectivity and data software" is currently around $10 billion. Alex Johnson, director of fintech research at Cornerstone Advisors, said the market "could be bigger."

Current estimates are likely "based on the initial use cases they've already identified and started selling against," he told Protocol. "There are a lot of other potential use cases that will be discovered over time."

New uses for data

Those could be even more compelling than the ones based on access to bank data, which companies like Yodlee pioneered and Plaid and Finicity later pursued. Payroll data is potentially more valuable and richer, covering other key information, such as 401(k) contributions, taxes and even a consumer's vacation and sick days tally. Even travel companies might want a look.

"Payroll is like the top of the waterfall," Johnson said. "It's where you get all of that from. There's a set of things you can do with payroll data that hasn't been done well with data further down in the stream."

Shmulik Fishman, CEO and founder of Argyle, another payroll data software company, said coming up with new uses for the information has been "one of the most fascinating parts of this business." "Every week, we get a new idea of what this same data set and same platform can be used for," he said.

John Whitfield, Pinwheel's vice president of engineering, argued that payroll software data could spark a new wave of fintech innovation, much like Plaid did by connecting startups like Square, PayPal, Robinhood, SoFi and Affirm to banking data.

"One thing we have in common with Plaid is this idea that we're an underlying platform for a fintech company that has yet to be founded," Whitfield, who is a Plaid alumnus, told Protocol. "That's something that Plaid preached and based their business on — that startup after startup would come in and immediately start using Plaid."

Johnson said the payroll data software industry could even be "stealing a certain amount of share from the data aggregators space." That could be a reason, he argued, "why Plaid is trying to get into the space — because they see it as potentially taking some of their total addressable market away."

"We are focused on the needs of our customers in service of consumers, not individual competitors," a Plaid spokesman said in a statement. "Plaid's entry into the payroll data space is a natural evolution and has been planned for some time."

He called Plaid Income, the company's offering in payroll data software, fintech's "first building block for payroll data, [which] builds on our experience working with the most widely-used fintech services in the world."

Plaid also said when it launched an income-verification product in March that payroll data was the "next frontier" in open finance and that access "to payroll data has tremendous potential to expand financial opportunity and help people lead healthier financial lives."

But it has also proven to be a tough market to crack — even for a new fintech juggernaut.

Hurdles in the quest

In a setback to its payroll data software ambitions, Plaid paused a major payroll-related product just five months into a big offensive to add payroll data to its offerings. The company has said it is not giving up on the space, and that suspending development of its Deposit Switch product does "not signal that Plaid is less committed to the payroll data API space."

But the move underlined the challenges in a market that Davis of Atomic also described as "messy." One reason is the nature of payroll data.

While consumers usually stick with one bank account for years, they may change jobs frequently and become part of different payroll systems, such as ADP, Paychex or Workday. And there are also different rules that govern the way payroll records are kept depending on where the employer and their employees are based.

"You've got the state labor laws that are fragmented state by state," she told Protocol. "You've got the employers that have enabled certain levels of access for their employees. And every employer has [its] own way of integrating with an HR system."

Then there are the different ways that payroll data can be accessed. API connections are "vastly preferred by everyone in the market," because it is more secure, Johnson said. But the big challenge for payroll data software companies, he said, is: "How do you motivate the companies that have this data to build technical integration, a business partnership that allows for access to the data?"

That prompted some companies to get data through screen scraping, where you "go to a payroll portal, ask someone for their username and password, and go in and pull down that data," Finicity CEO Steve Smith said, adding that screen scraping payroll data is an approach that "we will not adopt."

Screen scraping is a controversial and sensitive tactic in the payroll data software market — to the point that some in the industry are reluctant to discuss it.

Pinwheel CEO Kurt Lin Pinwheel CEO Kurt Lin's chasing "the holy grail" of fintech: connecting payroll data to pretty much any app.Photo: Pinwheel

Pinwheel's Lin said "data aggregation is a difficult endeavor and we use a number of methods to connect with payroll platforms," but when asked if those included screen scraping, he said, "No further comment."

Davis of Atomic said the company has used screen scraping "when user-permissioned APIs are not available." One example is when Atomic needs to connect with state unemployment systems, which typically don't have API connectivity.

A Plaid spokesman said the company uses "a combination of API access and screen scraping at the direction of customers."

Some screen scraping tactics have raised alarm.

Plaid has taken heat for reportedly offering to pay users $500 for providing their employer payroll login details. The company denied it did anything illegal, saying it was part of "a voluntary and time-limited pilot program" that involved 12 participants and was meant to "assist Plaid in building consumer-permissioned tools that make it easier for consumers to securely share their information digitally."

Argyle faced similar accusations. Fishman, the company's CEO, denied the accusation, saying, "That's not something that Argyle performs."

Johnson of Cornerstone Advisors said reports of the pay-for-data-access tactics didn't sit well with some payroll data software companies that saw the approach as "essentially spending VC money to bribe your way into getting larger coverage."

"Everyone I talked to in the industry, uniformly, is like, 'This is just bad for all of us. We get the temptation. We get why you might want to use this to build up coverage faster. But it's just bad for the industry overall if you do this,'" Johnson said. "This is the quickest way to regulators just shutting it down and not being on the side of this data sharing."

Setting the rules

There are looming battles over data privacy and ownership that are sure to engulf the payroll data software companies. This was highlighted by recent news that the SEC, under Chairman Gary Gensler, will look more closely into how data analytics and AI are used in financial services and possibly draft new rules that would cover these technologies.

Some companies have taken a proactive approach to the expected wave of regulations. For example, both Finicity, which was acquired by Mastercard last year, and Pinwheel have opted to become Fair Credit Reporting Act-compliant companies. Atomic is looking to be FCRA compliant. "It's not a matter of if, it's when," Davis said.

Being FCRA compliant means these companies must adhere to strict rules related to the handling of consumer data, which includes making sure, as credit reporting agencies do, that the information they collect is accurate and up to date. "We're basically on the hook for having data that actually is of high quality versus otherwise just being a data aggregator," Lin said.

Johnson said being FCRA compliant is significant in fintech where many companies "don't want to have to deal with more regulation than you have to." He added: "The way that data aggregators typically think about that is, 'We're just the pipes that pass data back and forth. We don't hold the data. We don't build consumer reports. We don't add any analysis layer or anything on top of the data.'"

He said it is a "really smart" move to "just lean in and say, 'Look, we're going to get regulated at some point, let's embrace it. Let's be the first one to talk to regulators about this. Let's get out ahead of it.'"

Despite Plaid's unexpected stumble, the company is expected to be a formidable competitor in the space given its track record as a fintech powerhouse. And its smaller rivals know this.

"Plaid's solution has been an integral part of the rise of fintech," Lin said. Davis said Plaid's decision to enter the market is "a huge validation" for the space. "It shows that it is important, that this is a market worth paying attention to, something we've believed for years now," she said.

Johnson echoed that view, calling Plaid's decision to take the "jump with both feet" into this market "a net positive." "The biggest problem you have — and we saw this in bank account aggregation as well — is incumbents just trying to kill this [space] before it gets started," he said.

Plaid, given its size and reach in fintech, can help establish payroll data software "as a category that's not going to go away."


Apple's new payments tech won't kill Square

It could be used in place of the Square dongle, but it's far short of a full-fledged payments service.

The Apple system would reportedly only handle contactless payments.

Photo: Nathan Dumlao/Unsplash

Apple is preparing a product to enable merchants to accept contactless payments via iPhones without additional hardware, according to Bloomberg.

While this may seem like a move to compete with Block and its Square merchant unit in point-of-sale payments, that’s unlikely. The Apple service is using technology from its acquisition of Mobeewave in 2020 that enables contactless payments using NFC technology.

Keep Reading Show less
Tomio Geron

Tomio Geron ( @tomiogeron) is a San Francisco-based reporter covering fintech. He was previously a reporter and editor at The Wall Street Journal, covering venture capital and startups. Before that, he worked as a staff writer at Forbes, covering social media and venture capital, and also edited the Midas List of top tech investors. He has also worked at newspapers covering crime, courts, health and other topics. He can be reached at or

Sponsored Content

A CCO’s viewpoint on top enterprise priorities in 2022

The 2022 non-predictions guide to what your enterprise is working on starting this week

As Honeywell’s global chief commercial officer, I am privileged to have the vantage point of seeing the demands, challenges and dynamics that customers across the many sectors we cater to are experiencing and sharing.

This past year has brought upon all businesses and enterprises an unparalleled change and challenge. This was the case at Honeywell, for example, a company with a legacy in innovation and technology for over a century. When I joined the company just months before the pandemic hit we were already in the midst of an intense transformation under the leadership of CEO Darius Adamczyk. This transformation spanned our portfolio and business units. We were already actively working on products and solutions in advanced phases of rollouts that the world has shown a need and demand for pre-pandemic. Those included solutions in edge intelligence, remote operations, quantum computing, warehouse automation, building technologies, safety and health monitoring and of course ESG and climate tech which was based on our exceptional success over the previous decade.

Keep Reading Show less
Jeff Kimbell
Jeff Kimbell is Senior Vice President and Chief Commercial Officer at Honeywell. In this role, he has broad responsibilities to drive organic growth by enhancing global sales and marketing capabilities. Jeff has nearly three decades of leadership experience. Prior to joining Honeywell in 2019, Jeff served as a Partner in the Transformation Practice at McKinsey & Company, where he worked with companies facing operational and financial challenges and undergoing “good to great” transformations. Before that, he was an Operating Partner at Silver Lake Partners, a global leader in technology and held a similar position at Cerberus Capital LP. Jeff started his career as a Manufacturing Team Manager and Engineering Project Manager at Procter & Gamble before becoming a strategy consultant at Bain & Company and holding executive roles at Dell EMC and Transamerica Corporation. Jeff earned a B.S. in electrical engineering at Kansas State University and an M.B.A. at Dartmouth College.

Why does China's '996' overtime culture persist?

A Tencent worker’s open criticism shows why this work schedule is hard to change in Chinese tech.

Excessive overtime is one of the plights Chinese workers are grappling with across sectors.

Photo: VCG/VCG via Getty Images

Workers were skeptical when Chinese Big Tech called off its notorious and prevalent overtime policy: “996,” a 12-hour, six-day work schedule. They were right to be: A recent incident at gaming and social media giant Tencent proves that a deep-rooted overtime culture is hard to change, new policy or not.

Defiant Tencent worker Zhang Yifei, who openly challenged the company’s overtime culture, reignited wide discussion of the touchy topic this week. What triggered Zhang's criticism, according to his own account, was his team’s positive attitude toward overtime. His team, which falls under WeCom — a business communication and office collaboration tool similar to Slack — announced its in-house Breakthrough Awards. The judges’ comments to one winner highly praised them for logging “over 20 hours of intense work nonstop,” to help meet the deadline for launching a marketing page.

Keep Reading Show less
Shen Lu

Shen Lu covers China's tech industry.

Boost 2

Can Matt Mullenweg save the internet?

He's turning Automattic into a different kind of tech giant. But can he take on the trillion-dollar walled gardens and give the internet back to the people?

Matt Mullenweg, CEO of Automattic and founder of WordPress, poses for Protocol at his home in Houston, Texas.
Photo: Arturo Olmos for Protocol

In the early days of the pandemic, Matt Mullenweg didn't move to a compound in Hawaii, bug out to a bunker in New Zealand or head to Miami and start shilling for crypto. No, in the early days of the pandemic, Mullenweg bought an RV. He drove it all over the country, bouncing between Houston and San Francisco and Jackson Hole with plenty of stops in national parks. In between, he started doing some tinkering.

The tinkering is a part-time gig: Most of Mullenweg’s time is spent as CEO of Automattic, one of the web’s largest platforms. It’s best known as the company that runs, the hosted version of the blogging platform that powers about 43% of the websites on the internet. Since WordPress is open-source software, no company technically owns it, but Automattic provides tools and services and oversees most of the WordPress-powered internet. It’s also the owner of the booming ecommerce platform WooCommerce, Day One, the analytics tool and the podcast app Pocket Casts. Oh, and Tumblr. And Simplenote. And many others. That makes Mullenweg one of the most powerful CEOs in tech, and one of the most important voices in the debate over the future of the internet.

Keep Reading Show less
David Pierce

David Pierce ( @pierce) is Protocol's editorial director. Prior to joining Protocol, he was a columnist at The Wall Street Journal, a senior writer with Wired, and deputy editor at The Verge. He owns all the phones.


Spoiler alert: We’re already in the beta-metaverse

300 million people use metaverse-like platforms — Fortnite, Roblox and Minecraft — every month. That equals the total user base of the internet in 1999.

A lot of us are using platforms that can be considered metaverse prototypes.

Illustration: Christopher T. Fong/Protocol

What does it take to build the metaverse? What building blocks do we need, how can companies ensure that the metaverse is going to be inclusive, and how do we know that we have arrived in the 'verse?

This week, we convened a panel of experts for Protocol Entertainment’s first virtual live event, including Epic Games Unreal Engine VP and GM Marc Petit, Oasis Consortium co-founder and President Tiffany Xingyu Wang and Emerge co-founder and CEO Sly Lee.

Keep Reading Show less
Janko Roettgers

Janko Roettgers (@jank0) is a senior reporter at Protocol, reporting on the shifting power dynamics between tech, media, and entertainment, including the impact of new technologies. Previously, Janko was Variety's first-ever technology writer in San Francisco, where he covered big tech and emerging technologies. He has reported for Gigaom, Frankfurter Rundschau, Berliner Zeitung, and ORF, among others. He has written three books on consumer cord-cutting and online music and co-edited an anthology on internet subcultures. He lives with his family in Oakland.


Lyin’ AI: OpenAI launches new language model despite toxic tendencies

Research company OpenAI says this year’s language model is less toxic than GPT-3. But the new default, InstructGPT, still has tendencies to make discriminatory comments and generate false information.

The new default, called InstructGPT, still has tendencies to make discriminatory comments and generate false information.

Illustration: Pixabay; Protocol

OpenAI knows its text generators have had their fair share of problems. Now the research company has shifted to a new deep-learning model it says works better to produce “fewer toxic outputs” than GPT-3, its flawed but widely-used system.

Starting Thursday, a new model called InstructGPT will be the default technology served up through OpenAI’s API, which delivers foundational AI into all sorts of chatbots, automatic writing tools and other text-based applications. Consider the new system, which has been in beta testing for the past year, to be a work in progress toward an automatic text generator that OpenAI hopes is closer to what humans actually want.

Keep Reading Show less
Kate Kaye

Kate Kaye is an award-winning multimedia reporter digging deep and telling print, digital and audio stories. She covers AI and data for Protocol. Her reporting on AI and tech ethics issues has been published in OneZero, Fast Company, MIT Technology Review, CityLab, Ad Age and Digiday and heard on NPR. Kate is the creator of and is the author of "Campaign '08: A Turning Point for Digital Media," a book about how the 2008 presidential campaigns used digital media and data.

Latest Stories