Rima Seiilova-Olson wasn’t sure why she was the only startup founder on a panel full of academics.
“I feel a little puzzled,” said Seiilova-Olson, co-founder and chief machine-learning scientist at a mental health AI startup Kintsugi, talking to Protocol about her participation in a Feb. 16 federal task force meeting about how she might use a federally funded AI research cloud.
The National AI Research Resource, or NAIRR, would be a repository of data and tools for AI research combined with access to the computing power necessary to develop machine learning and other AI systems. The proposed project is currently being evaluated by a task force overseen by the White House Office of Science and Technology Policy and the National Science Foundation.
Just who will get to use it remains in question. The National Artificial Intelligence Initiative, established by Congress in 2020, envisions the NAIRR as a research hub “for AI researchers and students across scientific fields and disciplines” including from “communities, institutions, and regions that have been traditionally underserved.”
That legislation, which established the task force planning the NAIRR, does not exclude private-sector researchers, but some believe they should not belong in that community. Researchers from the Stanford Institute for Human-Centered Artificial Intelligence, an influential proponent of the project, have indicated that the government-funded research hub should focus on the needs of academic and nonprofit researchers.
“Public investment in AI research for noncommercial purposes may help to address some of the issues of social harm we see presently in commercial contexts, as well as contribute to shifting the broader focus of the field toward technology developed in the public interest by the public sector and civil society, including academia,” wrote the institute in an October 2021 white paper.
The case for startup access
Amid representatives from five colleges and universities, Seiilova-Olson was the lone speaker representing the private sector at the virtual panel discussion addressing the needs of various potential users of the NAIRR.
She discussed her goals as a startup founder trying to compete with Big Tech. Kintsugi is developing machine-learning models to help detect clinical depression and anxiety based on voice data, and she said that means the company needs access to costly computing power to process large-scale, unstructured data.
But in addition to the nuts and bolts of building AI, Seiilova-Olson said the NAIRR also should provide training or community resources for people without access to traditional computer-science education.
“An open resource that is freely available for people, I think it’s extremely important. It played a huge role in my journey from being a regular software engineer to being a machine-learning scientist,” she said during the panel discussion, explaining her background using online training to advance her AI skills.
Rima Seiilova-Olson.Courtesy: Rima Seiilova-Olson
“There’s a big need for small players like myself to benefit from these resources, and I’m not talking about compute power and data,” she told Protocol last week, adding that she hopes NAIRR can provide access for STEM-related educational resources for underrepresented people.
After Seiilova-Olson spoke at the NAIRR meeting, Lynne Parker, task force co-chair and the founding director of the National AI Initiative Office at the OSTP, told her she made “a really strong case for how the availability of resources can really make a huge impact, including in particular for startups.”
Jen King, privacy and data policy fellow at the Stanford Institute, said use by private-sector researchers would pose legal and logistical issues, as well as distract from the core mission of the research cloud. “Overloading this resource out of the gate to address very different sets of users — small business and academic researchers — may jeopardize its development and ultimately its effectiveness,” she told Protocol.
Seiilova-Olson said she was personally invited by Parker to join the panel discussion, but she wasn’t sure why. Her company has ties to the National Science Foundation. Kintsugi has received around $2 million in NSF grant funding with another $1.5 million in funding on the way, according to a company spokesperson.
“In NAIRR task force discussions to date, a number of task force members have suggested that small business and startups, specifically those funded by federal [small business innovation] grants, have important perspectives that could help inform the task force’s work,” said Andres Anzola, press secretary at the OSTP. He said the task force co-chairs worked with federal subject matter experts to identify a grant recipient to participate in the panel discussion.
Seiilova-Olson, who co-founded her company after a personal experience with postpartum depression in the hopes of helping other parents, said startups are important contributors to AI research in part because they develop AI that is intended to be commercialized, and therefore possibly meet people’s needs.
However, noting her respect for academic research, she also said, “academic researchers have their own ideas that may be a bit detached from the needs of regular citizens.”
King said NAIRR may not be the appropriate home for startup AI research. “Small business AI may be expressing legitimate needs and constraints with competing against big AI, but the NAIRR may not be the right solution for addressing them,” King said.
Private-sector presence on the task force
The NAIRR task force already has a private-sector presence, which is by design according to the legislation that established it.
Among its 12 members are Andrew Moore, Google Cloud’s vice president and general manager for AI and Industry Solutions; Daniela Braga, founder and CEO of AI startup Defined.ai, formerly DefinedCrowd; and Oren Etzioni, a venture partner at investment firm Madrona Venture Group who also works with new AI startups as part of an incubator fund affiliated with AI2, a nonprofit he leads that was founded by Microsoft co-founder Paul Allen.
Google has said it wants researchers who don’t need computing power from the research cloud to be able to access data in the NAIRR. For one thing, that would ensure that researchers from commercial cloud providers like Google, Amazon and Microsoft would be able to take advantage of the data flowing through the system.
Health care data giant Cerner, recently acquired by Oracle, also indicated interest in the NAIRR. The company, which helps health care customers manage patient data and is increasing its use of AI for hospital administration and patient care, emphasized public-private partnerships when it comes to how data in the resource is handled.
“It is critical that the governance structure for the NAIRR involve[s] representation from knowledgeable and relevant public-private entities for the types of data and purposes of collection that prevail,” the company wrote in an October 2021 response to a request for information by the task force. “Public policies and laws are government core competencies. However, administration of the data asset and associated technical aspects may be more suited for a public-private partner.”
Meanwhile, opponents of the project, including some advising AI policy inside the Federal Trade Commission, have raised concerns that the NAIRR will be designed primarily to enable massive-scale AI projects that, by default, would require assistance from big private-sector cloud providers.
Still, there’s no shortage of academics who see value in private cloud providers building and maintaining the NAIRR. “I would strongly urge that the NAIRR would be based on the existing cloud providers in the commercial space,” said Tom Dietterich, distinguished professor emeritus in the Collaborative Robotics and Intelligent Systems Institute at Oregon State University, during the panel discussion.
A task force working group focused on compute capabilities of the NAIRR is also weighing the possibility of working with commercial cloud providers.
During a separate session of the February task force meeting, research staff from the Science and Technology Policy Institute, a federally funded research and development center, said that partnering with established entities like commercial cloud providers could provide the NAIRR with a range of data sources, a workforce pipeline and timely updates to data architectures and data security technologies.
However, said Emily Grumbling, a research staff member with the Policy Institute, “relying on these resources could ultimately increase dependence on the private sector and for-profit based resources for the AI [research and development] ecosystem.”
This story was updated to correct the amount of funding that Kintsugi has received.