People

Microsoft wants to take AI voices everywhere

By adding safeguards, Microsoft wants to ensure deepfake voices aren't being abused.

Duolingo Bea character

Duolingo gave its Bea character its own voice, with a little help from some neural networks.

Image: Duolingo

Get ready for every brand and app to have its own voice: Microsoft started to make its custom neural voice product more widely available to commercial partners Wednesday, allowing companies to generate their own voices for chatbots and other interactive applications. Custom neural voices are based on Microsoft's Azure AI platform, and use neural networks to create voices that don't have a robotic sound, like old-school text-to-speech technology.

The company spotlighted some early high-profile customers:

  • AT&T is using custom neural voice tech to bring Bugs Bunny in its Dallas experience store to life. Customers are greeted by name, and can chat with the Looney Tunes character while exploring the store.
  • Progressive created a voice chatbot for Flo, the omnipresent face of the insurance brand.
  • Duolingo is using custom neural voice to create multilingual voices for a set of characters, meant to bring personality to its language-learning app. Soon, you'll be able to choose whether you'd rather get help with your Japanese lessons from an emo teenager, a video game-loving kiddo who eats too much candy or a speed-talker who thinks she is always right.

To create these voices, Microsoft is asking companies to supply them with speech samples; for AT&T's Bugs Bunny, a voice actor recorded 2,000 phrases and lines. Azure AI then uses two neural networks to turn text into speech that actually pronounces words correctly, and also gets the tone and duration of each and every phoneme right.

Microsoft isn't the first company to use AI for custom voices. Google and Amazon have both generated celebrity voices for their respective assistants in the past, and Amazon recently announced that it would white-label Alexa, complete with custom voices. In October, Toronto-based Resemble AI launched Localize, a service that clones voices to produce translated audio recordings in a number of different languages.

With AI getting better and better at creating voices that are indistinguishable from real recordings, we'll likely also see a whole new wave of deepfake audio. Microsoft, for its part, went out of its way to stress that it is aware of the potential for abuse:

  • The company will limit access to its custom neural voice product to pre-approved partners, who have to contractually agree to a code of conduct.
  • Customers also have to agree to add disclaimers to their applications if consumers could mistake an AI voice for a real person.
  • The company is exploring the use of watermarks to make sure that AI recordings aren't used out of context.
  • Microsoft is also asking voice actors to acknowledge within their recordings that they are knowingly participating in an AI voice project — a safeguard against voice hijacking.

"As creators of this technology, we have an obligation to make sure it's used responsibly," said Azure AI platform VP Eric Boyd. "We're careful with the partners we work with in making sure they follow the guidelines."

A version of this story will appear in this week's Next Up newsletter.

Fintech

Judge Zia Faruqui is trying to teach you crypto, one ‘SNL’ reference at a time

His decisions on major cryptocurrency cases have quoted "The Big Lebowski," "SNL," and "Dr. Strangelove." That’s because he wants you — yes, you — to read them.

The ways Zia Faruqui (right) has weighed on cases that have come before him can give lawyers clues as to what legal frameworks will pass muster.

Photo: Carolyn Van Houten/The Washington Post via Getty Images

“Cryptocurrency and related software analytics tools are ‘The wave of the future, Dude. One hundred percent electronic.’”

That’s not a quote from "The Big Lebowski" — at least, not directly. It’s a quote from a Washington, D.C., district court memorandum opinion on the role cryptocurrency analytics tools can play in government investigations. The author is Magistrate Judge Zia Faruqui.

Keep Reading Show less
Veronica Irwin

Veronica Irwin (@vronirwin) is a San Francisco-based reporter at Protocol covering fintech. Previously she was at the San Francisco Examiner, covering tech from a hyper-local angle. Before that, her byline was featured in SF Weekly, The Nation, Techworker, Ms. Magazine and The Frisc.

The financial technology transformation is driving competition, creating consumer choice, and shaping the future of finance. Hear from seven fintech leaders who are reshaping the future of finance, and join the inaugural Financial Technology Association Fintech Summit to learn more.

Keep Reading Show less
FTA
The Financial Technology Association (FTA) represents industry leaders shaping the future of finance. We champion the power of technology-centered financial services and advocate for the modernization of financial regulation to support inclusion and responsible innovation.
Enterprise

AWS CEO: The cloud isn’t just about technology

As AWS preps for its annual re:Invent conference, Adam Selipsky talks product strategy, support for hybrid environments, and the value of the cloud in uncertain economic times.

Photo: Noah Berger/Getty Images for Amazon Web Services

AWS is gearing up for re:Invent, its annual cloud computing conference where announcements this year are expected to focus on its end-to-end data strategy and delivering new industry-specific services.

It will be the second re:Invent with CEO Adam Selipsky as leader of the industry’s largest cloud provider after his return last year to AWS from data visualization company Tableau Software.

Keep Reading Show less
Donna Goodison

Donna Goodison (@dgoodison) is Protocol's senior reporter focusing on enterprise infrastructure technology, from the 'Big 3' cloud computing providers to data centers. She previously covered the public cloud at CRN after 15 years as a business reporter for the Boston Herald. Based in Massachusetts, she also has worked as a Boston Globe freelancer, business reporter at the Boston Business Journal and real estate reporter at Banker & Tradesman after toiling at weekly newspapers.

Image: Protocol

We launched Protocol in February 2020 to cover the evolving power center of tech. It is with deep sadness that just under three years later, we are winding down the publication.

As of today, we will not publish any more stories. All of our newsletters, apart from our flagship, Source Code, will no longer be sent. Source Code will be published and sent for the next few weeks, but it will also close down in December.

Keep Reading Show less
Bennett Richardson

Bennett Richardson ( @bennettrich) is the president of Protocol. Prior to joining Protocol in 2019, Bennett was executive director of global strategic partnerships at POLITICO, where he led strategic growth efforts including POLITICO's European expansion in Brussels and POLITICO's creative agency POLITICO Focus during his six years with the company. Prior to POLITICO, Bennett was co-founder and CMO of Hinge, the mobile dating company recently acquired by Match Group. Bennett began his career in digital and social brand marketing working with major brands across tech, energy, and health care at leading marketing and communications agencies including Edelman and GMMB. Bennett is originally from Portland, Maine, and received his bachelor's degree from Colgate University.

Enterprise

Why large enterprises struggle to find suitable platforms for MLops

As companies expand their use of AI beyond running just a few machine learning models, and as larger enterprises go from deploying hundreds of models to thousands and even millions of models, ML practitioners say that they have yet to find what they need from prepackaged MLops systems.

As companies expand their use of AI beyond running just a few machine learning models, ML practitioners say that they have yet to find what they need from prepackaged MLops systems.

Photo: artpartner-images via Getty Images

On any given day, Lily AI runs hundreds of machine learning models using computer vision and natural language processing that are customized for its retail and ecommerce clients to make website product recommendations, forecast demand, and plan merchandising. But this spring when the company was in the market for a machine learning operations platform to manage its expanding model roster, it wasn’t easy to find a suitable off-the-shelf system that could handle such a large number of models in deployment while also meeting other criteria.

Some MLops platforms are not well-suited for maintaining even more than 10 machine learning models when it comes to keeping track of data, navigating their user interfaces, or reporting capabilities, Matthew Nokleby, machine learning manager for Lily AI’s product intelligence team, told Protocol earlier this year. “The duct tape starts to show,” he said.

Keep Reading Show less
Kate Kaye

Kate Kaye is an award-winning multimedia reporter digging deep and telling print, digital and audio stories. She covers AI and data for Protocol. Her reporting on AI and tech ethics issues has been published in OneZero, Fast Company, MIT Technology Review, CityLab, Ad Age and Digiday and heard on NPR. Kate is the creator of RedTailMedia.org and is the author of "Campaign '08: A Turning Point for Digital Media," a book about how the 2008 presidential campaigns used digital media and data.

Latest Stories
Bulletins