May 26, 2022
Hello, and welcome to Protocol Entertainment, your guide to the business of the gaming and media industries. This Thursday, we’re exploring how Sonos built its voice assistant, and why Amazon didn’t use computer vision for its new Glow projector device. Also: Time to take a deep breath.
Sonos quietly began rolling out its voice assistant to some people in the U.S. this week, days before its official June 1 launch date. Sonos Voice Control is purpose-built for music playback, and it comes with strong privacy safeguards: Unlike Alexa or the Google Assistant, it doesn’t upload any voice recordings to the cloud, but instead processes everything on the device.
I spoke with Sonos’ Sébastien Maury, the company’s senior director for Voice Experience, and Kåre Sjolander, European head of Text-to-Speech for synthetic voice specialist ReadSpeaker, to learn more about the work that went into building the assistant.
Giving the Sonos assistant a voice. Sonos teamed up with ReadSpeaker to generate the unique voice profile of its assistant, which is based on “Breaking Bad” actor Giancarlo Esposito.
Making sure the assistant understands you. Having an assistant respond with a synthetic voice is only half the battle. Getting it to actually understand requests is just as important — and even more challenging if it’s done locally on the device.
The focus on just one use case makes things a little easier. Sonos Voice Control won’t need to tell people about their weather or commute, and speaker owners will likely use a much more streamlined set of requests.
“This is one of the advantages of running locally,” Maury said. “We have one [speech recognition] model per house.”
— Janko Roettgers
When I first heard about Amazon’s new kid-focused video calling device, the Amazon Glow, my mind immediately went to Osmo, which has been combining digital and physical play with its child-centric entertainment apps and accessories for years. There are even tangram sets for both, allowing kids to solve digital puzzles with physical puzzle pieces.
But after talking to some of the folks who worked on the Glow for an in-depth story on its development that published on Protocol.com this week, I realized that Amazon ultimately decided to take a very different approach — and the reasons for that decision show that there’s no one-size-fits-all approach when it comes to building next-generation entertainment devices.
Ultimately, Amazon went with an IR sensor that can track a person’s fingers instead of a traditional RGB camera. However, Aalund readily admitted computer vision may one day provide even better results. “We started this five years ago,” he said. We didn't have quite as powerful cameras and systems as we do today. That biased us a little bit.”
— Janko Roettgers
The digital revolution is already here – transforming the way we live, work, and communicate. Smart infrastructure is a key part of this revolution. It brings the power of the digital world to physical components like energy, public transportation, and public safety by using sensors, cameras, and connected devices.
Magic Leap is getting rid of its original headset. The company’s pivot to the enterprise is complete: Discount site Woot is selling the Magic Leap 1 headset, which used to be $2300, for $550 this week.
Netflix is eyeing console and cloud gaming. In a lengthy survey, the company asked subscribers about their interest in playing Netflix games on TV.
Niantic is building an AR map of the world. The Pokemon Go developer has been crowdsourcing its Visual Positioning System, which allows developers to create persistent AR experiences at 30,000 locations.
Netflix layoffs disproportionately impacted people with marginalized identities. A recent round of layoffs resulted in deep cuts on social media teams set up to speak to people of color and LGBTQ+ viewers.
The metaverse gets its first in-world conference. The Meta Festival, scheduled for June 28, will include speakers from Netflix, Headspace, Paramount and others.
Roblox hires a former Zynga and Twitter exec. Nick Tornow, the former chief technology officer at Zynga, is joining Roblox as vice president of Engineering for its developer team. Tornow was previously Twitter’s platform lead.
The war in Ukraine is still straining game development. The Belarusian game developer Sad Cat Studio said on Wednesday it was delaying its upcoming Xbox exclusive Replaced to 2023, citing the ongoing conflict and the impact it’s had on staff members.
An NFT nightmare: Seth Green made headlines this week when his Bored Ape NFT was stolen and resold to a buyer who has no intention of returning it. That could complicate Green’s plans for an animated TV show using the underlying art and character of the NFT.
It’s easy to feel lost and overwhelmed in a week like this. Self-care obviously won’t solve all of our problems (for one, it doesn’t get rid of assault weapons), but taking a moment for yourself can at least help to cope with some of the feelings these senseless tragedies leave us with. One way to do that is guided meditation, which is something that VR meditation company Tripp is currently offering for free in its mobile app. Plus, Tripp recently teamed up with Niantic to soon integrate AR experiences into its mobile app, so you’ll be able to find those self-care moments anywhere.
— Janko Roettgers
The potential of the IIJA to shape our future is immense; if we don’t spend the funds wisely, the effects will be felt for generations. Physical infrastructure alone does not fully address the diverse needs of our modern, information-driven economy and set us up for future success.
Thoughts, questions, tips? Send them to email@example.com. Enjoy your day, see you tomorrow.