The smart speaker revolution has unlocked an opportunity to engage and communicate by voice — we’ve all heard our fair share of “Hey Siri!” and “Alexa, what’s the weather?” Today, smart speakers are owned by an estimated 60 million people in the U.S. TV manufacturers are even installing microphones for far-field voice recognition, in an effort to make TV’s speakers function like smart home speakers.
The “always on” nature of these listening devices is leading to a new and ubiquitous form of ambient computing. Importantly, they also represent new distribution channels that will be exploited by a new generation of startups.
## 3. New modes of interaction
Audio-first modes of interaction represent the[ third major platform and UI shift of the past three decades](https://hbr.org/2019/05/why-tech-giants-are-so-desperate-to-provide-your-voice-assistant), following the Internet, the PC, and the mobile eras. The current shift to voice has increased consumers’ familiarity and acceptance with audio, shown through users’ growing comfort with speaking to smart devices, like Siri or Alexa. It’s also enabled more short-form texting inputs: for[ a culture obsessed with multitasking](https://qz.com/emails/quartz-obsession/1784195/), audio is our new best friend.
**Last decade, mobile apps were the remote control for the real world. This decade, voice is the remote control.**
> This decade, voice is the remote control.
The advent of always-on, two-way voice communication channels not only makes voice assistants like Siri and Alexa more accessible and utilitarian, but also opens the door to new social behavior. Already, a myriad of social audio networks have taken 2020 by storm. This is what continues to excite us about companies like [Discord](https://discord.com) and other emerging audio social networks, including [Clubhouse](https://www.joinclubhouse.com) and [Chalk](https://chalkapp.com) (broadcast), [Rodeo](https://www.producthunt.com/posts/rodeo-4) and [Soapbox](https://soapbox.social) (small group), [Dialup](https://dialup.com) (one-to-one), and more. We’re even seeing vertical social networks in this space begin to materialize, such as [Locker Room](https://joinlockerroom.com) for sports, [Roadtrip](https://www.producthunt.com/posts/roadtrip-3) for music, and [Quilt](https://www.beta.wearequilt.com) for meaningful connection and self-care.
(The explosion of social audio deserves an analysis of its own. In a more thorough breakdown, Rex Woodbury dissects the [opportunity for a new platform](https://digitalnative.substack.com/p/the-opportunity-in-social-audio?token=eyJ1c2VyX2lkIjoyNDYxNzI3LCJwb3N0X2lkIjoxOTI4Njc0MSwiXyI6IlBIYmo4IiwiaWF0IjoxNjA2NDI0MjI0LCJleHAiOjE2MDY0Mjc4MjQsImlzcyI6InB1Yi0zODUwMCIsInN1YiI6InBvc3QtcmVhY3Rpb24ifQ.WgnRpOi14KZir2ir4DP_C8h90BFJnSqO3EYxV_lDWnw) to emerge in audio, arguing that the platform will be highly social and will rely on users to create and distribute content.)
# Request for startups
[
“Audio listening has always adapted to the times, and that’s one of its main benefits.” - James Cridland
](https://james.cridland.net)
At Bessemer, we’re on the hunt for new audio-first products. Some areas we’re excited about include:
## The audio creation suite
During a gold rush, sell shovels. Some of the most exciting startups we’re seeing in audio serve as underlying infrastructure for creation. This opportunity has proven lucrative across other rich content types like photo and video; for example, photo creation/sharing tools were democratized by platforms like Canva and Instagram, and video by the likes of YouTube, Loom, and TikTok.
We’re bullish about tools that expand the funnel of audio creators. Earlier in this piece, we mentioned [Spotify’s strategic acquisition of Anchor](https://medium.com/anchor/anchor-is-joining-spotify-70356e3ac23f), which brought Anchor’s suite tools for podcast creation and distribution to Spotify’s community of over 200 million users. Another example in the audio creation suite is [Descript](https://www.descript.com), which makes editing audio and video content as easy as editing a word document, and provides collaborative tools for podcast and media creation. (Descript also acquired [Lyrebird](https://www.descript.com/lyrebird), which built editing tools for speech synthesis.) In music, we see startups like [Splice](https://splice.com), a collaborative tool for music production, often referred to as a “GitHub for musicians.”
Beyond creation, there are exciting innovations for fan engagement and content monetization. Audio has been [notoriously under-monetized relative to other forms of media.](https://www.matthewball.vc/all/audiotech) So far, we’ve seen products like [Glow](https://www.glow.fm), which offers a way to distribute and monetize premium podcast content.
## Audio for the masses
With an influx of audio creation tools, it’s never been easier to make and distribute audio content. But as a consumer, it’s never been harder to distinguish the best content from the crowd. We see the opportunity for there to be a “TikTok for Audio,” which we believe might help solve this problem.
TikTok is powerful in its features for both creation and discovery. On the creation front, it has enabled anyone to become a filmmaker, similar to how Instagram widened the funnel for photographers. The social graph on TikTok has allowed users to share videos to their friends and greater network, paving the way for power users to become influencers in their own right. And on the discovery front, TikTok is almost too good to be true: just look at Dreams by Fleetwood Mac, which catapulted back onto the charts more than four decades after its initial success, thanks to a [viral TikTok](https://www.tiktok.com/@420doggface208/video/6876424179084709126?lang=en&sender_device=pc&sender_web_id=6892988847333246470&is_from_webapp=1) that combined a morning skateboard session and a bottle of Ocean Spray paired with the classic hit.
Following these themes, a “TikTok for Audio” could solve some of the problems surrounding audio discovery and engagement. So far, we’ve been excited by the entrance of startups like [Shuffle](https://getshuffle.app), which helps podcast fans share the best ideas and stories locked up in long podcasts.
## Voice at work
[
“There’s a temptation in our networked age to think that ideas can be developed by email and iChat. That’s crazy. Creativity comes from spontaneous meetings, from random discussions. You run into someone, you ask what they’re doing, you say ‘wow,’ and soon you’re cooking up all sorts of ideas.” - Steve Jobs
](https://www.goodreads.com/quotes/511417-there-s-a-temptation-in-our-networked-age-to-think-that)
Voice technology is just beginning to permeate the workplace. Knowledge work is increasingly remote and solo, and [both open offices and remote work benefit from audio solutions](https://techcrunch.com/2020/11/18/virtual-hqs-race-to-win-over-a-remote-work-fatigued-market/) (AirPods are a survival tool; we already knew that, but it’s worth noting that they give employees more mobility and free up their hands, too).
![]()
[
Spotify’s #2020WRAPPED showed a sizable increase in work-from-home themed playlists.
](https://newsroom.spotify.com/2020-12-01/the-trends-that-shaped-streaming-in-2020/)
But this theme isn’t just limited to music and podcast streaming — voice experiences at work are [becoming widespread](https://www.gartner.com/smarterwithgartner/chatbots-will-appeal-to-modern-workers/). Specifically, we’re intrigued by innovations in asynchronous and spatial audio that help create a sense of presence in a digitally-native workplace. For example, [Yac](https://www.yac.com) utilizes asynchronous voice messaging to help cut down on meetings and calls, and [Slashtalk](https://slashtalk.com) decentralizes conversations so remote teams can move faster. In spatial audio, companies like [Branch](https://branch.gg) are creating virtual HQs to bring serendipity back into the workday.
Beyond this, advances in audio and voice-based technologies are reinventing sales, marketing, and customer relationship management software. [Gong](https://www.gong.io), a voice intelligence company, uses real-time natural language processing to train and suggest information to salespeople and other customer service reps. (Fun fact: Gong’s model was [trained on the full run of Seinfeld](https://www.gong.io/press/seinfeld-helps-make-machines-a-little-more-human/), in an effort to teach the platform about awkward conversations and sarcasm.)
# The future of audio
[
“If I have seen further, it is by standing on the shoulders of giants.” -Isaac Newton
](https://en.wikipedia.org/wiki/Standing_on_the_shoulders_of_giants)
Innovation doesn’t — and shouldn’t — occur in a vacuum. The iPod was built upon the work of [Kane Kramer](https://en.wikipedia.org/wiki/Kane_Kramer), who took inspiration from the Sony Walkman. Facebook is presumed to have been partially inspired by [Friendster](https://digital.hbs.edu/platform-digit/submission/before-facebook-there-was-friendster-yes-thats-right/). Similarly, many successful companies grew efficiently in their early years by standing on the shoulders of giants and exploiting creative distribution loopholes. [Airbnb scraped Craigslist](https://growthhackers.com/growth-studies/airbnb) to hack marketplace growth, Zynga leveraged Facebook’s social graph and newsfeed, and Yelp built its user base by capitalizing on Google search algorithms. This trend will be no different in audio. We’re only scratching the surface: as new platforms, technological innovations, and modes of communication emerge, audio is a space to watch—or should we say, listen?
Are you building something in audio? Let us know and reach out to Talia Goldberg (talia@bvp.com) and Gaby Goldberg (ggoldberg@bvp.com) via email.