Audiobooks are a booming business, but it’s impossible to imagine high-quality performers reading every book ever written out loud. To try to fill that gap, Audible, Amazon‘s audiobook arm, is testing out using voice clones produced with AI to speed up the process. The idea of a massive increase in books with an audio format has obvious appeal but immediately begs the question of whether they would be able to match a human performance, or whether it’s worth pursuing even if they can.
The program, in beta in the U.S., is for a group of current Audible narrators to create voice clones using AI trained on samples of their speech. They will then be able to plug these AI replicas into their book recordings. The AI voice won’t just be a plug-in for certain chapters and passages, either. Narrators can edit their AI voices to fix pronunciation, slow down or speed up the pacing, and then review the final production to ensure accuracy, just as they would in a live reading.
Audible suggested that AI voices will be present alongside human performances in a book, aiding rather than replacing the narrator. That means the narrators retain control over their AI voice, and they can audition their real and synthetic voices for reading projects. The exact payment system Audible will use for AI voices hasn’t been completely worked out yet, but the beta offers a share of royalties on income from titles. How much narrators can expect to earn from AI-generated recordings compared to live readings is less clear. Plus, Audible mentioned that beta participants will be able to produce their AI voice for free, hinting that those not in the beta may have to pay a fee, or possibly a subscription, for the synthetic version of their voice.
“Audible continues to hear from members of our creator community who are interested in using emerging technologies to make more audio content available to listeners. There is a vast catalog of books that does not yet exist in audio and as we explore ways to bring more books to life on Audible, we’re committed to thoughtfully balancing the interests of authors, narrators, publishers, and listeners,” Audible explained in the announcement. “Narrators continue to maintain control of the projects they want to audition for, using voice replica or live performance, and will remain central to the production process.”
AI bookworms
Assuming the beta test goes well, the option to add an AI-produced audio track will then be expanded to the authors, agents, publishers, and other copyright owners later this year. That suggests Audible is testing both the technical and political waters to see how the people involved in making an audiobook feel about the idea even before it becomes available to listeners.
Still, even if you don’t get an AI version of a current professional narrator, AI-generated voices are spreading thanks to improved synthetic voice cloning. For instance, ElevenLabs now offers AI versions of the voices of James Dean, Burt Reynolds, Judy Garland, and Sir Laurence Olivier for its Reader App, which can read out everything from books to websites.
Amazon clearly wants to show that it is focused on ensuring that the people involved in the project are happy. However, the company was more cavalier with its self-published audiobooks, adding AI voices through Kindle Direct Publishing (KDP). Authors could turn their books into audiobooks read by entirely synthetic voices with mixed reviews of their quality. There are now tens of thousands of titles on Audible with AI-generated voices. Professional narrators have complained about how it degrades their work and casts aside quality for cheapness even as it reduces their opportunities. The lack of a filter for AI-read books hasn’t helped matters. If Audible is going to navigate through these issues and assuage the writers and narrators, along with their fans eager to hear the books, the company will need to listen closely.