How to Create an AI Voice That Sounds Like You With ElevenLabs

Generative AI and deepfakes have collided with the development of AI voice tools. The idea is simple: you take a voice and manipulate it to speak the words you give it.

Leading the pack in this area is ElevenLabs’ AI tool, which boasts a free-to-use tier alongside some impressive paid options.

4

What Is ElevenLabs?

Founded by an ex-Google machine learning engineer and an ex-Palintir deployment strategist, ElevenLabs is a voice technology research company. The speech software is a key element of its strategy, but the final aim is to create a tool that “instantly convert[s] spoken audio between languages.”

ElevenLabs Voice AI is a text-to-speech model that can create a realistic-sounding human voice. Its website states: “Our mission is to make on-demand multilingual audio support a reality across education, streaming, audiobooks, gaming, movies, and even real-time conversation.”

Two women, one listening, one shouting

Google Translate and its alternativesare one thing, but can you imagine a tool that instantly translates what you’re hearing? Cloning the voice of the speaker so that you hear the speech as they would say it is an important stepping stone towards that.

What Is AI Voice Generation?

Described simply, AI voice generation lets you take a voice and make it say whatever you want to hear. Simply choose a voice, provide dialogue, and the tool does the rest.

You might think “well, Microsoft Sam was doing that back in the 1990s” and you would be quite right. But Microsoft Sam and similar tools sounded like robots. ElevenLabs’ tool, meanwhile, sounds far closer to humans.

Speech Synthesis settings in ElevenLabs

ElevenLabs offers three speech AI options: its completely free “premade” voices, an AI voice generator (allowing you to select sex, age, and accent) and the subscription-only “cloned” voices that you can upload.

Here’s an example:

Generate synthesized AI speech

Use of AI for creative purposes comes with some moral and ethical responsibilities and creating voices with ElevenLabs’ speech AI tool is no different.

In short, don’t use someone’s voice without their permission. While it’s not illegal, they might be upset about it.

Use voice design

Before you proceed, remember that at the time of writing, ElevenLabs’ speech AI tool is in beta. This means that it is not the finished product.

Generating a Basic AI Dialogue

The simplest way to get started is to use the ElevenLabs free speech AI tool.

To use this, go tobeta.elevenlabs.ioand create an account (you can use your own email, a Google account, or Facebook).

You can alsoDownloadthe generated sample.

How to Make an AI Voice With ElevenLabs

If you prefer to create a new voice, you can use theAdd Voicebutton to visit theVoiceLabscreen. To generate a new voice based on ElevenLabs’ presets:

In testing, I found that both the Female/Young/Australian and the Male/Old/Australian accents were distinctly “American.” This is an issue that will probably be ironed out as the technology develops.

Creating Your Own Voice in AI

While the premade and configurable options are interesting, the really exciting element of ElevenLabs’ technology is the Instant Voice Cloning tool.

Unlike the other options Instant Voice Cloning requires a subscription. Several options are available, the cheapest being $5 a month. At the time of writing, this comes with an 80% discount for the first month, making it just $1.

Other options cost $22, $99, and $330 a month, with the possibility of generating up to 40 hours of audio per month.

To use ElevenLabs voice cloning tool, you will need both some dialogue, and a sample of your voice. Anything will do, as long as it is clear, and in MP3 format. The longer the sample, the better, up to 5 minutes.

From the VoiceLab screen:

With the voice added, you can adjust it in the Speech Synthesis screen as above.

What Can You Do With an AI Voice?

AI speech with premade and cloned voices has numerous possibilities. As noted, ElevenLabs’ final aim is for live translation, but they’ve noted various other uses.

Audiobooks are mentioned (perhaps read by a long-dead movie star) along with video games (using AI speech would save on voice actors). But it has uses beyond this, from music to satire to self-help, and probably beyond.

it’s possible to even create a podcast using AI speech, although the results could sound flat and boring.

The introduction to an episode of ourReally Useful Podcast was produced using ElevenLabs:

While the results weren’t quite what we’d hoped, it’s good enough to use, and the technology can only get better.

Meanwhile, ElevenLabs is planning a generated “voice conversation” feature to be introduced at a later date.

Use Your Voice in a New Way With ElevenLabs’ Speech AI

Artificial intelligence has brought us some amazing new tools over the past few years. Chat-GPT can be used to create text, answer questions, outline reports, and more. Midjourney is an astonishing tool that generates art based on prompts.

Now, the speech AI tool from ElevenLabs makes it easy to manipulate a voice. It’s like an impersonation, but with a clone of the original voice.

While there are ethical arguments against using voices without consent, this is a powerful tool with some interesting possibilities. Best of all, it’s surprisingly easy to use and delivers impressive results.

You can use ChatGPT for so many things, and accurate and contextual language translation is one of them.

This small feature makes a massive difference.

You’re not getting the most out of what you pay for iCloud+.

Obsidian finally feels complete.

Don’t let someone else take over your phone number.

Sometimes the smallest cleaning habit makes the biggest mess.

Technology Explained

PC & Mobile