Spot the AI Voice Quiz

How much audio is needed to clone a voice?

Modern tools like ElevenLabs can produce a recognizable voice clone from just 3 to 10 seconds of reference audio, and OpenAI's Voice Engine (announced March 2024) uses a 15-second reference sample.

What is a deepfake?

A deepfake is AI-generated or AI-manipulated audio, image, or video that convincingly imitates a real person. Audio deepfakes use voice cloning to impersonate politicians, executives, or family members in scams.

How can you spot AI-generated speech?

Listen for unnaturally consistent tone, missing breath and swallow sounds, mispronounced proper nouns, mechanical pauses or 'um/uh,' and incorrect emotional inflection on complex sentences. Detection tools like Hive and Resemble Detect claim 90%+ accuracy.