Article overview
This guide walks you through creating professional voiceovers in WellSaid Studio — from building a project and choosing a voice to fine-tuning your audio and downloading your finished files.
Create a new project
- Select Projects from the left menu.
- Click + New in the top-right corner.
- Enter a name under Project details.
- Click Create Project.
Choose a voice
Voice selection is the first creative step in Studio. It determines the model, style, and available features for your audio. You can change it later.
To choose a voice:
- Click the voice selector in the section header, or select See all voices from the right sidebar.
- Browse voice options and listen to samples.
- Use the Style dropdown to explore variations.
- Select a voice to apply it to your active section.
Save frequently used voices by clicking the heart icon on the voice card.
Select a model or language
- Open the section Style dropdown.
- Choose the model you want to use.
Features like Respellings and Rewrite for Speech depend on your selected model or language. Enterprise users can access the built-in Translation feature to translate content and generate audio directly in Studio.
Enter and structure your script
You can add your script in several ways:
- Paste, type, or click Import Script to add your content.
- Organize your script using one of these options:
- Split by paragraph — creates one section per paragraph
- Split by sentence — breaks content into individual sections
- Manual — create and manage sections yourself
Generate your first take
Once your voice and script are ready:
- Click the Play button to generate audio for the active section.
- Review the take and adjust the script if needed.
- Click Play again to generate new takes to compare.
Takes let you quickly refine your audio until it sounds exactly right.
Manage your takes
Sections support multiple takes, giving you flexibility as you iterate.
- Regenerate a take: Click the Regenerate icon (circling arrows) to create a new take.
- View take history: Click Take [#] at the top of the section or open Take History in the right sidebar.
- Repopulate a take: Open Take History, then select Use this take to restore both the audio and related text.
- Delete a take: Use the ellipsis (…) menu next to a take to delete it.
- Download a take: Click the Download icon and choose your file format (MP3, WAV, or OGG).
Fine-tune your audio
Studio provides pronunciation tools and script-shaping tools to help you guide how a voice delivers your script. Use the features below to improve clarity, pacing, emphasis, and overall performance.
Pronunciation tools
-
Respelling suggestions (Oxford & Smart Suggestions)
- Double-click or highlight a word to pull up the Smart Toolbar, view suggestions, and apply them to fix mispronunciations while keeping your script intact.
- Oxford Suggestions: Provides accurate respellings for over 250,000 British and American English words.
- Smart Suggestions: AI-generated respellings for words not covered by Oxford, such as proper names, acronyms, or brand-specific terms.
- Double-click or highlight a word to pull up the Smart Toolbar, view suggestions, and apply them to fix mispronunciations while keeping your script intact.
-
Create a phonetic respelling
- Create your own respelling by breaking a word into syllables and indicating how it should sound.
- Best for specialized terms or cases where suggestions don't match your desired pronunciation.
-
Replacements
- Use a replacement to substitute a word, term, or phrase with an alternative spelling when the pronunciation is otherwise ambiguous.
- Example: Change "1099-MISC" from "ten ninety-nine M I S C" to "ten ninety-nine miscellaneous."
- Store replacements in your Replacement Library to ensure the same word is read consistently every time.
-
Acronyms
- Some acronyms sound like words (NASA); others are spelled out (NBA).
- Use respellings or replacements when Studio misinterprets them.
- This helps voices distinguish pronunciation types.
-
Numbers
- Numbers can be read as dates, values, quantities, or references.
- Add context or replacements to guide the correct reading.
- Especially useful for addresses, IDs, and dates.
Script shaping tools
-
Voice Cues
- Use Voice Cues to shape delivery without rewriting your script. You can adjust:
- Loudness: make a phrase louder or quieter
- Pace: slow down or speed up delivery
- Pitch: raise or lower tone
- Pause: add intentional breaks at commas or periods
- Highlight text to reveal available cues in the sidebar, then apply and preview your changes.
- Or use an Emotional Preset as a starting point.
- Note: Voice Cues availability depends on the selected voice and model. Some combinations do not support cues.
- Use Voice Cues to shape delivery without rewriting your script. You can adjust:
-
Add emphasis
- Place a word or phrase in quotation marks (" ") to signal emphasis.
- This helps voices stress key points.
-
Pauses (commas, periods, ellipses)
- Punctuation directly affects pacing:
- Commas (,) create a light, natural pause.
- Periods (.) create a stronger pause with a downward inflection.
- Ellipses (...) create a longer, more conversational pause.
- For longer breaks, use Pause Cues or set pause durations when combining sections.
- Punctuation directly affects pacing:
-
Shaping questions
- Some questions need guidance to achieve a natural upward inflection.
- Use punctuation (commas, ellipses) or rephrase slightly to help the model deliver the intended tone.
Organize your sections
Studio makes it easy to manage and export complex scripts.
- Rename a section: Click the section title, enter the new name, and click outside to save.
- Rearrange sections: Copy text to a new section in the right order and generate a new take.
-
Combine sections: Before you begin, ensure your sections have been rendered.
- Select multiple sections or click Select all at the top of the section list.
- Click Download → Download audio as: Combined file.
- Choose a pause length: Small (0.3 seconds), Natural (0.8 seconds), Long (1.2 seconds), or a Custom value up to 10 seconds.
- Name your file. The file name is used for the downloaded audio.
- Optional: Enable captions (SRT/VTT).
Sections combine from top to bottom. Only active takes are downloaded.
Download your audio
You can download:
- Individual takes
- Combined audio from multiple sections
- Optional caption files
Downloads use your browser's standard download settings.
FAQs
Q: Can I change the voice after I've started writing my script?
A: Yes. You can change the voice on any section at any time. Generating a new take with the updated voice will reflect the change.
Q: What's the difference between a section and a take?
A: A section holds a portion of your script. A take is a generated audio rendering of that section. Each section can have multiple takes so you can compare and choose the best result.
Q: Do Voice Cues work with all voices?
A: No. Voice Cues availability depends on the selected voice and model. Some combinations show a message indicating cues are not supported.