How to Use Generative Audio | Runway Academy
TLDRThis Runway Academy tutorial explores generative audio, demonstrating how to convert text to speech, customize voice models, and create lip-sync videos. Users can type and generate spoken audio, preview voices, and train custom voice models with clean audio. The process also covers adding lip-sync to images or videos, with tips for seamless integration and avoiding camera motion for a natural effect. The tutorial encourages joining the community for further resources and support.
Takeaways
- 🎙️ Generative audio includes text-to-speech, custom voice models, and creating lip-sync videos.
- 🔧 Access the generative audio tool from the Runway dashboard to create spoken audio files from text.
- 🗣️ Preview and select a voice from the default list or train a custom voice model using clean audio recordings.
- ⏱️ Audio generation time varies based on script length but is generally quick.
- 💾 Generated audio is automatically saved in the 'generative audio' folder within the main assets.
- 🎧 Train a custom voice model with a few minutes of clean audio and use it for text-to-speech.
- 👤 For lip-sync videos, upload an image or video with a clear view of the person's face.
- 🎥 Use lip-sync with generated, recorded, or uploaded audio to synchronize with the visuals.
- 🔁 If the audio is longer than the video, the video will loop to match the audio duration.
- 📹 Pro tip: When creating videos, use subject motion with a motion brush to minimize the reversing effect.
- 💡 For more resources and community support, join Runway's Discord or use the dashboard help button.
Q & A
What is generative audio?
-Generative audio refers to the process of creating audio content using artificial intelligence, which includes text-to-speech, custom voice models, and creating lip-sync videos.
How do you access the generative audio tool in Runway?
-You can access the generative audio tool by logging into your Runway dashboard and clicking on the generative audio tool at the top.
What can you do with the generative audio tool after typing in text?
-After typing in text, you can preview it, choose a voice from the default voice list, and then click on the generate button to turn it into a spoken audio file.
How long does it typically take to generate audio using the tool?
-The generation time depends on the length of the script, but it usually goes pretty quickly.
Where are the generated audio files saved by default in Runway?
-The generated audio files are automatically saved to the generative audio folder inside of your main assets folder in Runway.
What is required to train a custom voice model in Runway?
-To train a custom voice model, you need a few minutes of clean audio, which can be imported or recorded directly within the generative audio tool.
How do you ensure the audio is clean for training a custom voice model?
-The audio should be as clear as possible, with minimal background noise and consistent volume levels.
What is the process for creating a lip-sync video in Runway?
-To create a lip-sync video, you need an image or video of a person with a full face visible. You then upload this media and synchronize it with generated or uploaded audio.
Can you use lip-sync with different types of audio in Runway?
-Yes, lip-sync can be used with generated audio from text-to-speech, recorded audio, or uploaded audio.
What happens if the audio is longer than the video when creating a lip-sync video?
-If the audio is longer than the video, the video will reverse and go back to the beginning for the duration of the audio once it reaches the end of its duration.
What is a pro tip for using the video workflow in Runway's generative audio tool?
-A pro tip is to avoid using camera motion parameters and instead add subject motion with a motion brush to make the reversing effect less noticeable.
Where can users find more resources and community support for using Runway?
-Users can join the Runway community on Discord for more resources and experimentation, or use the dashboard button for specific answers to their questions.
Outlines
🎙️ Introduction to Generative Audio
The video script introduces viewers to Runway Academy's generative audio tool, which encompasses text-to-speech, custom voice models, and lip sync video creation. The tutorial begins with accessing the tool from the Runway dashboard and demonstrates how to convert typed text into spoken audio. It guides users through previewing and selecting a voice, with James as a default option, and generating the audio. The script also explains how to save the audio files and briefly touches on training a custom voice model using clean audio recordings. The process involves importing audio, ensuring clarity, and naming the model for use with text-to-speech.
Mindmap
Keywords
Generative Audio
Text to Speech
Custom Voice Models
Lip Sync Videos
Runway Dashboard
Voice List
Audio Generation
Assets Folder
Lip Sync
Gen 2
Motion Brush
Highlights
Introduction to generative audio tools in Runway Academy.
Generative audio includes text to speech, custom voice models, and creating lip sync videos.
Access the generative audio tool from the Runway dashboard.
Type in text to convert it into a spoken audio file.
Preview and choose a voice from the default voice list.
Generation times vary based on script length but are usually quick.
Audio generations are automatically saved in the generative audio folder.
Option to save audio in a different location via a drop-down menu.
Train a custom voice model with a few minutes of clean audio.
Record audio directly in Runway for custom voice models.
Ensure the audio is clean for optimal custom voice model training.
Use the trained custom voice model with text to speech.
Create a lip sync video using an image or video of a person.
Upload your own media or use preset characters for lip sync.
Lip sync can be applied to generated, recorded, or uploaded audio.
Use Gen 2 to turn an image into a video for lip sync.
If audio is longer than video, it will loop from the beginning.
Tip: Avoid camera motion parameters for smoother video reversing.
Join the Runway community on Discord for more resources and experimentation.
Find specific answers to questions using the dashboard button.