Advanced Settings Tutorial - Kits AI
TLDRThe video tutorial from Kits AI provides a comprehensive guide on how to optimize the conversion of voices using their advanced settings. It begins with the removal of instrumentals and other unwanted audio elements such as reverb and backing vocals. The tutorial then delves into pitch adjustment with the pit shift tool, which is crucial when the audio's pitch does not match the selected AI model. The importance of conversion strength and volume blend is emphasized, as these settings directly impact the conversion's outcome. The video also covers pre-processing effects like cut noise and smooth volume, and post-processing effects including compression and creative options like chorus, reverb, and delay. The presenter demonstrates how to apply these settings using a clean studio recording and suggests saving presets for future use. The tutorial concludes with a comparison of the original and AI-converted audio, showcasing the improved presence and smoothness achievable with Kits AI's advanced settings.
Takeaways
- 🎧 Use the 'Remove Instrumentals' feature to separate vocals from the instrumentals in a full song.
- 🔊 Click 'Remove Reverb' to eliminate reverberation and 'Remove Backing Vocals' for ad libs or backup singers.
- 🎵 Utilize 'Pitch Shift' to adjust the pitch level of the audio to match the AI model's range.
- 🔧 Start with a 'Medium Conversion Strength' and adjust according to the needs of your audio.
- 📈 'Volume Blend' should be set high for a polished sound or low to preserve the dynamics of the original recording.
- 🔉 'Cut Noise' can help mask background noise, while 'Smooth Volume' is good for recordings with inconsistent volume levels.
- 🎛️ Apply 'Pre-processing Effects' before conversion to clean up the input audio.
- 🎚️ Use 'Post-processing Effects' like 'Compressor' for better volume control and presence.
- 🎛️ For creative effects, use 'Chorus', 'Reverb', and 'Delay' with caution, especially if you plan to use the audio in a DAW.
- 💾 Save your preferred settings as a preset for future use.
- 📈 Experiment with different settings to find the best conversion for your AI voices.
Q & A
What is the main topic of the instructional video?
-The main topic of the instructional video is about the advanced settings when converting voices with Kits AI.
What is the first advanced setting discussed in the video?
-The first advanced setting discussed is 'remove instrumentals,' which is useful for separating vocals from the instrumental in a full song.
How does the 'remove reverb and delay' feature help in cleaning up the audio?
-The 'remove reverb and delay' feature helps in cleaning up the audio by reducing the reverberation and delay effects that are common in vocal and song recordings.
What is the purpose of the 'remove backing vocals' feature?
-The 'remove backing vocals' feature is used to eliminate ad libs in hip-hop songs or backup singers from the audio.
How does the pitch shift tool help in audio conversion?
-The pitch shift tool is helpful when the audio to be converted is out of the range of the selected AI model. It allows users to adjust the pitch level to match the model without affecting the overall audio quality.
What are the two most important settings that affect the outcome of the conversion?
-The two most important settings that affect the outcome of the conversion are 'conversion strength' and 'volume blend.'
How does the conversion strength setting influence the AI voice conversion?
-The conversion strength setting determines how much the input audio is changed to sound like the AI voice. A higher setting will add more character to the AI voice, but it may also increase mispronunciation of certain words.
What is the role of volume blend in the conversion process?
-Volume blend determines the balance between the original audio levels and the AI voice. A lower model volume maintains the original audio levels, while a higher model volume smooths out the audio and makes the conversion sound more polished.
What are pre-processing effects and how do they help in cleaning up the input audio?
-Pre-processing effects are subtle changes applied to the input audio before conversion. They include cut noise, low/high shelf, and smooth volume, which help to reduce background noise, dial down certain frequencies, and even out volume levels.
What is the significance of using a compressor in post-processing effects?
-A compressor in post-processing effects helps to manage varied volumes and enhance the overall presence of the audio, making it more suitable for use in various applications.
Why is it recommended to start with a medium conversion strength and then adjust as needed?
-Starting with a medium conversion strength provides a balanced conversion that maintains the character of the AI voice without excessive mispronunciation. Adjustments can then be made based on the specific needs of the audio.
How can users save their preferred settings for future use?
-Users can save their preferred settings as a preset, allowing them to quickly apply the exact same settings in future audio conversions.
Outlines
🎙️ Advanced Settings for AI Voice Conversions
The video introduces viewers to the advanced settings available in Kits AI for converting voices. It explains the importance of using these settings to achieve the best possible conversion results. The first part focuses on removing instrumentals from audio, which is useful for full songs with vocals and other elements. It also discusses the removal of reverb, delay, and backing vocals to clean up the audio. The pit shift tool is highlighted for adjusting the pitch of the audio to match the AI model's capabilities. The conversion strength is emphasized as a key factor in how much the AI voice will be altered to match the input audio. The volume blend setting is also crucial, determining how the AI voice will blend with the original audio dynamics. The video concludes with a practical demonstration using the M strange Rock model for conversion and suggests starting with medium conversion strength and adjusting as needed.
🔊 Fine-Tuning Audio Conversion with Kits AI
This paragraph delves into the nuances of audio conversion using Kits AI, emphasizing the significance of volume blend and conversion strength. It provides a comparison between high and low model volumes and their impact on the final audio output. The paragraph also discusses pre-processing effects like cut noise and smooth volume to refine the input audio before conversion. Post-processing effects such as compression, chorus, reverb, and delay are introduced, with advice on using them judiciously, especially when the converted audio will be used in a larger project. The presenter demonstrates the application of these settings with an example audio clip, showing the difference between the original and the AI-converted version. The video concludes with a reminder to save preferred settings as a preset for future use and thanks the viewers for watching.
Mindmap
Keywords
💡Advanced Settings
💡Remove Instrumentals
💡Reverb and Delay
💡Pitch Shift
💡Conversion Strength
💡Volume Blend
💡Pre-processing Effects
💡Post-processing Effects
💡Dynamics
💡Mispronunciation
💡Preset
Highlights
Advanced settings in Kits AI can significantly improve AI voice conversions.
Removing instrumentals can help separate vocals from the background music.
Reverb and Delay can be reduced for cleaner audio conversion.
Back-up vocals or ad libs can be removed for a more focused vocal track.
Pit shift is a useful tool for adjusting audio pitch to match the AI model.
Conversion strength determines how much the AI voice changes the input audio.
High conversion strength can exaggerate certain sounds but may mispronounce words.
Medium conversion strength is recommended as a starting point.
Volume blend affects the smoothness and polish of the converted audio.
High model volume is suitable for recordings with varied audio levels.
Low volume blend preserves the dynamics of the original recording.
Pre-processing effects like cut noise and smooth volume can enhance audio quality.
Post-processing with a compressor can improve volume consistency.
Creative post-processing effects like chorus, reverb, and delay can be used for specific needs.
It's important to use post-processing effects that provide flexibility for further audio processing.
Saving presets allows for quick reuse of preferred settings.
The tutorial demonstrates the conversion process using the M strange Rock model.
AI conversion fills out the audio more compared to the original, especially with added reverb and delay.
The tutorial concludes with a clear demonstration of the advanced settings' impact on AI voice conversion.