Remove Vocals from a Song Using an AI Vocal Remover (PhonicMind)

Reuben Chng
5 Jul 202107:15

TLDRThis video tutorial demonstrates how to use PhonicMind, an AI vocal remover, to separate vocals from a song. It compares PhonicMind's capabilities with other tools like Spleeter and LALAL AI, highlighting its superior audio engine and deep learning techniques. The process includes uploading a song, waiting for the AI to analyze and split it into stems, and then downloading the desired audio tracks. The video also discusses the benefits of using PhonicMind over traditional audio plugins and the pricing structure for different conversion bundles.

Takeaways

  • 🎵 PhonicMind is an AI-based tool for removing vocals and creating stems from songs.
  • ⏱️ It takes up to a minute to split a song into four audio stems: vocals, drums, bass, and other instruments.
  • 🔊 The audio stems from PhonicMind can be combined to recreate the original song's sound.
  • 🆚 Compared to Spleeter, PhonicMind provides better results as it continues to learn and understand music through deep learning.
  • 🎶 The tool offers the ability to adjust the level of vocals, allowing for partial or complete removal.
  • 💾 Users can download the stems in various formats for different uses, such as karaoke or DJ-ing.
  • 🎚️ The process allows for muting specific tracks to create an acapella version or isolate individual elements like drums.
  • 💻 The stems can be imported into DAWs like Adobe Audition for further editing and playback.
  • 💰 PhonicMind is a paid service, with a single song conversion costing $3.99, but offers bundles for lower per-song costs.
  • 💡 The convenience and quality of PhonicMind's AI vocal removal are suggested to be worth the cost compared to manual methods.
  • 📢 The presenter encourages viewers to share their thoughts on PhonicMind's quality and their experiences with it.

Q & A

  • What is PhonicMind?

    -PhonicMind is an AI-based vocal remover and stems maker that can separate vocals, drums, bass, and other instruments from a song.

  • How does PhonicMind compare to other solutions like Spleeter and LALAL AI?

    -PhonicMind's audio engine is considered one of the best in the market, providing better separation of audio stems compared to Spleeter, which may silence certain musical elements it does not understand.

  • How long does it take for PhonicMind to process a song?

    -It takes up to a minute for PhonicMind to analyze and process a song before splitting it into four audio stems.

  • What audio formats are recommended for the best results with PhonicMind?

    -High-quality, uncompressed audio formats like .wav, .aiff, or .flac are recommended for the best results with PhonicMind.

  • Can you adjust the vocal levels after using PhonicMind?

    -Yes, you can adjust the vocal levels by turning them up or down after the initial removal.

  • What can you do with the separated audio stems from PhonicMind?

    -You can use the separated audio stems for various purposes like creating karaoke tracks, extracting acapellas, resampling, or DJ-ing.

  • How much does it cost to use PhonicMind for a full song conversion?

    -The cost for a single full song conversion is $3.99, but there are pro and extreme bundles available that can bring down the cost per song.

  • What is the difference between the single bundle and the pro or extreme bundles offered by PhonicMind?

    -The single bundle costs $3.99 per song, while the pro or extreme bundles offer a lower cost per song, making them more economical for frequent use.

  • Can you use conventional audio effects and plugins to achieve similar results as PhonicMind?

    -While conventional audio effects and plugins can be used to remove vocals, the results and quality are often not as good as those provided by an AI-based solution like PhonicMind.

  • What are some potential uses for the processed audio stems from PhonicMind?

    -Processed audio stems can be used for creative purposes such as remixing, creating mashups, or even for educational and analytical purposes to study the different elements of a song.

  • How does PhonicMind continue to improve its performance?

    -PhonicMind uses deep learning techniques and listens to music every day, which helps it understand music better and improve its performance over time.

Outlines

00:00

🎶 AI-Powered Vocal Removal with Phonic Mind

This paragraph introduces the topic of the video, which is the use of AI to remove vocals from a song. The tool being highlighted is Phonic Mind, an AI-based vocal remover and stems maker. The speaker shares their experience comparing Phonic Mind to other solutions like Spleeter and LALAL AI, praising Phonic Mind's superior audio engine. It is noted that Phonic Mind can split a song into four separate audio stems—vocals, drums, bass, and other instruments—in just a minute, and these stems can be recombined in a DAW to recreate the original song's sound. The script also touches on the limitations of Spleeter, which sometimes silences parts of the music it doesn't understand, whereas Phonic Mind continuously learns and improves through deep learning techniques. The speaker then proceeds to demonstrate how to upload a song to Phonic Mind and use its features to remove vocals and split the song into stems.

05:03

💰 Phonic Mind's Pricing and Advantages

The second paragraph discusses the pricing model of Phonic Mind, emphasizing that it is a paid solution and users will be charged for full song conversions. The speaker suggests that once a song is converted, it can be downloaded multiple times from the user's account. The pricing is detailed, with a single bundle costing $3.99 for one full song conversion, but the speaker recommends the pro or extreme bundle to reduce the cost per song to $1.99. The paragraph also addresses the value proposition of using Phonic Mind over conventional audio effects and plugins, suggesting that the time saved and the superior quality of the AI-based tool justify the cost. The speaker invites viewers to share their thoughts on Phonic Mind's quality and experiences in the comments section and concludes the video with a farewell.

Mindmap

Keywords

💡AI Vocal Remover

An AI Vocal Remover is a software tool that utilizes artificial intelligence to separate vocals from a song's instrumental track. In the context of the video, PhonicMind is an AI-based vocal remover that can split a song into different audio stems, including vocals, drums, bass, and other instruments. The video demonstrates how PhonicMind's AI engine is superior to other solutions because it continues to learn and improve its music understanding capabilities.

💡PhonicMind

PhonicMind is the specific AI vocal remover tool highlighted in the video. It is described as having one of the best audio engines in the market for splitting songs into separate audio stems. The video creator uses PhonicMind to demonstrate the process of removing vocals from a song and the quality of the分离后的结果.

💡Stems Maker

A Stems Maker is a tool that can separate a song into its constituent parts or 'stems'. In the video, PhonicMind is referred to as an AI-based stems maker, indicating its ability to isolate different elements of a song such as vocals, drums, bass, and other instruments, which can then be manipulated or used independently.

💡Spleeter

Spleeter is mentioned as one of the other solutions the video creator has tried for removing vocals from songs. It is used for comparison to illustrate that PhonicMind's audio engine provides better results, as Spleeter may not accurately represent certain musical elements and may silence parts of the audio.

💡LALAL AI

LALAL AI is another tool compared alongside PhonicMind in the video. It is part of the discussion to show that the video creator has tried various options and found PhonicMind to be superior in terms of audio quality and the accuracy of separating vocals from instrumentals.

💡Audio Stems

Audio Stems refer to the individual isolated tracks of a song, such as vocals, drums, bass, and other instruments. The video explains that PhonicMind can split a song into four separate audio stems, which can then be recombined in a DAW to recreate the original song or manipulated for various purposes like karaoke or DJ-ing.

💡DAW

A DAW, or Digital Audio Workstation, is software used for recording, editing, and producing audio files. In the video, the creator uploads the audio stems from PhonicMind into a DAW (Adobe Audition) to demonstrate how the original song can be recreated or how individual stems can be used for different purposes.

💡Karaoke Track

A Karaoke Track is a version of a song that has the vocals removed, allowing someone to sing along. The video creator shows how to use PhonicMind to create a karaoke track by muting all the non-vocal stems and downloading just the vocal track.

💡Acapella

An Acapella is a performance of a song or instrumental music without instrumental accompaniment. In the video, the term is used to describe the isolated vocal track that can be extracted using PhonicMind, which can then be used for various creative purposes.

💡Multitrack

A Multitrack refers to a recording that has been split into several tracks, allowing for separate manipulation of each element. The video creator discusses how PhonicMind provides multitrack capabilities, enabling users to download and work with individual stems of a song.

💡Pricing

The term 'Pricing' in the video refers to the cost of using PhonicMind's services. The video creator discusses the different bundles available and their respective costs, emphasizing the value of the service compared to the time and effort required to achieve similar results with conventional audio effects and plugins.

Highlights

Showcasing how to remove vocals from a song using AI with PhonicMind.

PhonicMind is an AI-based vocal remover and stems maker.

Comparison with other solutions like Spleeter and LALAL AI.

PhonicMind's audio engine is considered one of the best in the market.

The tool can split a song into four separate audio stems in up to a minute.

The stems include vocals, drums, bass, and other instruments.

PhonicMind's output can be played back together to hear the original sound.

Spleeter's output does not sound exactly like the original song.

PhonicMind continues to listen to music daily using deep learning.

The AI understands music better and improves over time.

Uploading a song into PhonicMind and removing vocals.

PhonicMind allows adjusting the level of vocals in the mix.

The ability to download vocals, drums, bass, and other stems separately.

Creating a karaoke track or an acapella by muting other tracks.

PhonicMind requires payment for full song conversion.

Once converted, the song can be downloaded multiple times.

Pricing options for PhonicMind, with single and bundle options.

The value of using PhonicMind over conventional audio effects and plugins.

The final processed song matches the original closely.

Ability to manipulate the audio stems for various purposes like DJ-ing.

Invitation for feedback on PhonicMind's quality and user experience.