Is Stable Audio AI The ULTIMATE Sample Generator???

Simulation Beats
24 Sept 202314:24

TLDRThe video script provides an in-depth review of Stable Audio, an AI text-to-music product. The reviewer shares their experience with various AI music generation tools and highlights Stable Audio's ability to create high-quality, studio-grade audio stems. They discuss the product's user interface, the importance of detailed prompts, and the capability to generate individual instrument tracks. Despite some limitations, such as the AI not fully comprehending 'no drums' requests or key specifications, the reviewer is overall impressed with the audio quality and the product's potential in the future of AI-generated music.

Takeaways

  • 🎵 Stable audio is a popular AI text-to-music product that offers high-quality audio outputs.
  • 🚀 The product allows users to generate studio-grade audio samples based on text prompts.
  • 🌟 Users can create stems, which are individual instrument tracks, for royalty-free use.
  • 📝 The interface is user-friendly, requiring only text input to generate music.
  • 🎶 The AI can produce a variety of genres and styles, including trance and meditation music.
  • 💰 There are different subscription tiers, with the free version offering 20 monthly track generations up to 45 seconds each.
  • 📈 The professional version costs $12 a month and allows for 500 track generations up to 90 seconds.
  • 🛠️ Users can request specific instruments and genres, but the AI does not always perfectly follow complex prompts.
  • 🔄 The AI struggles with certain aspects, such as excluding drums or adhering to a specific key.
  • 🔗 The product is seen as a significant advancement in AI-generated music, but not without its limitations.
  • 💡 The user's experience is a mix of excitement for the technology's potential and frustration with current limitations.

Q & A

  • What is the main topic of the transcript?

    -The main topic of the transcript is the user's experience and evaluation of Stable Audio, an AI text-to-music product.

  • How does the user describe the current trend of AI text-to-music products?

    -The user describes the current trend of AI text-to-music products as 'blowing up', indicating a rapid growth in popularity and usage.

  • What are the key features of Stable Audio that the user highlights?

    -The key features highlighted by the user include high-quality audio output, the ability to create stems (individual instrument tracks), and the generation of royalty-free samples.

  • What are the different pricing tiers offered by Stable Audio?

    -Stable Audio offers a free version with 20 monthly track generations up to 45 seconds each, and a professional version for $12 a month which allows for 500 track generations of up to 90 seconds.

  • How does the user feel about the audio quality of Stable Audio?

    -The user believes the audio quality is really good, but not quite at the level of studio quality. They note that it's close, but there's still room for improvement.

  • What issues did the user encounter with the comprehension of prompts by Stable Audio?

    -The user encountered issues where the AI did not fully comprehend the prompts, such as including drums even when specified to exclude them, and not accurately considering the key in the prompt.

  • How does the user utilize the ability to create stems with Stable Audio?

    -The user utilizes the ability to create stems to generate individual instrument tracks, such as a solo piano, which can be used to create a variety of royalty-free samples and for mixing with other genres.

  • What is the user's overall verdict on Stable Audio?

    -The user considers Stable Audio to be one of the best AI products for text-to-music currently available, but they also note that it's not significantly better than other products on the market.

  • How does the user handle the limitations of the free version of Stable Audio?

    -The user acknowledges the limitations of the free version, such as the number of track generations and duration, but finds it sufficient for the purpose of the video and their testing.

  • What is the user's process for evaluating the generated music samples?

    -The user's process involves testing different prompts, paying attention to the audio quality, checking if the AI follows the prompts accurately, and assessing the viability of using the generated samples in actual music projects.

  • What suggestions does the user have for future improvements in AI text-to-music products?

    -The user suggests that future improvements should focus on better comprehension of prompts, especially in terms of excluding certain elements like drums and accurately considering the specified key.

Outlines

00:00

🎵 Introduction to Stable Audio AI

The speaker introduces Stable Audio AI, a text-to-music product that has gained popularity. They discuss their experience with various AI text-to-music products and express their excitement about trying Stable Audio, which is noted for its high-quality audio output. The speaker explains that Stable Audio functions like other AI products, requiring a text prompt to generate an AI sample. They also highlight the unique feature of creating stems, allowing users to isolate specific instruments for royalty-free samples. The speaker admits they have not tested the product yet and will be learning alongside their audience.

05:00

🚀 Exploring Stable Audio's Features

The speaker delves into the specifics of Stable Audio's capabilities, discussing the free and professional versions of the product, their limitations in track generations, and the duration of the generated tracks. They note the importance of adding detailed prompts, such as specific genres and instruments, to achieve better results. The speaker shares their initial attempts at creating music with Stable Audio, experimenting with different genres and styles, and comments on the quality of the output. They also mention the ability to generate individual stems with a single instrument or a group of instruments, emphasizing the potential of Stable Audio as a sample generator.

10:01

📊 Evaluation and Comparison of Stable Audio

In the final paragraph, the speaker evaluates their experience with Stable Audio, comparing it to other AI text-to-music products on the market. They discuss the audio quality, the AI's comprehension of prompts, and the issues they encountered, such as the AI's inability to exclude drums and the lack of consideration for the specified key. The speaker also shares their process of downloading and sampling the generated audio, their attempts to align the tracks in the same key, and the challenges they faced. They conclude that while Stable Audio is a significant advancement in AI text-to-music technology, there is still room for improvement.

Mindmap

Keywords

💡AI text to music

AI text to music refers to the use of artificial intelligence to generate music based on textual input provided by users. In the context of the video, it is the core technology behind the product being reviewed, which takes user prompts and creates corresponding music samples. The video explores the capabilities and limitations of this technology as implemented in the Stable Audio product.

💡Stable Audio

Stable Audio is an AI-based product that stands out in the video as being different from other AI text to music offerings. It is highlighted for its purported high-quality audio output and its unique feature of generating individual stems, which are separate audio tracks for different instruments. The video's creator is particularly interested in testing this product and shares their first-hand experience with it.

💡Stems

In music production, stems refer to individual audio tracks that have been separated from a mixed recording. Each stem typically contains the isolated performance of a single instrument or group of instruments. In the context of the video, the ability of Stable Audio to create stems is a significant feature, allowing users to have more control over their music by being able to manipulate individual elements of the generated tracks.

💡Royalty-free samples

Royalty-free samples are audio clips or loops that can be used in music production without having to pay ongoing royalties to the original creator or copyright holder. These samples are typically available for a one-time fee or sometimes for free, and they offer creators a wide range of sounds to use in their projects without legal or financial constraints. In the video, the creator is impressed by Stable Audio's ability to generate such samples, which can be a significant advantage for musicians and producers.

💡Pro version

The Pro version of a software or service typically refers to a premium subscription plan that offers additional features, capabilities, or resources beyond the basic or free version. In the context of the video, the Pro version of Stable Audio provides users with more track generations and longer track durations compared to the free version, catering to the needs of more serious or professional users.

💡Prompts

In the context of AI text to music, prompts are the textual inputs provided by users to guide the AI in generating music. These prompts can include specific genres, descriptive phrases, or instrument choices, and they serve as the foundation for the AI to create the desired audio output. The video emphasizes the importance of detailed and clear prompts to achieve better results with Stable Audio.

💡BPM (Beats Per Minute)

BPM, or Beats Per Minute, is a measure of the tempo or speed of a piece of music, indicating the number of beats that occur in one minute. It is a crucial aspect of music production and performance, as it helps to set the pace and rhythm of a track. In the video, the creator specifies a BPM of 72 for some of their Stable Audio prompts, aiming to generate music at a particular tempo suitable for certain music styles.

💡Audio quality

Audio quality refers to the clarity, depth, and overall sonic characteristics of a recording or generated sound. High-quality audio is typically characterized by a rich, full sound with clear and distinct frequencies, while lower quality audio may exhibit distortion, noise, or a limited frequency range. In the video, the creator discusses the audio quality of the samples produced by Stable Audio, noting that it is close to studio quality, though not perfect.

💡Comprehension of prompts

Comprehension of prompts in the context of AI text to music refers to the AI's ability to accurately interpret and respond to the textual input provided by users. A high level of comprehension would mean that the AI can generate music that closely matches the user's intended style, mood, or instrumentation as described in the prompt. The video highlights this as an important aspect of AI music generation products and notes that Stable Audio performs well in this regard.

💡Key

In music, the key refers to the tonal center or main note around which a piece of music is structured. It defines the scale used in the composition and gives the music its overall mood or feeling. In the context of the video, the creator expresses disappointment that Stable Audio does not always take the specified key into account when generating music, which can lead to samples that are not in the desired tonal center.

💡Sampling

Sampling in music production involves the use of existing audio recordings, typically short snippets or loops, as the basis for creating new music. These samples can be manipulated, layered, and combined to produce a fresh sound or track. In the video, the creator discusses the potential of using Stable Audio to generate samples for music production, particularly highlighting the fun and creative possibilities it offers.

Highlights

Stable audio AI is gaining popularity in the AI text to music field.

The product offers high-quality audio output, claimed to be studio-grade.

Users can create stems, isolating specific instruments like a solo piano.

The AI generates music based on text prompts entered by the user.

The free version allows 20 monthly track generations up to 45 seconds each.

The professional version offers 500 track generations of up to 90 seconds.

The product emphasizes the importance of adding detailed prompts for better results.

The AI can generate a variety of music styles, such as trance and meditation.

Users can request specific instruments and genres in their prompts.

The AI's ability to generate individual stems is a significant feature.

The product allows for the creation of royalty-free samples.

The AI sometimes does not follow prompts accurately, such as including drums when instructed not to.

The AI's comprehension of prompts is considered strong, but not perfect.

The AI's output is not always in the same key as specified in the prompt.

The AI's ability to generate music is seen as a big step in the future of AI text and music.

The product is considered one of the best AI products for text to music currently available.

Despite some limitations, the product is praised for its audio quality and prompt comprehension.

The user's experience with the product is shared in a video, providing a real-time review.