Generate Music & Sound Effects with AI! | Stability AI’s NEW Stable Audio Review

The Prince of Prompting
24 Sept 202308:27

TLDRThe video explores the capabilities of Stable Audio, an AI tool for generating music and sound effects. It provides an overview of the website, pricing, and user guide, and shares examples of generated sound effects and music. The reviewer finds the tool particularly effective for music production, though sound effects generation has room for improvement. They recommend Stable Audio for musicians and express excitement for future advancements.

Takeaways

  • 🎵 Stable Audio can generate up to 90 seconds of music and sound effects.
  • 🌐 The website is simple, with main sections including Generate, Pricing, User Guide, and Licensing.
  • 💰 The pricing model is considered reasonable by the reviewer.
  • 📚 The User Guide provides examples and information on prompts, models, and training data.
  • 🔍 Sound effect generation may require multiple attempts to achieve satisfactory results.
  • 🎶 Music generation appears to be the tool's strong suit, with better results out of the box.
  • 💡 Each generation, regardless of duration, consumes one generative credit, making longer durations more cost-effective.
  • 🐉 Unique sound effects like a dragon's roar can be generated, but may have some noise and imperfections.
  • 🎹 Classical piano and orchestral music seem to be less effectively rendered compared to other genres.
  • 🚀 The tool is most suitable for musicians and may become more valuable as users learn its intricacies over time.
  • 📈 The reviewer plans to create a comprehensive guide for using Stable Audio as they continue exploring its capabilities.

Q & A

  • What is the main topic of the video?

    -The main topic of the video is an overview of AI-generated music and sound effects using Stable Audio, including its capabilities, pricing model, and examples of usage.

  • What are the capabilities of Stable Audio?

    -Stable Audio can create up to 90 seconds of music and sound effects based on text prompts, and it offers a variety of models and durations for generation.

  • How does the pricing model work for Stable Audio?

    -The pricing model for Stable Audio is based on generative credits, with each generation, regardless of duration, consuming one credit. It is noted as being reasonable in the video.

  • What kind of information is provided in the user guide of Stable Audio?

    -The user guide provides information about prompts, model, training, data, and licensing, as well as examples ranging from full instrumentals to individual stems and sound effects.

  • What limitations did the video creator notice with sound effect generation?

    -The video creator noticed that while some sound effects like a rainy ambiance were successful, others like a dragon's roar and classical piano sounds were not as accurate or had unwanted noise.

  • How did the video creator find the music generation capabilities of Stable Audio?

    -The video creator found the music generation capabilities to be more effective than sound effects, with examples like a jazz tune and Western lo-fi music being particularly successful.

  • What was the video creator's overall verdict on Stable Audio?

    -The video creator concluded that Stable Audio is a fun tool with powerful results, especially for music generation, but may not be the best option for sound effects compared to other stock audio providers.

  • What advice does the video creator give for users interested in trying Stable Audio?

    -The video creator advises musicians to check out the tool and suggests that they will continue learning and eventually create a comprehensive prompting guide as Stable Audio progresses.

  • What was the video creator's most memorable musical piece generated with Stable Audio?

    -The most memorable musical piece for the video creator was a Western lo-fi style song that felt like a fusion between Western and Middle Eastern or Indian lo-fi music.

  • How long does it take to generate a sound effect or music piece with Stable Audio?

    -It takes about 15 to 20 seconds to generate a sound effect or music piece with Stable Audio, regardless of the chosen duration.

  • What is the video creator's strategy for using Stable Audio efficiently?

    -The video creator suggests using the maximum duration for each generation since it consumes the same amount of credits and gives more material to work with.

Outlines

00:00

🎵 Introduction to AI-Generated Music with Stable Audio

This paragraph introduces the concept of AI-generated music and sound effects using Stable Audio. The speaker discusses the capabilities of the tool, which can create up to 90 seconds of audio, and provides an overview of the website's layout, including the generate section, pricing model, user guide, and examples. The speaker also shares their initial impressions and expectations for using the tool, mentioning the potential for exploring more intricate prompts over time. The focus is on exploring sound effects first, with a tip on the optimal use of generative credits, followed by a range of examples from rainy ambiance to a dragon's roar and an underwater city with whale songs.

05:02

🎶 Diving Deeper into Music Generation

In this paragraph, the speaker transitions from sound effects to music generation, noting the tool's apparent focus on instrumentals and stems rather than classical or orchestral music. The speaker shares examples from the user guide, such as beachy trance and post-rock guitars, before trying out their own creative prompts like a Western lo-fi tune and epic cinematic battle music for space age. The speaker reflects on the tool's strengths and weaknesses, ultimately concluding that while it can be fun to use and is powerful for music generation, it may not yet surpass existing stock audio providers for sound effects. The speaker expresses excitement for the tool's potential and plans to create a comprehensive guide as they continue learning and using Stable Audio.

Mindmap

Keywords

💡AI Generated Music

AI Generated Music refers to the creation of musical compositions using artificial intelligence algorithms. In the context of the video, it highlights the capability of Stable Audio to produce music by interpreting text prompts provided by the user. The video explores the effectiveness of this technology in creating various types of music, such as jazz tunes and classical piano pieces, and evaluates its potential for music production.

💡Sound Effects

Sound Effects are audio elements that are used to enhance the auditory experience of a production, often by模拟 real-world sounds or creating imaginary ones. In the video, the reviewer tests the ability of Stable Audio to generate sound effects, such as 'rainy ambiance' and 'dragon roaring with wing flaps', and assesses the quality and usability of the generated sounds in comparison to traditional stock audio.

💡Stable Audio

Stable Audio is a platform that utilizes AI to generate music and sound effects based on user-provided text prompts. It is the central tool discussed in the video, with the reviewer exploring its features, pricing model, and user guide. The video provides an overview of the website's interface and delves into its effectiveness through various examples.

💡Pricing Model

The Pricing Model refers to the structure by which a service charges its users, typically based on usage or subscription. In the context of the video, the reviewer comments on the reasonableness of Stable Audio's pricing model, which charges users based on the number of generative credits consumed for creating music or sound effects.

💡User Guide

A User Guide is a document or resource that provides instructions and information to help users understand and effectively use a product or service. In the video, the reviewer refers to Stable Audio's user guide, which includes examples and information about prompts, models, training, data, and licensing, to help users navigate the platform and optimize their use of the AI-generated music and sound effects.

💡Text Prompt

A Text Prompt is a piece of text that serves as a starting point or input for an AI system to generate a response or output. In the context of the video, text prompts are crucial for the Stable Audio platform, as they dictate the type of music or sound effects that the AI will attempt to create.

💡Duration

Duration refers to the length of time that something lasts or is intended to last. In the video, the reviewer discusses the option to change the duration of the generated music or sound effects on Stable Audio, noting that each generation takes the same amount of time regardless of the set duration.

💡Generative Credit

A Generative Credit is a unit of measurement used by Stable Audio to quantify the usage of its AI generation services. Each credit allows a user to generate a certain amount of music or sound effects, and the video emphasizes the importance of using these credits efficiently by opting for the maximum duration generation.

💡Instrumentals

Instrumentals are musical compositions that are created without lyrics or vocals, focusing solely on the harmony and melody produced by musical instruments. The video discusses the user's experience with generating instrumentals on Stable Audio, noting the platform's strengths in creating certain types of instrumental music.

💡Classical Piano

Classical Piano refers to a style of piano music that is typically associated with the traditional compositions of the Classical period, characterized by its structured form and expressive melodies. In the video, the reviewer's attempts to generate classical piano music with Stable Audio are met with mixed results, indicating that the AI may struggle with certain complex or nuanced musical styles.

💡Cinematic Battle Music

Cinematic Battle Music is a type of music composed to accompany intense action sequences or dramatic conflicts in films, often characterized by its grand and epic sound. The video explores the potential of Stable Audio to generate music fitting for space battles, using a string-heavy orchestral approach to create a sense of grandeur and tension.

Highlights

AI generated music and sound effects with Stable Audio can create up to 90 seconds of content.

The website is simple but effective with a few main sections including a generate section, pricing model, user guide, and information about prompts, model, training, data, and licensing.

The pricing model of Stable Audio is considered reasonable.

The user guide provides examples ranging from full instrumentals to individual stems and sound effects.

The tool's capabilities include generating sound effects like a rainy ambiance and a thunderstorm with thunderclaps and rain on a metal roof.

Stable Audio can produce unique and hard-to-find sounds such as a dragon roaring with wing flaps.

The tool can generate creative sound effects like an underwater city with distant whale songs.

For music generation, Stable Audio excels at creating jazz tunes with prominent saxophone.

The tool struggles with classical piano, producing less satisfying results.

Stable Audio is better suited for music production, with a focus on instrumentals and stems.

The tool can generate unique and interesting music styles like a Western lo-fi track.

Epic cinematic battle music for space age themes can be generated, although the results may vary.

The tool is fun to use and can produce powerful results in music generation.

For sound effects, it might be more efficient to use existing stock audio providers rather than Stable Audio.

The reviewer plans to continue learning and eventually create a comprehensive guide for using Stable Audio.

The verdict is that Stable Audio has value for musicians and is worth exploring for music production.