AI News: Amazing New Tools You Can Use NOW!

Matt Wolfe
14 Jun 202433:20

TLDRThis week in AI brought a plethora of creative tools, from Luma AI's Dream Machine competing with Sora and Veo in the AI video generation space, to updates on Stable Diffusion 3.0 and Apple's AI integrations across its devices. Despite some initial frustrations with wait times and errors, Dream Machine showed promise in image-to-video generation. Stable Diffusion 3.0's release and the custom Leonardo Phoenix model by Mid Journey demonstrated advancements in image quality and prompt adherence. Additionally, Apple's AI unveiling at WWDC included AI-powered features for iOS, iPad, and Mac, emphasizing privacy and on-device intelligence. The video also touched on Adobe's terms of service clarification and the integration of OpenAI's Chat GPT in Siri, sparking controversy among tech giants like Elon Musk.

Takeaways

  • ๐Ÿ˜ฒ Luma AI released 'Dream Machine', a video generation tool that faces competition from other AI video tools like Sora, Veo, and Runway.
  • ๐Ÿ” Users initially experienced long wait times for video generation with Dream Machine, but the service has since improved its speed.
  • ๐Ÿ“น Dream Machine has shown potential in image-to-video generation, excelling in scenarios that involve transforming still images into short video clips.
  • ๐ŸŽจ Other AI image generation tools like Pika have also updated their models, but specifics on improvements remain unclear.
  • ๐Ÿ“š Stable Diffusion 3 by Stability AI has been released, offering improved text-to-image capabilities and is available for download on Hugging Face.
  • ๐Ÿ›  Users can now experiment with Stable Diffusion 3 through Hugging Face Spaces, although generating images may require waiting due to high demand.
  • ๐ŸŒŸ Leonardo AI introduced the 'Phoenix' model, a custom foundational model that is not based on Stable Diffusion and offers enhanced features like prompt adherence and coherent text in images.
  • ๐ŸŽผ Mid Journey AI introduced 'Model Personalization', allowing users to create images tailored to their preferences based on their past voting history on generated images.
  • ๐ŸŽต Soono unveiled a new feature that extends short audio clips into full songs, offering users a creative way to generate music with AI.
  • ๐Ÿ“ Adobe is revising its terms of service following concerns about AI training on customer work, clarifying that they will not use customer content for AI training without permission.
  • ๐ŸŽ Apple's WWDC event highlighted the integration of AI across all their devices, introducing new features like Image Playground for image generation, personalized Emoji creation, and enhanced Siri capabilities.

Q & A

  • What is Luma AI's Dream Machine and how does it compare to other AI video tools like Sora and Veo?

    -Luma AI's Dream Machine is a video generation tool that competes with other AI video tools such as Sora, Veo, Cling, and Pika. It allows users to generate videos based on text prompts. While some claim it is on par with Sora, the script suggests that it may not have fully reached that level yet, though it shows promise in certain scenarios.

  • What issues were experienced during the initial use of Luma AI's Dream Machine?

    -During the initial use, the Dream Machine had long wait times, with one request taking 7 hours to start after being queued due to high demand. Additionally, there were instances of video generation failures, resulting in error messages without any output.

  • How has Luma AI addressed the long wait times for video generation?

    -The script suggests that Luma AI has scaled up their service to reduce wait times, making the process faster and more efficient for users.

  • What are some of the limitations observed in Luma AI's Dream Machine video generation?

    -Luma AI's Dream Machine has limitations such as generating videos where elements from the prompt do not appear correctly in the video, like a wolf that looks three-legged or a teddy bear that morphs into an unrelated object.

  • What is the current status of Luma AI's Dream Machine in terms of cost and availability?

    -As of the script's recording, Luma AI's Dream Machine is in research preview and offers 30 free generations per month. After the free limit, the cost is approximately 25 cents per video generation.

  • What updates did Pika make to their image to video model, and how does it compare to Luma AI's Dream Machine?

    -Pika made updates to their image to video model, but specifics were not provided in the script. It is suggested that Dream Machine might be performing slightly better in image to video generation, but users are encouraged to try both tools as alternatives.

  • What is the significance of the release of Stable Diffusion 3 by Stability AI?

    -The release of Stable Diffusion 3 is significant as it provides improved capabilities in AI image generation. The model is now available for download and use, offering better text-to-image generation compared to its predecessors.

  • How does Stable Diffusion 3 perform in generating images from simple prompts?

    -Stable Diffusion 3 can generate images from simple prompts, but it appears to perform better when given more detailed and advanced prompts, suggesting that 'prompt engineering' is still an important skill for achieving quality results.

  • What is Leonardo Phoenix and how does it differ from Stable Diffusion 3?

    -Leonardo Phoenix is a new custom foundational model developed by Leonardo, an AI company. Unlike Stable Diffusion 3, which is based on the Stable Diffusion model, Leonardo Phoenix is trained specifically for Leonardo and offers enhanced prompt adherence, coherent text in images, and superior image quality.

  • What new features did Mid Journey introduce with their model personalization?

    -Mid Journey introduced a feature called Model Personalization, which allows users to create images tailored to their preferences based on their past voting history on images. This personalization code helps the model generate images closer to what the user likes.

  • What is the purpose of the new feature 'Gen Type' by Google Labs?

    -Gen Type by Google Labs is a tool that generates letters or text in various styles as specified by the user. It allows for creative typography and can be used for designing custom alphabets or text in styles like electronic circuitry.

  • What updates did Adobe announce regarding their AI tools and terms of service?

    -Adobe announced that they are overhauling their terms of service to clarify that they will not train AI on their customers' work. This was in response to concerns raised by a previous terms of service update that suggested otherwise.

  • What are some of the AI features Apple introduced during their WWDC event?

    -Apple introduced several AI features during their WWDC event, including AI-powered text summarization, smart replies, image generation with Image Playground, personalized emojis with Gen Emoji, and updates to Siri for on-device AI processing and contextual awareness.

  • What is the collaboration between Apple and OpenAI, and how will it affect Siri?

    -Apple and OpenAI have partnered to integrate Chat GPT with Siri. When Siri encounters a question it cannot answer as effectively, it will ask the user if it can send the question to Chat GPT for a better response. This collaboration aims to enhance Siri's capabilities without compromising user privacy.

  • What was Elon Musk's reaction to Apple's integration with OpenAI, and what was the outcome?

    -Elon Musk tweeted that if Apple integrates OpenAI at the OS level, then Apple devices would be banned at his companies due to security concerns. However, Apple clarified that OpenAI operates separately and user data is not shared without explicit permission. Musk later dropped a lawsuit against OpenAI regarding a breach of contract and fiduciary duty.

  • What is the significance of the new open-source model Quinn 2, and how does it compare to other models like LLaMA 3?

    -Quinn 2 is a new open-source AI model that outperforms other models like LLaMA 3 in various benchmarks. Despite having fewer parameters than the previous Quinn model, it achieves higher scores in human eval, C eval, and 2 cmml, indicating superior performance.

  • What incident involved a photographer being disqualified from an AI image contest for using a real photo?

    -A photographer named Miles Estay was disqualified from an AI image contest after winning with a real photo. The incident highlights the ongoing debate about the value of human creativity versus AI-generated art in contests.

Outlines

00:00

๐ŸŽจ AI Video Tools and Creative Experiments

The script discusses the excitement of new AI video tools like Luma AI's Dream Machine, which is a competitor to other AI video generators. The author shares experiences with the platform, including the initial frustration with long wait times and generation errors. However, after learning from the AI community on Twitter, the author finds success with image-to-video generation, showcasing examples and noting the tool's potential despite some inconsistencies in character morphing and realism.

05:01

๐Ÿ–ผ๏ธ Advancements in AI Image Generation

This paragraph delves into the updates in AI image generation, particularly the release of stable diffusion 3 by Stability AI, which has been made available for public use. The author tests the model with various prompts, noting the need for detailed prompts to achieve better results. The paragraph also mentions the new Leonardo Phoenix model by Leonardo, which is not based on stable diffusion and offers enhanced features. The author, as an adviser for Leonardo, tests the model and finds the image quality impressive.

10:02

๐ŸŽผ AI Music Creation and Adobe's AI Integration

The script introduces AI music creation capabilities, with a focus on Sunno's feature that extends short musical pieces into full songs. The author shares personal experiences with the tool, demonstrating its ability to generate lyrics and music based on simple input. Additionally, the paragraph covers Adobe's update to its terms of service, which initially caused concern among users regarding AI training on their work, but Adobe later clarified that they will not use customer work for AI training without permission.

15:03

๐Ÿ“ฑ Apple's AI Integration and Industry Impact

The author summarizes Apple's WWDC event, highlighting the company's commitment to integrating AI across all its devices and services. Apple's AI will offer features like text summarization, smart replies, and image recognition. The introduction of Image Playground for image generation, Gen Emoji for custom emoji creation, and updates to Siri and Apple Cloud are also discussed. The script touches on the industry response to Apple's AI announcements, including a rise in Apple's market cap and Elon Musk's criticism of Apple's partnership with OpenAI.

20:03

๐Ÿค– OpenAI's Updates and AI Model Developments

This paragraph covers OpenAI's hiring of new executives, the discontinuation of custom GPTs in Microsoft's CoPilot, and the introduction of Quinn 2, a new open-source AI model that outperforms its predecessors and other models like LLM 3 in various benchmarks. The author also mentions a case of a photographer disqualified from an AI image contest for using a real photo, emphasizing the ongoing debate about the value of human creativity versus AI-generated art.

25:05

๐ŸŒ AI News Curation and Community Engagement

The script concludes with a call to action for viewers to stay informed about AI developments through the author's AI news page and newsletter. The author promotes the benefits of subscribing, such as access to the AI income database, and highlights the importance of community engagement through likes, subscriptions, and participation in giveaways. The author expresses gratitude for the audience's interest and support.

Mindmap

Keywords

๐Ÿ’กAI News

AI News refers to the latest developments and updates in the field of artificial intelligence. In the context of the video, it serves as an introduction to the various AI tools and updates that are currently available for use, emphasizing the rapid pace of innovation in this area.

๐Ÿ’กDream Machine

The Dream Machine is an AI video generation tool developed by Luma AI, which competes with other similar AI tools like Sora and Veo. It is designed to create videos based on textual prompts, showcasing the advancement in AI's ability to understand and visualize concepts.

๐Ÿ’กStable Diffusion 3

Stable Diffusion 3 is an AI image generation model developed by Stability AI. It is significant because it has recently become available for public use, allowing users to generate images with improved text-to-image capabilities, as demonstrated in the video with various prompts.

๐Ÿ’กImage to Video

Image to video refers to the process of converting still images into video format, often with added animations or transitions. In the script, it is highlighted as a strength of the Luma AI's Dream Machine, especially when starting with a provided thumbnail or image.

๐Ÿ’กLeonardo Phoenix

Leonardo Phoenix is a new custom AI model developed by Leonardo, an AI company. It is distinguished by its enhanced prompt adherence and improved image quality, as opposed to previous models that were based on Stable Diffusion.

๐Ÿ’กMid Journey

Mid Journey is an AI platform that has introduced a feature called Model Personalization. This allows the AI to learn from a user's preferences in image ranking, tailoring future image generations to align more closely with the user's tastes.

๐Ÿ’กGen Type

Gen Type is a tool developed by Google Labs that generates letters or text in various styles, such as electronic circuitry. It demonstrates the application of AI in creative typography and design.

๐Ÿ’กSoono

Soono is an AI platform that has introduced a feature allowing users to create songs by uploading or recording audio, which the AI then uses to generate music and lyrics. It exemplifies the application of AI in the music composition process.

๐Ÿ’กApple Intelligent

Apple Intelligent refers to the integration of AI across Apple's ecosystem, including iOS, iPad, and Mac devices. The video discusses how Apple is leveraging AI for various functionalities like email summarization, smart replies, and photo editing.

๐Ÿ’กImage Playground

Image Playground is an image generation feature introduced by Apple. It allows users to create animated, illustrative, and sketch-style images, marking Apple's entry into the AI-generated art space with a focus on non-realistic styles.

๐Ÿ’กChat GPT

Chat GPT is an AI chatbot developed by OpenAI that is known for its conversational abilities. In the context of the video, it is highlighted as being integrated with Apple's Siri, offering an option for users to receive answers generated by Chat GPT when appropriate.

Highlights

Introduction of various creative AI tools available for use.

Luma AI's release of Dream Machine as a competitor to other AI video tools.

Dream Machine's initial high demand leading to long wait times for video generation.

Personal experience with Dream Machine's video generation, including successes and frustrations.

Comparison of Dream Machine's capabilities with other AI video tools like Sora.

Discovery of optimal use cases for Dream Machine in image to video generation.

Announcement of free access to 30 video generations per month during Luma Labs' research preview.

Updates to Pika's image to video model and its comparison with Dream Machine.

Release of Stable Diffusion 3 by Stability AI and its availability for public use.

Testing Stable Diffusion 3 with various prompts and the model's performance.

Introduction of Leonardo Phoenix, a custom model by Leonardo with enhanced features.

Demonstration of text in image capabilities with Leonardo Phoenix.

Mid Journey's new feature, Model Personalization, and its customization based on user preferences.

Google Labs' Gen Type tool that generates letters in various styles.

Sunno's new song generation feature that extends and improves user-provided music.

Adobe's clarification on their terms of service regarding AI training on customer work.

Apple's WWDC event unveiling AI integration across all their devices and services.

Apple's new Image Playground for image generation with a focus on animations, illustrations, and sketches.

Elon Musk's criticism of Apple's integration with Open AI and its potential security implications.

Introduction of Quinn 2, a new open-source AI model outperforming previous benchmarks.

Photographer disqualified from AI image contest for using a real photo to highlight human creativity.