AI News: Amazing New Tools You Can Use NOW!

Matt Wolfe
14 Jun 202433:20

TLDRThis week in AI brought exciting new tools and updates, with Luma AI's Dream Machine generating mixed results in video creation and excelling in image-to-video conversion. Stable Diffusion 3's release offers improved text-to-image capabilities, while Leonardo Phoenix stands out for its enhanced prompt adherence and image quality. Mid Journey introduces model personalization, and Google Labs' Gen Type provides stylish alphabet designs. Apple's WWDC unveils AI integration across devices, including Siri's new capabilities and the Image Playground for image generation, marking a significant leap in AI accessibility and utility.

Takeaways

  • 🆕 Luma AI released 'Dream Machine', a competitor to other AI video tools like Sora and Veo, with mixed results and some initial frustrations due to high demand and long wait times.
  • 📹 Dream Machine's image-to-video feature shows promise, with some impressive results in generating short video clips from still images.
  • 🎨 Stable Diffusion 3 by Stability AI has been released, offering improved text-to-image capabilities and is available for download on Hugging Face for local use.
  • 🔍 Leonardo AI introduced the 'Phoenix' model, a custom foundational model not based on Stable Diffusion, with enhanced features like prompt adherence and coherent text in images.
  • 🎶 Soono, a music creation AI tool, now allows users to extend short musical pieces into full songs with added lyrics and beats.
  • 🤖 Apple announced the integration of AI across all their devices and services, focusing on on-device intelligence and user privacy.
  • 🎨 Apple's new 'Image Playground' feature generates animated, illustrative, and sketch images, avoiding the potential misuse for deepfakes.
  • 🖼️ Adobe is revising their terms of service after a misunderstanding about training AI on customer work, clarifying they won't use customer content for AI training without consent.
  • 🧑‍💼 Open AI welcomed new C-level executives, Sarah Frier as CFO and Kevin Weil as CPO, both with significant industry experience.
  • 🚀 A new open-source model, Quinn 2, outperformed previous models like LLaMA 3 in various benchmarks, showcasing the rapid advancement in AI capabilities.
  • 🏆 A photographer was disqualified from an AI image contest for using a real photo, highlighting ongoing discussions about the value of human creativity versus AI-generated art.

Q & A

  • What is the main focus of the AI News video transcript?

    -The main focus of the AI News video transcript is to discuss and showcase various newly released AI tools, particularly those related to video and image generation, and to provide insights into their capabilities and user experiences.

  • What is Luma AI's Dream Machine and how does it compare to its competitors?

    -Luma AI's Dream Machine is an AI video generation tool that competes with other AI video tools like Sora, Veo, Cling, Pika, and Runway. While it has some impressive capabilities in certain scenarios, the transcript suggests that it may not yet be on par with Sora in most cases, but it has potential, especially in image to video generation.

  • What issues were encountered during the initial use of Luma AI's Dream Machine?

    -During the initial use of Luma AI's Dream Machine, there were long wait times for video generation, with one request taking 7 hours to start. Additionally, there were instances of video generation failure, resulting in error messages without any output.

  • How has Luma AI addressed the initial scaling issues with Dream Machine?

    -Luma AI appears to have scaled up their system to eliminate the huge wait times that were initially experienced. The platform now seems to be able to handle requests more efficiently, reducing the generation time to around 120 seconds.

  • What are some of the scenarios where Luma AI's Dream Machine performed well?

    -Luma AI's Dream Machine performed well in generating videos from images, such as flyover shots, aerial shots of a lighthouse, and transforming a YouTube thumbnail into an image to video format with a man's head exploding with color.

  • What is the current pricing model for using Luma AI's Dream Machine during its research preview phase?

    -During the research preview phase, users of Luma AI's Dream Machine get 30 free generations per month. After that, the cost is approximately 25 cents per video generated.

  • What updates did Pika, one of the competitors, make to its image to video model?

    -Pika made updates to its image to video model, improving its capabilities. However, the exact details of the updates were not specified in the transcript, and the community was left to infer that the improvements were noticeable.

  • What is the significance of the release of Stable Diffusion 3 by Stability AI?

    -The release of Stable Diffusion 3 by Stability AI is significant because it makes the model's weights available for public use. This allows users to download the weights and run the model locally on their computers or cloud servers for image generation.

  • How does the Leonardo Phoenix model differ from previous models used by Leonardo AI?

    -The Leonardo Phoenix model is a new foundational model developed specifically for Leonardo AI. Unlike previous models, which were based on Stable Diffusion, the Phoenix model is trained from the ground up for Leonardo AI, offering enhanced prompt adherence, coherent text in images, superior image quality, and more creative control.

  • What is the new feature introduced by Mid Journey called and what does it do?

    -The new feature introduced by Mid Journey is called 'Model Personalization'. It allows the AI to learn the user's preferences based on their past voting on images. Once the user has a personalization code, the AI attempts to generate images that are closer to the style and preferences the user has demonstrated to like.

  • What is Gen Type by Google Labs and what can it do?

    -Gen Type is a tool by Google Labs that generates letters or text in various styles as requested by the user. It can create an entire alphabet or specific words in styles such as colorful electronic circuitry, offering a fun and creative way to visualize text.

  • What updates did Apple announce regarding AI integration in their devices during the WWDC event?

    -During the WWDC event, Apple announced that they are integrating AI into all their devices, including iOS, iPad, and Mac. They introduced features like text summarization, smart replies, image generation with Image Playground, custom Gen Emojis, photo editing features, and Siri updates with on-device AI capabilities. They also mentioned a partnership with OpenAI for Siri's question handling.

  • What was the controversy surrounding Apple's partnership with OpenAI and how was it resolved?

    -The controversy arose when it was believed that Apple would train AI on customer work, leading to privacy concerns. Apple clarified that they are not training AI on customer work and that any use of OpenAI's technology, such as Chat GPT, would require explicit user permission, ensuring privacy protection.

  • What is the significance of the new Quinn 2 model in the AI community?

    -The Quinn 2 model is significant because it outperforms other models like LLaMA 3 and Mixl 8X 22b in various benchmarks, despite having fewer parameters. This suggests that Quinn 2 is highly efficient and could offer improved performance in AI applications.

  • What was the outcome of the case between Elon Musk and OpenAI?

    -Elon Musk dropped the case against OpenAI, which was centered around claims of a breach of contract and transformation of OpenAI into a for-profit entity. The case was dropped likely due to the lack of a strong legal standing after emails showed Musk's previous agreement with OpenAI's plans.

  • What new executives did OpenAI bring on board and what are their backgrounds?

    -OpenAI brought on Sarah Friar as CFO, who was previously the CEO of Nextdoor and CFO of Square, and Kevin Weil as CPO, who has held positions at Planet Labs, Facebook, Instagram, and Twitter, and worked on the Libra cryptocurrency project.

  • Why did Microsoft decide to remove the custom GPT feature from their CoPilot Pro tool?

    -Microsoft decided to remove the custom GPT feature from CoPilot Pro because it did not gain enough traction or usage among users, indicating that the feature was not as valuable or necessary as initially thought.

  • What was the unusual incident involving a photographer disqualified from an AI image contest?

    -A photographer was disqualified from an AI image contest after winning with a real photo. The photographer, Miles Estay, intended to demonstrate that human creativity is still valued and preferred over AI-generated art.

Outlines

00:00

🎨 AI Video Tools and Creative Experiments

The paragraph discusses the exciting developments in AI video generation tools, highlighting the release of Luma AI's 'Dream Machine' as a competitor to other AI video tools like Sora and Veo. The author shares personal experiences with the Dream Machine, noting the initial frustrations with long wait times and generation errors, but also showcasing successful video outputs. The paragraph emphasizes the tool's potential, especially in image-to-video generation, and mentions the free tier availability before it potentially ends.

05:01

🖼️ Advancements in AI Image Generation

This paragraph delves into the updates in AI image generation, including the highly anticipated release of Stable Diffusion 3 by Stability AI. The author provides insights into the capabilities of Stable Diffusion 3, noting its improved text-to-image generation and the availability of its weights for download on Hugging Face. The paragraph also touches on the user experience with the model, the need for detailed prompts to achieve better results, and compares it to other models like Leonardo Phoenix for image quality and prompt adherence.

10:02

🎼 AI Music Composition and Adobe's AI Integration

The focus of this paragraph is on the new AI music composition feature by Sunno and the controversy surrounding Adobe's terms of service update. The author describes the process of creating music with Sunno by uploading or recording audio and extending it with AI-generated elements and lyrics. Additionally, the paragraph discusses Adobe's clarification that they will not train AI on customer work, following concerns raised by the community.

15:03

🍎 Apple's AI Integration and Ecosystem Updates

This paragraph summarizes Apple's AI announcements during their WWDC event, highlighting the company's strategy to integrate AI across all their devices and services. The author mentions new features such as AI-powered email summarization, smart replies, and enhanced search capabilities. Also covered are Apple's Image Playground for image generation, the new Gen Emoji feature, and updates to Siri, including the option to use Chat GPT for certain queries.

20:03

🤖 Open AI's Strategic Partnerships and Executive Appointments

The paragraph covers Open AI's partnership with Apple to offer Chat GPT as an option within Siri and the subsequent controversy sparked by Elon Musk's tweets, which led to a clarification from Apple about data privacy and control. It also discusses the end of Elon Musk's lawsuit against Open AI and the appointment of new C-level executives at Open AI, indicating the organization's growth and development.

25:05

🏆 AI Model Benchmarks and Quinn 2's Superior Performance

This paragraph reports on the release and benchmark testing of the Quinn 2 AI model, which has shown to outperform other models like LLM 3 in various tests despite having fewer parameters. The author suggests that Quinn 2's results are significant within the AI community and could impact the development and adoption of AI models.

30:05

🎉 Celebrating Creativity and Staying Updated with AI News

The final paragraph wraps up the video script by celebrating human creativity in AI-generated art contests and encouraging viewers to stay updated with AI news through the author's 'Futur Tools' platform. The author promotes their newsletter, AI income database, and the benefits of subscribing, including access to the latest AI tools and news.

Mindmap

Keywords

Luma AI

Luma AI is a new AI tool introduced in the video, known for its 'Dream Machine' feature, which generates video content. It is mentioned as a competitor to other AI video tools like Sora and Veo, and is noted for its image-to-video capabilities, though it has some limitations in text-to-video generation.

Dream Machine

Dream Machine is a feature of Luma AI that allows users to create video content from text prompts. The video discusses its performance and issues, such as long wait times for video generation and occasional failures, highlighting that it excels more in image-to-video than text-to-video.

Stable Diffusion 3

Stable Diffusion 3 is the latest version of the AI image generation tool from Stability AI. The video highlights its improved ability to incorporate text into images and the availability of its weights for download on Hugging Face, allowing users to run the model locally.

Image-to-Video

Image-to-Video is a feature of Luma AI’s Dream Machine that transforms static images into dynamic video content. The video shows various examples, demonstrating its effectiveness in creating realistic animations and its advantage over the text-to-video feature.

MidJourney

MidJourney is an AI image generation tool that recently introduced a model personalization feature, allowing users to create images that align more closely with their preferences based on past interactions. The video explains how users can rank images to train MidJourney to better understand their tastes.

GenType

GenType is a tool from Google Labs that generates text in various artistic styles. The video demonstrates how it can create letters in unique designs, such as 'colorful electronic circuitry,' and mentions its similarity to Adobe Firefly’s text generation capabilities.

Suno

Suno is an AI tool that allows users to create songs by uploading or recording audio snippets. The video showcases how it can extend and enhance basic guitar riffs into full songs with added beats and lyrics, making music creation more accessible.

Apple Intelligence

Apple Intelligence refers to the suite of AI features announced by Apple at WWDC, aimed at integrating AI across its devices. These include capabilities for proofreading, summarizing text, and enhancing Siri with on-device AI processing, enhancing user experience without compromising privacy.

Hugging Face

Hugging Face is a platform mentioned in the video where users can access and run AI models like Stable Diffusion 3. It allows for testing and generating images for free, though there might be wait times due to high demand.

Leonardo Phoenix

Leonardo Phoenix is a new foundational model released by Leonardo AI, designed to improve prompt adherence, image quality, and provide more creative control. The video highlights its superiority over other models like Stable Diffusion 3, especially in generating coherent text within images.

Highlights

Introduction to a variety of new AI tools available for use, including video generators and image generation models.

Luma AI's release of Dream Machine, a competitor to other AI video tools like Sora, Veo, and Runway.

Dream Machine's initial high demand and long wait times for video generation.

Examples of generated videos using Dream Machine, showcasing its capabilities and limitations.

Comparison of Dream Machine's text-to-video capabilities with other AI video tools.

Luma AI's image-to-video feature, which shows more promising results than text-to-video.

Pika's updated image-to-video model and its comparison with Dream Machine.

Release of Stable Diffusion 3 by Stability AI, now available for public use.

Examples of images generated by Stable Diffusion 3 and its prompt requirements for better results.

Introduction of Leonardo Phoenix, a new custom model by Leonardo with enhanced features.

Demonstration of image generation using Leonardo Phoenix and its auto-generated prompts.

Mid Journey's new feature 'Model Personalization' based on user preferences.

Google Labs' 'Gen Type' tool for generating letters in custom styles.

Sunno's new feature for creating songs from uploaded audio or sound effects.

Adobe's clarification on their terms of service regarding AI training on customer work.

Apple's WWDC event announcements, focusing on integrating AI across all their devices and services.

Details on Apple's new features like Image Playground, Gen Emoji, and Siri updates.

Elon Musk's reaction to Apple's integration with OpenAI and his concerns over privacy.

OpenAI's addition of new executives and updates on their partnership with Microsoft.

Introduction of Quinn 2, a new open-source AI model outperforming others in benchmark tests.

A photographer's disqualification from an AI image contest for submitting a real photo, highlighting human creativity.