MidJourney Version 6, finally worth the price?

VISULA by TOBY
3 Jan 202406:36

TLDRMidJourney Version 6 is reviewed by Toby from Visual Toby, who explores its new features for AI-generated images. The new version allows for more natural language prompts, eliminating the need for specific keywords. In a comparison with the industry-leading Stable Diffusion model Juggernaut XL, MidJourney's images are found to be more detailed and lifelike, with better adherence to the prompts. Toby is particularly impressed with the quality and reliability of MidJourney's output, noting that it allows users to bring their ideas to life with minimal adjustments. However, he suggests that the MidJourney team could improve by offering more customization options and a dedicated website for the generator. He also criticizes the need to pay extra for privacy, which he believes should not be a premium feature. Despite these points, Toby concludes that MidJourney's advancements are beneficial for the AI image generation industry as a whole.

Takeaways

  • πŸŽ‰ MidJourney Version 6 introduces a new way of prompting images with natural language, eliminating the need for generic keywords.
  • πŸ“Έ The new version provides high-quality image generation that rivals industry leaders like Juggernaut XL based on Stable Diffusion.
  • πŸ” MidJourney's images are detailed and lifelike, with excellent texture and clarity, even capturing subtle features like hair and dress details.
  • πŸš€ The prompt example given, 'a beautiful red curly-haired woman in a blue dress by the sea,' resulted in images that closely matched the description.
  • πŸ†š In a comparison with Stable Diffusion, MidJourney's images were often more aligned with the prompt and had better overall quality.
  • 🧸 For a prompt involving a 'photorealistic ice bear baby,' both models struggled with certain aspects, but MidJourney's interpretation was slightly more accurate.
  • πŸš— A prompt for a 'photorealistic black Porsche' was perfectly captured by MidJourney, showing a high level of detail and adherence to the description.
  • βœ… MidJourney's reliability allows users to bring their ideas to life with minimal prompt adjustments, making the process more efficient.
  • πŸ’­ The video suggests that while MidJourney is impressive, there is room for improvement in customizability and freedom with the generator.
  • πŸ’¬ The platform currently operates on Discord, and there is a suggestion for a dedicated site and presets or templates for more control.
  • πŸ”’ Privacy is a concern as images are visible to all unless a higher, more expensive plan is purchased, which the video argues should not be a premium feature.
  • 🌟 Despite some drawbacks, the advancements in MidJourney are beneficial for the industry, as they push other software to adopt similar technologies.

Q & A

  • What is the main topic of the video?

    -The main topic of the video is the review and comparison of MidJourney Version 6, an AI image generation tool, and its features and performance against another industry-leading model, Juggernaut XL based on Stable Diffusion.

  • How has MidJourney Version 6 changed the way of prompting for images?

    -MidJourney Version 6 has introduced a new way of prompting for images that allows users to describe their desired image in natural language, as if explaining it to a human, without the need for generic keywords or terms.

  • What is the significance of using natural language in prompting for MidJourney Version 6?

    -Using natural language in prompting allows for a more human-like interaction with the AI, which can lead to more accurate and detailed image generation that closely matches the user's intended concept.

  • How does the video compare MidJourney Version 6 to Stable Diffusion XL?

    -The video compares MidJourney Version 6 to Stable Diffusion XL by using specific prompts and evaluating the resulting images based on their quality, detail, and adherence to the prompt.

  • What are some of the advantages of MidJourney Version 6 over Stable Diffusion XL as per the video?

    -MidJourney Version 6 is praised for its ability to generate highly detailed and realistic images that closely follow the user's natural language prompts. It is also noted for its reliability and the reduced need for multiple iterations to achieve the desired image.

  • What are the areas where the MidJourney team is suggested to improve?

    -The video suggests that the MidJourney team should work on improving customizability and freedom with their generator, consider creating a dedicated site for the generator instead of running it on Discord, and reconsider the premium feature for privacy as it should not be an extra cost.

  • What is the reviewer's opinion on the image quality of MidJourney Version 6?

    -The reviewer is highly impressed with the image quality of MidJourney Version 6, describing the images as extremely alive, with great detail in hair, dress texture, and background scenes.

  • How does the video address the issue of privacy with MidJourney's image generation?

    -The video points out that unless users upgrade to a higher plan, their images can be seen by everyone, which is considered a downside and suggested to be a standard feature rather than a premium one.

  • What does the reviewer suggest for the future of AI image generation tools?

    -The reviewer suggests that while MidJourney is currently the best, it does not mean it will stay that way forever. They anticipate that other software will adapt similar technologies, leading to further advancements in the field.

  • What is the reviewer's final verdict on whether it's worth making the switch to MidJourney Version 6?

    -The reviewer concludes that MidJourney Version 6 is extremely reliable and allows users to bring their exact ideas to life with minimal prompt adjustments, suggesting that it may be worth making the switch.

  • How does the video engage with the audience for further interaction?

    -The video encourages audience interaction by inviting viewers to leave a thumbs up if they enjoyed the content and to write any questions in the comment section for the reviewer to answer.

  • What is the significance of the 'stable diffusion' term mentioned in the video?

    -Stable diffusion refers to a type of AI model used for image generation, and in the context of the video, it is used to compare the performance of MidJourney Version 6 against a leading model, Juggernaut XL based on Stable Diffusion.

Outlines

00:00

πŸ–ΌοΈ Mid Journey Version 6: AI Image Generation Enhancements

The video introduces Mid Journey's version 6, an AI model for generating images with significant updates. The new version allows for more natural language prompts, eliminating the need for generic keywords. It is compared to other AI models like Stable Diffusion and Deli. The video demonstrates how the new prompting system works with an example and compares the quality of generated images to those of Stable Diffusion's Juggernaut XL. The comparison shows that Mid Journey's images are more detailed and closer to real-life, even though some details like a birthmark were not captured. The video concludes with a discussion on the strengths of Mid Journey and potential areas for improvement, such as customizability and privacy concerns.

05:01

πŸ” Mid Journey's Image Quality and Customization Concerns

This paragraph discusses the impressive image quality produced by Mid Journey's AI model, noting that it often requires less tweaking to achieve desired results. The video acknowledges that while Mid Journey is currently leading in image generation, it is not without areas for improvement. The paragraph raises concerns about the lack of customizability in the generator and the necessity of a dedicated website instead of relying on Discord. It also mentions the need for presets or templates similar to those available for Stable Diffusion. The paragraph ends with a critique of the premium feature for privacy, suggesting it should not be an additional cost. The video concludes by encouraging viewers to engage with the content through likes and comments, and promises to address any questions in the comments section.

Mindmap

Keywords

πŸ’‘MidJourney Version 6

MidJourney Version 6 refers to the latest iteration of an AI image generation software. It is the main subject of the video, which discusses its new features and improvements over previous versions. The video explores whether this version is worth the investment compared to other AI models.

πŸ’‘Prompting

In the context of AI image generation, 'prompting' is the process of providing a description or set of instructions to the AI to guide the creation of an image. MidJourney Version 6 introduces a new, more natural language approach to prompting, which is a significant change from previous methods.

πŸ’‘Natural Language

Natural language is the way humans communicate with each other, as opposed to formal programming languages. The video emphasizes that MidJourney Version 6 allows users to prompt the AI using natural language, making it easier to communicate their vision to the software.

πŸ’‘Stable Diffusion

Stable Diffusion is an industry-leading AI model used for image generation, specifically mentioned as Juggernaut XL in the script. It serves as a comparison point to evaluate the performance and quality of images generated by MidJourney Version 6.

πŸ’‘Photorealistic

Photorealistic refers to the quality of an image that closely resembles a photograph. It is a desired outcome in the context of AI image generation, with the video comparing the photorealism of images produced by MidJourney Version 6 and Stable Diffusion.

πŸ’‘Customizability

Customizability is the ability to modify or adapt features to suit individual preferences or needs. The video suggests that MidJourney Version 6 has room for improvement in terms of customizability, indicating that users may want more control over the image generation process.

πŸ’‘Discord

Discord is a communication platform where the MidJourney generator operates. The video questions why the generator is hosted on Discord instead of having a dedicated site, hinting at potential limitations in user experience.

πŸ’‘Presets or Templates

Presets or templates are pre-defined settings or configurations that users can choose to quickly generate images with specific characteristics. The video suggests that having such options could enhance the user experience with MidJourney Version 6.

πŸ’‘Privacy

Privacy, in the context of the video, refers to the visibility of generated images to others. The video criticizes the need to pay extra for privacy, as users' images are visible to everyone unless they upgrade to a higher, more expensive plan.

πŸ’‘Reliability

Reliability, in the context of AI image generation, means the consistency and dependability of the AI to produce high-quality images. The video praises MidJourney Version 6 for its reliability, stating that it can bring users' ideas to life with minimal adjustments.

πŸ’‘AI Model

An AI model is a specific instance or version of an artificial intelligence system designed for a particular task, such as image generation. The video discusses switching from other AI models to MidJourney Version 6, highlighting the advancements in AI technology.

Highlights

MidJourney Version 6 introduces a new way of prompting for images using natural language, similar to explaining to a human.

Using generic keywords like '4K' or 'photorealistic' in prompts is now considered harmful for MidJourney Version 6.

An example prompt demonstrates a detailed scene with a red curly-haired woman by a vintage Mercedes-Benz by the sea.

MidJourney's images are of such high quality that they could be mistaken for real photographs.

The detail in hair and texture of clothing in MidJourney's images are significantly better than those from Stable Diffusion models.

MidJourney's image generation is more reliable and requires less prompt adjustment to achieve desired results.

MidJourney's generator is criticized for its lack of customizability and reliance on Discord.

The video suggests that a dedicated website for the generator and presets/templates would be beneficial.

Privacy is a concern as images are visible to everyone unless a higher, more expensive plan is purchased.

The presenter is amazed by the quality and detail of a photorealistic black Porsche image generated by MidJourney.

MidJourney's ability to pick up on the entire prompt with great detail is praised.

The presenter suggests that other software will likely adopt similar technologies, keeping the field competitive.

Despite its current superiority, MidJourney is not expected to remain the best forever due to competition.

The video encourages viewers to leave a thumbs up and ask questions in the comments for further interaction.

Stable Diffusion's image of a scene with a car on a motorway was criticized for not accurately following the prompt.

MidJourney's image of a plushy teddy bear wearing a cowboy hat was favored over Stable Diffusion's version.

The presenter's favorite prompt resulted in an 'insane' level of detail in MidJourney's image, exceeding expectations.