Was NOT Expecting This! Midjourney V6 Competes with DALL-E 3 | Comparison & Review

MattVidPro AI
21 Dec 202319:33

TLDRMidjourney V6, an AI art generator, has made significant strides to compete with DALL-E 3, impressing even skeptics with its capabilities. Despite being in its Alpha version, V6 has demonstrated its ability to generate realistic and detailed images, including text that is often more accurate than DALL-E 3's. The video compares various outputs from both AIs, highlighting Midjourney V6's strengths in photorealism and its potential to improve with further development. While DALL-E 3 is currently free on certain platforms, Midjourney V6 offers a more controlled and less censored environment, with a subscription-based model starting at $10 per month. The review concludes that Midjourney V6 is a strong contender in the AI art space, showing promise to challenge DALL-E 3.

Takeaways

  • 🚀 Midjourney V6 has made significant advancements and is now competing with DALL-E 3 in terms of AI art generation.
  • 🔍 The development time for Midjourney V6 was nearly twice as long as the previous longest development cycle, indicating a major update.
  • 🆕 Midjourney V6 is currently in its Alpha version, suggesting that its capabilities will improve over time.
  • 🎨 The AI can generate more realistic and good-looking words compared to DALL-E 3, although this is subjective.
  • 📈 Midjourney V6 has improved in photorealism and prompt understanding, showing better results in certain areas than V5.
  • 📜 Text generation in Midjourney V6 has improved, but it still requires specific prompting to achieve accurate results.
  • 📉 DALL-E 3 sometimes outperforms Midjourney V6 in text accuracy and character generation, but not consistently.
  • 💲 Midjourney V6 requires a subscription to access, while DALL-E 3 is available for free on certain platforms, offering a different value proposition.
  • 🌐 Midjourney V6 offers more control, less censorship, and better understanding of pop culture characters, which is a significant advantage.
  • 📱 There is a noted preference for a web interface over Discord for generating images, indicating a desire for a more user-friendly experience.
  • ✅ Midjourney V6 has passed the 'cigarette test,' demonstrating its ability to generate complex and detailed images.

Q & A

  • What is the significance of the development time for Midjourney V6?

    -The development time for Midjourney V6 was nearly twice as long as the previous longest development cycle, indicating a significant investment in improvements and updates to compete with other AI art platforms like DALL-E 3.

  • How does Midjourney V6 compare to DALL-E 3 in terms of text generation?

    -Midjourney V6 has made strides in text generation and is now competitive with DALL-E 3, although it sometimes produces text that appears more 'Photoshop-esque' compared to the more natural text generated by DALL-E 3.

  • What are some of the strengths of Midjourney V6?

    -Midjourney V6 excels in photorealism, has improved prompt understanding, and offers more control with less censorship. It also has a better understanding of pop culture characters and provides more aspect ratios to choose from.

  • How does the community perceive the quality of images generated by Midjourney V6?

    -The community has reacted positively to the images generated by Midjourney V6, noting that they are realistic, detailed, and in some cases, indistinguishable from actual photographs.

  • What are the current access options for DALL-E 3?

    -DALL-E 3 can currently be accessed for free on platforms like Bing Image Creator and Microsoft Designer Image Creator, which utilize the DALL-E 3 API.

  • What is the author's conspiracy theory regarding the differences in text generation between Midjourney V6 and DALL-E 3?

    -The author theorizes that Midjourney V6 may be synthetically trained to produce text, while DALL-E 3 is naturally trained, which could explain the differences in text quality and character generation between the two platforms.

  • What are the limitations of Midjourney V6 when generating images with multiple characters?

    -Midjourney V6 sometimes struggles with generating images featuring multiple characters, as they may blend together or appear distorted, resulting in a less coherent final image.

  • How does the pricing for Midjourney V6 compare to DALL-E 3?

    -Midjourney V6 requires a minimum subscription fee of $10 a month to access, whereas DALL-E 3 can be accessed for free on certain platforms, although a more censored version is available through Chat GPT for a minimum of $20 a month.

  • What is the current status of Midjourney V6?

    -Midjourney V6 is currently in its Alpha version, which means it is still in development and has the potential for further improvements and updates.

  • What is the 'cigarette test' mentioned in the script, and how does Midjourney V6 perform on this test?

    -The 'cigarette test' is a challenge for AI image generators to accurately depict a cigarette in someone's mouth or hand. Midjourney V6 successfully passes this test, demonstrating its ability to generate detailed and realistic images.

  • How does the author suggest improving the text generation capabilities of Midjourney V6?

    -The author suggests using better prompting techniques, such as switching to Raw mode, to improve the text generation capabilities of Midjourney V6.

Outlines

00:00

🚀 Mid Journey V6's Impressive Development

The video discusses the significant advancements in Mid Journey V6, highlighting its development time being nearly twice as long as the previous longest development cycle. It emphasizes the competitive edge Mid Journey V6 has gained against Dolly 3, especially after the release of a free competitor, SDXL, and the impressive capabilities of Dolly 3. The community's initial reactions are also shared, noting the subjective beauty of the generated words and the cinematic and realistic feel of the images produced by Mid Journey V6 compared to Dolly 3 and SDXL.

05:02

📈 Comparing Mid Journey V6 and Dolly 3

The script provides a detailed comparison between Mid Journey V6 and Dolly 3, showcasing examples of text and image generation. It notes that while Dolly 3 has a slight edge in text accuracy, Mid Journey V6 offers superior aesthetics and photorealism. The video also discusses the community's feedback, including a direct comparison from V5.2 to V6, and the importance of prompt accuracy in achieving better results with AI image generators.

10:03

🤖 Dolly 3's Market Impact and Mid Journey's Response

The video script theorizes that Dolly 3's influence on the market pushed Mid Journey to improve its text generation capabilities. It suggests that Dolly 3 might be naturally trained to produce text, while Mid Journey V6 is synthetically trained. The video also includes a test comparing photorealistic image generation between Mid Journey V6 and Dolly 3, where Mid Journey V6 excels in producing images that resemble real Instagram photos, while Dolly 3 shows a slight lag in text accuracy.

15:04

🎨 Mid Journey V6's Strengths and Future Prospects

The script acknowledges that Mid Journey V6, despite being an alpha version, has made significant strides in photorealism and text generation, positioning it as a strong contender against Dolly 3. It discusses the challenges faced by Mid Journey in competing with well-funded entities like Open AI and Microsoft. The video concludes by stating that Mid Journey V6 has managed to impress and regain its position in the market, suggesting that with further improvements, it could effectively compete with Dolly 3.

Mindmap

Keywords

Midjourney V6

Midjourney V6 refers to the sixth version of the AI art generator, Midjourney. It is a significant update that has taken nearly twice as long to develop as the previous longest development cycle. In the video, it is compared with DALL-E 3 and is shown to have made considerable advancements in text generation and photorealism, positioning it as a strong competitor in the AI art landscape.

DALL-E 3

DALL-E 3 is an advanced AI image generator developed by OpenAI. It is known for its high level of coherence, prompt understanding, and impressive scale. The video discusses how Midjourney V6 is competing with DALL-E 3, particularly in areas such as text generation within images and photorealistic outputs.

Photorealism

Photorealism in the context of the video refers to the ability of AI art generators to produce images that closely resemble real photographs. Midjourney V6 is praised for its strong performance in this area, with images that are described as 'legit' and 'professional looking,' which is a significant aspect of the comparison with DALL-E 3.

Prompt Understanding

Prompt understanding is the ability of an AI to accurately interpret and generate images based on textual descriptions provided by users. The video highlights that Midjourney V6 has improved significantly in this area, allowing it to better compete with DALL-E 3, although DALL-E 3 is still considered to have an edge.

Text Generation

Text generation is the process by which AI art generators create and incorporate text into images. The video discusses how Midjourney V6 has made strides in text generation, making it more competitive with DALL-E 3, although there are still areas for improvement in terms of accuracy and naturalness.

AI Art Landscape

The AI art landscape refers to the current state and advancements in the field of AI-generated art. The video provides an overview of how Midjourney V6's development has been influenced by changes in this landscape, particularly the introduction of competitors like DALL-E 3 and the demand for more advanced features.

Cinematic

Cinematic, in the context of the video, describes the quality of AI-generated images that resemble scenes from movies, with a high level of detail and realism. Midjourney V6 is noted to produce images with a more cinematic and realistic vibe compared to DALL-E 3 in certain comparisons.

Pop Culture Characters

Pop culture characters refer to well-known figures from popular culture, such as those from movies, TV shows, or comic books. The video tests Midjourney V6's ability to generate images of these characters and discusses the level of accuracy and realism achieved, which is an important factor in evaluating the AI's capabilities.

Upscaling

Upscaling is the process of enhancing the resolution of an image without losing quality. The video demonstrates Midjourney V6's upscaling feature, comparing it with other upscaling tools to show how well it can enlarge images while maintaining detail and realism.

In-Painting

In-painting is a feature in AI art generators that allows users to fill in or modify parts of an image. The video mentions that Midjourney V6 has an in-painting feature, which is not available in DALL-E 3, giving it an advantage in certain creative tasks.

Discord

Discord is a communication platform where the AI art generator Midjourney V6 is currently accessible. The video expresses frustration with the use of Discord for this purpose, suggesting that a web interface would be more user-friendly for generating and manipulating images.

Highlights

Midjourney V6 has made significant advancements, competing with DALL-E 3 in the AI art landscape.

The development time for Midjourney V6 was nearly twice as long as the previous longest development cycle.

Midjourney V6 is currently in its Alpha version, with potential for further improvements.

Community reactions suggest that Midjourney V6 can generate more beautiful and realistic words compared to DALL-E 3.

A side-by-side comparison reveals that while DALL-E 3 has a more photoshop-esque vibe, Midjourney V6 offers a more cinematic and realistic output.

Midjourney V6's text generation is competitive, with some images appearing more photo-realistic and detailed.

DALL-E 3 sometimes struggles with text accuracy, as seen in the comparison where 'Organic snacks' was misspelled in some generations.

Midjourney V6 has superior aesthetics that compensate for slightly less accuracy in text generation compared to DALL-E 3.

Midjourney V6 has made strides in photo-realistic image generation, leading the field in this area.

The text in Midjourney V6 images can sometimes appear unnatural, possibly due to synthetic training methods.

DALL-E 3 has a more natural text generation, possibly due to natural training methods.

Midjourney V6 offers more control, less censorship, and a better understanding of pop culture characters.

DALL-E 3 is currently accessible for free on certain platforms, while Midjourney V6 requires a subscription.

Midjourney V6 has a built-in upscaling feature that provides a decent upscale of realistic movie scenes.

Midjourney V6 passes the 'cigarette test' for AI image generators, successfully generating a cigarette in someone's mouth or hand.

The reviewer has resumed their Midjourney subscription due to the impressive capabilities of V6.

Midjourney V6 is seen as a strong contender in the AI art generator market, potentially competing head-to-head with DALL-E 3.