BEST AI Art Generator? Dall E 2 vs Midjourney vs Stable Diffusion

Wade McMaster - Creator Impact
22 Dec 202207:04

TLDRThis video compares three leading AI art platforms: Dall-E 2, Midjourney, and Stable Diffusion. The comparison is based on the outputs generated from basic prompts to evaluate the styles and quality of the images produced by each platform. Dall-E 2 is noted for its photorealistic images, although sometimes with minor quirks. Midjourney is praised for its artistic and well-composed images, which often stand out for their style and appeal. Stable Diffusion, while capable of decent results, is generally considered to be the least impressive of the three in this context. The video also discusses the user interfaces and ease of use for each platform, with Dall-E 2 having a more user-friendly interface and Midjourney offering a more complex but rewarding experience. The narrator shares a preference for Midjourney's artistic style and invites viewers to share their thoughts and preferences in the comments.

Takeaways

  • ๐ŸŽจ Dall-E 2, Midjourney, and Stable Diffusion are three leading AI art platforms used to generate creative images based on prompts.
  • ๐Ÿ” When comparing the platforms using basic prompts, Dall-E 2 produced photorealistic images, though with some quirks.
  • ๐ŸŒŸ Midjourney's results were consistently artistic and visually appealing, often standing out for their style.
  • ๐Ÿ“ˆ Stable Diffusion's outputs were generally good but did not match the artistic flair of Midjourney or the photorealism of Dall-E 2.
  • ๐Ÿ–ผ๏ธ Dall-E 2's interface is user-friendly with additional features like in-painting and out-painting for enhancing AI art.
  • ๐Ÿค– Midjourney, while more complex to use, offers more artistic and better-composed images.
  • ๐Ÿ†“ Stable Diffusion is available for free but has a steeper learning curve and is more complex to set up.
  • ๐Ÿ“ฑ Dall-E 2 and Midjourney both support creating images at a resolution of 1024 by 1024 pixels, while Stable Diffusion's resolution can be adjusted.
  • ๐Ÿ† Dall-E 2 is favored for its photorealistic capabilities, while Midjourney is preferred for artistic composition.
  • ๐Ÿ“ˆ Midjourney's results are often more striking and artistic, making it a favorite for users who value unique and expressive styles.
  • ๐Ÿ’ฌ The choice between these platforms depends on the user's preference for photorealism, artistic style, or ease of use.

Q & A

  • What are the three main AI art platforms mentioned in the transcript?

    -The three main AI art platforms mentioned are Dall-E 2, Midjourney, and Stable Diffusion.

  • Which platform is noted to have the most photorealistic results?

    -Dall-E 2 is noted to have the most photorealistic results.

  • In the comparison, which platform seems to have the most artistic and better composed images?

    -Midjourney tends to have more artistic and better composed images.

  • What is the user's preferred platform for artistic style?

    -The user's preferred platform for artistic style is Midjourney.

  • How does the user describe the interface of Dall-E 2?

    -The user describes the interface of Dall-E 2 as much nicer, easy to use, and having great features like in-painting and out-painting.

  • What is the main advantage of Midjourney according to the user?

    -The main advantage of Midjourney is its ability to produce high-quality artistic images, despite its more complex usage compared to Dall-E 2.

  • Which platform is mentioned as being free to use?

    -Stable Diffusion is mentioned as being free to use.

  • What is the user's opinion on the photorealism of Stable Diffusion?

    -The user believes that Stable Diffusion is better at creating photorealistic images than Midjourney but still considers it second to Dall-E 2.

  • What is the user's view on the complexity of using Stable Diffusion?

    -The user views Stable Diffusion as the most complex to set up, unless one opts for an online interface.

  • What are the features of Dall-E 2 that the user appreciates?

    -The user appreciates Dall-E 2's user-friendly interface and features like in-painting and out-painting, which allow for the addition of AI art into or outside of designated areas.

  • What is the user's recommendation for someone who needs to create more AI art?

    -The user recommends leaving a comment below to share thoughts and preferences, suggesting an engagement with the community for further insights.

Outlines

00:00

๐ŸŽจ AI Art Platform Comparison: Dolly 2, Mid-Journey, and Stable Diffusion

The video script discusses three prominent AI art platforms: Dolly 2, Mid-Journey, and Stable Diffusion. The narrator compares these platforms by inputting basic prompts to evaluate the styles and quality of the generated images. Dolly 2 is noted for its photorealistic results, although with some quirks like funny-looking teeth in one example. Mid-Journey is praised for its stunning and artistic outputs, while Stable Diffusion's results are considered decent but not the best among the three. The platforms are tested with various prompts, including images of a woman, an oil painting, a Shaolin monk, a sunny outdoor scene, a busy city street, a cyborg, a puppy, a 3D render of a turtle, and an ink sketch of a dragon. The narrator expresses a preference for Mid-Journey for its artistic style, although acknowledges the strengths of each platform in different contexts.

05:01

๐Ÿ“ท Photorealism and Artistic Composition in AI Art Platforms

The second paragraph focuses on the photorealism and artistic composition capabilities of the AI art platforms. Dolly 2 is recognized for producing highly realistic images, making it a winner in the photo realism category. Mid-Journey, while not accurately replicating a photo style in one instance, is favored for its artistic and well-composed imagery. Stable Diffusion is noted to be free and capable of creating photorealistic images, but it falls short in comparison to the other two platforms. The narrator also discusses the user interfaces of the platforms, with Dolly 2 having a more user-friendly interface and Mid-Journey being more complex but offering superior image quality. The video concludes with a prompt for viewers to share their preferences and thoughts on the platforms.

Mindmap

Keywords

๐Ÿ’กAI art platforms

AI art platforms refer to digital services or software that utilize artificial intelligence to generate artwork based on user prompts. In the video, three such platforms are compared: Dall E 2, Midjourney, and Stable Diffusion. These platforms are central to the video's theme as they are the subjects being evaluated for their performance in creating various styles of art.

๐Ÿ’กPhotorealistic

Photorealistic refers to the quality of an image or artwork that closely resembles a photograph in terms of detail and realism. In the context of the video, Dall E 2 is noted for producing images that are almost photorealistic, indicating a high level of detail and lifelike appearance, especially in the prompt for a 'beautiful woman with blue eyes'.

๐Ÿ’กOil painting

An oil painting is a type of painting that uses oil paints, which are pigments suspended in a carrier medium and applied to a canvas or panel. The video discusses the platforms' ability to generate images in the style of an oil painting, particularly when creating an image of a 'Shaolin monk', showcasing the diversity of styles these AI platforms can emulate.

๐Ÿ’กStable Diffusion

Stable Diffusion is one of the AI art platforms mentioned in the video. It is noted for producing standard-looking oil paintings and photorealistic images, although it is sometimes considered less impressive compared to the other platforms. The term is significant as it represents one of the technologies being evaluated for its artistic output.

๐Ÿ’กMidjourney

Midjourney is another AI art platform featured in the video. It is praised for creating images with a more artistic and striking style, such as the 'sunny outdoor scene' and 'cyborg with glowing eyes'. The term is significant as it represents a platform that often excels in producing artistically composed images.

๐Ÿ’กDall E 2

Dall E 2 is an AI art platform that is highlighted for its photorealistic capabilities. The term is used throughout the video to refer to the platform that generated images which closely resemble real photographs, making it a key player in the comparison of the platforms' abilities.

๐Ÿ’กArtistic composition

Artistic composition refers to the arrangement of visual elements within an artwork to create a coherent and aesthetically pleasing image. The video discusses the artistic composition of the images produced by the platforms, with Midjourney being favored for its artistic and well-composed images, particularly in the 'busy city street' and 'cute puppy wearing sunglasses and headphones' prompts.

๐Ÿ’ก3D render

A 3D render is a two-dimensional representation of a three-dimensional object or scene, created using computer graphics. In the video, a '3D render of a turtle' is used as a prompt to demonstrate the platforms' ability to generate 3D-like images, with Midjourney producing a more impressive 3D effect compared to Dall E 2 and Stable Diffusion.

๐Ÿ’กInk sketch

An ink sketch is a drawing made using ink, often characterized by bold lines and minimal color. The video includes an 'ink sketch of a dragon' to test the platforms' ability to mimic the style of traditional ink drawings. The term is important as it represents a specific artistic style that the AI platforms are challenged to reproduce.

๐Ÿ’กBusinessman photograph

A businessman photograph refers to a professional image of a businessman, often used in corporate or commercial contexts. The video uses this prompt to evaluate the platforms' ability to generate photorealistic images, with Dall E 2 being noted for its success in creating a realistic photograph, despite some issues with the facial elements.

๐Ÿ’กUser interface

A user interface (UI) is the point of interaction between a user and a digital device or software, including the design and layout of the controls and displays. The video mentions the user interfaces of the AI art platforms, noting that Dall E 2 has a more user-friendly interface with features like in-painting and out-painting, which affects the ease of use and overall experience for the user.

Highlights

Dall-E 2, Midjourney, and Stable Diffusion are three main AI art platforms used for generating impressive art.

Dall-E 2 creates a photorealistic image of a woman with blue eyes, though the teeth appear slightly odd.

Midjourney's image of a woman is stunning and artistic, but not as photorealistic as Dall-E 2's.

Stable Diffusion's result is decent but not as strong as the other two platforms in the initial comparison.

Vision prep is noted as the best-looking image, while Dall-E 2 has the most realistic photorealistic image.

An oil painting of a Shaolin monk by Dall-E 2 looks like a standard oil painting.

Midjourney's Shaolin monk image is described as next-level, sharp, and exciting.

Stable Diffusion's oil painting maintains a standard look but still appears cool.

Midjourney is favored for its cooler style and appeal to the presenter's taste.

Dall-E 2's outdoor scene looks very much like a photo, although not the most appealing.

Midjourney's outdoor scene is artistic and more like a painting style.

Stable Diffusion opts for a more photo-like look for the outdoor scene.

Dall-E 2's depiction of a busy city street combines a painted and photographic look.

Midjourney's city street image is striking, artistic, and has nice use of color.

Stable Diffusion's city street maintains a photorealistic style, differing from Midjourney's artistic approach.

Dall-E 2's cyborg image is simple and basic, not fully meeting the presenter's expectations but still cool.

Midjourney's cyborg image is impressive with a video game style appearance.

Stable Diffusion's cyborg has glowing eyes that don't fully meet the brief but still looks cool.

A cute puppy wearing sunglasses and headphones by Dall-E 2 has a photographic look but a boring background.

Midjourney's puppy image is more artistic, with more depth and impressiveness.

Stable Diffusion's puppy image is photorealistic and on par with Dall-E 2's, but with a more interesting background.

Dall-E 2's 3D render of a turtle is plain but maintains a 3D look.

Midjourney's 3D render is superior to Dall-E 2's, with a more impressive scene.

Stable Diffusion's 3D render is better than Dall-E 2's but has a plain background.

Dall-E 2's ink sketch of a dragon is rough but cool for the style.

Midjourney's ink sketch is described as next level, with more detail and a fun style.

Stable Diffusion's ink sketch is neater and more cohesive than Dall-E 2's.

Dall-E 2 is noted for its superior interface with features like in-painting and out-painting.

Midjourney is more complex to use but produces better imagery.

Stable Diffusion is free but can be complex to set up without using an online interface.

Dall-E 2 is favored for photorealism, Midjourney for artistic composition, and Stable Diffusion for a balance between the two.