Stable Diffusion 3 HANDS ON! How Good Is It Really?
TLDRStability AI has recently launched Stable Diffusion 3 and its Turbo version, accessible only via API through a partnership with Fireworks AI. The company plans to make model weights available for self-hosting with a Stability AI membership soon. Despite the high API pricing, the beta version of Stable Diffusion 3 was successfully implemented on Pixel Doo within three hours, allowing users to generate images with prompts and optional negative prompts. The quality of the generated images is generally consistent with those displayed on the company's website, with prompt adherence being notably good. However, text generation within images remains a challenge. The Turbo model is faster but produces lower resolution images. The video concludes that Stable Diffusion 3 mostly lives up to its hype, with most images generated closely resembling the examples provided by the company.
Takeaways
- 🚀 Stable Diffusion 3 and Stable Diffusion 3 Turbo have been released by Stability AI, but are only available via API.
- 🤝 Stability AI has partnered with Fireworks AI, an API platform that provides hosting and fast access to models like Stable Diffusion.
- 📚 They plan to make the model weights available for self-hosting with a Stability AI membership in the near future.
- ⏱️ The reviewer had Stable Diffusion 3 beta up and running on Pixel Doo within 3 hours.
- 💰 The pricing for the API is relatively high, with credits costing about $10 per thousand and image generation costs varying between the models.
- 🔍 The reviewer tested the model's image generation without cherry-picking, using prompts from press releases to assess the quality.
- 📸 The generated images were generally in line with the prompts and similar to those displayed on the Stability AI website.
- 📝 Text coherence in images generated by Stable Diffusion 3 was a challenge, with some text appearing mashed up or incorrect.
- 🔧 The reviewer noted that the Turbo model was quicker but resulted in lower quality images compared to the standard model.
- 🎨 The adherence to complex prompts, including those with text, was generally good, although not perfect.
- 📈 Stable Diffusion 3 seems to live up to the hype, with most images generated closely matching the examples on the website.
- 💡 Negative prompts were not used in the tests, but they could be an option for users to experiment with for better results.
Q & A
What is the main difference between Stable Diffusion 3 and Stable Diffusion 3 Turbo?
-Stable Diffusion 3 and Stable Diffusion 3 Turbo are both available via API, but the Turbo version is quicker to return results, although the quality might be lower compared to the standard model.
How can one access Stable Diffusion 3 and Stable Diffusion 3 Turbo?
-They are accessible via an API provided by Fireworks AI, an API platform that offers hosting and fast access to these models.
What is the pricing structure for the API that hosts Stable Diffusion 3 models?
-The API operates on a credit system where users need to purchase credits, with each image generated costing 6 to 12 credits, making it about 32 times more expensive than generating an image with Stable Diffusion XL 1.0.
What is the cost per thousand credits for using the Stable Diffusion 3 API?
-The cost is approximately $10 per thousand credits.
What is the commitment Stability AI has towards open generative AI?
-Stability AI has committed to making the model weights available for self-hosting to those with a Stability AI membership in the near future.
How long did it take to set up Stable Diffusion 3 beta on Pixel Doo after its release?
-It took about 3 hours to set up Stable Diffusion 3 beta on Pixel Doo.
What are the options available to users when generating an image with Stable Diffusion 3?
-Users can input a prompt, optionally provide a negative prompt, and choose between Stable Diffusion 3 and Stable Diffusion 3 Turbo.
How does the quality of images generated by Stable Diffusion 3 compare to those displayed on the Stability AI website?
-The quality of images generated by Stable Diffusion 3 is quite good and does not appear to be significantly cherry-picked compared to the images on the Stability AI website.
What challenges do most AI generators face when generating images with text?
-AI generators often struggle with text coherence, ensuring that the text in the generated image is legible and accurately reflects the input prompt.
What is the monthly cost for a Pro Plan on Pixel Doo?
-The Pro Plan on Pixel Doo starts at $9.95 a month, which includes unlimited image generations.
What additional features are available to Pro Plan members on Pixel Doo?
-Pro Plan members have access to a creative upscaler and all the other Stable Diffusion models that are integrated into Pixel Doo.
How does the user feel about the prompt adherence of Stable Diffusion 3 compared to previous versions?
-The user feels that the prompt adherence for Stable Diffusion 3 is significantly better compared to previous versions, to the point where negative prompts might not be as necessary.
Outlines
🚀 Stable Diffusion 3 and Turbo Release via API
Stability AI has released Stable Diffusion 3 and its Turbo version, but with a catch—they are only available through an API. The company has partnered with Fireworks AI, an API platform that offers hosting and quick access to AI models. Despite the high API pricing of about $10 per thousand credits, with Stable Diffusion 3 costing 6 to 12 credits per image generated, the presenter managed to set up Stable Diffusion 3 beta on Pixel Dojo within 3 hours. This allows users to generate images by providing a prompt and optionally a negative prompt, choosing between the two versions of the model, and viewing examples. The presenter also discusses the cost of the Pro Plan for unlimited usage and shares initial generated images to demonstrate the model's adherence to prompts and quality, comparing them to those displayed on Stability AI's website.
🎨 Testing Image Generation with Stable Diffusion 3
The video script continues with a detailed examination of the image generation process using Stable Diffusion 3 and its Turbo model. The presenter tests various prompts to assess the model's adherence to the given instructions and its ability to handle text within images—a challenge for many AI generators. Examples include generating images of an anthropomorphic tortoise on a subway, a man with a retro TV for a head in a desert, and a cardboard box with text on it. The presenter notes that while the standard model generally performs well, the Turbo model, which is faster, produces lower quality and more cartoonish results. The script also includes tests with more complex prompts, such as a kangaroo wearing ski goggles and holding a beer, and an entire universe inside a bottle at Walmart. The presenter concludes that Stable Diffusion 3 mostly lives up to its hype, producing images that are similar to those on the Stability AI website without excessive cherry-picking. The summary ends with an invitation for viewers to try the model on Pixel Dojo with a Pro membership and to share their thoughts on the new models.
Mindmap
Keywords
💡Stable Diffusion 3
💡API
💡Fireworks AI
💡Model Weights
💡Pixel Doo
💡Prompt
💡Negative Prompt
💡Credits
💡Pro Plan
💡Text Coherence
💡Cherry-Picking
Highlights
Stability AI has released Stable Diffusion 3 and Stable Diffusion 3 Turbo, available only via API.
Stable Diffusion 3 has partnered with Fireworks AI for hosting and fast access.
Model weights for self-hosting will be available for Stability AI members in the near future.
Stable Diffusion 3 Beta was set up on Pixel Doo within 3 hours.
Users can generate images with a prompt, optionally a negative prompt, and choose between Stable Diffusion 3 and Turbo.
Pricing for the API is high, at about $10 per thousand credits.
Stable Diffusion 3 is 32 times more expensive to generate an image than Stable Diffusion XL 1.0.
A Pro Plan starts at $9.95 a month for unlimited usage of Pixel Dojo.
The quality of images generated by the model is comparable to those displayed on the website, suggesting no cherry-picking.
The model's prompt adherence is strong, with generated images closely following the input prompts.
Text coherence in generated images is generally good, although some issues were noted.
Stable Diffusion 3 Turbo model is quicker but produces lower quality images compared to the standard model.
The Turbo model struggled slightly with complex text in images but still provided reasonable results.
The standard model demonstrated better adherence to complex prompts with detailed elements.
Stable Diffusion 3 seems to live up to the hype, with most generated images being of high quality.
Negative prompts were not used in the tests, but could be an area for further exploration.
Pixel Doo offers a Pro membership for $9.95 a month, which includes unlimited generations and access to other Stable Diffusion models.
More features and models will be added to Pixel Doo as time progresses.
Casual Browsing
Stable Diffusion 3 is out! How to start using it!
2024-06-13 15:10:00
Stable Diffusion 3 Announced! How can you get it?
2024-06-13 12:55:01
A.I. wrote me an Oreo Cake Recipe ... is it any good?! GPT-3
2024-05-04 14:25:00
Stable Diffusion 3 - How to use it today! Easy Guide for ComfyUI
2024-06-13 12:00:01
Stable Diffusion 3 IS FINALLY HERE!
2024-06-13 11:30:00