3D Optimism | Midjourney Office Hours Recap April 3rd 2024 | Midjourney News

Future Tech Pilot
3 Apr 202403:42

TLDRThe Midjourney office hours recap from April 3rd, 2024, provides updates on the company's progress and upcoming features. There are no major announcements, but the team is working on website enhancements, including new social features to be stress-tested with guides and mods. Personalization is a focus, albeit progressing slowly due to multiple time zones. An algorithm is being developed to improve hand and body representation, as well as text accuracy, which should reduce the occurrence of poor image quality. A speed update is also anticipated, potentially making processes 25-50% faster and cheaper. A caption party is planned to teach the version 7 model about the connection between images and language, with potential rewards for participants. The team is considering a new class of users for rating and captioning. Video features are still under discussion, with optimism for a version 7 model. The feedback leaderboard on the Midjourney website will see more ideas added periodically, and the company is not planning to expand on any not-safe-for-workplace features. The recap also mentions the possibility of multiple consistent characters in future versions and provides a prompt for creating a serene double exposure image.

Takeaways

  • ๐Ÿ“ **Medium Website Recommendation**: Creatives are encouraged to check out Medium for customizable prompts that can save time at work.
  • ๐Ÿ–๏ธ **Vacation Impact**: Recent vacations have slowed progress, with no major announcements to report.
  • ๐ŸŒ **Website and Social Features**: The team is working on the website, including new social features, with initial testing involving a limited number of spaces.
  • ๐Ÿ”ง **Personalization Efforts**: Personalization is a work in progress, facing challenges due to multiple time zones and a slower pace than desired.
  • ๐ŸŽจ **Style Random Feature**: The 'style random' feature is set to return, although specifics on how it will be implemented are unclear.
  • ๐Ÿค– **Algorithm Improvements**: An algorithm is being developed to enhance hand and body depictions, as well as text accuracy, although it has been finicky.
  • ๐Ÿ–ผ๏ธ **Image Quality Enhancements**: Efforts are being made to improve image quality, particularly regarding small pixel artifacts.
  • โšก **Performance Update**: A potential speed update could make processes 25-50% faster and cheaper, but it's on hold until other updates are completed.
  • ๐ŸŽ‰ **Caption Party**: An upcoming event aims to teach the version 7 model about the connection between images and language, with possible future rewards.
  • ๐Ÿ† **New User Class**: A new class of trusted users may be introduced for rating and captioning, potentially leading to larger rewards.
  • ๐ŸŽฅ **Video Model Development**: While version 6 for video may not materialize, confidence is high for a robust version 7 model, focusing on quality over exportability.
  • ๐Ÿ“Š **Feedback Leaderboard**: The company plans to add more ideas to the feedback leaderboard and may incorporate demographic data to understand feature requests better.
  • ๐Ÿšซ **Content Policy**: There are no plans to expand on not-safe-for-workplace features, and user manipulation of images with the Midjourney model is not yet supported.
  • ๐Ÿง‘ **Character Consistency**: Multiple consistent characters in generation are not available in version 6 but may be possible in version 7.
  • ๐ŸŽจ **Art Prompt Example**: An example of a successful art prompt is provided, showcasing how to create a serene double exposure image.

Q & A

  • What is the main topic discussed in the Midjourney Office Hours recap from April 3rd, 2024?

    -The main topic discussed is the progress and updates on the Midjourney platform, including new social features, personalization efforts, and improvements in image quality and text accuracy.

  • What is the recommended website for creative professionals to check out?

    -The recommended website for creative professionals is 'Medium', which offers customizable prompts that can save time at work.

  • What new feature is being tested with guides and mods?

    -The new social features on the Midjourney website are being tested with guides and mods.

  • How many social spaces are expected to be available at the start of the new feature rollout?

    -At the start, there will be a low number of social spaces with lots of people, as they are trying to stress test the system.

  • What is the current status of the personalization feature?

    -Personalization is being worked on, but it is moving slower than desired due to the team working across multiple time zones.

  • When is the 'style random' feature expected to show up again?

    -The 'style random' feature will show up again, but the exact timing is not specified; it is expected to come from dial tuning.

  • What is the team working on to improve the quality of hands and bodies in generated images?

    -The team is working on an algorithm to help with hands and bodies, as well as text accuracy, which has been finicky but is expected to reduce the occurrence of bad images.

  • What is the potential speed update that might be implemented?

    -There might be a small speed update that could make things 25-50% faster and cheaper.

  • What is the goal of the upcoming 'Caption Party'?

    -The goal of the 'Caption Party' is to help teach the version 7 model the connection between images and language.

  • What new class of users is briefly mentioned in the recap?

    -A new class of users is mentioned who would be trusted with rating and captioning; these users might have to qualify for the rewards, potentially allowing for larger rewards.

  • What is the current stance on the development of a video feature?

    -The development of a video feature is not very satisfactory, and it is unlikely to see a version 6 model for it. However, there is confidence in a version 7 model.

  • What is the focus of the 3D model development?

    -The focus is on producing high-quality 3D models rather than exportable ones, although plans are not set in stone.

Outlines

00:00

๐Ÿ“ Mid-Journey Office Hours Recap

This paragraph provides a summary of the Mid-Journey office hours held on April 3rd. It mentions the suggestion for creative professionals to check out Medium, a website offering customizable prompts. The speaker notes a slower than usual progress due to vacations, and highlights the main focus areas including the development of the website with new social features, testing with guides and mods, and the creation of social spaces. Personalization is also being worked on, albeit at a slower pace. The return of 'style random' is teased, and an algorithm to improve hands, bodies, and text accuracy is under development. There's mention of potential improvements in image quality and processing speed. A caption party is planned to help teach the version 7 model about the connection between images and language. The idea of a new class of users for rating and captioning is introduced. The speaker also discusses the feedback leaderboard, the potential for adding more ideas, and the focus on high-quality 3D models. Lastly, the speaker mentions the possibility of including demographics in the feedback and the non-inclusion of multiple consistent characters in version 6 but a potential inclusion in version 7.

Mindmap

Keywords

๐Ÿ’กcreative

In the context of the video, 'creative' refers to individuals who are involved in creative work, such as artists, designers, writers, etc. The video suggests that these individuals might find the website Medium useful due to its customizable prompts that can save time in their work. It is a key term as it sets the target audience for the website's utility.

๐Ÿ’กsocial features

The term 'social features' in the video script refers to the new functionalities being developed for the website that allow for social interaction. This is a core part of the update, as it indicates a shift towards a more community-oriented platform. The script mentions testing these features with guides and mods, suggesting a focus on user engagement and community building.

๐Ÿ’กpersonalization

Personalization in the video script denotes the customization of user experiences on the website. It is a key concept as it is something the developers are working hard to improve, aiming to tailor the website's content and interactions to individual users' preferences. However, the process is moving slower than desired, indicating it as a complex and ongoing challenge.

๐Ÿ’กstyle random

The phrase 'style random' is mentioned in the context of a feature that will reappear, possibly as a result of dial tuning. It suggests a randomization element in the style of the creative outputs generated by the website, which could add an element of surprise or novelty to the user experience. However, the specifics of how it will work are not detailed in the script.

๐Ÿ’กalgorithm

An 'algorithm' in the script refers to a set of rules or procedures for solving problems or performing tasks. The developers are working on an algorithm to improve the accuracy of hands and bodies in generated images, as well as text accuracy. This is significant as it indicates a technical focus on enhancing the quality and realism of the outputs, directly impacting the user's creative process.

๐Ÿ’กimage quality

The term 'image quality' is used to describe the ongoing efforts to improve the visual output of the website. The script mentions addressing small pixel artifacts to make the images better, which is crucial for users who rely on high-quality visuals for their creative work. It is a key aspect of the video's theme as it relates to the technical advancements being made.

๐Ÿ’กspeed update

A 'speed update' refers to improvements in the processing speed of the website's services. The script suggests that there might be an update that makes things 25-50% faster and cheaper, which is important for users who value efficiency and cost-effectiveness in their creative tools.

๐Ÿ’กcaption party

The 'caption party' is an upcoming event mentioned in the video script with the goal of teaching the version 7 model the connection between images and language. It represents a novel approach to improving the AI's understanding and could potentially lead to official activities where users can earn rewards, thus adding a gamified element to the user experience.

๐Ÿ’กnew class of users

The phrase 'new class of users' is introduced as a concept where certain users might be trusted with rating and captioning. This suggests a potential shift in the community dynamics, where a select group of users could have additional responsibilities and possibly rewards, thus adding a layer of user engagement and contribution to the platform.

๐Ÿ’ก3D model

A '3D model' in the context of the video refers to the development of a three-dimensional representation or simulation. The script mentions optimism about having a really good 3D model thanks to progress in hardware capture, indicating a significant advancement in the technology that could lead to more realistic and immersive creative outputs.

๐Ÿ’กfeedback leaderboard

The 'feedback leaderboard' is a feature on the Midjourney website where ideas are added and rated by the community. It serves as a tool for gauging user interest and feedback, allowing the developers to prioritize features and updates based on community input. It is an important concept as it demonstrates the company's commitment to user-centric development.

Highlights

Medium is recommended for creatives as a website selling customizable prompts that can save time at work.

There were no exciting announcements due to slower progress while people were on vacation.

The main focus is on the website, including new social features, with testing to be done with guides and mods.

Initially, there will be a limited number of social spaces to stress test the system.

Users will eventually be able to create both public and private spaces.

Personalization is being worked on, albeit at a slower pace than desired.

Style random is expected to return, possibly from dial tuning, but without user access to the tuning part.

An algorithm to improve hands and bodies in images, as well as text accuracy, is being developed.

Bad images will still occur, but less frequently with the new algorithm.

Enhancements are being made to image quality, addressing small pixel artifacts.

A potential speed update could make processes 25-50% faster and cheaper.

A caption party is planned to teach the version 7 model the connection between images and language.

There's a possibility of implementing the caption party as an official activity with rewards in the future.

A new class of users might be introduced, trusted with rating and captioning, potentially leading to larger rewards.

David is not satisfied with the video stuff and is leaning towards a version 7 model instead of version 6.

Optimism for a really good 3D model is due to progress in hardware capture.

The focus is on producing high-quality 3D models rather than just exportable ones.

Feedback Leaderboard on the Midjourney website will receive more ideas periodically for community rating.

Multiple consistent characters in a generation might be possible in version 7.

A serene double exposure image prompt is shared for artistic inspiration.

The speaker can be followed on Instagram and Twitter for more pictures and prompts.