3D Optimism | Midjourney Office Hours Recap April 3rd 2024 | Midjourney News

Future Tech Pilot
3 Apr 202403:42

TLDRIn the mid-April Journey office hours recap, it's mentioned that while there are no major announcements, progress on the website's new social features is ongoing, albeit slower due to vacations. The focus is on stress-testing social spaces, improving personalization, and refining text and image accuracy. A caption party is planned to enhance the model's understanding of the connection between images and language. There's optimism about a high-quality 3D model in the future, and the feedback leaderboard on the website encourages community involvement in shaping the platform.

Takeaways

  • ๐ŸŒŸ Medium, a website for selling customizable prompts, is recommended for employed creatives to save time.
  • ๐Ÿ“… Progress has been slower than usual due to vacations, but work continues on the website's new social features.
  • ๐Ÿ› ๏ธ Social spaces will start limited in number but aim to allow users to create public and private spaces eventually.
  • ๐ŸŽจ Personalization is being improved, albeit at a slower pace due to the involvement of multiple time zones.
  • ๐Ÿ”„ Style, random feature might return, although specifics are unclear and it may not include the tuning part.
  • ๐Ÿค– An algorithm is under development to enhance text accuracy for hands, bodies, and overall image quality.
  • ๐Ÿš€ A potential speed update of 25-50% is in the works, but it will be released after other updates are completed.
  • ๐Ÿฅณ A caption party is planned to help the version 7 model learn the connection between images and language, with possible rewards in the future.
  • ๐Ÿ† A new class of trusted users may be introduced for rating and captioning tasks.
  • ๐ŸŽฅ Video features are still in development, with no version 6 model expected but confidence in a version 7 model.
  • ๐Ÿž๏ธ 3D models are being focused on high quality rather than just exportable models, though plans may change.

Q & A

  • What is the primary recommendation for creatives mentioned in the recap?

    -The primary recommendation is for creatives to check out Medium, a website selling customizable prompts that can save time at work.

  • What is the current status of the social features on the website?

    -The social features are under development and will be tested with guides and mods. Initially, there will be a limited number of social spaces with many people to stress test the system.

  • How is the team addressing the issue of personalization?

    -The team is working hard on personalization, but it's progressing slower than desired due to having team members across multiple time zones.

  • What is the expected outcome of the algorithm being developed for hands, bodies, and text accuracy?

    -The algorithm is expected to improve the quality of images, making them more accurate and reducing the frequency of bad images, although it has been finicky during development.

  • Are there any updates planned to improve image quality and speed?

    -Yes, there are plans to improve image quality to reduce pixel artifacts and potentially increase speed by 25-50%, but the speed update will be released after other updates are completed.

  • What is the purpose of the upcoming caption party?

    -The caption party aims to help teach the version 7 model the connection between images and language, and if successful, it might become an official activity where participants can earn rewards in the future.

  • What is the potential new class of users mentioned in the recap?

    -The potential new class of users are those who will be trusted with rating and captioning, possibly needing to qualify for rewards, which could lead to larger rewards.

  • What is the current stance on video features in the development?

    -The development team is not super happy with the current state of video features, and it's uncertain if a version 6 model will include them. However, there is confidence in a version 7 model.

  • What is the focus of the 3D model development?

    -The focus is on producing high-quality 3D models rather than exportable ones, but plans are not set in stone and could change.

  • How can users contribute to the development through the feedback leaderboard?

    -Users can contribute by rating ideas added to the feedback leaderboard on the Mid Journey website, helping the team prioritize features.

  • What is the stance on adding demographics to the feedback system?

    -Adding demographics to the feedback system is a possibility in the future to better understand user preferences and feature requests.

Outlines

00:00

๐Ÿ“ฐ Mid-Journey Office Hours Recap

The paragraph begins with a brief introduction to the Mid-Journey Office Hours from April 3rd, highlighting the importance of Medium for creative professionals. It then moves on to discuss the lack of major announcements due to the slower progress caused by vacations. The main focus has been on developing the website with new social features, which will initially have a limited number of spaces for stress testing purposes. Personalization is also being worked on, albeit at a slower pace due to the challenges of coordinating across multiple time zones. David mentions that style and randomness will be reintroduced, and efforts are being made to improve the accuracy of hands, bodies, and text through an algorithm. Despite some issues, the team is optimistic about its potential. Efforts are also being made to enhance image quality and reduce pixel artifacts. A speed update is planned, but it will be released after other updates. The caption party is upcoming, aiming to improve the connection between images and language for the version 7 model. There are also plans for a new class of users who will be trusted with rating and captioning tasks. Lastly, David discusses the feedback leaderboard on the Mid-Journey website and the potential for future updates and moderation.

Mindmap

Keywords

๐Ÿ’กMedium

Medium is a platform where creative individuals can find and purchase customizable prompts to enhance their work efficiency. In the context of the video, it is recommended as a resource for those employed in creative fields. The mention of Medium illustrates the importance of utilizing available tools to streamline creative processes and stay updated with industry trends.

๐Ÿ’กSocial Features

Social features refer to the new interactive elements being integrated into the website that allows users to engage with each other. This is a significant part of the development update in the video, as it indicates a move towards fostering a community and enhancing user experience. The social features are being tested with guides and mods to ensure a smooth and positive user interaction.

๐Ÿ’กPersonalization

Personalization in the context of the video refers to the customization of user experiences based on individual preferences and behaviors. It is a key focus of the development team, aiming to make the platform more tailored and relevant to each user. The process is moving slower than desired due to the complexity of coordinating across multiple time zones, highlighting the challenges of global teams in software development.

๐Ÿ’กAlgorithm

An algorithm in this context is a set of rules or instructions for solving problems, particularly in the realm of computer programming and data processing. The video discusses an algorithm being developed to improve the accuracy of hands, bodies, and text in images. This reflects the ongoing efforts to enhance the quality and reliability of the platform's outputs based on user feedback.

๐Ÿ’กImage Quality

Image quality pertains to the clarity, sharpness, and overall visual appeal of the images produced by the platform. The video mentions that efforts are being made to improve image quality, specifically addressing small pixel artifacts. This indicates a commitment to providing higher visual standards for users and enhancing the overall product offering.

๐Ÿ’กSpeed Update

A speed update refers to improvements made to increase the efficiency and processing speed of the platform. The video suggests that there might be a small speed update that could make things 25-5% faster and cheaper. However, this update is contingent on completing other updates first, showing a strategic approach to releasing new features and improvements.

๐Ÿ’กCaption Party

A caption party is an event or initiative where users are engaged in the process of teaching the AI model the connection between images and language. The goal is to improve the AI's understanding and accuracy in generating relevant captions. In the video, it is mentioned as an upcoming event with the possibility of future rewards for participants, indicating a community-driven approach to refining the platform's capabilities.

๐Ÿ’กUser Trust

User trust refers to the confidence and reliability placed in certain users by the platform to perform tasks such as rating and captioning. The video briefly mentions a new class of trusted users, suggesting a system of earned privileges and responsibilities within the community. This concept underscores the platform's intention to involve its user base in content moderation and quality assurance.

๐Ÿ’ก3D Model

A 3D model refers to a digital representation of a three-dimensional object or character. In the video, the development team is optimistic about creating a high-quality 3D model, indicating advancements in hardware capture technology. This reflects a focus on enhancing the platform's capabilities to produce more realistic and detailed outputs, although the plans are subject to change.

๐Ÿ’กFeedback Leaderboard

The feedback leaderboard is a system where user-submitted ideas are ranked based on community ratings. This tool is used to gauge the popularity and demand for certain features and to guide development priorities. The video mentions plans to add more ideas to the leaderboard, demonstrating the platform's commitment to user engagement and incorporating community feedback into its development process.

๐Ÿ’กConsistent Characters

Consistent characters refer to the ability of the platform to generate images or content that maintain a uniform and recognizable identity across multiple generations. The video mentions that this feature might be possible in version 7, indicating ongoing efforts to improve the coherence and continuity of the platform's outputs, which is particularly important for storytelling and branding purposes.

Highlights

Medium as a resource for creatives, offering customizable prompts and time-saving potential.

The recap reveals a slower progress due to vacations and the main focus on website development, including new social features.

Initial social spaces will be limited in number but high in user engagement, aiming for a stress test of the system.

Personalization is being worked on, albeit at a slower pace due to multiple time zones and complexity.

Style, random feature is expected to return, possibly from dial tuning, without user access to the tuning part.

Efforts are being made to improve hands and bodies as well as text accuracy through a specialized algorithm.

Despite challenges, the team is optimistic about reducing the frequency of bad images with the help of user feedback.

Image quality improvements are in the works, specifically targeting small pixel artifacts.

A potential speed update is planned, aiming for 25-50% faster and cheaper performance.

The speed update release is contingent on completing other updates first.

An upcoming caption party aims to enhance the version 7 model's understanding of the connection between images and language.

There are plans for a new class of users who will be trusted with rating and captioning tasks, potentially linked to rewards.

Video features are being reconsidered, with version 7 model sounding more promising than version 6.

The focus for 3D models is on quality over exportability, with hardware capture advancements.

Feedback leaderboard on the Mid Journey website will receive regular updates and community ratings.

The idea of user manipulation of images with the Mid Journey model is not currently feasible due to moderation concerns.

Expansion into not safe for workplace features is unlikely, with a humorous mention of potential sway if public opinion shifts.

Demographic targeting for feedback features may be considered in the future.

Multiple consistent characters in a generation may be possible in version 7, not version 6.

A serene double exposure image prompt is showcased, demonstrating the creative potential of the Mid Journey website.