画像生成AI『Stable Diffusion』で表情だけを変える方法。呪文集一覧やコツもご紹介

AI FREAK - 最新のAIツールをご紹介
23 May 202308:13

TLDRThe video script introduces techniques and tips for altering facial expressions using Stable Diffusion, a cutting-edge AI tool. It guides viewers through the process of generating images with desired expressions by adjusting prompts and utilizing the Reference Only feature for maintaining the original image's atmosphere. The demonstration showcases the impact of different prompts and intensity levels on the resulting expressions, highlighting the importance of fine-tuning both for achieving the most accurate representation of emotions. The video serves as a practical guide for those interested in exploring the capabilities of AI in creating expressive images.

Takeaways

  • 🎨 The video introduces methods and tips for changing facial expressions using Stable Diffusion.
  • 🖼️ Start by preparing a base image, in this case, a portrait of a woman, and use the Reference Only feature for validation.
  • 🛠️ Reference Only is a convenient Control Net function that allows for inheriting the atmosphere of the base image while making changes.
  • 😄 Add the word 'smile' to the prompt and generate the image to see the woman smiling while maintaining her original facial atmosphere.
  • 🔢 Emphasize the smile by adding a colon and a number after the word 'smile' to control the intensity of the expression.
  • 🔄 Experiment with different numbers (e.g., 1.5, 2.0) to find the right balance for the desired expression.
  • 👀 Compare the original and modified images to ensure the same facial atmosphere is maintained with the new expression.
  • 😌 Try other expressions like 'smirk' and 'happy wolf filter' to see how the AI adjusts the expressions accordingly.
  • 😞 For sad expressions, increase the intensity value to capture the emotion more effectively, even with subtle prompts.
  • 🤬 When using prompts like 'angry', adjust the intensity value and possibly the prompt itself to accurately represent the emotion.
  • 📝 Provide a list of expression-changing prompts with a 1.4 emphasis on each, leaving room for adjustments based on the user's needs.
  • 🌐 The video script is a guide and should be adapted based on the specific model and main prompts used, as the impact can vary.

Q & A

  • What is the main topic of the video?

    -The main topic of the video is about changing facial expressions using Stable Diffusion and the techniques involved.

  • How does the video demonstrate altering facial expressions?

    -The video demonstrates altering facial expressions by adjusting prompts within the Stable Diffusion tool, using a base image of a woman and applying the Reference Only feature.

  • What is the purpose of the Reference Only feature?

    -The Reference Only feature allows users to maintain the original atmosphere or vibe of the base image while making changes, providing a convenient function within ControlNet.

  • How can users adjust the intensity of the facial expressions?

    -Users can adjust the intensity of the facial expressions by adding a colon and a number after the prompt word, such as 'Smile:2', to emphasize the expression more.

  • What happens when the emphasis value is set too high?

    -When the emphasis value is set too high, the generated image may become grotesque or overly exaggerated, such as an excessively wide smile.

  • How can users fine-tune the facial expressions?

    -Users can fine-tune the facial expressions by adjusting the numerical value after the prompt word and experimenting with different prompts to achieve the desired result.

  • What is the significance of the XYZ plot mentioned in the video?

    -The XYZ plot is a feature that allows users to view a list of different expressions generated using various prompts, helping them understand the impact of single words on the final image.

  • What are some of the emotions explored in the video?

    -The video explores emotions such as happiness (smiling), sadness, and anger, demonstrating how different prompts and emphasis values can generate images with these expressions.

  • How does the video address prompts that don't yield the expected results?

    -The video suggests adjusting the numerical value or modifying the prompt itself to better convey the intended emotion, such as changing 'Upset' to 'Very Upset' for a more pronounced effect.

  • What is the advice given for users trying to generate specific facial expressions?

    -The advice given is to use the XYZ plot as a reference, adjust the emphasis values, and experiment with different prompts to find the right combination that generates the desired expression.

  • How does the video conclude regarding the use of Stable Diffusion for facial expression changes?

    -The video concludes that by skillfully adjusting prompts and values, users can potentially generate facial expressions that meet their expectations, but it emphasizes that results may vary depending on the model and main prompts used.

Outlines

00:00

🎨 Customizing Expressions with Stable Diffusion

This paragraph introduces the method and tips for changing expressions using Stable Diffusion. It starts by showcasing how adjusting prompts can add various expressions to an image. The video then proceeds to demonstrate the process using a base image of a woman, utilizing the 'Reference Only' feature to maintain the original atmosphere while making changes. The audience is guided through the steps of adding prompts like 'smile' and adjusting the intensity with numbers. The results show how different expressions can be achieved by tweaking the prompts and intensity levels. The video also addresses how to handle prompts that don't yield the desired result and suggests adjusting the numbers or the prompt itself for better outcomes.

05:01

😠 Exploring Angry Expressions and Fine-Tuning

This paragraph delves into creating angry expressions using the 'Angry' prompt at different intensity levels. The video demonstrates that setting 'Angry' to 1.5 results in a non-distorted image that conveys anger effectively. It also explores other expressions with subtle differences and discusses the limitations of certain prompts, such as 'Upset', which doesn't fully convey the intended emotion at 1.5. The video suggests increasing the intensity or adding emphasis to the prompt, like 'Very Upset', to achieve a more accurate representation of the desired emotion. The paragraph concludes with a list of expression-changing prompts, encouraging viewers to use them as a reference and adjust the intensity and prompts to create the ideal expression, while noting that results may vary depending on the model and main prompt used.

Mindmap

Keywords

💡Stable Diffusion

Stable Diffusion is a type of artificial intelligence (AI) model used for generating images. In the context of the video, it is used to demonstrate how various expressions can be applied to a base image to achieve different visual outcomes. The video showcases how prompts and adjustments within the Stable Diffusion model can result in a range of facial expressions, from smiles to more complex emotions.

💡Expression

In the video, 'expression' refers to the facial emotions or moods that are conveyed through the generated images. The main theme revolves around altering these expressions using AI tools, such as Stable Diffusion, to create a variety of looks and feelings. The video provides techniques for changing expressions to achieve desired results, like a smile or a sad look.

💡Prompt

A 'prompt' in the context of the video is a command or a word that guides the AI model to generate a specific output. It is used to instruct the Stable Diffusion model on what kind of expression or characteristic to apply to the base image. The video demonstrates how different prompts can lead to different expressions, and how adjusting the intensity of the prompts can refine the output.

💡Reference Only

Reference Only is a feature mentioned in the video that allows users to maintain the overall atmosphere or vibe of the base image while making changes. This function in the AI model is convenient for keeping the essence of the original image intact while still allowing for modifications, such as altering facial expressions.

💡XYZ Plot

The XYZ Plot is a tool or feature used in the video to display a range of expressions or prompts in a list format. It helps users visualize the different outcomes that can be achieved by adjusting the prompts and settings in the AI model. The script uses the XYZ Plot to compare and contrast the effects of various prompts on the generated images.

💡Emphasis

In the context of the video, 'emphasis' refers to the intensity or degree to which a particular prompt or characteristic is applied to the image. Adjusting the emphasis can alter the final look of the generated expression, whether it's making a smile more pronounced or modifying the intensity of a sad expression.

💡Angry

The term 'angry' is used in the video to describe one of the expressions that can be generated using the AI model. It is one of the emotional states that the video aims to demonstrate can be achieved through the careful use of prompts and adjustments in Stable Diffusion.

💡Sad

The word 'sad' represents a specific emotional expression that the video focuses on creating with the AI model. The process involves adjusting the prompts and emphasis levels to capture the desired degree of sadness in the generated image, illustrating the model's capability to depict a range of human emotions.

💡Upset

In the video, 'upset' is another emotional state that is discussed as something that can be conveyed through the AI-generated images. It is used to demonstrate the versatility of the AI in capturing subtle differences in human emotions, even beyond the primary focus on happiness or sadness.

💡Control Net

The Control Net is a feature of the AI model that allows for fine-tuning of the generated images by adjusting various parameters. It is an essential tool for achieving the desired expressions and ensuring that the generated images align with the user's intentions, as demonstrated in the video.

💡Adjustment

Adjustment refers to the process of fine-tuning the AI-generated images by changing the values or prompts associated with certain expressions. In the video, adjustments are made to the emphasis levels and prompts to achieve the desired look, such as a more natural smile or a more intense sad expression.

Highlights

The video introduces methods and tips on how to change facial expressions using Stable Diffusion.

An image of a woman is used as a base to demonstrate the process.

The Reference Only feature is used in conjunction to maintain the original atmosphere of the woman's face while making adjustments.

Adding the word 'smile' to the prompt generates a smiling image while keeping the woman's facial atmosphere.

By adding a colon and a number after 'smile', the emphasis on the expression can be adjusted.

Setting the number to 2 for 'smile' results in a more pronounced smile, but also creates a slightly grotesque image.

Adjusting the number to 1.5 achieves a more natural and appropriate smile.

Different words can be used to generate a variety of expressions, as shown in the XYZ Plot feature.

Even with the same smile expression, varying the single word can result in significant differences.

The video also demonstrates how to adjust expressions for sadness using different prompts and emphasis levels.

An expression with a 1.7 emphasis still feels restrained, but increasing it to 2.0 generates a more intense sad expression.

The video shows that not all sad expressions require a 2.0 emphasis, and adjustments should be based on the specific prompt.

Anger expressions are explored by setting 'Angry' to 1.5, resulting in a non-chaotic and acceptable image.

The video highlights that even subtle changes in prompts and emphasis levels can recreate different expressions accurately.

The word 'Upset' is used to demonstrate how increasing the emphasis to 1.8 can convey a sense of agitation.

Using the phrase 'Very Upset' instead of just 'Upset' with a 1.5 emphasis generates a more dramatic expression.

The video concludes with a list of expression-changing prompts, each with an emphasis of 1.4.

The video suggests that adjusting both the prompt and the emphasis level is key to generating the desired expression.

The video notes that the effectiveness of these methods can vary depending on the model and main prompt used.

The video provides a practical guide for users interested in experimenting with Stable Diffusion for facial expression generation.