Refining Midjourney Images with Stable Diffusion: A Step-by-Step Guide

ByWei Mao January 3, 2024January 3, 2024

Midjourney V6 has ushered in a significant advancement in texture quality for generated photos. The four images below, crafted by Midjourney V6, demonstrate this leap in detail, especially when scrutinizing the faces of people and animals like tigers.

A meticulous examination, however, reveals flaws in at least two hands among these images. This issue isn’t unique to Midjourney; it’s a common hurdle for all AI art generators.

Interestingly, when characters dominate the visual space, such imperfections are less frequent. Rerolling often yields flawless images in these scenarios.

But, when characters are smaller elements within the picture, detailing issues, particularly in faces and hands, become more pronounced. Rerolling here often falls short in addressing these glitches.

Midjourney does offer an inpaint function, designed to rectify such flaws, but it’s not yet implemented in V6. And even when available, its effectiveness is limited due to the prompt-driven control which struggles with intricacies, like the precise posture of a hand or the curvature of fingers.

Consider the image below, also generated with Midjourney V6. Its smaller-scale character highlights the issue with detail loss.

Zooming in, the flaws in the face and hands are evident.

Without the inpaint tool in V6, the optimal solution lies in Stable Diffusion. It’s a better tool for achieving the desired effects.

Let’s tackle the most challenging part first: the left hand. In the image above, the hand appears clenched and tense, detracting from the overall elegance. To address this, I used DesignDoll, a 3D modeling software available for free at terawell.net, to create the desired hand pose.

After incorporating the newly posed hand into the original image via Photoshop, the result may appear artificial.

But, worry not, as Stable Diffusion will seamlessly blend it in. The key here is ensuring the silhouette and pose are accurate.

Next, I added a white tiger’s ear, found online, into the image for a more cohesive look.

With these preparations, I moved on to repainting the image using Stable Diffusion to restore the naturalness of the blemished areas.

I began at the img2img section, uploading the image to the inpaint interface. I selected the majicMIX realistic model, focusing first on the left hand using a paintbrush tool.

The critical parameters I adjusted were “Inpaint area” (set to “Only masked”) and “Denoising strength” (kept at a lower value).

For precise hand pose control, I employed ControlNet’s OpenPose.

To enhance the hand’s details, I used the Adetailer plugin.

The right hand and face followed, but these were simpler tasks not requiring ControlNet. For the face, I used the Adetailer plugin with a “face” starting model, adjusting the Denoising strength for a more refined look.

Upon completing the three parts of the inpaint, the following image emerged:

This image, at 768×768 pixels, lacked some detail. Therefore, I utilized ControlNet’s Tile model for enlargement, adjusting the magnification to the required level and keeping the repainting magnitude minimal.

The final result, showcased below, vividly demonstrates the transformative power of Stable Diffusion in enhancing and refining AI-generated imagery.

Stable Diffusion

Best Way to Use LoRA: A Detailed Guide on LoRA + ADetailer Face Swap

ByWei Mao March 27, 2024March 27, 2024

In my previous exploration (Real-Life LoRA Training), I embarked on a fascinating journey to train a LoRA model, closely resembling the Hollywood icon, Scarlett Johansson. By focusing on her headshots, I aimed to equip the AI with the prowess to intricately learn her facial features, enabling it to generate images that mirror her appearance remarkably….

Stable Diffusion

Enhancing Clothing Details with DeepFashion (ADetailer) in Stable Diffusion

ByWei Mao January 25, 2024January 25, 2024

In my previous article, I explored the fascinating world of ADetailer, a powerful extension for Stable Diffusion. 👉 Mastering ADetailer (After Detailer) in Stable Diffusion Primarily focused on refining facial features and hands, ADetailer encompasses 14 distinct models, each serving a unique function. While I have delved into most of these models, one, in particular,…

Stable Diffusion

The Ultimate Guide to Train Your Face with Text Inversion Training in Stable Diffusion

ByWei Mao February 28, 2024February 28, 2024

The allure of Stable Diffusion lies in its unparalleled capacity for customization. It doesn’t just generate images out of thin air; the true enchantment unfolds when we tailor it to conjure visuals that align precisely with our vision. Let’s delve into how we can fine-tune its outputs to resonate with our individual preferences. Among the…

Stable Diffusion

Midjourney vs DALL-E vs Stable Diffusion: Which One Nails the Human Pose?

ByWei Mao December 23, 2023December 23, 2023

The recent release of Midjourney V6 has sparked a wave of excitement, its hyper-realistic images almost making traditional photography seem obsolete. However, those who have dabbled in the realm of AI art creation are familiar with a persistent shortcoming that isn’t disappearing anytime soon. Today, let’s embark on an exploratory project, putting the leading AI…

Stable Diffusion

ComfyUI Clothing Swapping: IP-Adapter V2 + FaceDetailer (DeepFashion)

ByWei Mao May 12, 2024May 12, 2024

Today, we’re diving into the innovative IP-Adapter V2 and ComfyUI integration, focusing on effortlessly swapping outfits in portraits. This tutorial simplifies the entire process, requiring just two images: one for the outfit and one featuring a person. By utilizing ComfyUI’s node operations, not only is the outfit swapped, but any minor discrepancies are resolved with…

Stable Diffusion

Unlock Midjourney’s Artistic Magic Using Stable Diffusion

ByWei Mao July 3, 2024July 3, 2024

In this tutorial, we’ll delve into using various LoRAs (Low-Rank Adaptations) to bring the artistic flair of Midjourney to images generated by Stable Diffusion. By combining these LoRAs, you can achieve a variety of artistic effects. We’ll build workflows in ComfyUI to combine these LoRAs, but you can also implement them in A1111 or Fooocus….

Similar Posts

Leave a Reply Cancel reply