This is an automated archive made by the Lemmit Bot.
The original was posted on /r/stablediffusion by /u/Enshitification on 2025-10-15 21:21:08+00:00.
Since faceCLIP was removed, I made a workflow with the next best thing (maybe better). Also, I’m tired of people messaging me to re-upload the faceCLIP models. They are unusable without the unreleased inference code anyway.
So what this does is use Hyper-Lora to create a fast SDXL lora from a few images of the body. It also does the face, but it tends to lack detail. Populate however many or few full body images of your subject on the left side. On the right side, input good quality face images of the subject. Enter an SDXL positive and negative prompt to create the initial image. Do not remove the “fcsks fxhks fhyks” from the beginning of the positive prompts. Hyper-Lora won’t work without it. Hyper-Lora is picky about which SDXL models it likes. RealVis v4.0 and Juggernaut v9 work well in my tests so far. That image is sent to InfiniteYou and the Flux model. Only stock Flux1.D makes accurate faces from what I’ve tested so far. If you want ոsfw, keep the Mystic v7 lora. You should keep it anyway because it seems to make InfiniteYou work better for some reason. The chin-fix lora is also recommended for obvious reasons. JoyCaption takes the SDXL image and makes a Flux-friendly prompt.
The output is only going to be as good as your input, so use high-quality images.
You might notice a lot of VRAM Debug nodes. This workflow will use nearly every byte of a 24GB card. If you have more, use the fp16 T5 instead of the fp8 for better results.
Are the settings in this workflow optimized? Probably not. I leave it to you to fiddle around with it. If you improve it, it would be nice if you would comment your improvements.
No, I will not walk you through installing Hyper-Lora and InfiniteYou.