This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/miaoshouai on 2024-11-05 16:56:31+00:00.


I’m deeply grateful for the feedback from this community. After much effort, PromptGen v2.0 has officially launched! Here’s what to expect in this new version (If you want to know what PromptGen is, please read it from this post):

  • Enhanced image caption quality across all instructions
  • Better recognition of explicit content
  • Improved image composition abilities
  • A new “analyze” mode designed to complement mixed_caption

With the new analyze capbility, PromptGen is able to understand more details and image composistions in the picture.

compare with analyze on and analyze off

v2.0 understands better on character positions in the image

Here’s some comparesons between the image generation using PromptGen v2.0 vs Joy Caption Alpha 2

with V2.0 you still get the same fast speed and it is the prefect model to do image captioning in batch.

So, please give the new version a try, I’m looking forward to getting your feedback and working more on the model.

Huggingface Page:

Github Page for ComfyUI MiaoshouAI Tagger:

Flux workflow download: