this post was submitted on 31 Mar 2024

73 points (92.9% liked)

AI Generated Images

7286 readers

76 users here now

Community for AI image generation. Any models are allowed. Creativity is valuable! It is recommended to post the model used for reference, but not a rule.

No explicit violence, gore, or nudity.

This is not a NSFW community although exceptions are sometimes made. Any NSFW posts must be marked as NSFW and may be removed at any moderator's discretion. Any suggestive imagery may be removed at any time.

Refer to https://lemmynsfw.com/ for any NSFW imagery.

No misconduct: Harassment, Abuse or assault, Bullying, Illegal activity, Discrimination, Racism, Trolling, Bigotry.

AI Generated Videos are allowed under the same rules. Photosensitivity warning required for any flashing videos.

To embed images type:

“![](put image url in here)”

Follow all sh.itjust.works rules.

Community Challenge Past Entries

Related communities:

!auai@programming.dev
Useful general AI discussion
!aiphotography@lemmings.world
Photo-realistic AI images
!stable_diffusion_art@lemmy.dbzer0.com Stable Diffusion Art
!share_anime_art@lemmy.dbzer0.com Stable Diffusion Anime Art
!botart@lemmy.dbzer0.com AI art generated through bots
!degenerate@lemmynsfw.com
NSFW weird and surreal images
!aigen@lemmynsfw.com
NSFW AI generated porn

founded 2 years ago

MODERATORS

thelsim@sh.itjust.works

noodle@sh.itjust.works

theUnlikely@sopuli.xyz

M0oP0o@mander.xyz

Deceptichum@quokk.au

Heartbreaker - Looking for model suggestions! (sopuli.xyz)

submitted 10 months ago by theUnlikely@sopuli.xyz to c/imageai@sh.itjust.works

22 comments fedilink hide all child comments

I've been trying to transition from SD1.5 to SDXL for a while, but old prompting habits die hard.

Even more difficult was finding a model to produce the look I prefer. Most seem to love realism, 3D pixar style, anime, and super airbrushed.

If you have any tips for SDXL models, loras, prompting for this style, etc. please let me know!

parameters:

1girl, demon woman with bangs and ram horns and wings, holding bloody human heart, solo, sitting with knees up, (full body:0.6), looking at viewer, (dark fantasy theme:1.1) (glowing eyes:1.05), thick lines, flat shading

Negative prompt: low quality, deformed, embedding:negativeXL_D, embedding:unaestheticXL_Sky3.1,

Steps: 10, Sampler: dpmpp_3m_sde_simple, CFG scale: 1.0, Seed: 212036223314935, Size: 1024x1024, Model hash: 268a170aa6, Model: grogmixTURBO_v10

(and some inpainting/upscaling)

top 22 comments

sorted by: hot top controversial new old

[–] boblemmy@lemmy.world 7 points 10 months ago* (last edited 10 months ago)

1girl, demon woman with bangs and ram horns and wings, holding bloody human heart, solo, sitting with knees up, (full body:0.6), looking at viewer, (dark fantasy theme:1.1) (glowing eyes:1.05), thick lines, flat shading <lora:color_vector:1> <lora:The_Simplest:0.5> <lora:GNXL-Line Art:0.6> Negative prompt: low quality, deformed, bad anatomy, (extra feet:2), extra ears, extra tails, signature, artist name Steps: 10, Sampler: DPM++ 3M SDE, CFG scale: 1, Seed: 212036223314935, Size: 1024x1024, Model hash: 7f63ddc0d8, Model: zavychromaxl_v50, VAE hash: 235745af8d, VAE: fixFP16ErrorsSDXLLowerMemoryUse_v10.safetensors, Denoising strength: 0.3, Hypertile VAE: True, Hires upscale: 1.5, Hires upscaler: 4x-AnimeSharp, Lora hashes: "color_vector: 244e3944f033, The_Simplest: a482d04bf144, GNXL-Line Art: 13ce5f42806b", Postprocess upscale by: 2, Postprocess upscaler: 4x-AnimeSharp, Version: v1.7.0

1girl, demon woman with bangs and ram horns and wings, holding bloody human heart, solo, sitting with knees up, (full body:0.6), looking at viewer, (dark fantasy theme:1.1) (glowing eyes:1.05), thick lines, flat shading <lora:The_Simplest:0.5> Negative prompt: low quality, deformed, (bad anatomy:2), (extra limbs:2), (extra feet:2), tits, fire, extra ears, extra tails, signature, artist name Steps: 10, Sampler: DPM++ 3M SDE, CFG scale: 1, Seed: 212036223314935, Size: 1024x1024, Model hash: 7f63ddc0d8, Model: zavychromaxl_v50, VAE hash: 235745af8d, VAE: fixFP16ErrorsSDXLLowerMemoryUse_v10.safetensors, Denoising strength: 0.3, Hypertile VAE: True, Hires upscale: 1.5, Hires upscaler: 4x-AnimeSharp, Lora hashes: "The_Simplest: a482d04bf144", Postprocess upscale by: 2, Postprocess upscaler: 4x-AnimeSharp, Version: v1.7.0

[–] tal@lemmy.today 4 points 10 months ago (1 children)

I don't know if it's what you want, but: using the same prompt terms, and grabbing my favorite in a batch of 20:

1girl, demon woman with bangs and ram horns and wings, holding bloody human heart, solo, sitting with knees up, (full body:0.6), looking at viewer, (dark fantasy theme:1.1) (glowing eyes:1.05), thick lines, flat shading

Negative prompt: low quality, deformed, embedding:negativeXL_D, embedding:unaestheticXL_Sky3.1,

Steps: 20, Sampler: Euler a, CFG scale: 7, Seed: 13, Size: 1024x1024, Model hash: ebf42d1fae, Model: realmixXL_v15, Token merging ratio: 0.5, Version: v1.7.0-270-g04a005f0

I'm guessing that, relative to that, you're looking for something that looks more like a painting?

Looking at civitai, grogmixTURBO, which you're using, seems to typically produce more anime-looking images than you're generating.

[–] theUnlikely@sopuli.xyz 2 points 10 months ago (1 children)

That's actually a good example of the overly airbrushed look I was trying to describe. So many models have output like that.

As for GrogMixTURBO, I found the keywords "thick lines, flat shading" from one of the creator's replies to a comment on the model page, and these magically gave me what I wanted with that model.

[–] tal@lemmy.today 3 points 10 months ago (1 children)

Have you tried finding an artist who has similar work and trying "by "?

[–] theUnlikely@sopuli.xyz 3 points 10 months ago (1 children)

I did that with RealCartoon-XL, and soooometimes it gave good results for the style, but inconsistent. I guess I just got too used to SD1.5 models specializing in a certain aesthetic.

[–] tal@lemmy.today 2 points 10 months ago* (last edited 10 months ago) (1 children)

Gotcha. Hmm.

Have you tried browsing images under XL models on civitai for anything that is similar to the aesthetic you like, and then swiping their prompt terms?

If you're gonna use the base model, you can try Clip Interrogator on your image to find terms that will generate similar images, but that didn't do much for me. I don't think the base model is trained much on succubi.

[–] theUnlikely@sopuli.xyz 1 points 10 months ago

Definitely! That's how I was able to find the ones I listed in another comment. Without this, I'd be doomed 😆

[–] Deceptichum@sh.itjust.works 2 points 10 months ago (1 children)

If you can avoid the turbo/lightning models, they're not as good as regular XL.

Likewise with XL the less negative prompts the better, I usually operate on none or only 'text,watermark'

As for style, LORA or prompting an artists name is your best bet.

If you're after anime/cartoon, Ponydiffuser is the best. It operates on booru tags, so it can't imagine much outside of that range but the flipside is it can create basically anything within those confines. Pony also has an extremely wide range of artist/style LORA to recreate any sort of look you're after.

For realism you've got a few options of models, none are stand-out better than the other top rated ones.

[–] theUnlikely@sopuli.xyz 1 points 10 months ago (1 children)

I usually go with the full models, but for some reason GrogMix only has turbo and can nail the look I'm after.

I keep seeing positive sentiments about Pony everywhere I look, so I'll keep trying with that one. Right now, I'm having trouble with any character that isn't very close. Anything slight back from super closeup portrait loses a ton of detail, even with hires fix and multiple rounds of upscaling.

[–] Deceptichum@sh.itjust.works 2 points 10 months ago (1 children)

What front end are you using? I can try to whip up something and share a json if you’re using comfyui.

[–] theUnlikely@sopuli.xyz 1 points 10 months ago (1 children)

A comfyui json would be much appreciated!

[–] Deceptichum@sh.itjust.works 2 points 10 months ago (2 children)

One last thing, can you share an example of the sort of art style you're after? All I know is not realism, 3D pixar style, anime, and super airbrushed.

[–] theUnlikely@sopuli.xyz 1 points 10 months ago

This account on civitai has many many good examples, but all SD1.5 https://civitai.com/user/TxcTrtl/images?sort=Most+Reactions

[–] theUnlikely@sopuli.xyz 1 points 10 months ago (1 children)

Sure! The main image of this post is one example.
And here are a few more (all from SD1.5 models):

spoiler

[–] Deceptichum@sh.itjust.works 2 points 10 months ago* (last edited 10 months ago) (1 children)

Need a bit more work on the outlines aspect and will probably want to prompt for more muted/natural colours, but this is as close as I can get for 2am.

https://pastebin.com/DPcz8fpb

The second group is optional, but it'll more often than not fix up faces and hand errors.

[–] theUnlikely@sopuli.xyz 1 points 10 months ago

Thanks a lot for making this! I'm starting to have some luck with the Pony model finally. It's crazy how many resources that model has already.

[–] theUnlikely@sopuli.xyz 1 points 10 months ago* (last edited 10 months ago) (1 children)

Ones I've found so far are:

GrogMix TURBO
_CHEYENNE_
RealCartoon-XL (sometimes?)
PixelPaint
PixelWaveTurbo

I've seen good images using Pony XL, but my attempts with it have so far produced messy slop so I'm not sure what I'm doing wrong there.

[–] Even_Adder@lemmy.dbzer0.com 3 points 10 months ago* (last edited 10 months ago) (1 children)

Check out the important information section on Pony Diffusion V6 XL's page. It says: "Make sure you load this model with clip skip 2 (or -2 in some software), otherwise you will be getting low quality blobs.". Find some images you like and work off of their prompts to learn to make your own,

[–] theUnlikely@sopuli.xyz 1 points 10 months ago (1 children)

Thanks for the reply, but yeah, I've definitely tried that. In ComfyUI for this model, setting it to -2 doesn't actually do anything compared to no node for that at all. Setting it to -1 of course ruins it though.

It seems really temperamental with the styles. It seems to vastly change just by changing a few words that aren't related to style at all.

This is the best I've gotten with it after hours if fiddling. Most of the image is good, but the face and hands never are.

[–] Even_Adder@lemmy.dbzer0.com 2 points 10 months ago (1 children)

Try using some LoRA. You have to use LoRA trained on Pony Diffusion, since the training process moved it far away enough from regular SDXL that Controlnets aren't even compatible with it,

[–] theUnlikely@sopuli.xyz 1 points 10 months ago

Yeah it seems like loras are necessary. For that image I used the concept art twilight lora.

[–] Grandwolf319@sh.itjust.works 1 points 10 months ago

Looks more like a heart harvester