AI image generation is getting crazy good

Darren_Garrison · August 9, 2025, 8:07pm

Imgur treats even a single image upload as an “album”. You linked to the album. To get the correct aspect ratio you need to link to the individual jpeg.

Wincerind · August 9, 2025, 8:25pm

Chronos · August 9, 2025, 9:36pm

I think it converted some leaves in the original into toy mice in the new one… but yeah, it’s a good job.

Darren_Garrison · August 9, 2025, 9:46pm

Those are difficult to interpret even for a human. It appears to use some sort of coiled wire to make (unpumpkinlike) tendrils, and possibly cloth leaves?

I also had Sora do a pair, and got these

Darren_Garrison · August 10, 2025, 1:17am

Best homemade snake evah (not the exact prompt used).

The model still has problems with snakes, tending to make closed loops

Or worse

madmonk28 · August 10, 2025, 2:21am

There’s this barber shops near my house, the owner uses AI to make pics that she posts to her facebook page, with mixed results. This is an actual post of hers.

[Album] imgur.com

Darren_Garrison · August 11, 2025, 7:04am

I thought to try making a fire made out of water. I wondered if Copilot would struggle with the concept but it understood right off the bat.

I then tried for candles

And putting out the water with fire

Sample prompt

Summary

Realistic photo of a campfire made out of clear, colorless water. (The “flames” are flame-shaped, but are water.) A girlscout is dousing it with a bucket full of fire. It is day, not night. Smokey the bear is standing to the side with his arms crossed, nodding in satisfaction. Iphone 15 photo with shallow dof and forced perspective.

The same prompt in SDXL, Flux Schnell, HiDream, and Ideogram, which didn’t get it

And Imagin (via Gemini) which mostly got the concept of firewater but for some reason made firefire as clumps of cloth or something.

Ponderoid · August 11, 2025, 11:39am

Notice how Smokey is always sized similarly to the presumably short girl scouts? That’s one of my pet peeves about all of the image generators I’ve tried; it’s hard to make them show realistic contrasting sizes between characters who should be significantly different heights.

Chronos · August 11, 2025, 12:16pm

That might explain the issue with one of the D&D portraits I made. It should have been a dwarf riding an ox, but it made both of them about the same size, to ludicrous effect.

And shouldn’t that be Smokey’s brother Drippy Bear? Only you can prevent forest waters.

DCnDC · August 11, 2025, 12:49pm

These were from a “reverse Pinocchio” idea I was working with, where a real boy exists in a world where everything else is made of wood.

Darren_Garrison · August 11, 2025, 12:59pm

I tried the same prompt, but added that Smokey is twice as tall as the Girl Scout. It worked, this time.

Maserschmidt · August 11, 2025, 12:59pm

A lot of cool images, still some struggle with hands.

Jophiel · August 11, 2025, 1:16pm

I mean, this image made Smokey well larger than the girl scout so I don’t see what the problem is.

Relative sizes has always been a weak point for these generators in my experience. If you have access to ControlNet, that would be the easiest way to handle it or else img2img to use as a guide. From a sizes standpoint, Midjourney didn’t do great making a dwarf riding an ox…

SDXL (GonzalomoDMD) got it pretty good but the quality leaves something to be desired…

SDXL (SplashedJourney) looks better from a classic art perspective but the sizes aren’t as good. If I was in the market for such an image, I’d try to get it out of Gonzalomo then run it through SplashedJourney (or Midjourney) as an img2img for better art quality.

There’s all sorts of issues with those samples; I was literally just throwing in a one or two line prompt and grabbing the first results for sake of example.

Edit: Late entry using a custom SDXL model merge looked pretty good as a start. Wants to put horns on my dwarf but that’s what inpainting is for.

Jophiel · August 11, 2025, 1:34pm

Couple more dwarfs with a little more effort

Edit: Playing with the prompt and changing “ox” to “yak” didn’t really change the beast but made my dwarfs look sort of Mongolian.

Chronos · August 11, 2025, 1:45pm

All of those are better than this:

Which is otherwise pretty close to what I wanted. The ox shouldn’t be quite so spectral-looking, but it is celestial, and it apparently doesn’t know what a “sheaf” of wheat is, but those are minor issues. The relative sizes, though…

Prompt

Second character: A male knight in shining full plate armor. He’s clean-shaven, but otherwise appears to be a dwarf. He looks trustworthy, dependable, and honorable. He’s wielding a flaming sword and a shield. His shield and armor are liberally adorned with symbols of a sheaf of wheat and a rose, and he’s also wearing a silver sheaf of wheat on a thin chain necklace. Instead of a horse, he’s riding a faintly-glowing ox wearing barding.

LSLGuy · August 11, 2025, 7:28pm

Those are real interesting. Thank yuo. But …

I’m struck by how much the flame-of-water examples look like either ice or blown glass. It looks inappropriately rigid, while the still images of fire somehow seem to leave a sensation of motion or at least of insubstantiality.

I’m wondering how much of that is in the image versus in my perception of the image? And how much is the legit difference between depicting a gas versus a liquid?

Darren_Garrison · August 11, 2025, 8:26pm

They do look kind of glassy/icy. But on the other hand, how would water behaving like fire actually look? Not exactly like that, definitely, but it is hard to get a mental picture of. (The first image with just the campwater is the most dynamic.)

Jophiel · August 11, 2025, 8:33pm

I think the house fire is the most fiery of them but it has the advantage of being further away rather than staring right into it.

Chronos · August 11, 2025, 8:45pm

My thought was that they looked like plastic Lego fire pieces, but same idea.

Then again, I sure couldn’t draw waterflames any better. That’s one of those prompts where you judge the AI on the fact that it was able to do it at all, not how well it did it.

Jophiel · August 12, 2025, 1:16am

The OpenAI LLM based bots (Copilot, ChatGPT) are excellent for prompt understanding due to their LLM nature as compared to single-nature image models. Unfortunately, they tend to look very stereotypically “AI” to me though that could just be a result of people not trying to go beyond that and a touch of toupee fallacy.

Topic		Replies	Views
Digital art creator algorithm website Cafe Society arts-crafts , ai	2368	39074	April 30, 2026
Funniest AI Legos Miscellaneous and Personal Stuff I Must Share ai	433	18146	March 30, 2025
Share your mental picture of your fellow doper(s) Miscellaneous and Personal Stuff I Must Share	182	30139	April 7, 2013
Misinterpreted avatars Miscellaneous and Personal Stuff I Must Share	109	4784	August 5, 2024
Favorite internet pics In My Humble Opinion	128	9277	March 8, 2008

AI image generation is getting crazy good

Related topics