One more thought on the waterfires: The best examples all appear to have disconnected bits. This makes it look more like both water and fire, since water will have loose droplets, and fire will have embers. Plastic or blown glass, though, will all be one connected piece, so the ones with one connected piece look more plasticky or glassy.
I had a sudden inspiration from the title of the Pet pictures thread…
A middle-aged woman in a sweatsuit power-walking on a sidewalk in a suburban neighborhood. She has three framed photos on leashes (like small dogs on leashes running with her). Iphone 15 photo with shallow dof.
It understood everything pretty well except for the number three.
Was that entirely AI-generated, in one pass? I’m a little surprised that it made the sign identical in both views. That’s the sort of thing that these generators often have difficulty with.
Back on the relative scale issue, the ChatGPT/Copilot/Sora renderer seems to have a pretty good grasp that elephants are big.
Sample prompt
Summary
Realistic profile photo of an elephant standing in a crowded room. A wholesome 1950s businessman is writing a mailing address on the elephant. Iphone 15 photo. 16:9
(I also find it interesting what it comes up with for mailing addresses when left with making that decision.)
Out of curiosity, I asked CoPilot to estimate how much energy it was using to generate one picture, quoted to me in the units of “time a standard microwave is on.” It danced around quite a bit about not being sure and data not being public, but eventually its answer was that depending on image complexity, generating one picture for me was like running the microwave for 30-50 seconds.
I then asked it how much energy it used in that calculation, and it said 1-7 microwave seconds.
Midjourney did pretty well with this, just prompting “Six year old in a phone booth”
Not exactly “nailed it” but two or three of my single test came out with pretty normal sized booths. I like the kid talking on a cell phone but, hey, he couldn’t reach the pay phone!
I used a couple local SDXL models and they made much smaller booths
I like the kid wearing headphones and trying to dial a door keypad.
Unrelated to children and phone booths, here’s my Mutants & Masterminds character, Molly Dynamite