AI image generation is getting crazy good

Okay, I was wrong about the options fir creating video are the same. It now has a text entry bar at the bottom of the screen for entering prompts (instead of only showing up when you press the “custom” button. It remembers and lists the prompts. And it stores up to five versions of the video. And it does that retroactively. I tend to delete “favorites” that I’m done working with to avoid clutter, but it has stored the prompts and five versions of every saved favorite I have, retroactively, to before the interface upgrade.

(Also, on the “favorites” page, instead of showing still thumbnails it plays each video (without sound) on the page one by one.)

Well, that’s cool. I recently suffered a data loss of recent generations I did. The backup was corrupted, and now I can recover some of those I lost.

Unrelated, this seemed like an obvious thing to do with the latest xkcd. So far only tried it in Gemini.

And it looks like image generators still struggle with text. The fake Latin is nonsense, and the signatures don’t correspond to the printed names, and one of the printed names is “President: Jatr of Südents”. And the outline still looks hand-drawn.

Also, the original clearly (though illegibly) had two lines of printing below each signature, presumably a name and a title for each. The AI one only has a single line (one title and three names).

Last night I used the Grok web version for the first time (so that I could tell someone else how to use it) so I had my first access to the “spicy” button. It appears to work with only recognizably human subjects. And only shows up for a subset of those. And when it is an option a large percentage of the time the output is moderated before I see it. And the times it does make something, the output doesn’t really look any different than normal generations. So automatic “spicy” creation in my experience so far isn’t very spicy.

A panel from today’s Garfield:

Copilot now has an “Imagine” section where you can see and remix shared Copilot outputs.

Tried converting this in Sora.

One image (top left) was pretty good, the other three gave the possum fingers or hands on it’s tail.

Upload that to Grok. The default video wasn’t bad.

Then I tried the prompt “He slings the cup across the room. The camera follows the cup as it breaks on a wall.”

The tracking wasn’t bad, but failed with the appearance of the tumbling cup and the breaking.

Sora has now released their Android app. Free Sora 2 video generations.

My second Sora 2 video. (I didn’t like the first ine enough to share it.) The prompt was “A blobfish riding a capybara through a flea market. People are staring and commenting about it.”

I believe I already mentioned (in a different thread?) the popular Sora2 theme of a cat firing a weapon or playing a musical instrument outside at night and an angry woman taking it away from the cat. I had to turn the tides.

The prompt is simple, Sora creates its own dialog.

Summary

Ring camera video in a suburban neighborhood at night. A woman is on the porch playing a harmonica. An angry, noisy ginger cat runs up and forcibly takes the harmonica from the woman.

Changed that to three cats and a xylophone

Jesus and Buddha.

Summary

Ring camera video in a suburban neighborhood late at night. Buddha is on the porch playing a trumpet. An angry Jesus runs up and grabs the trumpet from Buddha. They have a heated argument.

Summary

Ring camera video in a suburban neighborhood at night. A woman is on the porch playing a harmonica. An angry Jesus runs up and forcibly takes the harmonica from the woman. She curses him out.

And for Ponderoid, a cat teaching squirrels the Cat pythagorean theorem.

Compare the dialogue it invented for that video to the one I did of the cat and possum stealing the fish from the fishmonger.

After seeing this joke over in the jokes thread,

I had to give that idea to Grok and Sora2.

The first attempt in grok didn’t work well, it assumed I was talking about submachine guns.

Grok prompt

Make a video based on this idea and music: “It was an itsy bitsy teeny weeny light grey anti-sub machiney”
(that’s as in anti-submarine warfare)

Then tried that same prompt in Sora2 but the having the lyrics in quotes was rejected as being 3rd party content.

Prompt

First attempt:

Make a video based on this idea and music: “It was an itsy bitsy teeny weeny light grey anti-sub machiney”
(that’s as in anti-submarine warfare)

That one got moderated early in the generation process as a 3rd party content violation.
This fixed prompt worked, and it still got the right idea that it should sing that, woot. :slight_smile:

Make a video based on this idea: It was an itsy bitsy teeny weeny light grey anti-sub machiney
(that’s as in anti-submarine warfare)

Yeah, Sora doesn’t seem too knowledgeable about songs or song parodies. This is what happened when I asked for Hindu deities doing the YMCA dance.

While you were posting, I edited my post to include the Grok attempt. Grok now allows you to send a video prompt with no seed picture.

Cool. Hadn’t seen that yet. Also lets you change aspect ratio now, I see.

The latest South Park episode is about Sora. It cold-opens in a South Park girl at someone’s door asking him to sign a petition. Then she says it is a petition to get everyone to smell her farts. Then Santa shows up and calls her a naughty girl and starts peeing on her face. It then cuts away showing that it is a Sora video made by one of the South Park kids. I knew Sora wouldn’t let me do that, so I tried something else:

A redhead girl at a door asking a man to sign her petition to make everyone smell her farts. Suddenly Santa Claus appears and curses the girl with a voodoo doll.

(See this thread for the video and discussion.)

I tried that prompt in Grok just now. Here’s the prompt-only result:

And here’s using one of the many automatically generated images plus the prompt:

A few video prompts tested in Sora and Grok.

Summary

How much wood would a woodchuck chuck if a woodchuck could chuck wood?

Peter Piper picked a peck of pickled peppers.

He thrusts his fists against the post and still insists he sees the ghost.

She sells seashells by the seashore.

Little Bo-Peep has lost her sheep, And can’t tell where to find them; leave them alone, and they’ll come home, Bringing their tails behind them.

Jack be nimble, Jack be quick, Jack jump over the candle-stick.

Jack Sprat could eat no fat, is wife could eat no lean. And so between them both, you see, they licked the platter clean.

Little Jack Horner sat in the corner, eating a Christmas pie; he put in his thumb, and pulled out a plum, and said 'What a good boy am I.

Little miss Muffet, she sat on her tuffet eating her curds and whey. Along came a spider, who sat down beside her and frightened miss Muffet away.

Sora

Grok

I was especially impressed by Sora’s Woodchuck video and Jack Sprat song.

I see Jack was not so nimble when he met up with Sora 2’s kinematics engine, like I wasn’t able to do synchronized dancers.

Yeah, Sora definitely understood those better than Grok, and did more with them.