Midjourney is an AI based image and video generation tool. I’ve recently subscribed to it and it’s genuinely one of the most fun creative tools I have ever seen in my life. I’m having a blast with it.
I want to state up front that this isn’t a thread for the discussion of “AI slop” or whether can be a tool to create art or any of that. Here is a better place for that kind of discussion.
I wanted to share how cool it was with people (this post may sound like I’m an evangelist but I’m honestly just really enjoying it), share some of the things I’ve made, and invite others to do the same.
So I’ve tried other image generators before - DALL-E 3, imageGPT 1.5, google gemini. They’re all very good. They can create very interesting photos. But the workflow isn’t that fun. I’ll usually create 2 or 3 images and then I’m all done with that. If the image isn’t quite what you want, you have to try to master your prompt engineering to get it right. Which is a little bit frustrating.
Midjourney works quite differently. It always outputs 4 “takes” on the same idea. You can control how similar or different or creative these takes are. But the interesting thing is that you’re no longer trying to perfect one prompt to get the right picture. Rather, you’re generating 4 pictures and selecting and building off the one that most matches your intent or that you find interesting. And the tools are designed to easily iterate those images.
So you generate an image. Tropical sunset on a Hawaii beach. It gives you 4 photorealistic candidates (or you can make them oil paintings if you prefer - whatever). You pick the one you like the most, and you click “vary strong” which means that it generates 4 candidates with similar elements but are differently composed images. Ah, you like the #3 variation the most. So now you “remix” that image and you add the prompt “add some sailboats” - now you get 4 images of your chosen image that now have sailboats in them. You pick which of the 4 images you like the most and you click “animate” and it automatically animates your image so now the waves are rolling in and the sailboats are moving along. There are, as always, 4 different animations, all of which have a different take on what to animate. Some may look like a time lapse with the clouds moving fast across the screen. Some may be real time and show the waves coming in gently and the sailboats moving very slowly.
You started with a very simple idea and now you’ve refined several times, added elements, and even animated it. And that’s only using a fraction of the tools it has available. That’s what makes Midjourney more fun than the other image generators. It also outputs very high quality images. Quite possibly the best of any image generator.
And what’s even more remarkable is that it’s a team of like 100 guys at a company that funds through user subscriptions. No trillion dollar google, no venture capital billions - a small business that has a traditional business model that somehow beat out the tech giants. And that means it’s not free. They’re not trading venture capital for user share. They’re funded like a normal business, and quite frankly I appreciate the honesty of it. You can get 5 hours of GPU time (which would last for a few hundred images or video) for $10 a month or much more for $30 a month, including unlimited low priority image generation (may have to wait a minute), 15 hours of GPU time. You can pay the $10 to try it out, and if you like it, upgrade to the $30. That’s what I did.
It’s a very sophisticated tool and I don’t know how to use half the options it offers - I’ve only been using it for 3 days - but I’ll show you some of the sort of stuff I’ve created with it in that time. I’ll also share the prompts and tools I used - I think it’s more interesting that way and recommend you guys do the same if you share your own stuff.
This is a real place I’ve been and so I was trying to recreate an experience. I think that photo is excellent and could be mistaken for a real photo.
Let me show you an example of the workflow. I deliberately used a vague and absurd prompt to see what it would do and I got this result:
Four different takes on the same idea, 3 of which are genuinely interesting. There’s picture number 3, the literal kitten made out of cheesecake that’s begging “… kill me. kill… me”
Number 1, the adorable pixar-ish cartoon cat that’s a garnish/topping on the cheesecake (the little dollop of whip cream on her head means she’s part of the cheesecake), and image 4, which is sort of a pastry chef’s take on what a kitten-shaped cheesecake would look like. Two is a little lazy, just sort of… kitten+cheesecake. One simple and silly idea - 3 genuinely interesting results.
I left poor #3 alone because I didn’t want to create any more kittens begging for death but iterated on #1 and #4 to really good effect.
I’ll also share an example of the (rather incredible) video it creates. This is the cheesecake garnish kitten. You can manually animate (describe what you want to be animated) or just click animate and see what it does. This was an auto-animation. It gave the character life and cartoon-kitten behavior automatically. Really impressive.
You also get 4 video outputs with different ideas - I also animated that whale image. Two of the variations correctly made a pretty good looking whale jump - out of the water and back in. But two of them seemed to think that whales defied gravity and created a sky whale. But even that failure mode was genuinely interesting and pretty funny.
So - if anyone wants to share their stuff, I recommend carefully curating it. Share 1-4 images with us, tell us your prompt, your idea behind it, don’t just dump 20 images with no description.
If anyone uses midjourney, we can also follow each other. this is a link to my profile. Feel free to post yours.