In the age of generative AI, the best prompts will rule the world. Good prompts will net you the AI results that you want. Bad prompts will net you bad results and in most cases results that are unusable. In this article we will talk about how to write killer stable diffusion prompts.
Stable Diffusion is a text to image AI model by Stability.ai, used to generate AI images. Stable Diffusion was the first text to image model made open source which allowed consumers to run on their own hardware versus accessing the model through web services like Midjourney and Dall-E. With that said, Stability.ai does also offer their own web app called DreamStudio.
For those looking to try Stable Diffusion, check out DreamStudio which is the official Stable Diffusion app. I’ve also built a website powered by Stable Diffusion.
If you’re using the official Stable Diffusion app you’ll notice that the way they handle a prompt is different than Dall-E or Midjourney. Dall-E and Midjourney allows the user to set all image parameters in the prompt whereas parameters such as styles, negative prompts, image dimensions, and etc are decoupled from the prompt and instead displayed as configurable settings in the left navigation.
I like this better because it presents the parameters up front versus hidden away from view allowing you to simply just focus on the prompt itself.
A prompt can be crafted in infinite ways but I break it down to three components.
[Core Prompt] , [Style] , [Finishing Touches]
Your prompt can be as simple as one word. What is the subject or theme of the image that you’re trying to create. Its easier to envision what your ideal subject looks like and in what environment the subject is in and then describe it using words. Add specific adjectives to further give the subject character.
If you submit your prompt without a style, the model will default to commonalities of the core prompt you’ve described. Adding in styles is enough to steer the model to realized your concepts. Multiple styles can be chained together separated by commas. Additionally, consider including literal art styles (oil painting, anime, pencil) and famous artists (Vincent Van Gogh, Michelangelo) to make your style more specific.
Add finishing touches towards the end of your prompt for more fine tune control of how you want your image to look. This is the part where prompters are pushing the boundaries.
Think of negative prompts as exclude statements. Whatever you don’t want to see in the generated image, include it in the negative prompt. Like the prompt itself, you can chain them together with commas.
Here are some negative prompts to consider:
In conclusion, it’s easy to get caught up on all the technicalities of a prompt. At the end of the day this is my best advice, imagine that you are a photographer. Unfortunately you’re working offsite and is unable to be on location to capture a specific photo. Instead you have to send one of your disciples to capture the photo. They are an extension of your consciousness. In order to capture the exact photo that you want, you’ll have to describe what your photographer to capture in an explicit and concise way. This description is your prompt. Good luck and have fun!