Can ChatGPT generate high-quality prompts for Stable Diffusion? The answer is yes. But you will need special methods to tease them out. Once you learn how to do it, the results will surprise you. In this article, I will show you how to do that step-by-step.
- What is ChatGPT?
- How was ChatGPT trained?
- Does ChatGPT know anything about Stable Diffusion?
- How to use ChatGPT
- Asking for prompts
- Close-up illustration
- Full-body illustration
- Ask for more
What is ChatGPT?
ChatGPT is a chatbot released by Open AI in November 2022. It is built on the GPT-3 family of Large Language Model which was trained with text containing hundreds of billions of words on the internet.
How long is a book with hundreds of billions words? It would take a person more than five thousand years to read it all out.
As a result, ChatGPT contains vast amount of human knowledge. it can generate essays, poems, fictions and even computer codes.
How was ChatGPT trained?
ChatGPT was fine-tuned on top of GPT-3.5. GPT-3 itself is a powerful large language model. It operates like autocomplete. You type in a few words or a few sentences, the model predicts the most likely next words. Because the large amount of data it was trained on, the autocomplete of GPT-3 is supercharged. It can produce an essay of coherent and (sometimes) accurate content.
Human trainers played an important role in fine-tuning ChatGPT’s response. The model was trained to provide answers human trainers would give. Human trainers also ranked ChatGPT’s answers during the training phase so that it would prioritize answers humans prefer.
Does ChatGPT know anything about Stable Diffusion?
Unfortunately, the answer is no. You can easily check by asking ChatGPT the question “What is Stable Diffusion?”. Its answer is
Stable diffusion refers to a type of stochastic process that describes the spread of a certain quantity, such as information or a disease, through a network of individuals or nodes. The term “stable” is used to indicate that the process reaches a steady state, in which the rate at which the quantity spreads through the network is balanced by the rate at which it decays. The most common example of stable diffusion is the spread of a rumor through a social network.
We can debate whether this is complete nonsense, but we should all agree this is NOT Stable Diffusion. Its training data likely predates the release of Stable Diffusion.
Luckily, it knows what text-to-image models and DALL·E are (You can verify). So we can piggy-back on them in our prompt engineering.
How to use ChatGPT
To start using ChatGPT, go to chat.openai.com. Type a question in the input box at the bottom to start a conversation.
ChatGPT is aware of the history of your current conversation. Press “+ New Chat” button on the left panel to start a new conversation.
Asking for prompts
I am going to test ChatGPT by asking prompts for illustration portrait, one of the hardest to get right. Let’s see how it does.
There are three important techniques to tease out high-quality prompts for Stable Diffusion from ChatGPT:
- Specify image style. You need to ask for a specific kind of image. In my example, I will ask for “photorealistic close-up illustration”.
- Fix the subject. I will fix the subject as the actress “Ana de Armas”. I am not (too) obsessed with Ana de Armas. Reason for doing this is actress name is a strong keyword. Not only will it change the face (intended) but it will also heavily influence the style (not intended). If the subject is not fixed, ChatGPT will use a different actress in each prompt. The style and composition can be changed simply due to that.
- Prime with examples. Finally, give two example prompts.
Putting these together, the ChatGPT prompt is
Examples of high quality prompt for stunning close-up photorealistic illustration of Ana de Armas for text-to-image models (Stable Diffusion, midjourney or Dalle2) are
– portrait of beautiful happy young ana de armas, ethereal, realistic anime, trending on pixiv, detailed, clean lines, sharp lines, crisp lines, award winning illustration, masterpiece, 4k, eugene de blaas and ross tran, vibrant color scheme, intricately detailed
– alberto seveso and geo2099 style, A highly detailed and hyper realistic portrait of a gorgeous young ana de armas, lisa frank, trending on artstation, butterflies, floral, sharp focus, studio photo, intricate details, highly detailed, by Tvera and wlop and artgerm
Give me more examples.
- A stunning close-up illustration of Ana de Armas in a dramatic, dark and moody style, inspired by the work of Simon Stålenhag, with intricate details and a sense of mystery.
- A photorealistic, highly detailed illustration of Ana de Armas in a vintage Hollywood style, reminiscent of the golden age of cinema, with a focus on glamour and elegance.
- A close-up illustration of Ana de Armas in a neo-noir style, reminiscent of the film noir genre, with a focus on shadows, contrasts, and a sense of danger.
Let’s put these prompts in Stable Diffusion v1.5. I will use 25 steps of DPM++ 2M Karras sampler, CFG scale 7, image size 512×704. I will use the following universal negative prompt.
ugly, tiling, poorly drawn hands, poorly drawn feet, poorly drawn face, out of frame, extra limbs, disfigured, deformed, body out of frame, bad anatomy, watermark, signature, cut off, low contrast, underexposed, overexposed, bad art, beginner, amateur, distorted face
ChatGPT Prompt #1: A stunning close-up illustration of Ana de Armas in a dramatic, dark and moody style, inspired by the work of Simon Stålenhag, with intricate details and a sense of mystery.
ChatGPT Prompt #2: A photorealistic, highly detailed illustration of Ana de Armas in a vintage Hollywood style, reminiscent of the golden age of cinema, with a focus on glamour and elegance.
ChatGPT Prompt #3: A close-up illustration of Ana de Armas in a neo-noir style, reminiscent of the film noir genre, with a focus on shadows, contrasts, and a sense of danger.
These are all good stuffs! I am pleasantly surprised by how creative the prompts are, and how diverse the styles it gives.
And I surely learned some new keywords that I am going to put in the prompt generator.
Now let’s refine the ChatGPT prompt to tease out prompts for full-body portrait. All you need to do is to add “full-body” in your question and prompt examples.
Examples of high quality prompt for stunning photorealistic full body illustration of ana de armas for text-to-image models (Stable Diffusion, midjourney or Dalle2) are
- full body portrait of beautiful happy young ana de armas, ethereal, realistic anime, trending on pixiv, detailed, clean lines, sharp lines, crisp lines, award winning illustration, masterpiece, 4k, eugene de blaas and ross tran, vibrant color scheme, intricately detailed
- full body portrait of a gorgeous young ana de armas, A highly detailed and hyper realistic lisa frank, trending on artstation, butterflies, floral, sharp focus, studio photo, intricate details, highly detailed, by Tvera and wlop and artgerm, alberto seveso and geo2099 style,
Give me more examples
There are the prompts ChatGPT generated.
ChatGPT Prompt #4: Full body illustration of Ana de Armas in a stunning red dress, posing elegantly against a beautiful sunset background, with realistic lighting and shadows, by a master digital artist known for their photorealistic style, such as Jomaro Kindred or Gleb Alexandrov
ChatGPT Prompt #5: A full body illustration of Ana de Armas as a powerful warrior queen, with intricate armor and weapons, in a dynamic and action-packed pose, rendered in a highly detailed and photorealistic style by top fantasy artists such as Simon Stålenhag or Jora Vision
ChatGPT Prompt #6: A full body illustration of Ana de Armas as a sultry and mysterious sorceress, with detailed and realistic clothing and accessories, in a dramatic and captivating pose, by leading fantasy and sci-fi artists such as Yoshitaka Amano or Dave Rapoza.
These are good, stylish full-body portraits!
Ask for more
If you like a particular prompt ChatGPT gave, you can ask for more by something like
I like the third one. Can you give me more examples like that?
ChatGPT will give you more variants of the same style.
Can ChatGPT generate high-quality prompts for Stable Diffusion? The answer is a solid YES. But you need to follow these three guidelines: (1) Specify the image style, (2) fix the subject, and (3) prime with examples.
ChatGPT is still free to use as of time of writing. Go generate some prompts before they start charging!
Get updates on latest tutorials, prompts and exclusive contents.
Several generative AI models, including ChatGPT and an image generator called Stable Diffusion, can now be accessed online for free or for a low-cost subscription, which means people across the world can do everything from assemble a children's book to produce computer code in just a few clicks.What is stable diffusion model? ›
Stable Diffusion is a latent diffusion model that is capable of generating detailed images from text descriptions. It can also be used for tasks such as inpainting, outpainting, text-to-image and image-to-image translations.Is stable diffusion free? ›
Stable Diffusion 2.0 is now live on Playground AI, and it's completely free to use. Playground AI is one of my favorite AI image-generator web apps for its feature-rich tools and fast generation. Each user is limited to 1000 images per day, but you can choose to pay for credits if you want more.How big is stable diffusion? ›
Stable Diffusion is recommended to be run with 10 GB or more VRAM, however users with less VRAM may opt to load the weights in float16 precision instead of the default float32 to tradeoff model performance with lower VRAM usage.How do you set up Stable Diffusion? ›
- Step 1: Download and extract the installer. Click the download button for your operating system: On Windows: ...
- Step 2: Run the program. On Windows: Double-click Start Stable Diffusion UI.cmd . ...
- Step 3: There is no Step 3. It's that simple!
Then, there's Stability AI's open-source image generation model Stable Diffusion, which has been used on pop music videos, Hollywood movies and by more than 10 million people on a daily basis. Stability's brash CEO Emad Mostaque predicts the “dot-AI bubble” is coming.What is the best AI art generator? ›
- Best AI art generators in 2023. DALL-E 2. Jasper Art AI. Lensa AI. Dream by Wombo. NightCafe AI. Stable Diffusion AI. Midjourney. Fotor. Deep Dream Generator. Artbreeder. Runway ML. MyHeritage AI Time Machine. Dawn AI. Reface App. Bonus: Google Muse AI.
- Welcome to the AI-driven world.
Users Are Calling It 'Censorship' The latest update to Stable Diffusion also includes an adult content filter limiting the generation of NSFW images.What resolution is best for Stable Diffusion? ›
Width and Height for a generated image. By default, Stable Diffusion generates images in sizes 512 to 512 pixels. You will get the most consistent result when you use this size.Is there a NSFW AI image generator? ›
AI image generators are already being used to generate NSFW output, including both photorealistic and anime-style pictures. However, these models can also be used to generate NSFW imagery resembling specific individuals (known as non-consensual pornography) and images of child abuse.
We wanted to know how much time (and money) it would cost to train a Stable Diffusion model from scratch using our Streaming datasets, Composer, and MosaicML platform. Our results: it would take us 79,000 A100-hours in 13 days, for a total training cost of less than $160,000.What app is like Stable Diffusion? ›
The best alternatives to Stable Diffusion - DreamStudio are Insense , Playground AI and Pixelixe. If these 3 options don't work for you, we've listed over 10 alternatives below.Is Stable Diffusion an app? ›
Thanks to Stable Diffusion's open-source code, AI images can also be generated on a powerful PC rather than in the cloud. Developer Liu Liu has now packaged Stable Diffusion's algorithm into an iPhone app. The app does not require an Internet connection after downloading the roughly two-gigabyte resources.How many images for Stable Diffusion training? ›
Discovering instances of memorization in Stable Diffusion required 175 million image generations for testing and preexisting knowledge of trained images.How long does Stable Diffusion take to generate an image? ›
Currently, Stable Diffusion generates images fastest on high-end GPUs from Nvidia when run locally on a Windows or Linux PC. For example, generating a 512×512 image at 50 steps on an RTX 3060 takes about 8.7 seconds on our machine.How many images are in Stable Diffusion? ›
Stableboost generates up to 200 images in one go with Stable Diffusion. Jonathan works as a technology journalist who focuses primarily on how easily AI can already be used today and how it can support daily life.How do you make AI art prompts? ›
To write a good text-to-image prompt, it's recommended to be clear with the main idea and to define the details and styles you want to replicate. A prompt with at least 3 to 7 words is ideal, as it provides the AI with a clear context, and using multiple adjectives can infuse multiple feelings into the artwork.Can you make art with ChatGPT? ›
We have explored several easy steps on how to use ChatGPT and access Open AI through Delphi. Not only can you make amazing art, but you also have the chance to submit your work in the Delphi Fan Art Contest! Explore different ideas and look at other entries submitted by other creators.How many steps are in Stable Diffusion? ›
Around 25 sampling steps are usually enough to achieve high-quality images. Using more may produce a slightly different picture, but not necessarily better quality.Can you train Stable Diffusion? ›
To get good results training Stable Diffusion with Dreambooth, it's important to tune the learning rate and training steps for your dataset. High learning rates and too many training steps will lead to overfitting. The model will mostly generate images from your training data, no matter what prompt is used.
Stable Diffusion WebUI
It's a browser interface based on the Gradio library for Stable Diffusion that you set up from individual libraries. There are two great things about this option: WebUI works on all systems (Linux, Windows, mac)
History. Midjourney was founded by David Holz, co-founder of Leap Motion. It first entered open beta on July 12, 2022. However, on March 14, 2022, the discord server launched with a request to post high-quality photographs to Twitter/Reddit for system's training.Can I sell AI-generated art? ›
You can submit AI images you've generated to earn royalties!
I'm excited to tell you — it's official — that AI-generated images can now be submitted and sold on Adobe Stock!
In conclusion both Stable Diffusion 2.0 and Midjourney V4 are great choices when it comes to AI-generated imagery. However, if you're looking for the best results “straight out of the box”, Midjourney V4 is the better choice.What is the best free AI art generator app? ›
- Lensa – AI Art Image Generator.
- Wonder – An App that Generates Art from Your Text.
- Fotor – AI Art Generator.
- DeepArt – AI Art Image Generator.
- StarryAI – Create Art with AI.
- TikTok – Turn Your Photo into AI Art.
- PicsArt – Text to Image AI generator.
Tap on the profile icon. Select Settings. Tap the Blur NSFW Images toggle switch to turn it off.What celebrities are on Stable Diffusion? ›
Among the most frequently-appearing were Donald Trump, Charlize Theron, Marilyn Monroe, Kate Moss, Blake Lively, and Lupita Nyong'o, Rosie Huntington-Whiteley, and Scarlett Johansson. Unlike DALL-E 2, Stable Diffusion also lets you generate images of famous trademarked characters, so we searched for 600 of them.What's the best resolution setting? ›
For video editors, we recommend 1920 x 1080 as a minimum. This is because 1080p is the standard high-definition resolution for videos, so if you're working with 1080p video, you'll want a screen that can display that natively.Which resolution is enough? ›
It's best to look at the pixel dimension of your images as you're making them. As long as they're at least about 1024 pixels wide (for a horizontal image) they should be fine for teaching. The standard resolution for web images is 72 PPI (often called “screen resolution”).What resolution should you set for the best quality? ›
Often referred to as “full HD,” 1080 (1920 x 1080 pixels) has become the industry standard for a crisp HD digital video that doesn't break your storage space. This is also a common screen resolution for smartphones.
Lensa has been climbing the app store hit lists with its avatar-generating AI that is making artists wave the red flag. Now there's another reason to fly the flag: As it turns out, it's possible — and way too easy — to use the platform to generate non-consensual soft porn.Can you own AI generated images? ›
“ This means that, under the current rules, AI-generated art has no owner. Copyright protection in art is given as soon as the work is created, so the creator has exclusive right to decide the future use of the work.How does Stable Diffusion work? ›
The true power of the Stable Diffusion model is that it can generate images from text prompts. This is done by modifying the inner diffusion model to accept conditioning inputs. The inner diffusion model is turned into a conditional image generator by augmenting its denoising U-Net with the cross-attention mechanism.How long does it take a train to slow down to stop? ›
The average freight train is about 1 to 1¼ miles in length (90 to 120 rail cars). When it's moving at 55 miles an hour, it can take a mile or more to stop after the locomotive engineer fully applies the emergency brake. An 8-car passenger train moving at 80 miles an hour needs about a mile to stop.What is the time diffusion equation? ›
The time for diffusion is linear in y/x for 3 dimensions; proportional to log(y/x) for 2 dimensions; and independent of y/x for 1 dimension. For example, when y/x = 0.1 (e.g., target diameter 1 nm, diffusion distance 10 nm), q3 = 0.35 and q2 = 1.22.Is there an app like deep nostalgia? ›
About: PhotoGlory is an old photo restoration software that uses artificial intelligence and neural networks to repair damaged photos and colorise black & white shots.Is Dall E 2 free to use? ›
DALL. E 2 is currently free to use, but there is a catch. You're allotted 50 free credits during your first month's use and 15 free credits after that.How do I access Stable Diffusion AI? ›
- Download Git from the Windows x64 installer on the Git website. Follow all the prompts to install Git. ...
- Download the Miniconda3 Windows 64-bit from the Miniconda3 download page. Start the installation process, but select 'All Users' before clicking Next and finishing the process.
Anyone can use the Stable Diffusion model to generate customised images. They may also use the code to create art-related apps, such as those that enhance the drawing of cartoons or even making collages.Where can I get Stable Diffusion? ›
To run Stable Diffusion locally on your PC, download Stable Diffusion from GitHub and the latest checkpoints from HuggingFace.co, and install them.
ChatGPT can be applied to a range of NLP applications, such as Generating Text, Dialogue Generation, Language Translation, Text Summarization, Text Classification, Question Answering, and Text Completion among others.Is ChatGPT Pro better than free? ›
Performance: ChatGPT-professional delivers better performance than ChatGPT in terms of generating more human-like text and providing better context understanding. Cost: ChatGPT-professional is a paid service, while ChatGPT is a free to use model.How does chatbot work? ›
A chatbot performs routine automated tasks based on specific triggers and algorithms, simulating human conversation. A bot is designed to interact with a human via a chat interface or voice messaging in a web or mobile application, the same way a user would communicate with another person.What is the meaning of ChatGPT? ›
ChatGPT is an artificial intelligence chatbot developed by OpenAI and launched in November 2022. It is built on top of OpenAI's GPT-3 family of large language models and has been fine-tuned (an approach to transfer learning) using both supervised and reinforcement learning techniques.What are the limitations of ChatGPT? ›
Limitations in Training Data and Bias Issues
Like many AI models, ChatGPT has limitations in its training data. Both the constraints in training data and bias in the data can create a negative impact on the model's output. In fact, this AI tool has shown bias when it comes to training minority data groups.
ChatGPT runs on a language model architecture created by OpenAI called the Generative Pre-trained Transformer (GPT), specifically GPT-3. Generative AI models of this type are trained on vast amounts of information from the internet including websites, books, news articles and more.How does ChatGPT app work? ›
Chat bots like GPT are powered by large amounts of data and computing techniques to make predictions to string words together in a meaningful way. They not only tap into a vast amount of vocabulary and information, but also understand words in context.Will ChatGPT become a paid service? ›
OpenAI stressed in the waitlist document that access and potential use of the premium version are still experimental, subject to changes, and would only be available to a few select people for now. Update: On February 1, 2023, OpenAI announced the launch of a $20 pilot subscription plan for ChatGPT Plus.What is the cost of ChatGPT? ›
The new subscription plan, ChatGPT Plus, will be available for $20/month, and subscribers will receive a number of benefits: General access to ChatGPT, even during peak times. Faster response times. Priority access to new features and improvements.How much is ChatGPT pro? ›
OpenAI, the company behind the technology, has now announced that it will introduce a ChatGPT Pro subscription plan. This will cost $20 (£16) a month, giving paying users a host of key benefits to make the chatbot easier to use.
Chatbots are essentially smart robots that are programmed to answer questions. They understand what you want and then give you the answer you are looking for. Intelligent conversational chatbots are built on machine learning and become more “knowledgeable” the more you feed it data .Which algorithm is best for chatbot? ›
- Naïve Bayes Algorithm.
- Support vector Machine.
- Natural language processing (NLP)
- Recurrent neural networks (RNN)
- Long short-term memory (LSTM)
- Markov models for text generation.
- Grammar and Parsing Algorithms.
ChatGPT grew in popularity quickly, according to Adam Conner, vice president for technology Policy at the Center for American Progress, because it was one of the first AI technologies of its kind to be made available to the public in a way that the public could understand.What is the best Chat AI? ›
- EBI.AI. EBI.AI have created their own advanced conversational AI platform that comes with a free trial. ...
- ProProfs Chat. ...
- Chatfuel. ...
- MobileMonkey. ...
- Aivo. ...
- ItsAlive. ...
- Imperson. ...
It was trained on a massive corpus of text data, around 570GB of datasets, including web pages, books, and other sources. 11. GPT-3 has been fine-tuned for a variety of language tasks, such as translation, summarization, and question-answering.