
Stable Diffusion 3.5 Medium
Stable Diffusion 3.5 Medium is 2.5 billion parameter image model with improved MMDiT-X architecture
Avg Run Time: 8.000s
Model Slug: stable-diffusion-3-5-medium
Category: Text to Image
Input
Enter an URL or choose a file from your computer.
Click to upload or drag and drop
image/jpeg, image/png, image/jpg, image/webp (Max 50MB)
Output
Example Result
Preview and download your result.

Create a Prediction
Send a POST request to create a new prediction. This will return a prediction ID that you'll use to check the result. The request should include your model inputs and API key.
Get Prediction Result
Poll the prediction endpoint with the prediction ID until the result is ready. The API uses long-polling, so you'll need to repeatedly check until you receive a success status.
Overview
Stable Diffusion 3.5 Medium is an advanced image generation model designed to create highly detailed and visually appealing content based on textual prompts. The model enables users to transform their ideas into images by leveraging state-of-the-art diffusion techniques. Its flexibility allows for a wide range of creative possibilities, including artwork, photorealistic images, and stylized designs.
Technical Specifications
- Prompt Strength: Determines how strongly the model adheres to the given prompt. Lower values allow for more abstract outputs, while higher values enforce stricter adherence.
- CFG (Classifier-Free Guidance): Controls the balance between creativity and prompt adherence. Higher values produce outputs closely tied to the prompt, while lower values add more randomness.
- Steps: Specifies the number of iterations for the diffusion process. Higher values yield better details but increase generation time.
- Aspect Ratio: Supports multiple formats for different visual needs.
- Output Quality: Fine-tune the quality slider to control image resolution and file size.
Key Considerations
Prompt Length: Avoid excessively long prompts as they may confuse the model. Aim for concise yet descriptive instructions.
Style Consistency: When generating multiple images, use the same seed to maintain consistency across outputs.
Legal Information
By using this model, you agree to:
- Stability AI API agreement
- Stability AI Terms of Service
Tips & Tricks
Experiment with CFG:
Start with moderate values (e.g., 7-10) and adjust based on your needs. Use higher values for precise outputs and lower values for creative explorations.
Use Prompt Strength Wisely:
- For creative or abstract results, set prompt strength around 0.5-0.7.
- For exact representations, increase it to 0.8 or above.
Optimize Steps and Quality:
- For quick previews, use lower steps (e.g., 20-30) and moderate quality.
- For final outputs, increase steps (e.g., 50-70) and maximize quality.
Seed Exploration:
Generate multiple images with different seeds to explore diverse variations of the same prompt.
Prompt Clarity:
Use detailed and descriptive prompts to achieve the desired results. For example, instead of "a cat," try "a fluffy white cat sitting on a sunny windowsill with a garden in the background."
Aspect Ratio Selection:
Adjust the aspect ratio to match the intended use of the output. For instance, use a 16:9 ratio for landscapes and a 1:1 ratio for social media posts.
Capabilities
Generates high-quality images based on textual descriptions.
Offers flexibility in style, format, and aspect ratio.
Supports reproducible outputs using seed values.
Balances creative and literal interpretations through prompt strength and CFG.
What Can I Use It For?
Art and Design: Create custom artwork or illustrations.
Content Creation: Generate unique images for blogs, social media, or marketing campaigns.
Storytelling: Visualize characters, scenes, or settings for creative writing.
Prototyping: Produce quick visual concepts for design projects.
Things to Be Aware Of
Stylized Imagery:
Experiment with descriptive prompts like "a futuristic city skyline at sunset, cyberpunk style" to explore different aesthetics.
Photorealistic Results:
Use prompts with clear specifications, e.g., "a close-up of a golden retriever lying on a wooden floor with sunlight streaming through the window."
Iterative Refinement:
Start with a broad concept, then refine prompts and settings to perfect the output.
Creative Variations:
Adjust the seed, aspect ratio, and CFG to produce diverse versions of the same idea.
Limitations
Complex Scenes: The model may struggle with highly intricate scenes or overlapping elements. Simplify prompts if needed.
Abstract Prompts: Results can be unpredictable for abstract or vague instructions. Be specific to achieve better outcomes.
Fine Details: Extremely fine details may require higher steps and CFG values, increasing generation time.
Output Format:JPG,PNG,WEBP
Pricing Detail
This model runs at a cost of $0.035 per execution.
Pricing Type: Fixed
The cost remains the same regardless of which model you use or how long it runs. There are no variables affecting the price. It is a set, fixed amount per run, as the name suggests. This makes budgeting simple and predictable because you pay the same fee every time you execute the model.
Related AI Models
You can seamlessly integrate advanced AI capabilities into your applications without the hassle of managing complex infrastructure.