Stability AI’s open-weight image model family. The flagship as of May 2026 is SD 3.5 Large (8.1B parameters, MMDiT architecture), alongside SD 3.5 Large Turbo (4-step distilled) and SD 3.5 Medium (2.5B, consumer-hardware optimized).
Model weights are publicly downloadable under the Stability AI Community License: free for anyone earning under $1M/year, including commercial use. Paid access runs through the Stability API (3-8 credits per image) or third-party hosts like Replicate and Fal. SD 4 has not launched.
System Verdict
Pick Stable Diffusion if you need open weights, deep customization, or unlimited volume. Nothing else lets you self-host the model, fine-tune with LoRA or DreamBooth, condition with ControlNet, and generate without caps or content filters. The Civitai checkpoint ecosystem and ComfyUI node graph together form a moat no closed tool has matched. SD 3.5 Large with NVIDIA NIM and TensorRT optimizations is roughly 2x faster than SDXL with ~40% less VRAM.
Skip it if you need best-in-class default photoreal or minimal setup. Flux 2 Pro and Midjourney v7 beat SD 3.5 Large on out-of-the-box aesthetic quality. ChatGPT’s GPT Image 2 is faster to reach for casual use. Hand and text rendering remain weak across the SD family. Self-hosting demands an 8GB+ NVIDIA GPU and comfort with Python environments.
Who pays which tier: Self-host free on any 8GB+ VRAM GPU (sub-$1M revenue). Stability API for occasional cloud calls at $0.03-$0.08 per image. Third-party hosts (Replicate, Fal) for managed infrastructure without the Stability credit system. Enterprise license only above $1M annual revenue.
Key Facts
| Flagship model | Stable Diffusion 3.5 Large (8.1B parameters, MMDiT, 1MP) |
| Family members | SD 3.5 Large · SD 3.5 Large Turbo (4-step) · SD 3.5 Medium (2.5B, 0.25-2MP) |
| Prior generations | SDXL (still widely used in community) · SD 1.5 (legacy, low-VRAM workflows) |
| SD 4 status | Not launched as of May 2026 |
| License | Stability AI Community License · free under $1M annual revenue (commercial OK) |
| Enterprise license | Required above $1M annual revenue · contact Stability AI |
| Weights host | Hugging Face (stabilityai/stable-diffusion-3.5-large) · Civitai |
| Stability API pricing | Core: 3 credits ($0.03) · SD 3.5 Large: 6.5 ($0.065) · Turbo: 4 ($0.04) · Ultra: 8 ($0.08) |
| Free API credit | 25 credits on signup |
| Local minimum VRAM | 8GB NVIDIA (SD 3.5 Large) · 4-6GB SDXL/1.5 with optimizations |
| AMD / Apple support | ONNX for Radeon and Ryzen AI · CoreML for Apple Silicon |
| ControlNet (SD 3.5) | Blur, Canny, Depth (released Jan 29, 2026) |
| Companion models | Stable Video 4D 2.0 (video) · Stable Audio · Stable 3D |
Every data point above was verified against vendor sources on 2026-05-13. See Sources.
What it actually is
An open-weight text-to-image model family published by Stability AI, paired with a paid cloud API and a deep community stack. The locally-runnable weights are the product; the API is a convenience layer for teams that do not want to manage GPUs.
Three moats compound over time:
- Open weights. Every other frontier image tool (Midjourney, Flux Pro, GPT Image 2, Firefly) gates access behind an API. SD weights are downloadable, forkable, and fine-tunable by anyone. This is the reason SD survived Stability AI’s 2024 leadership upheaval: the community kept shipping regardless.
- Civitai checkpoint ecosystem. Tens of thousands of community-trained checkpoints and LoRAs cover photorealism, anime, architecture, product photography, and niches no vendor would build. Downloadable free. This library is the real competitive advantage vs. Flux or closed tools. No one has replicated it.
- ComfyUI workflow flexibility. The node-based graph UI lets users chain ControlNet, IP-Adapter, regional prompting, upscaling, and inpainting into reproducible pipelines. Production studios use ComfyUI workflows as serializable assets. Automatic1111 and InvokeAI cover less technical users but lose pipeline-as-code depth.
When to pick Stable Diffusion
- Self-hosting is a hard requirement. Air-gapped environments, privacy-sensitive workloads, regulatory constraints, or teams unwilling to send prompts to a third-party API.
- Fine-tuning on a custom subject or style. LoRA training on 10-20 images produces a consistent face, product, or aesthetic that no closed tool exposes. DreamBooth for heavier personalization.
- ControlNet-style spatial conditioning. Depth maps, Canny edges, OpenPose skeletons, and lineart conditioning for precise composition control, unmatched by Midjourney or GPT Image 2.
- High-volume generation without per-image cost. Studios producing thousands of variants a day: self-host amortizes faster than API billing past a few hundred images.
- Unrestricted content (within legal limits). No built-in content filter when self-hosted. Users retain responsibility under the Acceptable Use Policy.
- Production pipelines in ComfyUI. Reproducible node graphs beat prompt-only workflows for clients and teams that need deterministic output.
When to pick something else
- Default photoreal quality: Flux. FLUX.2 Pro beats SD 3.5 Large on out-of-the-box realism and hand/text rendering.
- Fastest path to polished output: Midjourney. v7’s aesthetic baseline is higher with zero setup.
- Text rendering and typography in images: Ideogram. Still the leader for poster, logo, and UI text generation.
- Balanced web UI with training built in: Leonardo. Hosted UI with LoRA training and style consistency for creators who want SD-adjacent control without local setup.
- Bundled with chat and broad ecosystem: ChatGPT. GPT Image 2 is adequate for most casual generation and integrated with text workflows.
- Commercial indemnification: Adobe Firefly. Trained on licensed content, comes with IP safe-harbor terms Stability does not offer.
Pricing
Self-host (open weights)
| Resource | Cost | Notes |
|---|---|---|
| SD 3.5 Large / Turbo / Medium weights | Free | Download from Hugging Face or Civitai under Community License |
| SDXL and prior weights | Free | Still supported by Automatic1111, ComfyUI, InvokeAI |
| Automatic1111 / ComfyUI / InvokeAI | Free | Open-source interfaces |
| Civitai checkpoints and LoRAs | Free | Community-trained add-ons; terms per model page |
| Hardware (NVIDIA) | One-time | 8GB+ VRAM recommended for SD 3.5 Large (RTX 3070/4070+) |
| Hardware (AMD / Apple) | One-time | ONNX runtime for Radeon/Ryzen AI; CoreML for Apple Silicon |
| Commercial use under $1M revenue | Free | Community License covers this tier |
| Commercial use above $1M revenue | Enterprise license | Contact Stability AI for terms |
Stability API (managed cloud)
| Model | Credits / image | USD / image | Best for |
|---|---|---|---|
| Stable Image Core | 3 | $0.03 | Fast iteration, social graphics, bulk |
| Stable Image Ultra | 8 | $0.08 | Flagship-quality cloud generation |
| SD 3.5 Large (API) | 6.5 | $0.065 | Hosted SD 3.5 Large without self-hosting |
| SD 3.5 Large Turbo | 4 | $0.04 | 4-step distilled for speed |
| Signup free credits | 25 | $0.25 | Trial use |
Third-party managed hosts
| Provider | Typical range | Notes |
|---|---|---|
| Replicate | ~$0.003-$0.01 per image | Pay-as-you-go per-second GPU billing; SD 3.5 Large Turbo available |
| Fal.ai | Similar range | Low-latency inference endpoints |
Prices verified 2026-05-13 via Stability platform pricing, Stability news, and the Stability Community License. 1 credit = $0.01. Self-hosting remains free indefinitely under the Community License for qualifying users.
Who’s it for: Self-host for high-volume, customization, privacy, or sub-$1M commercial use. Stability API for occasional cloud calls without managing GPUs. Third-party hosts (Replicate, Fal) for managed infrastructure billed by GPU-second.
Against the alternatives
| Stable Diffusion (SD 3.5 Large) | Flux (FLUX.2 Pro) | Midjourney v7 | |
|---|---|---|---|
| Photoreal quality (default) | Good, below Flux/MJ | Strongest open-weight photoreal | Strongest aesthetic baseline |
| Customizability | Highest (LoRA, DreamBooth, ControlNet, ComfyUI) | High (LoRA available) | Low (sref, style tuner only) |
| Open weights | Yes (Community License) | Yes (Dev weights open; Pro is API-only) | No (closed) |
| API cost | 3-8 credits ($0.03-$0.08) | Higher per image | Included in subscription |
| Ecosystem (checkpoints, workflows) | Largest (Civitai + ComfyUI) | Growing, smaller than SD | None (closed platform) |
| Hand / text rendering | Weak (SD family limitation) | Improved over SD | Stronger |
| Best viewed as | Open-weight workhorse with the deepest customization stack | Open-weight photoreal leader | Closed aesthetic leader |
Failure modes
- Default photoreal quality trails Flux 2 Pro and Midjourney v7. SD 3.5 Large is competitive with the right checkpoint and LoRA stack, but the base model alone loses to Flux and MJ on realism, skin, and lighting. Budget time for checkpoint shopping if photoreal is the goal.
- Hand, finger, and text rendering remain weak. Structural issue across the SD family, only partially fixed in 3.5. Workarounds: ControlNet pose conditioning for hands, post-hoc inpainting for text, or switch models for typography (Ideogram).
- Self-host requires real technical setup. Python environment management, CUDA drivers, model downloads, interface choice (A1111 / ComfyUI / InvokeAI), checkpoint selection. Not a 5-minute onboarding. Docker images help but do not eliminate the learning curve.
- GPU requirements gate access. 8GB+ VRAM NVIDIA is the practical minimum for SD 3.5 Large; AMD and Apple Silicon work via ONNX/CoreML but with reduced tooling maturity. Users without a GPU must rent cloud instances (Vast.ai, RunPod) or use the API.
- Built-in safety checker is trivial to disable. Self-hosters can turn off the content filter by editing a single line. Responsibility for outputs shifts entirely to the user under the Acceptable Use Policy. Commercial deployments need their own moderation.
- Stability AI corporate instability. Leadership changes and layoffs in 2024 slowed Stability’s model release cadence. The community ecosystem kept shipping regardless (SD 3.5 ControlNets released Jan 29, 2026), but the company’s solvency is not guaranteed, and no SD 4 release date has been announced as of May 2026.
- License tier jump at $1M revenue. The Community License’s free commercial tier ends at $1M annual revenue. Growing teams must budget for the Enterprise license conversion or migrate to fully permissive alternatives.
- Copyright ambiguity. Like all web-scraped-trained models, outputs carry unresolved IP exposure. Adobe Firefly’s indemnification offer is a meaningful differentiator for enterprise legal teams; Stability does not match it.
Methodology
This page was produced by the aipedia.wiki editorial pipeline, an automated system that ingests vendor documentation, verifies pricing and model details against primary sources, and generates the editorial analysis you are reading. No individual human wrote this review. Scoring follows the four-dimension rubric at /about/scoring/ (Utility × Value × Moat × Longevity, unweighted average). Last verified 2026-05-13 against Stability AI news, Stability AI license, Stability platform pricing, and the SD 3.5 Large Hugging Face model card.
FAQ
Is Stable Diffusion free? Yes, under the Stability AI Community License. SD 1.5, SDXL, SD 3.5 Large, Turbo, and Medium weights are all downloadable at no cost, and commercial use is free for individuals and organizations earning under $1M annual revenue. The only costs for self-hosting are hardware (8GB+ VRAM GPU recommended for SD 3.5 Large) and electricity. Above $1M annual revenue, an Enterprise license is required: contact Stability AI.
Has SD 4 launched? No. As of May 2026, SD 3.5 Large remains Stability AI’s flagship. The SD 3.5 family (Large, Large Turbo, Medium) was released October 29, 2024, and received three new ControlNets (Blur, Canny, Depth) on January 29, 2026, plus NVIDIA NIM microservice integration and TensorRT optimizations through early 2026. No SD 4 release date has been announced.
What GPU do I need to run SD 3.5 Large locally? NVIDIA GPU with 8GB+ VRAM is the practical minimum (RTX 3070 / 4070 and above). TensorRT optimizations reduce VRAM usage by roughly 40% on RTX GPUs. AMD Radeon and Ryzen AI are supported via ONNX runtime. Apple Silicon Macs run via CoreML backends. SD 3.5 Medium (2.5B parameters) targets consumer hardware and runs on lower-VRAM setups. SDXL and SD 1.5 remain viable for 4-6GB VRAM GPUs with optimizations.
What is the Stability API pricing? Credits on the Stability API are $0.01 each. Stable Image Core costs 3 credits ($0.03) per image, SD 3.5 Large Turbo 4 credits ($0.04), SD 3.5 Large 6.5 credits ($0.065), and Stable Image Ultra 8 credits ($0.08). New users get 25 free credits on signup. Third-party hosts like Replicate and Fal offer per-second GPU billing that can be cheaper for specific workloads.
Can I use Stable Diffusion commercially? Yes. Commercial use is free under the Community License for individuals and organizations earning under $1M annual revenue. Above $1M, an Enterprise license is required. Outputs can be sold, used in products, or licensed further, subject to the Acceptable Use Policy. Note: Stability does not offer Adobe Firefly-style IP indemnification; teams requiring legal safe harbor should evaluate Firefly as an alternative.
How does SD 3.5 Large compare to Flux and Midjourney? Out of the box, Flux 2 Pro and Midjourney v7 produce more polished photoreal and aesthetic outputs. SD 3.5 Large’s advantage is customization: LoRA fine-tuning, ControlNet conditioning, and the Civitai checkpoint library close or exceed the gap for specific niches (consistent characters, product photography, architectural rendering). Midjourney is closed-weight with no self-hosting option. Flux is partially open (Dev weights) but with a smaller ecosystem than SD. Pick SD for depth of control; pick Flux for default photoreal; pick Midjourney for aesthetic baseline.
Is SDXL still useful in 2026? Yes. SDXL remains widely used in the community. Stability AI no longer positions it as flagship, but the library of SDXL checkpoints, LoRAs, and ControlNets on Civitai is larger and more mature than the SD 3.5 ecosystem. Automatic1111 and ComfyUI fully support both. If a community checkpoint fits your use case, SDXL is still a valid production choice.
Related
- Flux · open-weight photoreal leader; weaker ecosystem, stronger defaults
- Midjourney · closed aesthetic leader, no self-host
- Ideogram · text-in-image specialist
- Leonardo · hosted SD-adjacent UI with LoRA training
- Adobe Firefly · commercial indemnification alternative
- ChatGPT · GPT Image 2 for bundled text+image workflows
- Category: AI Image · AI Design
Sources
- Stability AI news and updates (verified 2026-05-13)
- Stability AI Community License (verified 2026-05-13)
- Stability AI developer platform pricing (verified 2026-05-13)
- Introducing Stable Diffusion 3.5 (verified 2026-05-13)
- Stability AI Stable Image product page (verified 2026-05-13)
- SD 3.5 Large Hugging Face model card (verified 2026-05-13)
- SD 3.5 Large ControlNets release, Jan 29, 2026 (verified 2026-05-13)
- Civitai community checkpoint repository (verified 2026-05-13)