Stable Diffusion

1487 readers

1 users here now

Welcome to the Stable Diffusion community, dedicated to the exploration and discussion of the open source deep learning model known as Stable Diffusion.

Introduced in 2022, Stable Diffusion uses a latent diffusion model to generate detailed images based on text descriptions and can also be applied to other tasks such as inpainting, outpainting, and generating image-to-image translations guided by text prompts. The model was developed by the startup Stability AI, in collaboration with a number of academic researchers and non-profit organizations, marking a significant shift from previous proprietary models that were accessible only via cloud services.

founded 1 year ago

MODERATORS

buda@lemmy.ml

Stability AI stablized by investment from Silcon Valley royalty, new executive team (www.theregister.com)

submitted 4 months ago by ylai@lemmy.ml to c/stablediffusion@lemmy.ml

0 comments fedilink

Stability AI gets new leadership as gen AI innovations continue to roll out (venturebeat.com)

submitted 4 months ago by ylai@lemmy.ml to c/stablediffusion@lemmy.ml

1 comments fedilink

The next chapter for ComfyUI (blog.comfyui.ca)

submitted 4 months ago by ylai@lemmy.ml to c/stablediffusion@lemmy.ml

0 comments fedilink

Licensing Questions Plague Stability AI as SD3 Image Generator Gets Banned (decrypt.co)

submitted 4 months ago by ylai@lemmy.ml to c/stablediffusion@lemmy.ml

0 comments fedilink

How British tech star Stability AI imploded with debt and lawsuits (www.thetimes.com)

submitted 4 months ago by ylai@lemmy.ml to c/stablediffusion@lemmy.ml

2 comments fedilink

Without paywall: https://archive.ph/QD9v1

The developer of Comfy, who also helped train some versions of SD3, has resigned from SAI (www.reddit.com)

submitted 4 months ago* (last edited 4 months ago) by ylai@lemmy.ml to c/stablediffusion@lemmy.ml

0 comments fedilink

Excerpt from the relevant “ComfyUI dev” Matrix room:

matt3o
and what is it then?

comfyanonymous
"safety training"

matt3o
why does it trigger on certain keywords and it's like it's scrambling the image?

comfyanonymous
the 2B wasn't the one I had been working on so I don't really know the specifics

matt3o
I was even able to trick it by sending certain negatives

comfyanonymous
I was working on a T5 only 4B model which would ironically had been safer without breaking everything
because T5 doesn't know any image data so it was only able to generate images in the distribution of the filtered training data

comfyanonymous
but they canned my 4B and I wasn't really following the 2B that closely

[…]

comfyanonymous
yeah they did something with the weights
the model arch of the 2B was never changed at all

BVH
weights directly?
oh boy, abliteration, the worst kind

comfyanonymous
also they apparently messed up the pretraining on the 2B so it was never supposed to actually be released

[…]

comfyanonymous
yeah the 2B apparently was a bit of a failed experiment by the researchers that left
but there was a strong push by the top of the company to release to 2B instead of the 4B and 8B

Additional excerpt (after the Reddit post) from Stable Diffusion Discord “#sd3”:

comfy
Yes I resigned over 2 weeks ago and Friday was my last day at stability

Ridiculed Stable Diffusion 3 release excels at AI-generated body horror (arstechnica.com)

submitted 5 months ago by ylai@lemmy.ml to c/stablediffusion@lemmy.ml

3 comments fedilink

Stable Diffusion 3 Medium (stability.ai)

submitted 5 months ago by ylai@lemmy.ml to c/stablediffusion@lemmy.ml

0 comments fedilink

Stable Diffusion 3 to debut on June 12 (www.theregister.com)

submitted 5 months ago by ylai@lemmy.ml to c/stablediffusion@lemmy.ml

5 comments fedilink

Basic ComfyUI Workflows with minimal custom nodes (github.com)

submitted 5 months ago by lanolinoil@lemmy.world to c/stablediffusion@lemmy.ml

0 comments fedilink

Basic ComfyUI workflows without 10k custom nodes or impossible to follow workflows.

Stability AI in talks to raise money from investor group (www.reuters.com)

submitted 5 months ago by ylai@lemmy.ml to c/stablediffusion@lemmy.ml

0 comments fedilink

AI startup Stability lays off 10% of staff after controversial CEO's exit: Read the full memo (www.cnbc.com)

submitted 6 months ago by ylai@lemmy.ml to c/stablediffusion@lemmy.ml

0 comments fedilink

How Stability AI’s Founder Tanked His Billion-Dollar Startup (www.forbes.com)

submitted 7 months ago by ylai@lemmy.ml to c/stablediffusion@lemmy.ml

1 comments fedilink

https://archive.is/FkzdE

Inside the $1 billion love affair between Stability AI’s ‘complicated’ founder and tech investors Coatue and Lightspeed—and how it turned bitter within months (fortune.com)

submitted 7 months ago by ylai@lemmy.ml to c/stablediffusion@lemmy.ml

0 comments fedilink

Without paywall: https://archive.ph/8QkSl

Key Stable Diffusion Researchers Leave Stability AI As Company Flounders (www.forbes.com)

submitted 7 months ago by ylai@lemmy.ml to c/stablediffusion@lemmy.ml

4 comments fedilink

AI generates high-quality images 30 times faster in a single step (news.mit.edu)

submitted 7 months ago by ylai@lemmy.ml to c/stablediffusion@lemmy.ml

0 comments fedilink

Why is AI so bad at spelling? Because image generators aren't actually reading text (techcrunch.com)

submitted 7 months ago by ylai@lemmy.ml to c/stablediffusion@lemmy.ml

2 comments fedilink

Midjourney bans all Stability AI employees over alleged data scraping (www.theverge.com)

submitted 8 months ago by ylai@lemmy.ml to c/stablediffusion@lemmy.ml

12 comments fedilink

Stable Diffusion 3 arrives to solidify early lead in AI imagery against Sora and Gemini (techcrunch.com)

submitted 8 months ago* (last edited 8 months ago) by ylai@lemmy.ml to c/stablediffusion@lemmy.ml

0 comments fedilink

Factorio Blueprint Visualizer SDXL Lora (lemmy.world)

submitted 11 months ago by piebro@lemmy.world to c/stablediffusion@lemmy.ml

0 comments fedilink

I created a custom SDXL Lora using my dataset. I created the dataset using a previous generative art tool I build to visualize factorio blueprints: https://github.com/piebro/factorio-blueprint-visualizer. I like the lora to create interesting patterns.

Efficient Training Image Extraction from Diffusion Models Ryan Webs (www.youtube.com)

submitted 11 months ago by ylai@lemmy.ml to c/stablediffusion@lemmy.ml

0 comments fedilink

Strange Results for Prompt 'turbotits' From Stable Diffusion 1.4, 1.5, XL (lemmy.world)

submitted 11 months ago* (last edited 11 months ago) by ubermeisters@lemmy.world to c/stablediffusion@lemmy.ml

5 comments fedilink

I don't know why its cracking me up so much, I dont know what I was expecting.. but also, I can't find any easy reason why this is happening, maybe an artist name i cant find now? A product name translation?

~~Stable Diffusion XL doesn't seem to replicate the same results~~ (actually kinda @ 768px instead of 512) any?? Stable diffusion ~~1.x~~ model with only positive prompt:

turbotits

Settings	Image
Steps: 20, Sampler: Euler a, CFG scale: 7, Seed: 912261969, Size: 512x512, Model hash: f968fc436a, Model: analogMadness_v50, VAE hash: 735e4c3a44, VAE: vae-ft-mse-840000-ema-pruned.safetensors, Version: v1.6.0-2-g4afaaf8a
Steps: 20, Sampler: Euler a, CFG scale: 7, Seed: 1212726575, Size: 512x512, Model hash: 9aba26abdf, Model: deliberate_v2_sd1.5, VAE hash: 735e4c3a44, VAE: vae-ft-mse-840000-ema-pruned.safetensors, Version: v1.6.0-2-g4afaaf8a
Steps: 20, Sampler: DPM++ 2M, CFG scale: 7, Seed: 1020297828, Size: 512x512, Model hash: af9ab7cc05, Model:realcartoonPixar_v2_sd1.5, VAE hash: 735e4c3a44, VAE: animevae.pt, ADetailer model: yolov8n.pt, ADetailer confidence: 0.3, ADetailer dilate erode: 4, ADetailer mask blur: 4, ADetailer denoising strength: 0.4, ADetailer inpaint only masked: True, ADetailer inpaint padding: 32, ADetailer ControlNet model: control_v11p_sd15_inpaint [ebff9138], ADetailer ControlNet module: inpaint_global_harmonious, ADetailer version: 23.11.1, ControlNet 0: "Module: inpaint_global_harmonious, Model: control_v11p_sd15_inpaint [ebff9138], Weight: 1.0, Resize Mode: ResizeMode.INNER_FIT, Low Vram: False, Guidance Start: 0.0, Guidance End: 1.0, Pixel Perfect: True, Control Mode: ControlMode.BALANCED, Save Detected Map: True", Version: v1.6.0-2-g4afaaf8a
Steps: 20, Sampler: DPM++ 2M, CFG scale: 7, Seed: 2311419033, Size: 512x512, Model hash: cc6cb27103, Model: v1-5-pruned-emaonly, VAE hash: 735e4c3a44, VAE: vae-ft-mse-840000-ema-pruned.safetensors, Version: v1.6.0-2-g4afaaf8a
Steps: 20, Sampler: Euler a, CFG scale: 7, Seed: 2505533199, Size: 512x512, Model hash: fc2511737a, Model: chilloutmix_NiPrunedFp32Fix.sd1.5, VAE hash: 735e4c3a44, VAE: vae-ft-mse-840000-ema-pruned.safetensors, Clip skip: 2, Version: v1.6.0-2-g4afaaf8a
Steps: 20, Sampler: Euler a, CFG scale: 7, Seed: 3851799520, Size: 512x512, Model hash: fe4efff1e1, Model: HuggingFace_SD1.4, Clip skip: 2, Version: v1.6.0-2-g4afaaf8a
Steps: 20, Sampler: Euler a, CFG scale: 7, Seed: 2906017088, Size: 768x768, Model hash: e6bb9ea85b, Model: sdXL_v10VAEFix, Version: v1.6.0-2-g4afaaf8a

some models seem to show better results using clip skip 1 (vs. 2). Anyway, I got a chuckle from it. On a more utilitarian note, i suppose things like this could be used to gauge how "far" from an unknown merge point you are from a base model perhaps? (like DNA evolution i mean, wording it poorly)

settings for post/linked image:

Steps: 20, Sampler: Euler a, CFG scale: 7, Seed: 366961196, Size: 512x704, Model hash: 8c4042921a, Model: aZovyaRPGArtistTools_v3VAE_sd1.5, VAE hash: 735e4c3a44, VAE: vae-ft-mse-840000-ema-pruned.safetensors, Denoising strength: 0.52, Hires upscale: 1.5, Hires upscaler: Latent, Refiner: centerflex_v28.sd1.5_safetensors [02e248bf1b], Refiner switch at: 0.8, Version: v1.6.0-2-g4afaaf8a

Introducing Stable Video Diffusion — Stability AI (stability.ai)

submitted 11 months ago by ylai@lemmy.ml to c/stablediffusion@lemmy.ml

3 comments fedilink

NVIDIA TensorRT Extension for Stable Diffusion Performance Analysis (www.pugetsystems.com)

submitted 1 year ago by ylai@lemmy.ml to c/stablediffusion@lemmy.ml

1 comments fedilink

Happy Halloween (programming.dev)

submitted 1 year ago by Echostorm@programming.dev to c/stablediffusion@lemmy.ml

0 comments fedilink

Using ParchArtXL civitai.com/models/141471/parchartxl