AI generated images - Stable Diffusion on NixOS
In this blog post, I share my journey of discovering AI-generated images, starting with generating an image for my blog using Craiyon and moving on to using DALL·E 2 and Stable Diffusion. While the former required credits for each run, the latter was an open-sourced and more affordable option that I could run locally. I also share the steps that I followed to run Stable Diffusion on NixOS.
§ AI generated images
AI generated images became interesting for me when I start to think about creating this blog. I wanted to show the GitHub avatar on the top, but the original that I was using was recklessly copied from a copyrighted site.
I got an idea that if I would generate an image, then I could still keep using the "alien octopus" and avoid the copyright issue. Actually I am still not fully sure how the copyright for AI generated images work, but I imagine that the AI is a tool like Photoshop. The images created with tool do not belong to the software, but to the human "author".
My path of discovery started with Craiyon. The first image I chose from the generated batch was this one:
At the same day I have found DALL·E 2 subreddit with fresh post that the DALL·E 2 registration doesn't require special invite from wishlist anymore, and it goes fully public. So I jumped in, and generated some new octopuses:
How gorgeous! But bitter realization followed. Each run requires credits. Some free credits are given at the sign-up and few more each month. Extra credits are paid. Of course this model makes sense for some professional writers. But for a hobbyist who just want to play, it feels like a sword of Damocles - not allowing freely to experiment with your imagination because I am going to run out of the credits any time soon.
So the journey continues. Xe recently wrote some posts about the Stable Diffusion. A competitor to DALL·E 2, but open-sourced one 💕. And surprisingly originating from Munich, which made me personally feel happy.
The first experience with Stable Diffusion was via Dream Studio. It also offered an option I didn't see in DALL·E 2 before. It is possible to pass a reference image, provide a prompt and new image will be generated with the same style. For example the image on the left I draw long time ago, and the image on the right is the one generated via Dream Studio.
And the credits again. Ah, no problem anymore, we can run it locally and forget the whole credit nightmare ~ if we have super duped GPU with 10 GB free VRAM. Which we have, right? And if not, then there is optimized version which will be satisfied with 2.4 GB VRAM. These are the miracles that start happening when things go open source.
§ What worked
I am enchanted by Nix. And if I will have a solution then this solution will be running in NixOS. And the best place for finding NixOS solutions is NixOS Discource.
Stable Diffusion using nix flakes by Collin Arnett
These steps led me to locally running Stable Diffusion:
- clone repository
- register account on Hugging Faces
- generate Access account
- accept Terms and conditions for the Stable Diffusion model
- run
nix run --impure
- update token and run
First run resulted in a lot of downloading, but subsequent runs spent time only on image generation. Later I have updated the script to get 4 images at once and also to save them in PNG format. The final version before I decided to write this post looked like this:
import os
import torch
import transformers
from torch import autocast
from diffusers import StableDiffusionPipeline
HF_TOKEN = os.environ.get("HF_TOKEN")
pipe = StableDiffusionPipeline.from_pretrained(
"CompVis/stable-diffusion-v1-4",
revision="fp16",
torch_dtype=torch.float16,
use_auth_token=HF_TOKEN
)
pipe = pipe.to("cuda")
pipe.enable_attention_slicing()
prompt = "Octopus symmetry geometry math"
with autocast("cuda"):
image1 = pipe(prompt).images[0]
image2 = pipe(prompt).images[0]
image3 = pipe(prompt).images[0]
image4 = pipe(prompt).images[0]
display(image1)
display(image2)
display(image3)
display(image4)
image1.save(prompt + "1.png")
image2.save(prompt + "2.png")
image3.save(prompt + "3.png")
image4.save(prompt + "4.png")
And here are some images I have generated for the prompt "Octopus symmetry geometry math":
Update: Created repo github:PrimaMateria/stable-diffusion.
§ What didn't work
Just for a record, here is what a tried before and what didn't work for me.
First I tried to follow Xe's blog post.
nix shell nixpkgs#conda
conda-shell
But running this resulted first in error messages
*** stack smashsng detected ***: terminated
and later failed creation of Conda
environment because of no working git. Running git in Conda shell reported
git: error while loading shared libraries: __vdso_gettimeofday: invalid mode for dlopen(): Invalid argument
I have also tried to install Conda in via old nix-shell
.
nix-shell -p conda
Git was still not working, now with slightly different message:
git: /usr/lib/libc.so.6: version `GLIBC_2.34' not found (required by git)
I tried to look for help on NixOS wiki, but unfortunately it didn't help at all. When I searched on Discord the keyword Conda, I got NobbZ' recent message "The most reliable way of using conda I have heard about was through a VM", which ceased my further efforts involving Conda.
And then I turned to Discourse and I found the working solution without Conda as described above.
GitHub comments