GitHub - caniko/stable_diffusion_arc: Stable Difussion inference on Intel Arc dGPUs

Stable Diffusion inference on Intel Arc GPUs

Now that we have our Arc discrete GPU setup on Linux, let's try to run Stable Diffusion model using it. The gpu we are using is an Arc A770 16 GB card.

A quick recap / updated steps to set up Arc on Linux

Intel has now published documentation on how to set up Arc on Linux. I tried it today and it works beautifully.

Steps to configure Arc

Install the 5.7 OEM kernel
Install kernel mode drivers, gpu firmware
Install usermod drivers for compute, 3d graphics and media
Add user to render group
Install oneAPI 2022.3 (latest as of this writeup)

The version of the usermod drivers that I have tested with are:

intel-level-zero-gpu  (1.3.23937+i449~u22.04).
intel-media-va-driver-non-free  (22.5.1+i449~u22.04).
intel-opencl-icd  (22.32.23937+i449~u22.04).
level-zero  (1.8.5+i449~u22.04).
libegl-mesa0  (22.2.0.20220729.1.2+2046).
libegl1-mesa  (22.2.0.20220729.1.2+2046).
libegl1-mesa-dev  (22.2.0.20220729.1.2+2046).
libgbm1  (22.2.0.20220729.1.2+2046).
libgl1-mesa-dev  (22.2.0.20220729.1.2+2046).
libgl1-mesa-dri  (22.2.0.20220729.1.2+2046).
libglapi-mesa  (22.2.0.20220729.1.2+2046).
libgles2-mesa-dev  (22.2.0.20220729.1.2+2046).
libglx-mesa0  (22.2.0.20220729.1.2+2046).
libigdgmm12  (22.1.7+i449~u22.04).
libmfx1  (22.5.1+i449~u22.04).
libmfxgen1  (22.5.1+i449~u22.04).
libvpl2  (2022.1.6.0+i449~u22.04).
libxatracker2  (22.2.0.20220729.1.2+2046).
mesa-va-drivers  (22.2.0.20220729.1.2+2046).
mesa-vdpau-drivers  (22.2.0.20220729.1.2+2046).
mesa-vulkan-drivers  (22.2.0.20220729.1.2+2046).
va-driver-all  (2.15.0.2-36).

Stable Diffusion

Stable Diffusion is a fully open-source (thank you Stability.ai) deep learning text to image and image to image model. For more information on the model, checkout the wikipedia entry for the same.

PyTorch

To use PyTorch on Intel GPUs, we need to install, the Intel extensions for PyTorch or ipex. Let's get the latest release for pyTorch and ipex.

Create a conda environment with Python 3.9 and install both of the wheels.

~ → conda create -n ipex python=3.9 -y

~ → conda activate ipex
~ → pip install ~/Downloads/*.whl

Install diffusers library and dependencies

~ → pip install diffusers ftfy transformers Pillow

Run stable diffusion

We will use a model from 🤗 maintained by runwayml, runwayml/stable-diffusion-v1-5. To use the model, you will have to generate a User access token for the 🤗 model hub. Once generated we can easily download the model using diffusers API. Now that we have installed all the required packages and have the user token, lets try it out:

import intel_extension_for_pytorch
import torch
from diffusers import StableDiffusionPipeline

model_id="runwayml/stable-diffusion-v1-5"
prompt = "vivid red hot air ballons over paris in the evening"
pipe = StableDiffusionPipeline.from_pretrained(
    model_id,
    torch_dtype=torch.float16,  # this can be torch.float32 as well
    revision="fp16",
    use_auth_token="<the token you generated>")
pipe = pipe.to("xpu")
image = pipe(prompt).images[0]
image.save(f"{prompt[:5]}.png")

Executing this, we get the result:

In [8]: image = pipe(prompt).images[0]
   ...: 
100%|██████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 51/51 [00:35<00:00,  1.43it/s]
In [9]: image = pipe(prompt).images[0]
100%|██████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 51/51 [00:09<00:00,  5.20it/s]

As you can see the first time you run the model, it takes about 35 seconds, subsequent runs take about 10 seconds, you can expect this number to double when using fp32.

TensorFlow

Moving on to TensorFlow, we have this awesome repo from divamgupta:

Create a conda environment with Python 3.9 and install tensorflow and intel_extension_for_tensorflow wheels.

~ → conda create -n itex python=3.9 -y

~ → conda activate itex
~ → pip install tensorflow==2.10.0
~ → pip install --upgrade intel-extension-for-tensorflow[gpu]

Let's see how to run the model using PyTorch first,

Install stable_diffusion_tensorflow package and dependencies

~ → pip install git+https://github.com/divamgupta/stable-diffusion-tensorflow ftfy pillow tqdm regex tensorflow-addons

Run stable diffusion

Running the TensorFlow model is straightforward as there are no user tokens or anything like that required.

import intel_extension_for_tensorflow
import tensorflow
from stable_diffusion_tf.stable_diffusion import StableDiffusion
from PIL import Image

prompt = "vivid red hot air ballons over paris in the evening"
generator = StableDiffusion(
    img_height=512,
    img_width=512,
    jit_compile=False,
)

img = generator.generate(
    prompt,
    num_steps=50,
    unconditional_guidance_scale=7.5,
    temperature=1,
    batch_size=1,
)
Image.fromarray(img[0]).save("sd_tf_fp32.png")

Executing this, we get the result:

2022-11-06 23:00:51.948547: I tensorflow/core/grappler/optimizers/custom_graph_optimizer_registry.cc:114] Plugin optimizer for device_type XPU is enabled.
  0   1: 100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 50/50 [01:00<00:00,  1.21s/it]
2022-11-06 23:01:55.103111: I tensorflow/core/grappler/optimizers/custom_graph_optimizer_registry.cc:114] Plugin optimizer for device_type XPU is enabled.
  0   1: 100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 50/50 [00:29<00:00,  1.67it/s]

As you can see the first time you run the model, it takes about 60 seconds, subsequent runs take about 30 seconds. One thing to note here is that, for the TensorFlow version we used FP32 and not FP16 as in the case of pyTorch.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
images		images
pytorch		pytorch
tensorflow		tensorflow
LICENSE		LICENSE
Readme.md		Readme.md
sd_pytorch.py		sd_pytorch.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Stable Diffusion inference on Intel Arc GPUs

A quick recap / updated steps to set up Arc on Linux

Steps to configure Arc

Stable Diffusion

PyTorch

TensorFlow

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Stable Diffusion inference on Intel Arc GPUs

A quick recap / updated steps to set up Arc on Linux

Steps to configure Arc

Stable Diffusion

PyTorch

TensorFlow

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages