DiffBIR：超解像や劣化画像を修復

2023年10月7日プログラミング

1 はじめに
2 実装
3 おわりに

はじめに

DiffBIRは、Stable Diffusionの事前学習モデルを活用して、劣化画像の画像修復（超解像）ができまふ。(‘◇’)ゞ

実装

試行環境

Windows11 pro (Windows Insier Program)
マウスコンピューター製G-Tune E5-144
CPU：インテル(R) Core(TM) i7-10875H プロセッサー
メモリ：32GB メモリ
SSD (M.2)：512GB NVMe SSD
グラフィックス：NVIDIA GeForce RTX2060 / 6GB

Python : 3.10 (pyenvを使用)
CUDA : 1.17

方法

Git Clone

git clone https://github.com/XPixelGroup/DiffBIR.git

1	git clone https://github.com/XPixelGroup/DiffBIR.git

カレントディレクトリの移動

cd DiffBIR

1	cd DiffBIR

仮想環境構築（Pyenv で Python3.1.5 を使用）と起動

pyenv local 3.10.5
python -m venv venv
venv/scripts/activate

pyenv local 3.10.5

python -m venv venv

venv/scripts/activate

必要ライブラリのインストール

requirements.txtの変更、インストール

『requirements.txt』を、下記の様に一部をコメントアウトします。

# --extra-index-url https://download.pytorch.org/whl/cu116
# torch==1.13.1+cu116
# torchvision==0.14.1+cu116
xformers==0.0.16
pytorch_lightning==1.4.2
einops
open-clip-torch
omegaconf
torchmetrics==0.6.0
# triton==2.0.0
opencv-python-headless
scipy
matplotlib
lpips
gradio
chardet
transformers
facexlib

# --extra-index-url https://download.pytorch.org/whl/cu116

# torch==1.13.1+cu116

# torchvision==0.14.1+cu116

xformers==0.0.16

pytorch_lightning==1.4.2

einops

open-clip-torch

omegaconf

torchmetrics==0.6.0

# triton==2.0.0

opencv-python-headless

scipy

matplotlib

lpips

gradio

chardet

transformers

facexlib

pipでインストールします。

pip install -r requirements.txt

1	pip install -r requirements.txt

Tritonのインストール

下記サイトから『triton-2.0.0-cp310-cp310-win_amd64.whl』をダウンロードして、おなじフォルダに保存します。
https://huggingface.co/r4ziel/xformers_pre_built/commit/22505e3edeead471f3801ff2c3d478ffa51be755

pipでインストールします。

pip install triton-2.0.0-cp310-cp310-win_amd64.whl

1	pip install triton-2.0.0-cp310-cp310-win_amd64.whl

PyTorchのインストール

まず、既存の『torch』と『torchvision』をアンインストールします。

pip uninstall torch torchvision

1	pip uninstall torch torchvision

CUDA11.7対応PyTorchライブラリをインストール

pip install torch torchvision --index-url https://download.pytorch.org/whl/cu117

1	pip install torch torchvision --index-url https://download.pytorch.org/whl/cu117

学習済みモデルを配置

下記アドレスより学習済みモデルをダウンロードし、『weight』というフォルダを作成し、ここに保存します。

https://huggingface.co/lxq007/DiffBIR/resolve/main/general_full_v1.ckpt
https://huggingface.co/lxq007/DiffBIR/resolve/main/general_swinir_v1.ckpt

起動コマンド

下記コマンドで起動します。

python gradio_diffbir.py --ckpt weights/general_full_v1.ckpt --config configs/model/cldm.yaml --reload_swinir --swinir_ckpt weights/general_swinir_v1.ckpt --device cuda

1	python gradio_diffbir.py --ckpt weights/general_full_v1.ckpt --config configs/model/cldm.yaml --reload_swinir --swinir_ckpt weights/general_swinir_v1.ckpt --device cuda

ブラウザで『http://127.0.0.1:7860』にアクセスします。

付属の画像で試してみました。

左：元の画像、右：生成画像

おわりに

色々あそべそうでふね～(´▽｀)～

この記事を書いた人

さぷりぺんたん

化学系で博士号を取得したが、あるとき、これからの時代はプログラミング！、と目覚める。 pythonを用いてデータ解析や機械学習に没頭。最近は、Pytorchで作ったONNXモデルを、Nuxt3にのせたWebサービスの開発、 ChatGPT や Stable Diffusion に没頭中☆('ω')☆

SNSでフォローする

DiffBIR：超解像や劣化画像を修復

はじめに

実装

試行環境

方法

Git Clone

カレントディレクトリの移動

仮想環境構築（Pyenv で Python3.1.5 を使用）と起動

必要ライブラリのインストール

requirements.txtの変更、インストール

Tritonのインストール

PyTorchのインストール

学習済みモデルを配置

起動コマンド

おわりに

コメントを残す コメントをキャンセル

AnimateDiff：Windows用コードを作成しました

DA-CLIP(Degradation-aware CLIP)：Windows用コードを作成しました

ChatGPT：Code Interpreter を使って、超解像（ ESRGAN ）

Stable Diffusion web UI : APIモードのルーティング

人気の記事

コメントを残すコメントをキャンセル