Diffusers庫的初識及使用

iSherryZhang發表於2023-02-23

原文網址 : https://www.cnblogs.com/shuezhang/p/17147368.html

diffusers庫的目標是：

將擴散模型（diffusion models）集中到一個單一且長期維護的專案中
以公眾可訪問的方式復現高影響力的機器學習系統，如DALLE、Imagen等
讓開發人員可以很容易地使用API進行模型訓練或者使用現有模型進行推理

diffusers的核心分成三個元件：

Pipelines: 高層類，以一種使用者友好的方式，基於流行的擴散模型快速生成樣本
Models：訓練新擴散模型的流行架構，如UNet
Schedulers：推理場景下基於噪聲生成影像或訓練場景下基於噪聲生成帶噪影像的各種技術

diffusers的安裝

pip install diffusers

先看推理

匯入Pipeline，from_pretrained()載入模型，可以是本地模型，或從the Hugging Face Hub自動下載。

from diffusers import StableDiffusionPipeline

image_pipe = StableDiffusionPipeline.from_pretrained("CompVis/stable-diffusion-v1-4")
# 載入本地模型：
# image_pipe = StableDiffusionPipeline.from_pretrained("./models/Stablediffusion/stable-diffusion-v1-4")
image_pipe.to("cuda")

prompt = "a photograph of an astronaut riding a horse"
pipe_out = image_pipe(prompt)

image = pipe_out.images[0]
# you can save the image with
# image.save(f"astronaut_rides_horse.png")

我們檢視下image_pipe的內容：

StableDiffusionPipeline {
  "_class_name": "StableDiffusionPipeline",
  "_diffusers_version": "0.10.2",
  "feature_extractor": [
    "transformers",
    "CLIPFeatureExtractor"
  ],
  "requires_safety_checker": true,
  "safety_checker": [
    "stable_diffusion",
    "StableDiffusionSafetyChecker"
  ],
  "scheduler": [
    "diffusers",
    "PNDMScheduler"
  ],
  "text_encoder": [
    "transformers",
    "CLIPTextModel"
  ],
  "tokenizer": [
    "transformers",
    "CLIPTokenizer"
  ],
  "unet": [
    "diffusers",
    "UNet2DConditionModel"
  ],
  "vae": [
    "diffusers",
    "AutoencoderKL"
  ]
}

檢視Images的結構：

StableDiffusionPipelineOutput(
images=[<PIL.Image.Image image mode=RGB size=512x512 at 0x1A14BDD7730>], 
nsfw_content_detected=[False])

由此，可以看到pipe_out的包含兩部分，第一部分就是生成的圖片列表，如果只有一張圖片，則pipe_out.images[0]即可取出目標影像。

如果我們要一次生成多張影像呢？只需要修改prompt的list長度即可，程式碼如下。

from diffusers import StableDiffusionPipeline

image_pipe = StableDiffusionPipeline.from_pretrained("CompVis/stable-diffusion-v1-4")

image_pipe.to("cuda")
prompt = ["a photograph of an astronaut riding a horse"] * 3
out_images = image_pipe(prompt).images
for i, out_image in enumerate(out_images):
    out_image.save("astronaut_rides_horse" + str(i) + ".png")

在使用image_pipe生成影像時，預設是float32精度的，若本地現在不足，可能會報Out of memory的錯誤，此時，可以透過載入float16精度的模型來解決。

Note: If you are limited by GPU memory and have less than 10GB of GPU RAM available, please make sure to load the StableDiffusionPipeline in float16 precision instead of the default float32 precision as done above.

You can do so by loading the weights from the fp16 branch and by telling diffusers to expect the weights to be in float16 precision:
image_pipe = StableDiffusionPipeline.from_pretrained("CompVis/stable-diffusion-v1-4", revision="fp16", torch_dtype=torch.float16)

對於每個PipeLine都有一些特定的配置，如StableDiffusionPipeline除了必要的prompt引數，還可以配置如下引數：

num_inference_steps: int = 50
guidance_scale: float = 7.5
generator: Optional[torch.Generator] = None
等等

示例：如果你想要每次得到的結果均一致，可以設定每次的種子都一樣

generator = torch.Generator("cuda").manual_seed(1024)
prompt = ["a photograph of an astronaut riding a horse"] * 3
out_images = image_pipe(prompt, generator=generator).images

再看訓練

Laravel 學習--資料庫使用初識 1
2019-01-11
Laravel資料庫
開發 Diffusers 庫的道德行為指南
2023-05-11
使用 diffusers 訓練你自己的 ControlNet ?
2023-04-04
MySQL資料庫初識——初窺MySQL
2018-06-25
MySql資料庫
初識Git 基本的使用操作
2020-10-19
Git
初識時序資料庫
2019-11-25
資料庫
初識達夢資料庫
2022-12-21
資料庫
使用 ? Diffusers 實現 ControlNet 高速推理
2023-03-07
資料庫介紹--初識資料庫
2018-07-15
資料庫
初相識 | 全方位認識 sys 系統庫
2018-07-27
Typescript初識及初步實踐週報
2019-02-16
TypeScript
初識Netty原理（一）—— 基本使用
2019-12-17
Netty
推薦學Java——初識資料庫
2021-11-04
Java資料庫
1. 初識Jackson -- 世界上最好的JSON庫
2020-07-23
JSON
Nginx的初識
2019-01-19
Nginx
Pandas 基礎 (1) - 初識及安裝 yupyter
2019-03-07
初識Fish Redux在Flutter中使用
2019-07-08
ReduxFlutter
libevent原始碼初識及目錄結構分析
2021-09-09
原始碼
初識 Vue（19）---（Vue 中使用插槽(slot)）
2018-07-28
Vue
初識go的tomb包
2018-09-20
Go
初識前端中的棧
2019-06-04
前端
初識 “HTML”
2019-02-16
HTML
初識Golang
2019-02-16
Golang
初識jQuery
2018-10-28
jQuery
Nodejs初識
2019-04-01
NodeJS
Express初識
2019-04-08
Express
初識Git
2019-03-03
Git
初識JS
2018-10-11
JS
CDN初識
2018-09-07
初識Vue
2018-08-14
Vue
webpack初識
2018-07-03
Web
初識HIVE
2018-07-17
Hive
初識Tcp
2018-04-29
TCP
初識HTTP
2018-05-23
HTTP
初識ARKit
2018-05-17
初識Haphoop
2018-05-31
OOP
初識PostgreSql
2024-05-12
SQL
AsterixDB初識
2024-05-20
AST

Diffusers庫的初識及使用

diffusers的安裝

先看推理

再看訓練

相關文章