kelseye commited on
Commit
233ead0
·
verified ·
1 Parent(s): 3dbdc51

Upload folder using huggingface_hub

Browse files
.gitattributes CHANGED
@@ -33,3 +33,30 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ _cover_images_/11_enhance_2.jpg filter=lfs diff=lfs merge=lfs -text
37
+ _cover_images_/12_enhance_2.jpg filter=lfs diff=lfs merge=lfs -text
38
+ _cover_images_/15_enhance_2.jpg filter=lfs diff=lfs merge=lfs -text
39
+ _cover_images_/5_enhance_2.jpg filter=lfs diff=lfs merge=lfs -text
40
+ _cover_images_/6_enhance_2.jpg filter=lfs diff=lfs merge=lfs -text
41
+ _cover_images_/9_enhance_2.jpg filter=lfs diff=lfs merge=lfs -text
42
+ assets/0_enhance_0.jpg filter=lfs diff=lfs merge=lfs -text
43
+ assets/0_enhance_1.jpg filter=lfs diff=lfs merge=lfs -text
44
+ assets/0_enhance_2.jpg filter=lfs diff=lfs merge=lfs -text
45
+ assets/11_enhance_1.jpg filter=lfs diff=lfs merge=lfs -text
46
+ assets/11_enhance_2.jpg filter=lfs diff=lfs merge=lfs -text
47
+ assets/12_enhance_0.jpg filter=lfs diff=lfs merge=lfs -text
48
+ assets/12_enhance_1.jpg filter=lfs diff=lfs merge=lfs -text
49
+ assets/12_enhance_2.jpg filter=lfs diff=lfs merge=lfs -text
50
+ assets/15_enhance_1.jpg filter=lfs diff=lfs merge=lfs -text
51
+ assets/15_enhance_2.jpg filter=lfs diff=lfs merge=lfs -text
52
+ assets/4_enhance_0.jpg filter=lfs diff=lfs merge=lfs -text
53
+ assets/4_enhance_1.jpg filter=lfs diff=lfs merge=lfs -text
54
+ assets/4_enhance_2.jpg filter=lfs diff=lfs merge=lfs -text
55
+ assets/5.jpg filter=lfs diff=lfs merge=lfs -text
56
+ assets/5_enhance_0.jpg filter=lfs diff=lfs merge=lfs -text
57
+ assets/5_enhance_1.jpg filter=lfs diff=lfs merge=lfs -text
58
+ assets/5_enhance_2.jpg filter=lfs diff=lfs merge=lfs -text
59
+ assets/6_enhance_1.jpg filter=lfs diff=lfs merge=lfs -text
60
+ assets/6_enhance_2.jpg filter=lfs diff=lfs merge=lfs -text
61
+ assets/9_enhance_1.jpg filter=lfs diff=lfs merge=lfs -text
62
+ assets/9_enhance_2.jpg filter=lfs diff=lfs merge=lfs -text
README.md ADDED
@@ -0,0 +1,65 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ ---
4
+ # Aesthetic Enhancement - Kontext Image Editing LoRA
5
+
6
+ ## Model Introduction
7
+
8
+ This LoRA model is trained based on the [Kontext](https://www.modelscope.cn/models/black-forest-labs/FLUX.1-Kontext-dev) model and [DiffSynth-Studio](https://github.com/modelscope/DiffSynth-Studio). After using this model, you can input the instruction `Enhance the aesthetic quality of this image.` to enhance the aesthetic quality of an image. **The model automatically analyzes aspects such as lighting and composition based on the image content and adjusts them to make the image more visually appealing.** This model can be applied repeatedly, meaning you can perform aesthetic enhancement multiple times on an already enhanced image.
9
+
10
+ ## Model Results
11
+
12
+ ||Example 1|Example 2|Example 3|
13
+ |-|-|-|-|
14
+ |Original|![](./assets/9.jpg)|![](./assets/6.jpg)|![](./assets/12.jpg)|
15
+ |1st Enhancement|![](./assets/9_enhance_0.jpg)|![](./assets/6_enhance_0.jpg)|![](./assets/12_enhance_0.jpg)|
16
+ |2nd Enhancement|![](./assets/9_enhance_1.jpg)|![](./assets/6_enhance_1.jpg)|![](./assets/12_enhance_1.jpg)|
17
+ |3rd Enhancement|![](./assets/9_enhance_2.jpg)|![](./assets/6_enhance_2.jpg)|![](./assets/12_enhance_2.jpg)|
18
+
19
+ ||Example 4|Example 5|Example 6|
20
+ |-|-|-|-|
21
+ |Original|![](./assets/15.jpg)|![](./assets/5.jpg)|![](./assets/11.jpg)|
22
+ |1st Enhancement|![](./assets/15_enhance_0.jpg)|![](./assets/5_enhance_0.jpg)|![](./assets/11_enhance_0.jpg)|
23
+ |2nd Enhancement|![](./assets/15_enhance_1.jpg)|![](./assets/5_enhance_1.jpg)|![](./assets/11_enhance_1.jpg)|
24
+ |3rd Enhancement|![](./assets/15_enhance_2.jpg)|![](./assets/5_enhance_2.jpg)|![](./assets/11_enhance_2.jpg)|
25
+
26
+ ## Usage Instructions
27
+
28
+ This model is built on the [DiffSynth-Studio](https://github.com/modelscope/DiffSynth-Studio/tree/main/examples/flux) framework. Please install it first:
29
+
30
+ ```
31
+ git clone https://github.com/modelscope/DiffSynth-Studio.git
32
+ cd DiffSynth-Studio
33
+ pip install -e .
34
+ ```
35
+
36
+ ```python
37
+ import torch
38
+ from diffsynth.pipelines.flux_image_new import FluxImagePipeline, ModelConfig
39
+ from PIL import Image
40
+ from modelscope import snapshot_download
41
+ ```
42
+
43
+ ```python
44
+ snapshot_download("DiffSynth-Studio/FLUX.1-Kontext-dev-lora-ArtAug", cache_dir="./models")
45
+ pipe = FluxImagePipeline.from_pretrained(
46
+ torch_dtype=torch.bfloat16,
47
+ device="cuda",
48
+ model_configs=[
49
+ ModelConfig(model_id="black-forest-labs/FLUX.1-Kontext-dev", origin_file_pattern="flux1-kontext-dev.safetensors"),
50
+ ModelConfig(model_id="black-forest-labs/FLUX.1-dev", origin_file_pattern="text_encoder/model.safetensors"),
51
+ ModelConfig(model_id="black-forest-labs/FLUX.1-dev", origin_file_pattern="text_encoder_2/"),
52
+ ModelConfig(model_id="black-forest-labs/FLUX.1-dev", origin_file_pattern="ae.safetensors"),
53
+ ],
54
+ )
55
+ pipe.load_lora(pipe.dit, "models/DiffSynth-Studio/FLUX.1-Kontext-dev-lora-ArtAug/model.safetensors", alpha=1)
56
+
57
+ image = Image.open("your_image.jpg")
58
+ image = pipe(
59
+ prompt="Enhance the aesthetic quality of this image.",
60
+ kontext_images=image,
61
+ embedded_guidance=2.5,
62
+ seed=0,
63
+ )
64
+ image.save("output.jpg")
65
+ ```
README_from_modelscope.md ADDED
@@ -0,0 +1,104 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model: MusePublic/FLUX.1-Kontext-Dev@v1
3
+ cover_images:
4
+ - _cover_images_/9_enhance_2.jpg
5
+ - _cover_images_/6_enhance_2.jpg
6
+ - _cover_images_/12_enhance_2.jpg
7
+ - _cover_images_/15_enhance_2.jpg
8
+ - _cover_images_/5_enhance_2.jpg
9
+ - _cover_images_/11_enhance_2.jpg
10
+ frameworks:
11
+ - Pytorch
12
+ license: Apache License 2.0
13
+ tags:
14
+ - LoRA
15
+ - text-to-image
16
+ tasks:
17
+ - text-to-image-synthesis
18
+ vision_foundation: FLUX_1
19
+
20
+ #model-type:
21
+ ##如 gpt、phi、llama、chatglm、baichuan 等
22
+ #- gpt
23
+
24
+ #domain:
25
+ ##如 nlp、cv、audio、multi-modal
26
+ #- nlp
27
+
28
+ #language:
29
+ ##语言代码列表 https://help.aliyun.com/document_detail/215387.html?spm=a2c4g.11186623.0.0.9f8d7467kni6Aa
30
+ #- cn
31
+
32
+ #metrics:
33
+ ##如 CIDEr、Blue、ROUGE 等
34
+ #- CIDEr
35
+
36
+ #tags:
37
+ ##各种自定义,包括 pretrained、fine-tuned、instruction-tuned、RL-tuned 等训练方法和其他
38
+ #- pretrained
39
+
40
+ #tools:
41
+ ##如 vllm、fastchat、llamacpp、AdaSeq 等
42
+ #- vllm
43
+ ---
44
+
45
+ # 美学提升 - Kontext 图像编辑 LoRA
46
+
47
+ ## 模型介绍
48
+
49
+ 本 LoRA 模型是基于 [Kontext](https://www.modelscope.cn/models/black-forest-labs/FLUX.1-Kontext-dev) 模型和 [DiffSynth-Studio](https://github.com/modelscope/DiffSynth-Studio) 训练的 LoRA 模型,使用本模型后,可输入指令 `Enhance the aesthetic quality of this image.` 增强图像的美学质量。**模型会自动根据画面内容判断光照、构图等方面的调整,使其更美观。** 这个模型可以重复使用,也就是对美学增强后的图再次进行美学增强。
50
+
51
+ ## 模型效果
52
+
53
+ ||样例 1|样例 2|样例 3|
54
+ |-|-|-|-|
55
+ |原图|![](./assets/9.jpg)|![](./assets/6.jpg)|![](./assets/12.jpg)|
56
+ |第1次美学增强|![](./assets/9_enhance_0.jpg)|![](./assets/6_enhance_0.jpg)|![](./assets/12_enhance_0.jpg)|
57
+ |第2次美学增强|![](./assets/9_enhance_1.jpg)|![](./assets/6_enhance_1.jpg)|![](./assets/12_enhance_1.jpg)|
58
+ |第3次美学增强|![](./assets/9_enhance_2.jpg)|![](./assets/6_enhance_2.jpg)|![](./assets/12_enhance_2.jpg)|
59
+
60
+ ||样例 4|样例 5|样例 6|
61
+ |-|-|-|-|
62
+ |原图|![](./assets/15.jpg)|![](./assets/5.jpg)|![](./assets/11.jpg)|
63
+ |第1次美学增强|![](./assets/15_enhance_0.jpg)|![](./assets/5_enhance_0.jpg)|![](./assets/11_enhance_0.jpg)|
64
+ |第2次美学增强|![](./assets/15_enhance_1.jpg)|![](./assets/5_enhance_1.jpg)|![](./assets/11_enhance_1.jpg)|
65
+ |第3次美学增强|![](./assets/15_enhance_2.jpg)|![](./assets/5_enhance_2.jpg)|![](./assets/11_enhance_2.jpg)|
66
+
67
+ ## 使用说明
68
+
69
+ 本模型基于框架 [DiffSynth-Studio](https://github.com/modelscope/DiffSynth-Studio/tree/main/examples/flux) 训练,请先安装
70
+
71
+ ```
72
+ git clone https://github.com/modelscope/DiffSynth-Studio.git
73
+ cd DiffSynth-Studio
74
+ pip install -e .
75
+ ```
76
+
77
+ ```python
78
+ import torch
79
+ from diffsynth.pipelines.flux_image_new import FluxImagePipeline, ModelConfig
80
+ from PIL import Image
81
+ from modelscope import snapshot_download
82
+
83
+ snapshot_download("DiffSynth-Studio/FLUX.1-Kontext-dev-lora-ArtAug", cache_dir="./models")
84
+ pipe = FluxImagePipeline.from_pretrained(
85
+ torch_dtype=torch.bfloat16,
86
+ device="cuda",
87
+ model_configs=[
88
+ ModelConfig(model_id="black-forest-labs/FLUX.1-Kontext-dev", origin_file_pattern="flux1-kontext-dev.safetensors"),
89
+ ModelConfig(model_id="black-forest-labs/FLUX.1-dev", origin_file_pattern="text_encoder/model.safetensors"),
90
+ ModelConfig(model_id="black-forest-labs/FLUX.1-dev", origin_file_pattern="text_encoder_2/"),
91
+ ModelConfig(model_id="black-forest-labs/FLUX.1-dev", origin_file_pattern="ae.safetensors"),
92
+ ],
93
+ )
94
+ pipe.load_lora(pipe.dit, "models/DiffSynth-Studio/FLUX.1-Kontext-dev-lora-ArtAug/model.safetensors", alpha=1)
95
+
96
+ image = Image.open("your_image.jpg")
97
+ image = pipe(
98
+ prompt="Enhance the aesthetic quality of this image.",
99
+ kontext_images=image,
100
+ embedded_guidance=2.5,
101
+ seed=0,
102
+ )
103
+ image.save("output.jpg")
104
+ ```
_cover_images_/11_enhance_2.jpg ADDED

Git LFS Details

  • SHA256: b41b556bdeab20fa385041e29152c6985932d3ad992b52e1ecefa018803d30ea
  • Pointer size: 131 Bytes
  • Size of remote file: 121 kB
_cover_images_/12_enhance_2.jpg ADDED

Git LFS Details

  • SHA256: d332a786c4328019c1641825b49ab24d18131debd42d393a1141f429cd1b6634
  • Pointer size: 131 Bytes
  • Size of remote file: 133 kB
_cover_images_/15_enhance_2.jpg ADDED

Git LFS Details

  • SHA256: e966c88e4695f69450169c315c451bc089a4f9bccf41144d5a3bad0fd061c106
  • Pointer size: 131 Bytes
  • Size of remote file: 128 kB
_cover_images_/5_enhance_2.jpg ADDED

Git LFS Details

  • SHA256: ca220e8c648b80ced18aa1121f30af63e693ef591ac02a988024fac2dfae34e1
  • Pointer size: 131 Bytes
  • Size of remote file: 149 kB
_cover_images_/6_enhance_2.jpg ADDED

Git LFS Details

  • SHA256: 41dd0deca3005593f4d04904a06789bdd17ed6293d05d398ff354ed02f530aa8
  • Pointer size: 131 Bytes
  • Size of remote file: 118 kB
_cover_images_/9_enhance_2.jpg ADDED

Git LFS Details

  • SHA256: 3ed40d7f98c943f4ab378ed010cb3186428a518aaa628cb1ebe3aa6a189d76e4
  • Pointer size: 131 Bytes
  • Size of remote file: 114 kB
assets/0.jpg ADDED
assets/0_enhance_0.jpg ADDED

Git LFS Details

  • SHA256: 7a51d5fd0111a1d66f45e93f254ca6b3e7145002abaa09d12a4b9cfa1d47a9ee
  • Pointer size: 131 Bytes
  • Size of remote file: 111 kB
assets/0_enhance_1.jpg ADDED

Git LFS Details

  • SHA256: 351a677b59146689cee059c34dc9c7d269b7abc72bfb8120bc49b44019b75ef5
  • Pointer size: 131 Bytes
  • Size of remote file: 122 kB
assets/0_enhance_2.jpg ADDED

Git LFS Details

  • SHA256: 1d71f91af077617b1976bc63b263039f88fbb6f98be88765b6e646a5bcb115ba
  • Pointer size: 131 Bytes
  • Size of remote file: 133 kB
assets/11.jpg ADDED
assets/11_enhance_0.jpg ADDED
assets/11_enhance_1.jpg ADDED

Git LFS Details

  • SHA256: 3738d099e9d4c2092304ac627e60261160a3dfd30f43ac50fc193b4f5d16f35f
  • Pointer size: 131 Bytes
  • Size of remote file: 106 kB
assets/11_enhance_2.jpg ADDED

Git LFS Details

  • SHA256: b41b556bdeab20fa385041e29152c6985932d3ad992b52e1ecefa018803d30ea
  • Pointer size: 131 Bytes
  • Size of remote file: 121 kB
assets/12.jpg ADDED
assets/12_enhance_0.jpg ADDED

Git LFS Details

  • SHA256: a9f5a03264a6ae0253a9deda7302dbddc644132a136e5f129df4eb57667ef198
  • Pointer size: 131 Bytes
  • Size of remote file: 115 kB
assets/12_enhance_1.jpg ADDED

Git LFS Details

  • SHA256: 6ed389ecfea1e2ec5cb7ad786d157cffd3f59da03a4173f6314917bc6cb9b2e0
  • Pointer size: 131 Bytes
  • Size of remote file: 123 kB
assets/12_enhance_2.jpg ADDED

Git LFS Details

  • SHA256: d332a786c4328019c1641825b49ab24d18131debd42d393a1141f429cd1b6634
  • Pointer size: 131 Bytes
  • Size of remote file: 133 kB
assets/15.jpg ADDED
assets/15_enhance_0.jpg ADDED
assets/15_enhance_1.jpg ADDED

Git LFS Details

  • SHA256: 0743c7bcaa73af77ee604c1eecd3ab45d2d5cddb8050d8c2cf6898805f63e01f
  • Pointer size: 131 Bytes
  • Size of remote file: 117 kB
assets/15_enhance_2.jpg ADDED

Git LFS Details

  • SHA256: e966c88e4695f69450169c315c451bc089a4f9bccf41144d5a3bad0fd061c106
  • Pointer size: 131 Bytes
  • Size of remote file: 128 kB
assets/4.jpg ADDED
assets/4_enhance_0.jpg ADDED

Git LFS Details

  • SHA256: 86ff70bd6ca9dd3ed40a636b3dd70c00638178c1375895576af1004d1bafb32c
  • Pointer size: 131 Bytes
  • Size of remote file: 102 kB
assets/4_enhance_1.jpg ADDED

Git LFS Details

  • SHA256: 0bfb356a6cfd38de89479b2b3c565864399a79c2a8f2eb1fede1152fb0ee91fe
  • Pointer size: 131 Bytes
  • Size of remote file: 114 kB
assets/4_enhance_2.jpg ADDED

Git LFS Details

  • SHA256: e052a1d187708ef6213baf493e1d7965cfdbf41e47669f3d0b6d1a77f9a22d98
  • Pointer size: 131 Bytes
  • Size of remote file: 122 kB
assets/5.jpg ADDED

Git LFS Details

  • SHA256: 0447e7633037114cacbe65d7d63197770db472c7ae9fd752412081e3dcefb4fa
  • Pointer size: 131 Bytes
  • Size of remote file: 107 kB
assets/5_enhance_0.jpg ADDED

Git LFS Details

  • SHA256: e2b466bcce8cb70fa947c789bb17afeeb8da0ba20d267316202d89b0b5af298e
  • Pointer size: 131 Bytes
  • Size of remote file: 127 kB
assets/5_enhance_1.jpg ADDED

Git LFS Details

  • SHA256: ea82fd48843ff87e2ce96b5390e0c5a2a7cf92df9f069282d0c4303ec7ecb572
  • Pointer size: 131 Bytes
  • Size of remote file: 137 kB
assets/5_enhance_2.jpg ADDED

Git LFS Details

  • SHA256: ca220e8c648b80ced18aa1121f30af63e693ef591ac02a988024fac2dfae34e1
  • Pointer size: 131 Bytes
  • Size of remote file: 149 kB
assets/6.jpg ADDED
assets/6_enhance_0.jpg ADDED
assets/6_enhance_1.jpg ADDED

Git LFS Details

  • SHA256: b6d93e09eda9ce5ad7cd087643014b199e3ade9f370582d25708f08dc1003bf0
  • Pointer size: 131 Bytes
  • Size of remote file: 110 kB
assets/6_enhance_2.jpg ADDED

Git LFS Details

  • SHA256: 41dd0deca3005593f4d04904a06789bdd17ed6293d05d398ff354ed02f530aa8
  • Pointer size: 131 Bytes
  • Size of remote file: 118 kB
assets/9.jpg ADDED
assets/9_enhance_0.jpg ADDED
assets/9_enhance_1.jpg ADDED

Git LFS Details

  • SHA256: a8070035eb2387f353e14c46aa72e2d3b9376e9eb766b700b4acd4f09a3fce2d
  • Pointer size: 131 Bytes
  • Size of remote file: 105 kB
assets/9_enhance_2.jpg ADDED

Git LFS Details

  • SHA256: 3ed40d7f98c943f4ab378ed010cb3186428a518aaa628cb1ebe3aa6a189d76e4
  • Pointer size: 131 Bytes
  • Size of remote file: 114 kB
configuration.json ADDED
@@ -0,0 +1,6 @@
 
 
 
 
 
 
 
1
+ {
2
+ "aigc_model": true,
3
+ "model_file_location": "model.safetensors",
4
+ "framework": "Pytorch",
5
+ "task": "other"
6
+ }
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ec6083c7c0771cef2436fc69fe95882470bba0c678e5006327d023c917ce9904
3
+ size 306423720