updated
This commit is contained in:
parent
4df4bb8dc1
commit
4f15fb8d01
36
README.md
36
README.md
@ -1,32 +1,50 @@
|
|||||||
# Stable Diffusion for Remote Sensing Image Generation
|
# Stable Diffusion for Remote Sensing Image Generation
|
||||||
|
|
||||||
#### Author: Zhiqiang yuan @ AIR CAS, [Send a Email](yuan_zhi_qiang@sina.cn)
|
#### Author: Zhiqiang yuan @ AIR CAS, [Send a Email](yuan_zhi_qiang@sina.cn)
|
||||||
|
#### Welcome :+1:_<big>`Fork and Star`</big>_:+1:, then we'll let you know when we update
|
||||||
|
#### -------------------------------------------------------------------------------------
|
||||||
|
|
||||||
A simple project for text-to-image remote sensing image generation,
|
A simple project for `text-to-image remote sensing image generation`,
|
||||||
and we will release the code of `using text to control regions for super-large RS image generation` later.
|
and we will release the code of `using multiple text to control regions for super-large RS image generation` later.
|
||||||
Also welcome to see the project of [image-condition fake sample generation](https://github.com/xiaoyuan1996/Controllable-Fake-Sample-Generation-for-RS) in [TGRS, 2023](https://ieeexplore.ieee.org/abstract/document/10105619/).
|
Also welcome to see the project of [image-condition fake sample generation](https://github.com/xiaoyuan1996/Controllable-Fake-Sample-Generation-for-RS) in [TGRS, 2023](https://ieeexplore.ieee.org/abstract/document/10105619/).
|
||||||
|
|
||||||
## Environment configuration
|
## Environment configuration
|
||||||
|
|
||||||
Follow [original training repo](https://github.com/justinpinkney/stable-diffusion.git) .
|
Follow and thanks [original training repo](https://github.com/justinpinkney/stable-diffusion.git) .
|
||||||
|
|
||||||
|
|
||||||
## Pretrained weights
|
## Pretrained weights
|
||||||
|
|
||||||
We used [RSITMD](https://github.com/xiaoyuan1996/AMFMN) as training data and fine-tuned stable diffusion for 10 epochs with 1 x A100 GPU.
|
We used RS image-text dataset [RSITMD](https://github.com/xiaoyuan1996/AMFMN) as training data and fine-tuned stable diffusion for 10 epochs with 1 x A100 GPU.
|
||||||
When the batchsize is 4, the GPU memory consumption is about 40+ Gb during training, and about 20+ Gb during sampling.
|
When the batchsize is 4, the GPU memory consumption is about 40+ Gb during training, and about 20+ Gb during sampling.
|
||||||
The pretrain weights is realesed at [last-pruned.ckpt](https://github.com/xiaoyuan1996/AMFMN).
|
The pretrain weights is realesed at [last-pruned.ckpt](https://github.com/xiaoyuan1996/AMFMN).
|
||||||
|
|
||||||
## Using
|
## Using
|
||||||
Download the pretrain weights to current dir, and run with:
|
|
||||||
|
### Samling
|
||||||
|
Download the pretrain weights `last-pruned.ckpt` to current dir, and run with:
|
||||||
```commandline
|
```commandline
|
||||||
bash sample.sh
|
python scripts/txt2img.py \
|
||||||
|
--prompt 'Some boats drived in the sea' \
|
||||||
|
--outdir 'outputs/RS' \
|
||||||
|
--H 512 --W 512 \
|
||||||
|
--n_samples 4 \
|
||||||
|
--config 'configs/stable-diffusion/RSITMD.yaml' \
|
||||||
|
--ckpt './last-pruned.ckpt'
|
||||||
```
|
```
|
||||||
|
|
||||||
|
### Traing
|
||||||
We will update the train code ASAP.
|
We will update the train code ASAP.
|
||||||
|
|
||||||
## Examples
|
## Examples
|
||||||
**Caption:** Some boats drived in the sea.
|
**Caption:** `Some boats drived in the sea.`
|
||||||

|

|
||||||
|
|
||||||
**Caption:** A lot of cars parked in the airport.
|
**Caption:** `A lot of cars parked in the airport.`
|
||||||

|

|
||||||
|
|
||||||
|
**Caption:** `A large number of vehicles are parked in the parking lot, next to the bare desert.`
|
||||||
|

|
||||||
|
|
||||||
|
**Caption:** `There is a church in a dark green forest with two table tennis courts next to it.`
|
||||||
|

|
||||||
|
|||||||
BIN
assets/shows3.png
Normal file
BIN
assets/shows3.png
Normal file
Binary file not shown.
|
After Width: | Height: | Size: 1.7 MiB |
BIN
assets/shows4.png
Normal file
BIN
assets/shows4.png
Normal file
Binary file not shown.
|
After Width: | Height: | Size: 2.0 MiB |
Loading…
Reference in New Issue
Block a user