| Component | Size |
|---|---|
| Text Encoder 1 (safe tensors) | 246 MB |
| Text Encoder 2 (safe tensors) | 1.39 GB |
| Tokenizer 1 (vocab.json) | 1.06 MB |
| Tokenizer 2 (vocab.json) | 1.06 MB |
| UNet (safe tensors) | 5.14 GB |
| VAE (safe tensors) | 167 MB |
| LCM LoRA (safe tensors) | 787 MB |
| Component | Size |
|---|---|
| pytorch_lora_weights.safetensors | 354.54 MiB |
| text_embeddings.safetensors | 16.15 KiB |
| Component | Size |
|---|---|
| pytorch_lora_weights.safetensors | 177.35 MiB |
| text_embeddings.safetensors | 16.15 KiB |
Monitoring resource usage during image generation
| LoRA Rank | Model | First Generation (with model loading) | Second Generation (steady state) | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Time (s) | Avg GPU % | Max GPU % | Avg Mem (GB) | Max Mem (GB) | Time (s) | Avg GPU % | Max GPU % | Avg Mem (GB) | Max Mem (GB) | ||
| Rank 64 | hugo-boss-good-2 | 24.18 | 4.9 | 90.0 | 11.24 | 11.49 | 6.91 | 17.3 | 87.0 | 11.11 | 11.12 |
| Rank 32 | miu-miu | 17.65 | 7.0 | 88.0 | 11.15 | 11.31 | 6.49 | 16.4 | 87.0 | 11.10 | 11.10 |
Duration: 60 seconds
Note: Jobs are processed through Azure Queue, which handles requests sequentially.
| LoRA Rank | Model | Total Requests | Jobs/minute | Total Time (s) |
|---|---|---|---|---|
| Rank 64 | hugo-boss-good-2 | 9 | 8.92 | 60.53 |
| Rank 32 | miu-miu | 10 | 9.21 | 65.14 |
The first generation for each model includes the time needed to load the model into GPU memory.
| LoRA Rank | Model | Pass | First Generation (s) | Avg Subsequent (s) | Loading Overhead (s) |
|---|---|---|---|---|---|
| Rank 32 | miu-miu | 1 | 6.39 | 6.58 | -0.19 |
| Rank 32 | miu-miu | 2 | 17.28 | 6.41 | 10.86 |
| Rank 32 | miu-miu | Mean | 11.83 | 6.50 | 5.34 |
| Rank 64 | hugo-boss-good-2 | 1 | 24.49 | 6.47 | 18.02 |
| Rank 64 | hugo-boss-good-2 | 2 | 34.63 | 6.74 | 27.89 |
| Rank 64 | hugo-boss-good-2 | Mean | 7.01 | 6.60 | 0.40 |
Statistics for each LoRA rank (excluding first generations)
For each model, we will do 2 pass over each aspect ratio and steps
| LoRA Rank | Model | Pass | Count | Mean (s) | Std (s) | Min (s) | Max (s) |
|---|---|---|---|---|---|---|---|
| Rank 32 | miu-miu | 1 | 63.0 | 6.578 | 0.704 | 4.905 | 8.797 |
| Rank 32 | miu-miu | 2 | 63.0 | 6.414 | 0.873 | 4.859 | 8.343 |
| Rank 64 | hugo-boss-good-2 | 1 | 63.0 | 6.474 | 0.717 | 4.856 | 8.208 |
| Rank 64 | hugo-boss-good-2 | 2 | 62.0 | 6.736 | 1.604 | 4.876 | 17.366 |
| Aspect Ratio | Steps | Time (s) | Generated Image |
|---|---|---|---|
| 1:1 | 6 | 5.81 |
|
| 1:1 | 7 | 6.54 |
|
| 1:1 | 8 | 9.21 |
|
| 1:1 | 9 | 6.52 |
|
| 1:1 | 10 | 6.58 |
|
| 1:1 | 11 | 7.42 |
|
| 1:1 | 12 | 8.10 |
|
| 4:3 | 6 | 5.06 |
|
| 4:3 | 7 | 5.87 |
|
| 4:3 | 8 | 6.72 |
|
| 4:3 | 9 | 6.57 |
|
| 4:3 | 10 | 6.55 |
|
| 4:3 | 11 | 6.74 |
|
| 4:3 | 12 | 7.54 |
|