Some RGBA images cause broken backgrounds due to incorrect RGB conversion

Currently, the `load_image` function in `train_utils` uses the PIL `image.convert("RGB")` method for converting the loaded 4 RGBA channel images to the expected 3 channel RGB format.

This is incorrect for some edge cases, and causes the background to become distorted.

As an example, loading [this RGBA image](https://img3.gelbooru.com/images/b6/63/b663ee70d6ef873c1c97fee64079dda6.png) results in [heavy background artifacts.](https://github.com/kohya-ss/sd-scripts/assets/125218114/0518ec51-c26e-4122-8ca9-a9f8e71eac46) when used with the test code below:

```py
from PIL import Image
from library.train_util import load_image

img = load_image("test.png")
Image.fromarray(img, 'RGB').save("test_convert.png")
```

To verify this issue, a LoRA was created using that single image as the dataset, using a caption that includes the words "white background". Seed and everything else was fixed between the two tests.

The top row is the latest code using a fresh VENV. The bottom row is the proposed fix, which I will create a PR for shortly. It uses the `Image.alpha_composite` function with a new blank white image to handle the alpha channel.

![image](https://github.com/kohya-ss/sd-scripts/assets/125218114/9a8b3114-a004-4fc0-ae01-ed513d51a009)


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Some RGBA images cause broken backgrounds due to incorrect RGB conversion #1269

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Uh oh!

Some RGBA images cause broken backgrounds due to incorrect RGB conversion #1269

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions