Extracted the discussion from:
In order for the bbox to be correct, I'm thinking about:
- Implement the correct
max_pixels / min_pixels from the original config
- Pad right/bottom corner of the image if we require to upscale (to multiple of
2*patch_size). This will guarantee that the x/y coordinates stay unchanged. But we also need to check if the original implementation actually use this strategy or not.
CC @broadbit-hu @theo77186
Extracted the discussion from:
In order for the bbox to be correct, I'm thinking about:
max_pixels/min_pixelsfrom the original config2*patch_size). This will guarantee that the x/y coordinates stay unchanged. But we also need to check if the original implementation actually use this strategy or not.CC @broadbit-hu @theo77186