Add `NearestFrom` for faster `fast_blur` by RunDevelopment · Pull Request #2868 · image-rs/image

RunDevelopment · 2026-03-16T13:33:55Z

After talking about it here, I removed the f32::round in the hot path of fast_blur.

I did this by adding a new private trait: NearestFrom. This has similar semantics to NumCast::from, but it will round to nearest for float->int conversions and saturate on numeric bounds instead of returning Option. This makes it a natural fit for performance-sensitive code that needs to convert f32 to subpixels.

For now, I only used the trait in fast_blur, but other operations can use it as well for both correctness and performance. I specifically designed the trait to have uses beyond fast_blur.

This PR does make fast_blur significantly faster, even for the u8 case. Here are the benchmark results from my machine (Intel i7-8700K):

bench	before	after	change
fast blur: sigma 3.0	125.60 ms	17.616 ms	-85.975%
fast blur: sigma 7.0	112.48 ms	17.462 ms	-84.475%
fast blur: sigma 50.0	108.44 ms	18.044 ms	-83.360%

That's between 6-7 times faster.

Note that this is no competition for #2846 and its fixed-point implementation for u8. On my machine, #2846 reaches 8ms on the same benchmark.

I also want to mention that some implementations of NearestFrom still leave performance on the table. See this comment for example. I just optimized the important primitives for now. Everything else can follow later.

fintelia · 2026-03-17T07:02:09Z

We may want to spin everything related to PrimitiveSealed into a separate source file. I suspect we're going to end up with a bunch

RunDevelopment · 2026-03-17T10:59:46Z

Depends. For example, I also want to add a trait to make RGB->Luma conversions faster. This would also need to be a super trait of PrimitiveSealed, but I'd like to keep its definition and implementation right next to the (internal) code that uses it for local reasoning.

So I'm not too sure that the code in traits.rs is going to grow a lot, but I'm also not against moving things into a separate source file if it does.

src/traits.rs

RunDevelopment · 2026-03-23T22:14:16Z

We may want to spin everything related to PrimitiveSealed into a separate source file. I suspect we're going to end up with a bunch

After thinking about it a bit more, I implemented your suggestion. I don't see it growing a lot now, but there's also no harm in doing so.

Add NearestFrom for faster fast_blur

11a7371

RunDevelopment mentioned this pull request Mar 22, 2026

Fix brighten API and f32 behavior #2886

Open

mstoeckl reviewed Mar 22, 2026

View reviewed changes

src/traits.rs Outdated Show resolved Hide resolved

RunDevelopment added 3 commits March 22, 2026 23:25

Document approximation for rounding

960d575

Merge branch 'main' into nearest-from

ad8b297

Move everything PrimitiveSealed into separate file

e16cbeb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `NearestFrom` for faster `fast_blur`#2868

Add `NearestFrom` for faster `fast_blur`#2868
RunDevelopment wants to merge 4 commits intoimage-rs:mainfrom
RunDevelopment:nearest-from

RunDevelopment commented Mar 16, 2026 •

edited

Loading

Uh oh!

fintelia commented Mar 17, 2026

Uh oh!

RunDevelopment commented Mar 17, 2026

Uh oh!

Uh oh!

RunDevelopment commented Mar 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

RunDevelopment commented Mar 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fintelia commented Mar 17, 2026

Uh oh!

RunDevelopment commented Mar 17, 2026

Uh oh!

Uh oh!

RunDevelopment commented Mar 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

RunDevelopment commented Mar 16, 2026 •

edited

Loading