ggml-cpu : add basic RVV support for vector f32 ops #15057

xctan · 2025-08-03T16:55:03Z

This PR introduces RVV support for several f32 vector kernels.

The implementation required refactoring the vectorization logic. Due to RVV's flexible vector length, its intrinsic types are sizeless, which prevents the compiler from creating arrays of vector registers (a similar limitation also can be found in Arm's SVE). This makes traditional loop unrolling techniques incompatible, necessitating a rewrite of the code to support RVV's architecture.

ggml-cpu : add basic RVV support for vector f32 ops

835397f

github-actions bot added the ggml changes relating to the ggml tensor library for machine learning label Aug 3, 2025

xctan requested a review from ggerganov August 3, 2025 18:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ggml-cpu : add basic RVV support for vector f32 ops #15057

ggml-cpu : add basic RVV support for vector f32 ops #15057

xctan commented Aug 3, 2025

Uh oh!

Uh oh!

ggml-cpu : add basic RVV support for vector f32 ops #15057

Are you sure you want to change the base?

ggml-cpu : add basic RVV support for vector f32 ops #15057

Conversation

xctan commented Aug 3, 2025

Uh oh!

Uh oh!