Skip to content

[9.0] Optimize sparse vector stats collection (#128740) #128806

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
Jun 3, 2025

Conversation

jimczi
Copy link
Contributor

@jimczi jimczi commented Jun 3, 2025

Backport of #128740 to 9.0

This change improves the performance of sparse vector statistics gathering by using the document count of terms directly, rather than relying on the field name field to compute stats.
By avoiding per-term disk/network reads and instead leveraging statistics already loaded into leaf readers at index opening, we expect to significantly reduce overhead.

Relates to elastic#128583
@jimczi jimczi added >enhancement :Data Management/Stats Statistics tracking and retrieval APIs backport v9.0.3 auto-merge-without-approval Automatically merge pull request when CI checks pass (NB doesn't wait for reviews!) labels Jun 3, 2025
@elasticsearchmachine elasticsearchmachine merged commit bab6e83 into elastic:9.0 Jun 3, 2025
16 checks passed
@jimczi jimczi deleted the backport_128740_90 branch June 3, 2025 09:18
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
auto-merge-without-approval Automatically merge pull request when CI checks pass (NB doesn't wait for reviews!) backport :Data Management/Stats Statistics tracking and retrieval APIs >enhancement v9.0.3
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants