[ML] Remove Voyageai request manager classes #124512

jonathan-buttner · 2025-03-10T20:01:21Z

This PR refactors some of the VoyageAI logic:

Remove the request manager classes
Remove the account class
Refactor the URI logic to expose them via the VoyageAIModel base class

Follows the same pattern as the OpenAI PR: #124144

…ence-remove-voyage-request-managers

jonathan-buttner · 2025-03-10T20:10:12Z

...inference/src/main/java/org/elasticsearch/xpack/inference/external/request/RequestUtils.java

@@ -34,5 +34,9 @@ public static URI buildUri(URI accountUri, String service, CheckedSupplier<URI,
        }
    }

+    public static URI buildUri(String service, CheckedSupplier<URI, URISyntaxException> uriBuilder) {
+        return buildUri(null, service, uriBuilder);


Just a helper that converts a URI exception to an ElasticsearchStatusException.

jonathan-buttner · 2025-03-10T20:10:22Z

...a/org/elasticsearch/xpack/inference/external/request/voyageai/VoyageAIEmbeddingsRequest.java

@@ -77,11 +71,7 @@ public boolean[] getTruncationInfo() {
        return null;
    }

-    public VoyageAIEmbeddingsTaskSettings getTaskSettings() {


jonathan-buttner · 2025-03-10T20:10:55Z

...ference/src/main/java/org/elasticsearch/xpack/inference/services/voyageai/VoyageAIModel.java

 import java.util.Map;
 import java.util.Objects;

-public abstract class VoyageAIModel extends Model {
+public abstract class VoyageAIModel extends RateLimitGroupingModel {
+    private static final String DEFAULT_MODEL_FAMILY = "default_model_family";


Moving the rate limiting logic inside the model class

jonathan-buttner · 2025-03-10T20:11:20Z

...ference/src/main/java/org/elasticsearch/xpack/inference/services/voyageai/VoyageAIModel.java

@@ -73,22 +88,20 @@ public SecureString apiKey() {
        return apiKey;
    }

-    public VoyageAIRateLimitServiceSettings rateLimitServiceSettings() {


jonathan-buttner · 2025-03-10T20:13:49Z

...ference/src/main/java/org/elasticsearch/xpack/inference/services/voyageai/VoyageAIModel.java


-    public abstract ExecutableAction accept(VoyageAIActionVisitor creator, Map<String, Object> taskSettings, InputType inputType);
+        return Objects.hash(modelFamily, apiKey);


This is a notable change. Previously the request manager was not including the apiKey in the grouping. This didn't seem right to me though because it'd mean that all users who were using the same model id family would be rate limited together. From the voyageai docs it does seem like you can have more granular rate limits per project. This doesn't accomplish that but at least it's a step in that direction because we shouldn't be grouping all users together.

jonathan-buttner · 2025-03-10T20:14:19Z

.../org/elasticsearch/xpack/inference/services/voyageai/embeddings/VoyageAIEmbeddingsModel.java

    // should only be used for testing
    VoyageAIEmbeddingsModel(
-        String modelId,
+        String inferenceId,


Fixing naming

jonathan-buttner · 2025-03-10T20:14:39Z

.../org/elasticsearch/xpack/inference/services/voyageai/embeddings/VoyageAIEmbeddingsModel.java

        String service,
+        String url,


For testing we allow a string url.

jonathan-buttner · 2025-03-10T20:15:26Z

...rg/elasticsearch/xpack/inference/external/action/voyageai/VoyageAIEmbeddingsActionTests.java

@@ -339,26 +341,6 @@ public void testExecute_ThrowsElasticsearchException_WhenSenderOnFailureIsCalled
        MatcherAssert.assertThat(thrownException.getMessage(), is("Failed to send VoyageAI embeddings request. Cause: failed"));
    }

-    public void testExecute_ThrowsElasticsearchException_WhenSenderOnFailureIsCalled_WhenUrlIsNull() {


I think this was copied from openai. The voyageai url cannot be specified in the service settings so it should never be null.

elasticsearchmachine · 2025-03-12T12:48:38Z

Pinging @elastic/ml-core (Team:ML)

davidkyle

LGTM

jonathan-buttner · 2025-03-13T18:05:47Z

💚 All backports created successfully

Status	Branch	Result
✅	8.x

Questions ?

Please refer to the Backport tool documentation

* Removing voyage request managers * Fixing tests (cherry picked from commit 1bee2cc)

jonathan-buttner added 2 commits March 7, 2025 17:12

Removing voyage request managers

fb25581

Fixing tests

22639c4

jonathan-buttner added >non-issue :ml Machine learning Team:ML Meta label for the ML team auto-backport Automatically create backport pull requests when merged Feature:GenAI Features around GenAI v8.19.0 v9.1.0 labels Mar 10, 2025

Merge branch 'main' of github.com:elastic/elasticsearch into ml-infer…

34593f6

…ence-remove-voyage-request-managers

jonathan-buttner commented Mar 10, 2025

View reviewed changes

jonathan-buttner marked this pull request as ready for review March 12, 2025 12:48

davidkyle approved these changes Mar 13, 2025

View reviewed changes

jonathan-buttner merged commit 1bee2cc into elastic:main Mar 13, 2025
17 checks passed

jonathan-buttner deleted the ml-inference-remove-voyage-request-managers branch March 13, 2025 17:47

jonathan-buttner mentioned this pull request Mar 13, 2025

[8.x] [ML] Remove Voyageai request manager classes (#124512) #124795

Merged

jonathan-buttner added a commit to jonathan-buttner/elasticsearch that referenced this pull request Mar 13, 2025

[ML] Remove Voyageai request manager classes (elastic#124512)

2a6d5d7

* Removing voyage request managers * Fixing tests (cherry picked from commit 1bee2cc)

elasticsearchmachine pushed a commit that referenced this pull request Mar 13, 2025

[ML] Remove Voyageai request manager classes (#124512) (#124795)

60bb770

* Removing voyage request managers * Fixing tests (cherry picked from commit 1bee2cc)

jonathan-buttner mentioned this pull request May 6, 2025

[Inference API] Add "rerank" task type to "elastic" provider #126022

Merged

jonathan-buttner mentioned this pull request May 19, 2025

Implemented ChatCompletion task for Google VertexAI with Gemini Models #128105

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ML] Remove Voyageai request manager classes #124512

[ML] Remove Voyageai request manager classes #124512

Uh oh!

jonathan-buttner commented Mar 10, 2025

Uh oh!

jonathan-buttner Mar 10, 2025

Uh oh!

jonathan-buttner Mar 10, 2025

Uh oh!

jonathan-buttner Mar 10, 2025

Uh oh!

jonathan-buttner Mar 10, 2025

Uh oh!

jonathan-buttner Mar 10, 2025

Uh oh!

jonathan-buttner Mar 10, 2025

Uh oh!

jonathan-buttner Mar 10, 2025

Uh oh!

jonathan-buttner Mar 10, 2025

Uh oh!

elasticsearchmachine commented Mar 12, 2025

Uh oh!

davidkyle left a comment

Uh oh!

Uh oh!

jonathan-buttner commented Mar 13, 2025

Uh oh!

Uh oh!


		public abstract ExecutableAction accept(VoyageAIActionVisitor creator, Map<String, Object> taskSettings, InputType inputType);
		return Objects.hash(modelFamily, apiKey);

[ML] Remove Voyageai request manager classes #124512

[ML] Remove Voyageai request manager classes #124512

Uh oh!

Conversation

jonathan-buttner commented Mar 10, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

elasticsearchmachine commented Mar 12, 2025

Uh oh!

davidkyle left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jonathan-buttner commented Mar 13, 2025

💚 All backports created successfully

Questions ?

Uh oh!

Uh oh!