[ML] Integrate with DeepSeek API #122218

prwhelan · 2025-02-10T22:05:36Z

Integrating for Chat Completion and Completion task types, both calling the chat completion API for DeepSeek.

Verified:

content string
content object with "type": "text" + "text": "what is elastic"
max_completion_tokens does not work, we need to map to max_tokens
role field works
tools work
sending n is benign
temperature works
tool_choice works
strict tools is benign
top_p works
stream works
Added tests verifying unified error response

Integrating for Chat Completion and Completion task types, both calling the chat completion API for DeepSeek.

elasticsearchmachine · 2025-02-10T22:06:00Z

Hi @prwhelan, I've created a changelog YAML for you.

elasticsearchmachine · 2025-02-25T22:51:39Z

Pinging @elastic/ml-core (Team:ML)

...ticsearch/xpack/inference/external/request/deepseek/DeepSeekChatCompletionRequestEntity.java

server/src/main/java/org/elasticsearch/TransportVersions.java

...g/elasticsearch/xpack/inference/external/request/deepseek/DeepSeekChatCompletionRequest.java

...ticsearch/xpack/inference/external/request/deepseek/DeepSeekChatCompletionRequestEntity.java

jonathan-buttner · 2025-02-27T21:45:09Z

...in/java/org/elasticsearch/xpack/inference/services/deepseek/DeepSeekChatCompletionModel.java

+        var validationException = new ValidationException();
+
+        var model = extractRequiredString(serviceSettingsMap, MODEL_ID, ModelConfigurations.SERVICE_SETTINGS, validationException);
+        var uri = createOptionalUri(


Do we want to allow users to set the URL in the create inference endpoint request? If this is only for testing, then I think we can allow the child classes to override a method to return it, or pass it in via the constructor. Then in the tests we can set it via a constructor or a setter or something.

I really only left it exposed but optional because anyone can in theory run the model locally behind an OpenAI compatible endpoint and then call it that way.

I can remove it though if that's not something we want to support, I'm neutral towards it

...in/java/org/elasticsearch/xpack/inference/services/deepseek/DeepSeekChatCompletionModel.java

… deepseek

Integrating for Chat Completion and Completion task types, both calling the chat completion API for DeepSeek.

[ML] Integrate with DeepSeek API

ec94f53

Integrating for Chat Completion and Completion task types, both calling the chat completion API for DeepSeek.

prwhelan added >enhancement :ml Machine learning Team:ML Meta label for the ML team v9.1.0 labels Feb 10, 2025

Update docs/changelog/122218.yaml

746ed7c

prwhelan added 2 commits February 25, 2025 11:05

Merge branch 'main' into deepseek

508839a

Finish integration

5d5bd0a

prwhelan added auto-backport Automatically create backport pull requests when merged v8.19.0 labels Feb 25, 2025

prwhelan added 2 commits February 25, 2025 13:13

Merge branch 'main' into deepseek

5d4e47b

Fix InferenceGetServicesIT

65e8b26

prwhelan marked this pull request as ready for review February 25, 2025 22:51

prwhelan commented Feb 25, 2025

View reviewed changes

...ticsearch/xpack/inference/external/request/deepseek/DeepSeekChatCompletionRequestEntity.java Outdated Show resolved Hide resolved

jonathan-buttner reviewed Feb 27, 2025

View reviewed changes

prwhelan added 4 commits March 10, 2025 22:35

Merge branch 'main' of https://github.com/prwhelan/elasticsearch into…

7009516

… deepseek

Adding 8.x TransportVersion

9e9fbc1

Use new error message API

0173947

Move request packages to external

aa79769

jonathan-buttner approved these changes Mar 11, 2025

View reviewed changes

prwhelan added 3 commits March 11, 2025 09:34

Move max tokens up a level so it can be customized like model id

9c31d35

Fix tests to match the new stream error handling

fe5d29c

Adding per-node rate limit comment

ea24ac1

prwhelan removed the auto-backport Automatically create backport pull requests when merged label Mar 11, 2025

prwhelan added 2 commits March 11, 2025 10:08

Merge branch 'main' of https://github.com/prwhelan/elasticsearch into…

3be7f2e

… deepseek

Fix import statements from merge

68dbda3

jonathan-buttner approved these changes Mar 11, 2025

View reviewed changes

Merge branch 'main' of https://github.com/prwhelan/elasticsearch into…

9d1482c

… deepseek

Merge branch 'main' of https://github.com/prwhelan/elasticsearch into…

8329b2d

… deepseek

prwhelan enabled auto-merge (squash) March 12, 2025 13:19

prwhelan merged commit 9f89a3b into elastic:main Mar 12, 2025
17 checks passed

albertzaharovits pushed a commit to albertzaharovits/elasticsearch that referenced this pull request Mar 13, 2025

[ML] Integrate with DeepSeek API (elastic#122218)

f4b19e9

Integrating for Chat Completion and Completion task types, both calling the chat completion API for DeepSeek.

jfreden pushed a commit to jfreden/elasticsearch that referenced this pull request Mar 13, 2025

[ML] Integrate with DeepSeek API (elastic#122218)

f742bfb

Integrating for Chat Completion and Completion task types, both calling the chat completion API for DeepSeek.

prwhelan added a commit to prwhelan/elasticsearch that referenced this pull request Mar 13, 2025

[ML] Integrate with DeepSeek API (elastic#122218)

0ac3dfd

Integrating for Chat Completion and Completion task types, both calling the chat completion API for DeepSeek.

prwhelan mentioned this pull request Mar 13, 2025

[ML] Integrate with DeepSeek API (#122218) #124796

Merged

prwhelan added a commit that referenced this pull request Mar 13, 2025

[ML] Integrate with DeepSeek API (#122218) (#124796)

bdd7b6a

Integrating for Chat Completion and Completion task types, both calling the chat completion API for DeepSeek.

pquentin mentioned this pull request Jul 1, 2025

[ML] Add DeepSeek elastic/elasticsearch-specification#4723

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ML] Integrate with DeepSeek API #122218

[ML] Integrate with DeepSeek API #122218

Uh oh!

prwhelan commented Feb 10, 2025 •

edited

Loading

Uh oh!

elasticsearchmachine commented Feb 10, 2025

Uh oh!

elasticsearchmachine commented Feb 25, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jonathan-buttner Feb 27, 2025

Uh oh!

prwhelan Mar 11, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[ML] Integrate with DeepSeek API #122218

[ML] Integrate with DeepSeek API #122218

Uh oh!

Conversation

prwhelan commented Feb 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

elasticsearchmachine commented Feb 10, 2025

Uh oh!

elasticsearchmachine commented Feb 25, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jonathan-buttner Feb 27, 2025

Choose a reason for hiding this comment

Uh oh!

prwhelan Mar 11, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

prwhelan commented Feb 10, 2025 •

edited

Loading