include stop string in completions output #435

saiatmakuri · 2024-02-13T19:15:39Z

Pull Request Summary

Bring over flag added in vllm-project/vllm#1976 to llm engine

Test Plan and Usage Guide

test locally
without the flag:

> curl -H "Content-Type: application/json" -H "Authorization: Basic <AUTH>" -d '{ "prompt": "Whats 2+2? ", "stop_sequences": ["\n"], "max_new_tokens": 10, "temperature": 0.1 }' http://localhost:5000/v1/llm/completions-sync?model_endpoint_name=llama-7b-vllm-test-temp
{"request_id":"907c759f-e704-4d32-98a5-4dbe388ec04a","output":{"text":"4.","num_prompt_tokens":9,"num_completion_tokens":3,"tokens":null}}

note: "4." above

with the flag:

> curl -H "Content-Type: application/json" -H "Authorization: Basic <AUTH>" -d '{ "prompt": "Whats 2+2? ", "stop_sequences": ["\n"], "max_new_tokens": 10, "temperature": 0.1, "include_stop_str_in_output": true }' http://localhost:5000/v1/llm/completions-sync?model_endpoint_name=llama-7b-vllm-test-temp
{"request_id":"7f465cd7-8d42-4470-96ab-6b8cdfb2b5b0","output":{"text":"4.\n","num_prompt_tokens":9,"num_completion_tokens":3,"tokens":null}}

note: "4.\n" above

model-engine/model_engine_server/common/dtos/llms.py

include stop string in output

74950dd

saiatmakuri added the enhancement New feature or request label Feb 13, 2024

saiatmakuri requested review from yixu34 and jenkspt February 13, 2024 19:15

saiatmakuri self-assigned this Feb 13, 2024

ian-scale approved these changes Feb 13, 2024

View reviewed changes

model-engine/model_engine_server/common/dtos/llms.py Show resolved Hide resolved

saiatmakuri merged commit 5bff345 into main Feb 13, 2024

saiatmakuri deleted the saiatmakuri/include_stop_str_in_output branch February 13, 2024 22:15

yunfeng-scale mentioned this pull request Mar 6, 2024

Fix cacher #462

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

include stop string in completions output #435

include stop string in completions output #435

Uh oh!

saiatmakuri commented Feb 13, 2024 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

include stop string in completions output #435

include stop string in completions output #435

Uh oh!

Conversation

saiatmakuri commented Feb 13, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pull Request Summary

Test Plan and Usage Guide

Uh oh!

Uh oh!

Uh oh!

saiatmakuri commented Feb 13, 2024 •

edited

Loading