WIP : ADD: Batch request (Non-streaming) for OpenAI Plugin #66

thameem-abbas · 2024-10-23T13:36:07Z

Adds non-streaming batch request support in OpenAI plugin.

thameem-abbas · 2024-11-04T13:52:54Z

@sjmonson @dagrayvid @npalaska
It would be great if someone could review the PR.

npalaska · 2024-11-04T14:55:57Z

Thanks @thameem-abbas 👍 Initial tests with changes from this PR work well. I'll add more in the afternoon.

sjmonson · 2024-11-04T15:58:54Z

plugins/openai_plugin.py

I still don't like this approach. It duplicates too much code; fine for a one-off, but to merge it would be preferable to have all request methods take a list of queries and ~~return a list of Results~~ (see later comment).

sjmonson · 2024-11-04T15:59:27Z

plugins/openai_plugin.py

@@ -177,6 +185,88 @@ def request_http(self, query: dict, user_id: int, test_end_time: float = 0):

        return result

+    def request_batch_http(self, queries, user_id, test_end_time: float = 0):


Changing the interface of request_func() based on calling args is bad practice. See above comment.

plugins/openai_plugin.py

user.py

sjmonson · 2024-11-04T16:39:09Z

user.py

-            # if timeout passes, queue.Empty will be thrown
-            # User should continue to poll for inputs


Why was this comment removed?

result.py

plugins/openai_plugin.py

sjmonson · 2024-11-04T17:13:29Z

utils.py

    concurrency = load_options.get("concurrency")
    duration = load_options.get("duration")

    plugin_type = config.get("plugin")
    if plugin_type == "openai_plugin":
        plugin = openai_plugin.OpenAIPlugin(
-            config.get("plugin_options")
+            config.get("plugin_options"), batch_size


Again, this should be handled in a way the the plugin does not need to know the batch size in advance. However, if you must... just set config["plugin_options"] = batch_size to avoid changing the interface.

Agreeing on change of condition. Co-authored-by: Samuel Monson <[email protected]>

Don't know what was running through my head when I wrote that. Thanks Co-authored-by: Samuel Monson <[email protected]>

Fix redundant annotation Co-authored-by: Samuel Monson <[email protected]>

rgreenberg1 · 2025-02-18T13:02:48Z

Has this worked been picked back up/re-reviewed?

thameem-abbas · 2025-02-18T13:35:55Z

@rgreenberg1 This hasn't been picked up for a while now. It's out of sync with main by quite a bit. Can prioritize this if there is a need for this to land in llm-load-test.

thameem-abbas added 2 commits October 23, 2024 09:31

ADD: Batch request for OpenAI Plugin

2b3ab33

FIX: Reduce redundant code-paths

08d7bb8

thameem-abbas changed the title ~~ADD: Batch request for OpenAI Plugin~~ ADD: Batch request (Non-streaming) for OpenAI Plugin Nov 4, 2024

sjmonson requested changes Nov 4, 2024

View reviewed changes

thameem-abbas and others added 5 commits November 4, 2024 12:38

Update plugins/openai_plugin.py

9d928a3

Agreeing on change of condition. Co-authored-by: Samuel Monson <[email protected]>

Update utils.py

1a1d57b

Don't know what was running through my head when I wrote that. Thanks Co-authored-by: Samuel Monson <[email protected]>

Update result.py

5f0f6e2

Fix redundant annotation Co-authored-by: Samuel Monson <[email protected]>

Update user.py : Cleanup

508366c

Fix: Potential KeyErrors

d326110

thameem-abbas changed the title ~~ADD: Batch request (Non-streaming) for OpenAI Plugin~~ WIP : ADD: Batch request (Non-streaming) for OpenAI Plugin Nov 11, 2024

rgreenberg1 mentioned this pull request Mar 31, 2025

Enable Batch Inferencing Benchmarking Support vllm-project/guidellm#102

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

WIP : ADD: Batch request (Non-streaming) for OpenAI Plugin #66

WIP : ADD: Batch request (Non-streaming) for OpenAI Plugin #66

Uh oh!

thameem-abbas commented Oct 23, 2024

Uh oh!

thameem-abbas commented Nov 4, 2024

Uh oh!

npalaska commented Nov 4, 2024

Uh oh!

sjmonson Nov 4, 2024

Uh oh!

sjmonson Nov 4, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sjmonson Nov 4, 2024

Uh oh!

Uh oh!

Uh oh!

sjmonson Nov 4, 2024

Uh oh!

rgreenberg1 commented Feb 18, 2025

Uh oh!

thameem-abbas commented Feb 18, 2025

Uh oh!

Uh oh!

		@@ -177,6 +185,88 @@ def request_http(self, query: dict, user_id: int, test_end_time: float = 0):

		return result

		def request_batch_http(self, queries, user_id, test_end_time: float = 0):

		# if timeout passes, queue.Empty will be thrown
		# User should continue to poll for inputs

WIP : ADD: Batch request (Non-streaming) for OpenAI Plugin #66

Are you sure you want to change the base?

WIP : ADD: Batch request (Non-streaming) for OpenAI Plugin #66

Uh oh!

Conversation

thameem-abbas commented Oct 23, 2024

Uh oh!

thameem-abbas commented Nov 4, 2024

Uh oh!

npalaska commented Nov 4, 2024

Uh oh!

sjmonson Nov 4, 2024

Choose a reason for hiding this comment

Uh oh!

sjmonson Nov 4, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sjmonson Nov 4, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

sjmonson Nov 4, 2024

Choose a reason for hiding this comment

Uh oh!

rgreenberg1 commented Feb 18, 2025

Uh oh!

thameem-abbas commented Feb 18, 2025

Uh oh!

Uh oh!