Skip to content

Questions About VLLM Parallel Inference #7

@tulvgengenr

Description

@tulvgengenr

In evaluate_vllm.sh and evaluate_72B_vllm.sh, I see max_workers=1. This configuration fails to leverage VLLM's parallel processing advantages. However, when I set max_workers > 1, an error occurs. Have you encountered this issue?

Metadata

Metadata

Assignees

Labels

help wantedExtra attention is needed

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions