Skip to content

Qwen3-Embedding-8B和Qwen3-Reranker-8B报错 #3641

@noahachao

Description

@noahachao

System Info / 系統信息

mac os
Embedding错信息Server error: 400 - [address=127.0.0.1:47065, pid=95140] Model Qwen3-Embedding-8B cannot be run on engine sentence_transformers.
Reranker报错信息 error: 500 - [address=127.0.0.1:58592, pid=1019] Error while deserializing header: MetadataIncompleteBuffer

Running Xinference with Docker? / 是否使用 Docker 运行 Xinfernece?

  • docker / docker
  • pip install / 通过 pip install 安装
  • installation from source / 从源码安装

Version info / 版本信息

1.7.0post1

The command used to start Xinference / 用以启动 xinference 的命令

XINFERENCE_ENABLE_VIRTUAL_ENV=1 xinference-local

Reproduction / 复现过程

1.选择Qwen3-Reranker-8B或者 Qwen3-Embedding-8B
2.默认为副本1、设备cpu
3.点击小火箭启动

Expected behavior / 期待表现

希望能够正常运行

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions