-
Notifications
You must be signed in to change notification settings - Fork 735
Closed as not planned
Description
System Info / 系統信息
mac os
Embedding错信息Server error: 400 - [address=127.0.0.1:47065, pid=95140] Model Qwen3-Embedding-8B cannot be run on engine sentence_transformers.
Reranker报错信息 error: 500 - [address=127.0.0.1:58592, pid=1019] Error while deserializing header: MetadataIncompleteBuffer
Running Xinference with Docker? / 是否使用 Docker 运行 Xinfernece?
- docker / docker
- pip install / 通过 pip install 安装
- installation from source / 从源码安装
Version info / 版本信息
1.7.0post1
The command used to start Xinference / 用以启动 xinference 的命令
XINFERENCE_ENABLE_VIRTUAL_ENV=1 xinference-local
Reproduction / 复现过程
1.选择Qwen3-Reranker-8B或者 Qwen3-Embedding-8B
2.默认为副本1、设备cpu
3.点击小火箭启动
Expected behavior / 期待表现
希望能够正常运行