fix: pass rope_scaling=None for Qwen3 to avoid unhashable dict error by CruxZhou · Pull Request #198 · GeeeekExplorer/nano-vllm

CruxZhou · 2026-04-09T13:46:18Z

Summary

This PR fixes a runtime error when loading Qwen3 models with newer versions of transformers(>5).

Problem

Qwen3Attention forwards config.rope_scaling into get_rope(...), but get_rope() does not currently implement RoPE scaling and assumes rope_scaling is None.

In newer transformers versions, config.rope_scaling may be parsed as a dict, which causes:

TypeError: unhashable type: 'dict'

because get_rope() is cached with lru_cache.

Fix

Pass rope_scaling=None explicitly in Qwen3Attention.

This keeps behavior aligned with the current implementation status and avoids the runtime error without changing existing RoPE logic.

Fixes #189

fix: pass rope_scaling=None for Qwen3 to avoid unhashable dict error

8ef7030

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: pass rope_scaling=None for Qwen3 to avoid unhashable dict error#198

fix: pass rope_scaling=None for Qwen3 to avoid unhashable dict error#198
CruxZhou wants to merge 1 commit into
GeeeekExplorer:mainfrom
CruxZhou:rope-fix

CruxZhou commented Apr 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

CruxZhou commented Apr 9, 2026

Summary

Problem

Fix

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant