[v0.5.10] Fix mask_utils for transformers >=5.0 by yueming-yuan · Pull Request #925 · radixark/miles

yueming-yuan · 2026-04-06T21:40:57Z

Summary

transformers 5.x changed apply_chat_template(tokenize=True) to return BatchEncoding instead of list[int]
mask_utils.py used direct slicing on the result, which broke (dict has len()=2 for its keys)
Added _apply_chat_template_ids() wrapper that normalizes the return type

Test plan

test_loss_mask_qwen3_simple passes
test_loss_mask_qwen3_tools passes

gemini-code-assist

Code Review

This pull request introduces a helper function _apply_chat_template_ids in miles/utils/mask_utils.py to ensure that apply_chat_template consistently returns a list of token IDs, addressing changes in transformers >= 5.0. The existing logic in get_system_message_length, gen_multi_turn_loss_mask_qwen, and gen_multi_turn_loss_mask_qwen3 has been updated to use this new helper. Feedback suggests simplifying the helper function by using the return_dict=False parameter or utilizing an existing utility function.

gemini-code-assist · 2026-04-06T21:43:05Z

miles/utils/mask_utils.py

+def _apply_chat_template_ids(tokenizer, messages, **kwargs) -> list[int]:
+    """Wrapper that always returns list[int] from apply_chat_template(tokenize=True).
+
+    transformers >=5.0 returns BatchEncoding instead of list[int]."""
+    result = tokenizer.apply_chat_template(messages, tokenize=True, **kwargs)
+    if isinstance(result, list):
+        return result
+    return result["input_ids"]


The implementation of _apply_chat_template_ids can be simplified by using the return_dict=False parameter. This is the standard way in the transformers library to ensure that apply_chat_template returns a list of token IDs instead of a BatchEncoding object, making the code more idiomatic and robust across different versions. Alternatively, consider using the existing apply_chat_template utility from miles.utils.chat_template_utils.template which already handles this logic and provides additional normalization for tools and messages.

Suggested change

def _apply_chat_template_ids(tokenizer, messages, **kwargs) -> list[int]:

"""Wrapper that always returns list[int] from apply_chat_template(tokenize=True).

transformers >=5.0 returns BatchEncoding instead of list[int]."""

result = tokenizer.apply_chat_template(messages, tokenize=True, **kwargs)

if isinstance(result, list):

return result

return result["input_ids"]

def _apply_chat_template_ids(tokenizer, messages, **kwargs) -> list[int]:

"""Wrapper that always returns list[int] from apply_chat_template(tokenize=True).

transformers >=5.0 returns BatchEncoding instead of list[int] by default."""

return tokenizer.apply_chat_template(

messages, tokenize=True, return_dict=False, **kwargs

)

Fix mask_utils for transformers >=5.0 BatchEncoding return type

44e83de

yueming-yuan requested review from fzyzcjy, guapisolo and maocheng23 as code owners April 6, 2026 21:41

yueming-yuan changed the title ~~Fix mask_utils for transformers >=5.0~~ [v0.5.10] Fix mask_utils for transformers >=5.0 Apr 6, 2026

gemini-code-assist bot reviewed Apr 6, 2026

View reviewed changes

yueming-yuan added the run-ci-image label Apr 6, 2026

Format

1a2d5e8

yueming-yuan merged commit 14a0cdb into bump-sglang-v0.5.10 Apr 6, 2026
17 of 45 checks passed

yueming-yuan deleted the fix/mask-utils-transformers-v5 branch April 6, 2026 22:07

yueming-yuan mentioned this pull request Apr 6, 2026

[v0.5.10][2] Fix apply_chat_template behavior for transformers >=5.0 #926

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[v0.5.10] Fix mask_utils for transformers >=5.0#925

[v0.5.10] Fix mask_utils for transformers >=5.0#925
yueming-yuan merged 2 commits intobump-sglang-v0.5.10from
fix/mask-utils-transformers-v5

yueming-yuan commented Apr 6, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Apr 6, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

yueming-yuan commented Apr 6, 2026

Summary

Test plan

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Apr 6, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant