Skip to content

Conversation

@dlwh
Copy link
Member

@dlwh dlwh commented Oct 15, 2025

There were a bunch of things in DecodeState that duplicated what was in PageTable, and it was a big mess. Now it is a somewhat smaller mess.

decode_state = decode_state.assign_seq(
local_slot_id=slot_val,
tokens=work.prompt_tokens["seq", i],
seq_len=prompt_len,
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

we actually want it to start out as 0 and get set during prefill

outputs = _DecodeOutputs.init(
max_tokens=gen_state.decode_state.max_seqs * 2,
max_seqs=gen_state.decode_state.max_seqs,
max_tokens=decode_state.max_seqs * 2,
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

RP: review this in more detail.

@rjpower rjpower merged commit cc11732 into main Oct 15, 2025
10 of 11 checks passed
@rjpower rjpower deleted the generation_cleanups branch October 15, 2025 18:39
rjpower added a commit that referenced this pull request Oct 15, 2025
rjpower added a commit that referenced this pull request Oct 15, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants