fix: prevent prompt overflow by rangehow · Pull Request #157 · gusye1234/nano-graphrag

rangehow · 2025-07-11T09:58:45Z

The previous logic for truncation by length was too coarse-grained, failing to account for the extra characters used in various connections such as chat templates, templates, headers, and lists. Additionally, it did not consider the measurement discrepancies between different tokenizers (e.g., Qwen and GPT-4o can differ by up to 2k tokens in some cases). To address all these issues, we have proposed a preliminary solution.

We introduced a more refined, albeit more cumbersome, length budget allocation. At the same time, we allow users to pass a Hugging Face tokenizer via GraphRAG parameters to serve as the metric for token counting.

close #146

rangehow added 2 commits July 11, 2025 17:54

fix: prevent prompt overflow by truncating long inputs

644b277

typo

e025429

rangehow merged commit 0cea71b into main Jul 11, 2025
1 check failed

rangehow deleted the fix/prompt-length branch July 11, 2025 10:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: prevent prompt overflow#157

fix: prevent prompt overflow#157
rangehow merged 2 commits intomainfrom
fix/prompt-length

rangehow commented Jul 11, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

rangehow commented Jul 11, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant