Fixed `chunk_code_text`'s `UnboundLocalError` crash on empty code file #965

jamesbraza · 2025-06-11T18:35:06Z

If a code file is empty (e.g. a py.typed file), the Text-creation loop inside chunk_code_text will not be entered, so we hit an UnboundLocalError:

UnboundLocalError: cannot access local variable 'i' where it is not associated with a value

This PR:

Fixes that crash, with test coverage
Improves variable names and documents the purpose of conditional logic

Copilot

Pull Request Overview

This PR fixes an UnboundLocalError in chunk_code_text when processing empty files and improves the code’s variable naming.

Fixes a bug where an empty file (e.g. py.typed) would trigger an UnboundLocalError.
Enhances variable names and updates the logic in the chunking loop to improve clarity.
Adds test coverage to verify the handling of empty file content.

Reviewed Changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated no comments.

File	Description
tests/test_paperqa.py	Added a test case for a py.typed file to cover empty content scenarios.
paperqa/readers.py	Updated variable names in the chunking logic to prevent crashes and improve clarity.

Comments suppressed due to low confidence (2)

paperqa/readers.py:225

[nitpick] Consider renaming 'last_line_i' to 'start_line_index' or a similarly clear name to better indicate its purpose in marking the start of a chunk.

line_i = last_line_i = 0

paperqa/readers.py:222

[nitpick] Consider updating the function docstring to mention the use of the 'text_buffer' variable and the revised variable names for better clarity.

'''Parse a document into chunks, based on line numbers (for code).'''

paperqa/readers.py

jamesbraza added 3 commits June 11, 2025 11:24

Renamed split to text_buffer and i to line_i to be understandable

9d951f4

Documented final if statement's purpose

8126ea2

Added empty code artifact (py.typed) to the stub data

abb689b

jamesbraza requested review from whitead, sremo, mskarlin, maykcaldas, ludomitch, Eddie-MG and nadolskit June 11, 2025 18:35

jamesbraza self-assigned this Jun 11, 2025

Copilot AI review requested due to automatic review settings June 11, 2025 18:35

jamesbraza added the bug Something isn't working label Jun 11, 2025

dosubot bot added the size:S This PR changes 10-29 lines, ignoring generated files. label Jun 11, 2025

Copilot AI reviewed Jun 11, 2025

View reviewed changes

Eddie-MG approved these changes Jun 12, 2025

View reviewed changes

maykcaldas approved these changes Jun 12, 2025

View reviewed changes

paperqa/readers.py Show resolved Hide resolved

dosubot bot added the lgtm This PR has been approved by a maintainer label Jun 12, 2025

nadolskit approved these changes Jun 12, 2025

View reviewed changes

jamesbraza merged commit 1993745 into main Jun 12, 2025
17 of 20 checks passed

jamesbraza deleted the working-with-py-typed branch June 12, 2025 22:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fixed `chunk_code_text`'s `UnboundLocalError` crash on empty code file #965

Fixed `chunk_code_text`'s `UnboundLocalError` crash on empty code file #965

Uh oh!

jamesbraza commented Jun 11, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Fixed chunk_code_text's UnboundLocalError crash on empty code file #965

Fixed chunk_code_text's UnboundLocalError crash on empty code file #965

Uh oh!

Conversation

jamesbraza commented Jun 11, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Fixed `chunk_code_text`'s `UnboundLocalError` crash on empty code file #965

Fixed `chunk_code_text`'s `UnboundLocalError` crash on empty code file #965