Fix invalid "Runner" export in __all__ and incorrect metric aggregation by Muhammad-Ikhwan-Fathulloh · Pull Request #660 · rllm-org/rllm

Muhammad-Ikhwan-Fathulloh · 2026-06-16T17:59:39Z

Fixes #659

Summary

This PR fixes two issues related to package exports and metric aggregation:

Removes the invalid "Runner" entry from rllm/__init__.py::__all__.
Fixes reward aggregation in reduce_metrics_by_trajectory_name() so that all rewards for a trajectory name are collected instead of overwriting previous values.

These changes improve API consistency and ensure metrics are computed from the complete set of trajectory rewards.

Problem

Invalid Public Export

rllm/__init__.py exposes "Runner" through __all__, but no corresponding symbol is imported or defined. This results in an inconsistent public API and can mislead users relying on exported package symbols.

Incorrect Metric Aggregation

reduce_metrics_by_trajectory_name() replaces previously stored rewards when multiple trajectories share the same name.

Current behavior:

trajectory_rewards[name] = reward

This causes only the last reward to be retained.

Expected behavior:

trajectory_rewards[name].append(reward)

All rewards should be collected so that downstream statistics are calculated correctly.

Changes

rllm/init.py

Remove "Runner" from __all__
Ensure exported symbols accurately reflect available imports

rllm/trainer/algorithms/metrics.py

Aggregate rewards into lists instead of overwriting values
Preserve all rewards associated with a trajectory name
Add safe handling for None rewards during aggregation

Before

For trajectories:

[
    {"trajectory_name": "task_a", "reward": 0.5},
    {"trajectory_name": "task_a", "reward": 0.8},
]

Stored result:

{"task_a": 0.8}

After

Stored result:

{"task_a": [0.5, 0.8]}

This allows metrics to be computed using the full reward distribution.

Type of Change

Testing

Verified package exports after removing invalid symbol.
Verified reward aggregation preserves all rewards for identical trajectory names.
Verified metric reduction works correctly with multiple trajectories.
Verified None rewards do not break aggregation logic.

Impact

Fixes inconsistent package exports.
Prevents loss of reward information during metric aggregation.
Produces more accurate training and evaluation metrics.
Maintains backward compatibility.

1. Remove 'Runner' from __all__ in rllm/__init__.py since it doesn't exist 2. Fix reduce_metrics_by_trajectory_name to collect all rewards instead of replacing

Muhammad-Ikhwan-Fathulloh and others added 6 commits June 7, 2026 23:41

Fix variable shadowing, missing timeouts, and add EvalResult.load()

afa85ed

Merge branch 'rllm-org:main' into main

f862001

fix: resolve ruff linting and formatting issues

9a8b796

Merge branch 'rllm-org:main' into main

92a428c

Merge branch 'rllm-org:main' into main

df4d81f

fix: remove invalid 'Runner' from __all__ and fix metric collection

0fb9d26

1. Remove 'Runner' from __all__ in rllm/__init__.py since it doesn't exist 2. Fix reduce_metrics_by_trajectory_name to collect all rewards instead of replacing

jeffreysijuntan merged commit e2d8d10 into rllm-org:main Jun 17, 2026
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix invalid "Runner" export in all and incorrect metric aggregation#660

Fix invalid "Runner" export in all and incorrect metric aggregation#660
jeffreysijuntan merged 6 commits into
rllm-org:mainfrom
Muhammad-Ikhwan-Fathulloh:main

Muhammad-Ikhwan-Fathulloh commented Jun 16, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Muhammad-Ikhwan-Fathulloh commented Jun 16, 2026

Summary

Problem

Invalid Public Export

Incorrect Metric Aggregation

Changes

rllm/init.py

rllm/trainer/algorithms/metrics.py

Before

After

Type of Change

Testing

Impact

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants