-
Notifications
You must be signed in to change notification settings - Fork 19
feat(sdk): propagate OCI layer-based metadata to the launcher #523
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
agronskiy
merged 77 commits into
main
from
agronskiy/experimental/launcher/retrieve-oci-based-metadata-storage
Dec 16, 2025
Merged
feat(sdk): propagate OCI layer-based metadata to the launcher #523
agronskiy
merged 77 commits into
main
from
agronskiy/experimental/launcher/retrieve-oci-based-metadata-storage
Dec 16, 2025
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
|
Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually. Contributors can view more details about this message here. |
d39e224 to
5063446
Compare
Collaborator
Author
|
/ok to test 5063446 |
878087e to
5d7ba78
Compare
Collaborator
Author
|
/ok to test 5d7ba78 |
79d647b to
7bfcd76
Compare
Signed-off-by: Alex Gronskiy <[email protected]>
…f_eval) Signed-off-by: Marta Stepniewska-Dziubinska <[email protected]>
Signed-off-by: Marta Stepniewska-Dziubinska <[email protected]>
Signed-off-by: Marta Stepniewska-Dziubinska <[email protected]>
Signed-off-by: Alex Gronskiy <[email protected]>
Signed-off-by: Alex Gronskiy <[email protected]>
Signed-off-by: Marta Stepniewska-Dziubinska <[email protected]>
Signed-off-by: Alex Gronskiy <[email protected]>
Signed-off-by: Marta Stepniewska-Dziubinska <[email protected]>
Signed-off-by: Marta Stepniewska-Dziubinska <[email protected]>
Signed-off-by: Marta Stepniewska-Dziubinska <[email protected]>
Signed-off-by: Alex Gronskiy <[email protected]>
Signed-off-by: Marta Stepniewska-Dziubinska <[email protected]>
Signed-off-by: Alex Gronskiy <[email protected]>
Signed-off-by: Alex Gronskiy <[email protected]>
Signed-off-by: Alex Gronskiy <[email protected]>
Signed-off-by: Alex Gronskiy <[email protected]>
Signed-off-by: Alex Gronskiy <[email protected]>
Signed-off-by: Alex Gronskiy <[email protected]>
Signed-off-by: Alex Gronskiy <[email protected]>
Signed-off-by: Alex Gronskiy <[email protected]>
e0973bd to
80de472
Compare
ko3n1g
approved these changes
Dec 16, 2025
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
CI
documentation
Improvements or additions to documentation
nemo-evaluator
nemo-evaluator-launcher
scripts
tests
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
(still under active self-review, some AI gen remnants)
Implemented OCI layer inspection to extract framework definitions from container images without pulling entire images. The system now:
framework.ymlfiles from containers using OCI layer inspection (partial_pull.py)nemo_evaluator.core.input.get_framework_evaluations()TaskIntermediateRepresentation,HarnessIntermediateRepresentation)all_tasks_irs.yamlwithmapping.tomlchecksum validationls task,ls tasks) with--fromflag for on-the-fly container inspectionmapping.tomlchanges are reflected inall_tasks_irs.yamlApprox set of changes
1. OCI Layer Inspection
partial_pull.pymodule for OCI layer inspection~/.docker/config.json) as fallback~/.nemo-evaluator/docker-meta/)2. Framework Extraction Script
-
load_framework_definitions.pyscriptframework.ymlfrom containers using OCI layer inspectionfind_file_matching_pattern_in_image_layers()for pattern-based searchnemo_evaluator.core.input.get_framework_evaluations()all_tasks_irs.yamlwith checksum metadatamapping.tomland stores it inall_tasks_irs.yamlmetadatamapping.tomlchanges,all_tasks_irs.yamlmust be regenerated locally by running this scripttest_packaged_mapping_toml_checksum_match()test which fails if checksums don't matchmapping.tomlare always reflected inall_tasks_irs.yamlbefore merging (checksum checks)3. IR-Based Loading System
all_tasks_irs.yaml(single YAML document withmetadataandtaskssections)nemo_evaluator.core.input.get_framework_evaluations()to parse framework.yml filesTaskIntermediateRepresentationandHarnessIntermediateRepresentationdataclassesmapping.tomlchecksum stored inall_tasks_irs.yamlmetadata and validated on loadall_tasks_irs.yamlis in sync withmapping.tomltest_packaged_mapping_toml_checksum_match()ensures packaged artifacts match4. CLI Commands
ls_task.pycommand (new file)all_tasks_irs.yamlviaload_tasks_from_tasks_file()mapping_verified=False--from <container>flag for on-the-fly container inspectionls_tasks.pycommand--from <container>flag for on-the-fly container inspectionmapping.tomlwhen--fromnot provided--from <container>flag to both commandsframework.ymlfrom container using OCI layer inspection5. Documentation Generation
autogen_task_yamls.pyscriptload_tasks_from_tasks_file()to load IRsdocs/task_catalog/harnesses/*.md)docs/task_catalog/benchmarks-table.md)mapping_verified=Falsedocs/conf.pysetup()function)---) between tasks in harness pagesSummary by CodeRabbit
New Features
Documentation
Tests
✏️ Tip: You can customize this high-level summary in your review settings.