Add sim tutorial, fix lekiwi motor config, add notebook links #1275

pkooij · 2025-06-12T13:57:01Z

What this does

Adds a tutorial on using gym-hil to do imitation learning in sim
Adds a link to a collab notebook for training ACT and SmolVLA
Adds a fix for configuring motors for Lekiwi (: str type the LeKiwiConfig port) + docs on how to configure lekiwi motors
Fixes a minor HF papers link for diffusion policy

- Changed the device assignment for tensors in the ReplayBuffer class from `device` to `storage_device` for consistency and improved resource management.

for more information, see https://pre-commit.ci

…e calls - Introduced a new "task" field in frame_dict to meet the requirements of LeRobotDataset. - Removed task_name parameter from save_episode calls for consistency.

…management

- gym_manipulator - find_joint_limits - end_effector_utils

Add ManiSkill environment configuration and wrappers - Introduced `VideoRecordConfig` for video recording settings. - Added `ManiskillEnvConfig` to encapsulate environment-specific configurations. - Implemented various wrappers for the ManiSkill environment, including observation and action scaling. - Enhanced the `make_maniskill` function to create a wrapped ManiSkill environment with video recording and observation processing. - Updated the `actor_server` and `learner_server` to utilize the new configuration structure. - Refactored the training pipeline to accommodate the new environment and policy configurations.

- Reduced frame rate in `ManiskillEnvConfig` from 400 to 200. - Enhanced `SACConfig` with new dataclasses for actor, learner, and network configurations. - Improved input and output feature management in `SACConfig`. - Refactored `actor_server` and `learner_server` to access configuration properties directly. - Updated training pipeline to validate configurations and handle dataset repo IDs more robustly.

Added support for hil_serl classifier to be trained with train.py run classifier training by python lerobot/scripts/train.py --policy.type=hilserl_classifier fixes in find_joint_limits, control_robot, end_effector_control_utils

…rties - Introduced `WrapperConfig` dataclass for environment wrapper configurations. - Updated `ManiskillEnvConfig` to include a `wrapper` field for enhanced environment management. - Modified `SACConfig` to return `None` for `observation_delta_indices` and `action_delta_indices` properties. - Refactored `make_robot_env` function to improve readability and maintainability.

Moved HilSerl env config to configs/env/configs.py fixes in actor_server and modeling_sac and configuration_sac added the possibility of ignoring missing keys in env_cfg in get_features_from_env_config function

- Implemented process-specific logging for actor and learner servers to improve traceability. - Created a dedicated logs directory and ensured it exists before logging. - Initialized logging with explicit log files for each process, including actor transitions, interactions, and policy. - Updated the actor CLI to validate configuration and set up logging accordingly.

- Simplified the `image_features` property to directly iterate over `input_features`. - Removed unused imports and unnecessary code related to main execution, enhancing clarity and maintainability.

- Rearranged import statements for better readability. - Removed unused imports and streamlined the code structure.

- Removed unused imports and streamlined the code structure. - Consolidated logging initialization and enhanced logging for training processes. - Improved handling of training state loading and resume logic. - Refactored transition and interaction message processing for better readability and maintainability. - Added detailed comments and documentation for clarity.

- Consolidated logging initialization and enhanced logging for actor processes. - Streamlined the handling of gRPC connections and process management. - Improved readability by organizing core algorithm functions and communication functions. - Added detailed comments and documentation for clarity. - Ensured proper queue management and shutdown handling for actor processes.

…onality - Updated the `forward` method in `SACPolicy` to handle loss computation for actor, critic, and temperature models. - Replaced direct calls to `compute_loss_*` methods with a unified `forward` method in `learner_server`. - Enhanced batch processing by consolidating input parameters into a single dictionary for better readability and maintainability. - Removed redundant code and improved documentation for clarity.

- Enhanced type annotations for variables in the `SACPolicy` class to improve code clarity. - Updated method calls to use keyword arguments for better readability. - Streamlined the extraction of batch components, ensuring consistent typing across the class methods.

…f gamepad Minor modifications in gym_manipulator to quantize the gripper actions clamped the observations after F.resize in ConvertToLeRobotObservation wrapper due to a bug in F.resize, images were returned exceeding the maximum value of 1.0

for more information, see https://pre-commit.ci

…918)

…ed divergence

for more information, see https://pre-commit.ci

- Implemented grasp critic to evaluate gripper actions - Added corresponding config parameters for tuning

…e/lerobot into feat/add_sim_tutorial

docs/source/il_sim.mdx

michel-aractingi

lgtm, good work

AdilZouitine and others added 30 commits April 18, 2025 15:06

Update tensor device assignment in ReplayBuffer class

cdcf346

- Changed the device assignment for tensors in the ReplayBuffer class from `device` to `storage_device` for consistency and improved resource management.

[pre-commit.ci] auto fixes from pre-commit.com hooks

1c8daf1

for more information, see https://pre-commit.ci

Removed depleted files and scripts

2abbd60

[pre-commit.ci] auto fixes from pre-commit.com hooks

0ea2770

for more information, see https://pre-commit.ci

Handle multi optimizers

bb5a958

Handle new config with sac

80d566e

Add task field to frame_dict in ReplayBuffer and simplify save_episod…

38e8864

…e calls - Introduced a new "task" field in frame_dict to meet the requirements of LeRobotDataset. - Removed task_name parameter from save_episode calls for consistency.

Add .devcontainer to .gitignore for improved development environment …

26ee8b6

…management

Change config logic in:

114ec64

- gym_manipulator - find_joint_limits - end_effector_utils

Add wandb run id in config

0b5b62c

Change HILSerlRobotEnvConfig to inherit from EnvConfig

b69132c

Added support for hil_serl classifier to be trained with train.py run classifier training by python lerobot/scripts/train.py --policy.type=hilserl_classifier fixes in find_joint_limits, control_robot, end_effector_control_utils

Added gripper control mechanism to gym_manipulator

05a237c

Moved HilSerl env config to configs/env/configs.py fixes in actor_server and modeling_sac and configuration_sac added the possibility of ignoring missing keys in env_cfg in get_features_from_env_config function

fix

8fb373a

Refactor SACConfig properties for improved readability

c0ba4b4

- Simplified the `image_features` property to directly iterate over `input_features`. - Removed unused imports and unnecessary code related to main execution, enhancing clarity and maintainability.

Refactor imports in modeling_sac.py for improved organization

3beab33

- Rearranged import statements for better readability. - Removed unused imports and streamlined the code structure.

[pre-commit.ci] auto fixes from pre-commit.com hooks

eb44a06

for more information, see https://pre-commit.ci

Fix: Prevent Invalid next_state References When optimize_memory=True (#…

70d4189

…918)

Fix cuda graph break

0185a0b

Fix convergence of sac, multiple torch compile on the same model caus…

5b49601

…ed divergence

[pre-commit.ci] auto fixes from pre-commit.com hooks

334cf81

for more information, see https://pre-commit.ci

Add grasp critic

6669396

- Implemented grasp critic to evaluate gripper actions - Added corresponding config parameters for tuning

Merge branch 'feat/add_sim_tutorial' of https://github.com/huggingfac…

681f41c

…e/lerobot into feat/add_sim_tutorial

AdilZouitine reviewed Jun 13, 2025

View reviewed changes

docs/source/il_sim.mdx Outdated Show resolved Hide resolved

michel-aractingi and others added 2 commits June 13, 2025 18:09

(docs) Moved configs from examples directory to the hub

55e82c7

Merge branch 'main' into feat/add_sim_tutorial

c7d982a

imstevenpmwork added the bug Something isn’t working correctly label Jun 13, 2025

Merge branch 'main' into feat/add_sim_tutorial

e99f4f5

pkooij unassigned michel-aractingi Jun 13, 2025

pkooij requested a review from michel-aractingi June 13, 2025 16:42

michel-aractingi approved these changes Jun 13, 2025

View reviewed changes

pkooij merged commit 438334d into main Jun 13, 2025
8 checks passed

pkooij deleted the feat/add_sim_tutorial branch June 13, 2025 16:48

This was referenced Jun 30, 2025

Feature implementation from commits edfebd5..ce6a26d codeOwlAI/lerobot#2

Open

Feature implementation from commits 37748c8..35e6758 codeOwlAI/lerobot#3

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add sim tutorial, fix lekiwi motor config, add notebook links #1275

Add sim tutorial, fix lekiwi motor config, add notebook links #1275

Uh oh!

pkooij commented Jun 12, 2025 •

edited

Loading

Uh oh!

Uh oh!

michel-aractingi left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

Add sim tutorial, fix lekiwi motor config, add notebook links #1275

Add sim tutorial, fix lekiwi motor config, add notebook links #1275

Uh oh!

Conversation

pkooij commented Jun 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this does

Uh oh!

Uh oh!

michel-aractingi left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

pkooij commented Jun 12, 2025 •

edited

Loading