Adds Anymal-C navigation rsl_rl example with obstacle and camera in the scene #2027

KingsleyLiu-NV · 2025-03-06T07:30:42Z

Description

Please include a summary of the change and which issue is fixed. Please also include relevant motivation and context.
List any dependencies that are required for this change.

This PR adds the anymal-c navigation rsl_rl example with obstacle and camera in the scene:

Extends the rsl_rl libraries to support convolution networks and ppo runner
Extends the anymal-c navigation environment to have rigid object obstacle and tiled camera sensor
- Adds the event reset function to sample the position in a sector of ring area
- Adds the observation function for the flattened normalized rgb data
- Adds the pose3 command resample function to determine the target based on the obstacle position
Tune the RL reward terms to make the training converge

Type of change

New feature (non-breaking change which adds functionality)
This change requires a documentation update

It can be run with the following command:

./isaaclab.sh -p scripts/reinforcement_learning/rsl_rl/train.py --task Isaac-Navigation-Flat-Obstacle-Anymal-C-v0 --headless --num_envs 4096 --max_iterations 1501 --enable_cameras

./isaaclab.sh -p scripts/reinforcement_learning/rsl_rl/play.py --task Isaac-Navigation-Flat-Obstacle-Anymal-C-Play-v0 --headless --num_envs 16 --video

Screenshots

The training results are as below:

Memory footprint: (num_envs=4096, image_input_shape=(3, 64, 64))

Training speed:

Play with model_1300.pt:

e68abcd97722111e219462e88599bcba.mp4

Checklist

Toni-SM · 2025-03-06T19:04:45Z

...tasks/isaaclab_tasks/manager_based/navigation/config/anymal_c/navigation_obstacle_env_cfg.py

+        base_lin_vel = ObsTerm(func=mdp.base_lin_vel)
+        projected_gravity = ObsTerm(func=mdp.projected_gravity)
+        pose_command = ObsTerm(func=mdp.generated_commands, params={"command_name": "pose_command"})
+        camera_data = ObsTerm(func=mdp.image_flatten, params={"sensor_cfg": SceneEntityCfg("tiled_camera")})


This implementation flattens and concatenates all the observations components (propio and image) in one vector (a flattened observation_space), leading the definition of the image shape to use in the model to the agent configuration image_input_shape, right?

Agent configuration must be agnostic to any environment information except those specified by the environment spaces (observation_space, state_space and action_space). Currently, this is only possible with the direct workflow and using composite spaces

Yes, that understanding is correct. And I use a check to ensure the image_input_shape is consistent with the tiled camera configuration: train.py#L132-L140

Mayankm96 · 2025-03-07T07:50:04Z

scripts/reinforcement_learning/rsl_rl/actor_critic_conv2d.py

+from rsl_rl.modules.actor_critic import get_activation
+
+
+class ResidualBlock(nn.Module):


Since this is RSL-RL library related, I would recommend making an MR there for this actor critic structure :)

Yes, I can do that. Then I suppose this PR can only be merged after the RSL-RL accepts my changes.

Mayankm96

A lot of the modifications in this MR should be handled by RSL-RL library and not IsaacLab. It would be good to keep the two separated to keep IsaacLab mainly as simulation infrastructure.

Would it be possible to make an MR to RSL-RL instead? We can review it there and deal with this usecase accordingly. It shouldn't be hard for us to support a dictionary of observations and avoid an ugly hack here to deal with it.

KingsleyLiu-NV · 2025-03-07T08:54:26Z

A lot of the modifications in this MR should be handled by RSL-RL library and not IsaacLab. It would be good to keep the two separated to keep IsaacLab mainly as simulation infrastructure.

Would it be possible to make an MR to RSL-RL instead? We can review it there and deal with this usecase accordingly. It shouldn't be hard for us to support a dictionary of observations and avoid an ugly hack here to deal with it.

@Mayankm96 thanks for the suggestions. Basically, only actor_critic_conv2d.py and on_policy_runner_conv2d.py should go to RSL-RL library, and I can file an PR there.

Regarding support a dictionary of observations and avoid an ugly hack here to deal with it, can you elaborate what can be improved in this PR after conv2d-related code goes to RSL-RL library?

Toni-SM · 2025-03-07T15:42:46Z

Regarding support a dictionary of observations and avoid an ugly hack here to deal with it, can you elaborate what can be improved in this PR after conv2d-related code goes to RSL-RL library?

As I commented previously, agent configuration must be agnostic to any environment information except those specified by the environment spaces (observation_space, state_space and action_space) and this is currently, this is only possible with the direct workflow and using composite spaces (Tuple or Dict).

So, the current task implementation is a hacky and particular hard-code solution for one RL library that doesn't follow the convention to express highly structured space specification, and it is not generalizable/usable for other RL libraries. Since only Direct workflow support heterogeneous observations currently, the task implementation must be migrated to that workflow.

Mayankm96 · 2025-03-08T07:30:40Z

Asking the user here to migrate their environment to direct workflow isn't really the solution. We need to properly support heterogenous observation/action spaces to the manager-based environments. This is in the release plan for 2.2 of IsaacLab. We should probably prioritize it more as it is something doable.

@KingsleyLiu-NV An intermediate solution is that you have a separate observation group for the images similar to how we define the group called 'policy'. If this group is called 'images', you can access it from RSL-RL side through the dictionary: extras["observations"]["images"]. This is how we are dealing with asymmetric actor-critic over there. More information here: https://github.com/leggedrobotics/rsl_rl/blob/main/rsl_rl/runners/on_policy_runner.py#L32-L41

@kellyguo11 for visibility.

Toni-SM · 2025-03-08T15:24:20Z

This is in the release plan for 2.2 of IsaacLab. We should probably prioritize it more as it is something doable.

For sure, adding to the Manager-based workflow the ability to work with different space specifications is something to be prioritized.

However, while that happens, we should indicate the appropriate workflows to create well-done tasks according to the agreed interface (that allow the project to be used by any RL library) and stop proposing/encouraging hard-code solution (that although it is not as "ugly" as the previous one, still hacky hard-code) tied to specific implementations that do not promote code reusability and compromise the robust, maintainable, and flexible codebase of the project.

Therefore, for the time being, the way forward should be through Direct workflow by defining composite observation_space.

KingsleyLiu-NV · 2025-03-10T11:09:59Z

Hi @Mayankm96 , I get your point of using extras["observation"]. However, I think there still requires a new branch in the exporter: https://github.com/isaac-sim/IsaacLab/blob/main/source/isaaclab_rl/isaaclab_rl/rsl_rl/exporter.py#L123-L151. If this kind of modification is acceptable, I will make the changes accordingly.

…rain and play code

KingsleyLiu-NV self-assigned this Mar 6, 2025

KingsleyLiu-NV requested review from jsmith-bdai, kellyguo11, Mayankm96 and jtigue-bdai as code owners March 6, 2025 07:30

KingsleyLiu-NV added 4 commits March 6, 2025 01:33

adds pose2d command, reset event and observation function for the task

35bccd2

adds anymcal-c navigation env with obstacle and camera

428831f

extends rsl_rl ppo configuration to support convolutional newtorks

2c970f0

adds convolutional networks and ppo runner to rsl_rl libraries

6f78579

KingsleyLiu-NV force-pushed the fea/navigation-obstacle-camera branch from ac9d8b0 to 6f78579 Compare March 6, 2025 09:37

Toni-SM reviewed Mar 6, 2025

View reviewed changes

Mayankm96 reviewed Mar 7, 2025

View reviewed changes

Mayankm96 requested changes Mar 7, 2025

View reviewed changes

KingsleyLiu-NV added 4 commits March 7, 2025 02:31

change to resolve_nn_activation to adapt to latest rsl rl

fcbdcf8

rename the env as Isaac-Navigation-Flat-Obstacle-Anymal-C-v0

a88553c

update contributors and environments doc

4ffe9d2

remove redundant code in play env cfg

ae21285

KingsleyLiu-NV added 2 commits March 10, 2025 20:30

mv conv2d actor critic and ppo runner to rsl rl library

7e85c45

separate policy and sensor observations; modifyonnx exporter; clean t…

5c041dd

…rain and play code

KingsleyLiu-NV mentioned this pull request Mar 11, 2025

Add convolution networks for actor critic leggedrobotics/rsl_rl#74

Open

KingsleyLiu-NV added 2 commits March 10, 2025 22:14

fix policy play: handle inputs for both mlp and conv2d networks

aa3d643

remove flatten image observations

b803667

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adds Anymal-C navigation rsl_rl example with obstacle and camera in the scene #2027

Adds Anymal-C navigation rsl_rl example with obstacle and camera in the scene #2027

KingsleyLiu-NV commented Mar 6, 2025 •

edited

Loading

Toni-SM Mar 6, 2025 •

edited

Loading

KingsleyLiu-NV Mar 7, 2025

Mayankm96 Mar 7, 2025

KingsleyLiu-NV Mar 7, 2025 •

edited

Loading

Mayankm96 left a comment

KingsleyLiu-NV commented Mar 7, 2025

Toni-SM commented Mar 7, 2025

Mayankm96 commented Mar 8, 2025 •

edited

Loading

Toni-SM commented Mar 8, 2025

KingsleyLiu-NV commented Mar 10, 2025 •

edited

Loading

		from rsl_rl.modules.actor_critic import get_activation


		class ResidualBlock(nn.Module):

Adds Anymal-C navigation rsl_rl example with obstacle and camera in the scene #2027

Are you sure you want to change the base?

Adds Anymal-C navigation rsl_rl example with obstacle and camera in the scene #2027

Conversation

KingsleyLiu-NV commented Mar 6, 2025 • edited Loading

Description

Type of change

Screenshots

Checklist

Toni-SM Mar 6, 2025 • edited Loading

Choose a reason for hiding this comment

KingsleyLiu-NV Mar 7, 2025

Choose a reason for hiding this comment

Mayankm96 Mar 7, 2025

Choose a reason for hiding this comment

KingsleyLiu-NV Mar 7, 2025 • edited Loading

Choose a reason for hiding this comment

Mayankm96 left a comment

Choose a reason for hiding this comment

KingsleyLiu-NV commented Mar 7, 2025

Toni-SM commented Mar 7, 2025

Mayankm96 commented Mar 8, 2025 • edited Loading

Toni-SM commented Mar 8, 2025

KingsleyLiu-NV commented Mar 10, 2025 • edited Loading

KingsleyLiu-NV commented Mar 6, 2025 •

edited

Loading

Toni-SM Mar 6, 2025 •

edited

Loading

KingsleyLiu-NV Mar 7, 2025 •

edited

Loading

Mayankm96 commented Mar 8, 2025 •

edited

Loading

KingsleyLiu-NV commented Mar 10, 2025 •

edited

Loading