Fix floating-base fail check replay divergence by ZidongChen25 · Pull Request #49 · NeuracoreAI/bigym

ZidongChen25 · 2026-06-13T03:32:47Z

Summary

Fix a replay divergence in floating-base environments where the same action sequence could succeed with env.step(action, fast=True) but fail with env.step(action, fast=False).

Root cause

BiGymEnv._fail() previously checked whether the robot moved too far from the origin by reading the pelvis Cartesian position through self._robot.pelvis.get_position(). For the pelvis body this accesses MuJoCo's derived xpos field through dm_control.

When the physics state is dirty, reading a derived MuJoCo field can trigger an implicit physics.forward(). That extra forward pass does not advance simulation time, but it can recompute derived state, contacts, and constraint data. In contact-heavy tasks such as StackBlocks, doing this during every fast=False step can perturb the later physical trajectory.

Change

For floating-base robots, _fail() now reconstructs the pelvis XYZ position from the floating-base joint qpos values instead of reading body xpos. For axes that are not actuated, it falls back to the static MJCF body pos. Non-floating robots keep the previous pelvis.get_position() behavior.

This preserves the original fail check semantics while avoiding a derived-field read that can force an extra forward pass.

Validation

python -m py_compile bigym/bigym_env.py
Validated on a StackBlocks replay that previously diverged: after this change, the fast=False replay reaches success at frame 1344, matching the successful replay path.

stepjam

LGTM! Thank you!

Fix floating-base fail check replay divergence

d1c2197

ZidongChen25 mentioned this pull request Jun 13, 2026

Fix floating-base fail check replay divergence ZidongChen25/bigym#1

Closed

stepjam approved these changes Jun 13, 2026

View reviewed changes

stepjam merged commit 8a7a83c into NeuracoreAI:master Jun 13, 2026
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix floating-base fail check replay divergence#49

Fix floating-base fail check replay divergence#49
stepjam merged 1 commit into
NeuracoreAI:masterfrom
ZidongChen25:fix/floating-base-fail-check

ZidongChen25 commented Jun 13, 2026

Uh oh!

stepjam left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ZidongChen25 commented Jun 13, 2026

Summary

Root cause

Change

Validation

Uh oh!

stepjam left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants