**Describe the bug** Isn't it meaningless if we divide the whole number into whole numbers?  Isn't it right to change it like this? m_BlueAgentGroup.AddGroupReward(1 - **(float)m_ResetTimer** / MaxEnvironmentSteps); Thank you.