Closed
Description
Hello,
I trained an agent to reach a target. I give to the agent instant reward when he get closer to the target.
Now I added to the agent the ability to shoot "slow" bullets, and I want to give reward when one or more bullets hit the target.
So, there are two issue:
- How can I assign reward to an agent if the event occurs between bullet and target?
- How can agent know this reward is related to past decision? (The decision to shoot)
For the first issue, I think to instantiate bullets as childs of the agent, in order to call the parent function giveReward when the event is triggered.
For the second issue, I read something about recurrent neural network, but I don't know if it is the right way. Another possibility is to observation vector some information about the shooted bullet, in order to understand which bullets hitted?
Thanks in advance for the discussion.