You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Agents with the same goal already help each other at determining the reward from one point to the goal, with a very high value of alpha = 0.5.
But all vehicles use all roads... So maybe it is possible for all vehicles to arrive at some sort of consensus on the individual rewards of each action for all agents, since that reward does not depend on the individual goals of each individual.
This would likely make learning faster, but would not improve result quality.
The text was updated successfully, but these errors were encountered:
Agents with the same goal already help each other at determining the reward from one point to the goal, with a very high value of alpha = 0.5.
But all vehicles use all roads... So maybe it is possible for all vehicles to arrive at some sort of consensus on the individual rewards of each action for all agents, since that reward does not depend on the individual goals of each individual.
This would likely make learning faster, but would not improve result quality.
The text was updated successfully, but these errors were encountered: