You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi there, nice work.
However I think there is a look-ahead bias.
Every timestep, you get state and this state includes the current closeprice.
Then with step method you calculate profit as:
reward is computed from the current state, thus there is no lookahead bias.
Put it simply,
(e.g.) one decides to sell the stock based on past¤t price and
if one does sell, then one would calculate the earning based on current price.
Hi there, nice work.
However I think there is a look-ahead bias.
Every timestep, you get state and this state includes the current closeprice.
Then with step method you calculate profit as:
In this case you are using the same information that you already used to predict the next action.
What do you think about it?
The text was updated successfully, but these errors were encountered: