I think there is a look-ahead bias #5

mg64ve · 2019-08-12T06:53:54Z

Hi there, nice work.
However I think there is a look-ahead bias.
Every timestep, you get state and this state includes the current closeprice.
Then with step method you calculate profit as:

self.exit_price = self.closingPrice
self.reward += ((self.entry_price - self.exit_price)/self.exit_price + 1)*(1-self.fee)**2 - 1 # calculate reward

In this case you are using the same information that you already used to predict the next action.
What do you think about it?

The text was updated successfully, but these errors were encountered:

miroblog · 2019-12-31T20:39:10Z

state_n <- updateState()
action_n <- network(state_n)
reward_n <- compute_reward(action_n, state_n)

state_n_plus_1 <- updateState()
action_n_plus_1 <- network(state_n_plus_1)
reward_n_plus_1 <- compute_reward(action_n_plus_1, state_n_plus_1)

reward is computed from the current state, thus there is no lookahead bias.
Put it simply,
(e.g.) one decides to sell the stock based on past&current price and
if one does sell, then one would calculate the earning based on current price.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

I think there is a look-ahead bias #5

I think there is a look-ahead bias #5

mg64ve commented Aug 12, 2019

miroblog commented Dec 31, 2019

I think there is a look-ahead bias #5

I think there is a look-ahead bias #5

Comments

mg64ve commented Aug 12, 2019

miroblog commented Dec 31, 2019