You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Suppose I make an algorithm and I want to test how well it performs. One approach to testing would be to simply run the algorithm in one swoop over all of my data. This approach is the default for platforms like quantopian or backtrader. Another approach would be to first partition the data set into n sub-data sets, then perform the backtest on each of those n sub-data sets.
Benefits of Partitioning
It allows for a tighter confidence interval around performance. In general, we aren't trying to find a strategy that will perform incredibly over a 5 year period. While that would be nice, there just isn't enough data to form a solid confidence interval around how this algo would perform when deployed (you only have probably one relevant 5-year sample). Thus, it is clear that there is a trade-off between long-term testing and test confidence. That is, if a test is longer, then we cannot partition our data into as many sub-tests, and thus cannot form a confidence interval for our mean performance (or do any analysis of our sub-test results distribution of outcomes).
Speed. Since all of the sub-tests are run independently of each other (and each would presumably take some time), we could run them in parallel. For example, if you have 8 CPU cores, you may be able to run the tests in just over 1/8 the time it would take to run a full test. That would be nice!
Easy integration with train-validation-test splitting. In the process of making this feature, we would also create generic code for partitioning a data set by DateTime. Once this is in place, we can seamlessly use the partitioning functionality to make a train-validation-test partitioning of our data set.
Interface
Options
Option 1
Since this is an entirely new type of test, it may be best to make this into a new run function. The first version of run is for a continuous block of time. However, it might be nice to overload this function with another run. This new function could have one (or many!) of the following signatures:
run(::StrategyOptions, [(...Dates.DateTime args)]) [cutoffs between partitions]
run(::StrategyOptions, ::Integer) [number of partitions]
run(::StrategyOptions, ::Number[0,100], ::Number[0,100]) [proportion of dataset to be used for train and validation, respectively]
Option 2
Add various arguments to StrategyOptions that achieve the desired sampling. These arguments may include downsample, train, validation, partitionproportions, etc.
Preferences
I would prefer to see Option 1, for the following reasons:
Separation of concerns. If we have a working version of run(::StrategyOptions) (the original rendition of backtest runner), then we can add new run types without needing to re-test run(::StrategyOptions).
Argument parsing & extensibility. If we add the optional partitioning arguments, then we'd need to add logic inside of our original run(::StrategyOptions) function to account for what type of backtest we're running. This can lead to a lot of spaghetti code if we decide to add various partitioning features.
Implementation (assuming we go with Interface Option 1)
Mid-Level Steps (for any type of partitioning function)
Calculate the start and end time for all of the partitions (or potentially receive this as input).
Re-generate datareaders so that the number of datareader copies is the same as the number of partitions.
Adjust StrategyOptions as necessary.
Invoke run(::StrategyOptions) for each of the StrategyOptions objects.
Questions and Concerns
If we re-generate all of the datareaders, will there be issues with storing the data in memory with DataReaders.InMemoryDataReader? If so, how would we adjust our approach as we use more data, or as we create more partitions?
The text was updated successfully, but these errors were encountered:
Partition Testing
Background
Suppose I make an algorithm and I want to test how well it performs. One approach to testing would be to simply run the algorithm in one swoop over all of my data. This approach is the default for platforms like quantopian or backtrader. Another approach would be to first partition the data set into
n
sub-data sets, then perform the backtest on each of thosen
sub-data sets.Benefits of Partitioning
DateTime
. Once this is in place, we can seamlessly use the partitioning functionality to make atrain-validation-test
partitioning of our data set.Interface
Options
Option 1
Since this is an entirely new type of test, it may be best to make this into a new run function. The first version of
run
is for a continuous block of time. However, it might be nice to overload this function with anotherrun
. This new function could have one (or many!) of the following signatures:run(::StrategyOptions, [(...Dates.DateTime args)])
[cutoffs between partitions]run(::StrategyOptions, ::Integer)
[number of partitions]run(::StrategyOptions, ::Number[0,100], ::Number[0,100])
[proportion of dataset to be used fortrain
andvalidation
, respectively]Option 2
Add various arguments to
StrategyOptions
that achieve the desired sampling. These arguments may includedownsample
,train
,validation
,partitionproportions
, etc.Preferences
I would prefer to see Option 1, for the following reasons:
run(::StrategyOptions)
(the original rendition of backtest runner), then we can add newrun
types without needing to re-testrun(::StrategyOptions)
.run(::StrategyOptions)
function to account for what type of backtest we're running. This can lead to a lot of spaghetti code if we decide to add various partitioning features.Implementation (assuming we go with Interface Option 1)
Mid-Level Steps (for any type of partitioning function)
StrategyOptions
as necessary.run(::StrategyOptions)
for each of theStrategyOptions
objects.Questions and Concerns
DataReaders.InMemoryDataReader
? If so, how would we adjust our approach as we use more data, or as we create more partitions?The text was updated successfully, but these errors were encountered: