[QUESTION] Issue with Evaluating Decision Transformer Using Evaluators in d3rlpy #406

XiudingCai · 2024-07-21T09:05:16Z

I have encountered an issue while trying to evaluate the performance of the Decision Transformer (DT) using the d3rlpy library. Unlike other methods such as CQL, it seems that DT does not support passing evaluators like:

evaluators={
    'action_diff': d3rlpy.metrics.ContinuousActionDiffEvaluator(test_episodes),
}

This limitation is problematic, especially in settings where an environment is not available for evaluation. It hinders the ability to compare the performance of DT with other methods under these conditions.

Is there a workaround or a recommended approach for this situation?

Thanks so much! :>

The text was updated successfully, but these errors were encountered:

takuseno · 2024-07-28T13:45:38Z

@XiudingCai Hi, sorry for the late response. This is a little tricky issue because Q-learning and Decision Transformer are completely different algorithms and it's difficult to share some interface. One possible workaround is to use callback option at fit method. You can add arbitrary logics at every steps.

d3rlpy/d3rlpy/algos/transformer/base.py

Line 391 in 3433de5

callback: Optional[Callable[[Self, int, int], None]] = None,

XiudingCai added the enhancement New feature or request label Jul 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[QUESTION] Issue with Evaluating Decision Transformer Using Evaluators in d3rlpy #406

[QUESTION] Issue with Evaluating Decision Transformer Using Evaluators in d3rlpy #406

XiudingCai commented Jul 21, 2024 •

edited

Loading

takuseno commented Jul 28, 2024

[QUESTION] Issue with Evaluating Decision Transformer Using Evaluators in d3rlpy #406

[QUESTION] Issue with Evaluating Decision Transformer Using Evaluators in d3rlpy #406

Comments

XiudingCai commented Jul 21, 2024 • edited Loading

takuseno commented Jul 28, 2024

XiudingCai commented Jul 21, 2024 •

edited

Loading