How come the Decision Transformer requires the rewards as an input?

trtd56 · February 24, 2023, 5:26am

I am reading the Decision Transformer research paper and experimenting with the Decision Transformer’s API.

The Decision Transformer research paper outlines that it utilizes three categories of inputs, namely state, action, and return-to-go. Nevertheless, the Huggingface implementation requires an additional input, immediate reward (rewards), which is not employed in the forward function.

github.com

huggingface/transformers/blob/633062639bfd6be15abc072aaf7e18bce355f426/src/transformers/models/decision_transformer/modeling_decision_transformer.py#L835


      
          
          
    # Initialize weights and apply final processing
              self.post_init()
          
          
@add_start_docstrings_to_model_forward(DECISION_TRANSFORMER_INPUTS_DOCSTRING.format("batch_size, sequence_length"))
          @replace_return_docstrings(output_type=DecisionTransformerOutput, config_class=_CONFIG_FOR_DOC)
          def forward(
              self,
              states=None,
              actions=None,
              rewards=None,
              returns_to_go=None,
              timesteps=None,
              attention_mask=None,
              output_hidden_states=None,
              output_attentions=None,
              return_dict=None,
          ) -> Union[Tuple, DecisionTransformerOutput]:
              r"""
              Returns:

I conducted a trial by inputting a random value for rewards, but the outcome remained unchanged.

I believe that the rewards input is redundant in the forward function argument and may cause confusion, and therefore suggest its removal from the code.

As I am a novice in reinforcement learning, there is a possibility of errors in my analysis, and I would appreciate any corrections or feedback.

Thanks.

Topic		Replies	Views
Huggingface DecisionTransformer - Reward Calculation Beginners	0	234	September 15, 2022
Understanding the Decision Transformer 🤗Transformers	0	138	May 25, 2024
Decision Transformer for Discrete action Beginners	5	420	December 7, 2024
Question about the output of the decision transformer 🤗Transformers	0	152	December 11, 2023
Online Decision Transformer 🤗Transformers	1	334	July 14, 2024

How come the Decision Transformer requires the rewards as an input?

Related topics