Skip to content

Commit

Permalink
📖 Clarification max len in Reward documentation (#2740)
Browse files Browse the repository at this point in the history
* Nit fix about max_lenth argument.

* copy to docstring

* typo

* consistency

---------

Co-authored-by: Quentin Gallouédec <[email protected]>
Co-authored-by: Quentin Gallouédec <[email protected]>
  • Loading branch information
3 people authored Feb 4, 2025
1 parent 32f8fa8 commit 338fbd5
Showing 1 changed file with 4 additions and 4 deletions.
8 changes: 4 additions & 4 deletions trl/trainer/reward_config.py
Original file line number Diff line number Diff line change
Expand Up @@ -29,8 +29,8 @@ class RewardConfig(TrainingArguments):
Parameters:
max_length (`int` or `None`, *optional*, defaults to `1024`):
Maximum length of the sequences (prompt + completion) in the batch. This argument is required if you want
to use the default data collator.
Maximum length of the sequences (prompt + completion) in the batch, filters out entries that exceed the
limit. This argument is required if you want to use the default data collator.
disable_dropout (`bool`, *optional*, defaults to `True`):
Whether to disable dropout in the model.
dataset_num_proc (`int`, *optional*, defaults to `None`):
Expand All @@ -46,8 +46,8 @@ class RewardConfig(TrainingArguments):
max_length: Optional[int] = field(
default=1024,
metadata={
"help": "Maximum length of the sequences (prompt + completion) in the batch. This argument is required if "
"you want to use the default data collator."
"help": "Maximum length of the sequences (prompt + completion) in the batch, filters out entries that "
"exceed the limit. This argument is required if you want to use the default data collator."
},
)
disable_dropout: bool = field(
Expand Down

0 comments on commit 338fbd5

Please sign in to comment.