Skip to content

Actions: huggingface/trl

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
20,357 workflow runs
20,357 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

feat: Add vLLM dtype configuration for GRPO trainer
Build PR Documentation #6353: Pull request #2738 opened by joey00072
February 2, 2025 12:05 Action required joey00072:grpo_vllm
February 2, 2025 12:05 Action required
feat: Add vLLM dtype configuration for GRPO trainer
Tests #7200: Pull request #2738 opened by joey00072
February 2, 2025 12:05 Action required joey00072:grpo_vllm
February 2, 2025 12:05 Action required
docs: Fix typos in alias descriptions (#2729)
Build documentation #1097: Commit 6e088d1 pushed by qgallouedec
February 2, 2025 10:59 3m 19s main
February 2, 2025 10:59 3m 19s
docs: Fix typos in alias descriptions (#2729)
Secret Leaks #2315: Commit 6e088d1 pushed by qgallouedec
February 2, 2025 10:59 22s main
February 2, 2025 10:59 22s
docs: Fix typos in alias descriptions (#2729)
Tests #7199: Commit 6e088d1 pushed by qgallouedec
February 2, 2025 10:59 25m 13s main
February 2, 2025 10:59 25m 13s
pages build and deployment
pages-build-deployment #1107: by qgallouedec
February 2, 2025 10:59 36s main
February 2, 2025 10:59 36s
feat(GRPOTrainer): reward_func return None to skip
Hugging Face Issue Labeler #83: Issue #2737 opened by ctjlewis
February 2, 2025 08:25 35s
February 2, 2025 08:25 35s
PLZ make padding_free for DataCollatorForChatML.
Hugging Face Issue Labeler #82: Issue #2736 opened by YooSungHyun
February 2, 2025 05:44 36s
February 2, 2025 05:44 36s
SFTvsRL SFT Memorizes, RL Generalizes
Hugging Face Issue Labeler #81: Issue #2735 opened by NickyDark1
February 2, 2025 03:56 22s
February 2, 2025 03:56 22s
GRPO Trainer supports VLMs
Hugging Face Issue Labeler #80: Issue #2734 opened by sunildkumar
February 2, 2025 02:59 27s
February 2, 2025 02:59 27s
DPOTrainer Loss
Hugging Face Issue Labeler #79: Issue #2733 opened by jeromeku
February 2, 2025 02:39 23s
February 2, 2025 02:39 23s
GKD Example why do not use labels?
Hugging Face Issue Labeler #78: Issue #2732 opened by YooSungHyun
February 2, 2025 02:39 42s
February 2, 2025 02:39 42s
Build Docker images (scheduled)
Build Docker images (scheduled) #390: Scheduled
February 2, 2025 01:29 10m 38s main
February 2, 2025 01:29 10m 38s
Latest TRL code = significantly worse rewards for GRPO training
Hugging Face Issue Labeler #77: Issue #2731 opened by abacaj
February 2, 2025 01:18 24s
February 2, 2025 01:18 24s
Tests latest TRL release with dev dependencies
Tests latest TRL release with dev dependencies #60: Scheduled
February 2, 2025 00:19 20m 11s main
February 2, 2025 00:19 20m 11s
Cleanup Cache
Cleanup Cache #679: Scheduled
February 2, 2025 00:04 15s main
February 2, 2025 00:04 15s
Dynamically load LoRA weights when using vLLM
Tests #7198: Pull request #2730 opened by tgaddair
February 1, 2025 23:38 Action required tgaddair:fix-peft-vllm-grpo-lora
February 1, 2025 23:38 Action required
Dynamically load LoRA weights when using vLLM
Build PR Documentation #6352: Pull request #2730 opened by tgaddair
February 1, 2025 23:38 Action required tgaddair:fix-peft-vllm-grpo-lora
February 1, 2025 23:38 Action required
Upload PR Documentation
Upload PR Documentation #4635: completed by mirceapricop
February 1, 2025 21:05 24s
February 1, 2025 21:05 24s
GRPO: Set max_model_len when initializing vLLM instance
Build PR Documentation #6351: Pull request #2728 synchronize by mirceapricop
February 1, 2025 18:58 3m 31s mirceapricop:patch-1
February 1, 2025 18:58 3m 31s
GRPO: Set max_model_len when initializing vLLM instance
Tests #7197: Pull request #2728 synchronize by mirceapricop
February 1, 2025 18:58 27m 4s mirceapricop:patch-1
February 1, 2025 18:58 27m 4s
GRPO: Set max_model_len when initializing vLLM instance
Build PR Documentation #6349: Pull request #2728 opened by mirceapricop
February 1, 2025 16:01 Action required mirceapricop:patch-1
February 1, 2025 16:01 Action required
GRPO: Set max_model_len when initializing vLLM instance
Tests #7196: Pull request #2728 opened by mirceapricop
February 1, 2025 16:01 Action required mirceapricop:patch-1
February 1, 2025 16:01 Action required
fix: Fix typo in filename in ultrafeedback-prompt.py (#2716)
Build documentation #1096: Commit a325a0e pushed by qgallouedec
February 1, 2025 13:53 3m 20s main
February 1, 2025 13:53 3m 20s
fix: Fix typo in filename in ultrafeedback-prompt.py (#2716)
Slow tests (on push) #481: Commit a325a0e pushed by qgallouedec
February 1, 2025 13:53 17m 41s main
February 1, 2025 13:53 17m 41s