Skip to content

Actions: huggingface/trl

Build PR Documentation

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
16 workflow run results
16 workflow run results

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

remove spurious optimize_cuda_cache deprecation warning on init
Build PR Documentation #1798: Pull request #1045 opened by ChanderG
November 30, 2023 15:04 3m 39s ChanderG:fix-1044-warning
November 30, 2023 15:04 3m 39s
[DPO] Refactor eval logging of dpo trainer
Build PR Documentation #1797: Pull request #954 synchronize by lvwerra
November 30, 2023 10:30 4m 2s mnoukhov:dpo-eval
November 30, 2023 10:30 4m 2s
[DPO] cDPO loss
Build PR Documentation #1796: Pull request #1035 synchronize by kashif
November 30, 2023 10:25 3m 55s kashif:cDPO
November 30, 2023 10:25 3m 55s
spelling is hard
Build PR Documentation #1795: Pull request #1043 opened by grahamannett
November 30, 2023 02:58 3m 36s grahamannett:patch-1
November 30, 2023 02:58 3m 36s
Update utils.py
Build PR Documentation #1794: Pull request #1012 synchronize by ZihanWang314
November 29, 2023 14:49 3m 44s ZihanWang314:patch-1
November 29, 2023 14:49 3m 44s
Update utils.py
Build PR Documentation #1793: Pull request #1012 synchronize by ZihanWang314
November 28, 2023 18:36 3m 34s ZihanWang314:patch-1
November 28, 2023 18:36 3m 34s
Fixes reward and text gathering in distributed training
Build PR Documentation #1792: Pull request #850 synchronize by vwxyzjn
November 27, 2023 15:26 3m 33s fix-reward-gather
November 27, 2023 15:26 3m 33s
Fixes reward and text gathering in distributed training
Build PR Documentation #1791: Pull request #850 synchronize by vwxyzjn
November 27, 2023 15:17 3m 5s fix-reward-gather
November 27, 2023 15:17 3m 5s
Fixes reward and text gathering in distributed training
Build PR Documentation #1790: Pull request #850 synchronize by vwxyzjn
November 27, 2023 14:55 3m 40s fix-reward-gather
November 27, 2023 14:55 3m 40s
[DPO] cDPO loss
Build PR Documentation #1789: Pull request #1035 synchronize by kashif
November 26, 2023 10:27 3m 38s kashif:cDPO
November 26, 2023 10:27 3m 38s
[DPO] cDPO loss
Build PR Documentation #1788: Pull request #1035 opened by kashif
November 26, 2023 10:26 55s kashif:cDPO
November 26, 2023 10:26 55s
[SFT Trainer] precompute packed iterable into a dataset
Build PR Documentation #1787: Pull request #979 synchronize by lvwerra
November 24, 2023 17:05 3m 34s precompute-packing
November 24, 2023 17:05 3m 34s
[SFT Trainer] precompute packed iterable into a dataset
Build PR Documentation #1786: Pull request #979 synchronize by lvwerra
November 24, 2023 16:16 3m 34s precompute-packing
November 24, 2023 16:16 3m 34s
[DPO] use ref model logprobs if it exists in the data
Build PR Documentation #1785: Pull request #885 synchronize by kashif
November 24, 2023 15:36 3m 45s kashif:reference-logprobs
November 24, 2023 15:36 3m 45s
[DPO] use ref model logprobs if it exists in the data
Build PR Documentation #1784: Pull request #885 synchronize by kashif
November 24, 2023 15:00 3m 27s kashif:reference-logprobs
November 24, 2023 15:00 3m 27s