Skip to content

Actions: AdvancedPhotonSource/generic_trainer

Actions

Python application

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
80 workflow runs
80 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Allow providing separated training/validation datasets
Python application #30: Commit 9cfaef2 pushed by mdw771
April 4, 2024 20:17 2m 5s main
April 4, 2024 20:17 2m 5s
Freeze backbone when using pretrained models with only encoder loaded
Python application #29: Commit eada0d9 pushed by mdw771
April 4, 2024 19:30 4m 46s main
April 4, 2024 19:30 4m 46s
Move write_training_info after validation
Python application #28: Commit 11bdb43 pushed by mdw771
April 4, 2024 15:04 2m 13s main
April 4, 2024 15:04 2m 13s
Set epoch for distributed sampler
Python application #27: Commit 5414fc4 pushed by mdw771
April 3, 2024 14:41 11m 9s main
April 3, 2024 14:41 11m 9s
Fix bug in Pretrainer process_data_loader_yield
Python application #26: Commit b9d9cb5 pushed by mdw771
March 30, 2024 17:13 2m 15s main
March 30, 2024 17:13 2m 15s
Go back to use DistributedSampler for multi-node mode after fixing bug
Python application #25: Commit 882c77b pushed by mdw771
March 29, 2024 21:44 3m 12s main
March 29, 2024 21:44 3m 12s
Remove import message_logger from modules. Import this module in appl…
Python application #24: Commit 31a3208 pushed by mdw771
March 28, 2024 20:12 2m 0s main
March 28, 2024 20:12 2m 0s
Basic PyTorch Lightning trainer
Python application #23: Commit 005a66e pushed by mdw771
March 22, 2024 20:04 12m 8s main
March 22, 2024 20:04 12m 8s
HuggingFace Accelerate trainer load_model choose what method to use b…
Python application #22: Commit 3c72ce0 pushed by mdw771
March 22, 2024 17:09 2m 0s main
March 22, 2024 17:09 2m 0s
Multinode pretrained encoder loading
Python application #21: Commit 1e2ffa1 pushed by mdw771
March 22, 2024 17:03 1m 57s main
March 22, 2024 17:03 1m 57s
HuggingFace Accelerate pretrainer and load_model behavior
Python application #20: Commit 754c650 pushed by mdw771
March 14, 2024 22:19 1m 54s main
March 14, 2024 22:19 1m 54s
Make base class more generic to reduce overriden methods in subclasses
Python application #19: Commit 78282b5 pushed by mdw771
March 14, 2024 19:35 2m 0s main
March 14, 2024 19:35 2m 0s
HuggingFace Accelerate checkpointing
Python application #18: Commit 129dddc pushed by mdw771
March 13, 2024 21:52 2m 0s main
March 13, 2024 21:52 2m 0s
move_to_device wrapper; multirank gatekeeper
Python application #17: Commit 16ac05b pushed by mdw771
March 13, 2024 15:05 2m 43s main
March 13, 2024 15:05 2m 43s
Fixed data processor bug for multinode mode
Python application #16: Commit 18a9f6c pushed by mdw771
March 12, 2024 18:43 1m 59s main
March 12, 2024 18:43 1m 59s
Subclass with simple HuggingFace Accelerate integration
Python application #15: Commit eb31dc1 pushed by mdw771
March 11, 2024 21:28 1m 59s main
March 11, 2024 21:28 1m 59s
Sync loss tracker for Pretrainer
Python application #14: Commit 4c2387f pushed by mdw771
March 7, 2024 20:42 2m 2s main
March 7, 2024 20:42 2m 2s
Fix model save and load for multi-node
Python application #13: Commit 9fe42c4 pushed by mdw771
March 7, 2024 02:49 2m 30s main
March 7, 2024 02:49 2m 30s
Set log levels back to info
Python application #12: Commit 023f981 pushed by mdw771
March 6, 2024 22:36 2m 51s main
March 6, 2024 22:36 2m 51s
Sync LossTracker data across all ranks in multi-node mode
Python application #11: Commit 1fad1c7 pushed by mdw771
March 6, 2024 21:10 2m 3s main
March 6, 2024 21:10 2m 3s
Removed barriers that caused hanging on Polaris
Python application #10: Commit 25e9a44 pushed by mdw771
March 6, 2024 16:40 2m 7s main
March 6, 2024 16:40 2m 7s
Change Python version in pyproject.toml
Python application #9: Commit d8e71af pushed by mdw771
March 5, 2024 16:05 2m 4s main
March 5, 2024 16:05 2m 4s
Stop using DistributedSampler for DDP
Python application #8: Commit 5bf7a55 pushed by mdw771
March 5, 2024 15:46 1m 57s main
March 5, 2024 15:46 1m 57s
Add readme
Python application #7: Commit 30e6b9a pushed by mdw771
March 4, 2024 21:10 2m 20s main
March 4, 2024 21:10 2m 20s
Polaris examples
Python application #6: Commit e4c535b pushed by mdw771
March 4, 2024 21:09 2m 56s main
March 4, 2024 21:09 2m 56s