We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent 78e254c commit ba1cb92Copy full SHA for ba1cb92
recipes/configs/qwen2/0.5B_full.yaml
@@ -60,7 +60,7 @@ gradient_accumulation_steps: 16
60
device: cuda
61
62
# Memory management
63
-enable_activation_checkpointing: True
+enable_activation_checkpointing: False
64
65
# Reduced precision
66
dtype: bf16
recipes/configs/qwen2/1.5B_full.yaml
@@ -60,7 +60,7 @@ gradient_accumulation_steps: 1
0 commit comments