Skip to content

Commit e99b890

Browse files
felipemello1Felipe Mello
andauthored
Activation offloading for fullfinetuning + fix tied embedding (#1847)
Co-authored-by: Felipe Mello <[email protected]>
1 parent a1bcb97 commit e99b890

File tree

89 files changed

+384
-103
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

89 files changed

+384
-103
lines changed

recipes/configs/code_llama2/7B_full_low_memory.yaml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -69,6 +69,7 @@ device: cuda
6969

7070
# Memory management
7171
enable_activation_checkpointing: True
72+
enable_activation_offloading: True # True reduces memory
7273
dtype: bf16
7374

7475
# Logging

recipes/configs/code_llama2/7B_lora_single_device.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -77,7 +77,7 @@ device: cuda
7777

7878
# Memory management
7979
enable_activation_checkpointing: True
80-
enable_activation_offloading: False
80+
enable_activation_offloading: False # True reduces memory
8181
dtype: bf16
8282

8383
# Logging

recipes/configs/code_llama2/7B_qlora_single_device.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -76,7 +76,7 @@ device: cuda
7676

7777
# Memory management
7878
enable_activation_checkpointing: True
79-
enable_activation_offloading: False
79+
enable_activation_offloading: False # True reduces memory
8080
dtype: bf16
8181

8282
# Logging

recipes/configs/dev/8B_full_experimental.yaml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -65,6 +65,7 @@ device: cuda
6565

6666
# Memory management
6767
enable_activation_checkpointing: False
68+
enable_activation_offloading: False # True reduces memory
6869
ac_mode: 'selective' # ['selective', 'full']
6970
ac_option: 2 # [int] = ac every positive int layer
7071
memory_efficient_fsdp_wrap: False

recipes/configs/gemma/2B_full.yaml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -62,6 +62,7 @@ device: cuda
6262

6363
# Memory management
6464
enable_activation_checkpointing: True
65+
enable_activation_offloading: False # True reduces memory
6566

6667
# Reduced precision
6768
dtype: bf16

recipes/configs/gemma/2B_lora.yaml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -74,6 +74,7 @@ device: cuda
7474

7575
# Memory management
7676
enable_activation_checkpointing: True
77+
enable_activation_offloading: False # True reduces memory
7778

7879
# Reduced precision
7980
dtype: bf16

recipes/configs/gemma/2B_lora_single_device.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -73,7 +73,7 @@ device: cuda
7373

7474
# Memory management
7575
enable_activation_checkpointing: True
76-
enable_activation_offloading: False
76+
enable_activation_offloading: False # True reduces memory
7777

7878
# Reduced precision
7979
dtype: bf16

recipes/configs/gemma/2B_qlora_single_device.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -73,7 +73,7 @@ device: cuda
7373

7474
# Memory management
7575
enable_activation_checkpointing: True
76-
enable_activation_offloading: False
76+
enable_activation_offloading: False # True reduces memory
7777

7878
# Reduced precision
7979
dtype: bf16

recipes/configs/gemma/7B_full.yaml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -64,6 +64,7 @@ device: cuda
6464

6565
# Memory management
6666
enable_activation_checkpointing: True
67+
enable_activation_offloading: False # True reduces memory
6768

6869
# Reduced precision
6970
dtype: bf16

recipes/configs/gemma/7B_lora.yaml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -76,6 +76,7 @@ device: cuda
7676

7777
# Memory management
7878
enable_activation_checkpointing: True
79+
enable_activation_offloading: False # True reduces memory
7980

8081
# Reduced precision
8182
dtype: bf16

0 commit comments

Comments
 (0)