Skip to content

Commit 1eb4ad7

Browse files
committed
Nit
1 parent c53fbd4 commit 1eb4ad7

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -56,7 +56,7 @@ You can also run e.g. ``tune ls lora_finetune_single_device`` for a full list of
5656
Example: ``tune run knowledge_distillation_distributed --config qwen2/1.5B_to_0.5B_KD_lora_distributed`` <br />
5757
You can also run e.g. ``tune ls knowledge_distillation_distributed`` for a full list of available configs.
5858

59-
#### Reinforcement Learning + Reinforcement Learning from Human Feedback (RLHF)
59+
#### Reinforcement Learning / Reinforcement Learning from Human Feedback (RLHF)
6060

6161
| Method | Type of Weight Update | 1 Device | >1 Device | >1 Node |
6262
|------------------------------|-----------------------|:--------:|:---------:|:-------:|

0 commit comments

Comments
 (0)