Update lora_quantization_layers.py #10876

tugang-baidu · 2025-07-21T13:10:20Z

Fix parallel QLoRA in reference to paddlenlp/quantization/quantization_linear.py

The original lora_quantization_layers.py is in paddlenlp/peft/lora

In class QuantizationLoRABaseLinear:
In method init:
insert codes '
self.state = 0
if self.weight_quantize_algo in ["a8w8linear", "a8w4linear", "fp8linear"]:
self.act_scale = self.create_parameter(
shape=[1],
dtype=self._dtype,
is_bias=False,
default_initializer=nn.initializer.Constant(value=0.0),
)
self.act_scale.is_distributed = False
self.act_scale.stop_gradient = True
self.group = get_act_scale_group(is_row=True)
else:
raise NotImplementedError(
f"Not supported weight_quantize_algo {self.weight_quantize_algo}"
)
'
between 'self.bias = layer.bias' and 'self.lora_config = lora_config'

In method forward:
insert 'act_state=(self.state, self.training, self.act_scale, self.group)' in the parameter list of 'output=quant_weight_linear'

insert codes '
if self.training:
self.state += 1
'
before 'return output'

However, after such change, in different cases, I found that loss would start to converge with different beginnings (I have deleted all checkpoints every time I start a new case):

No parallelism: 7.68577909

tensor parallelism with paddle.distributed.launch: 14.6875057

paddle.distributed.launch: 7.88057899

pipeline parallelism with paddle.distributed.launch: 13.96667099

Fix parallel QLoRA in reference to paddlenlp/quantization/quantization_linear.py In class QuantizationLoRABaseLinear: In method __init__: insert codes ' self.state = 0 if self.weight_quantize_algo in ["a8w8linear", "a8w4linear", "fp8linear"]: self.act_scale = self.create_parameter( shape=[1], dtype=self._dtype, is_bias=False, default_initializer=nn.initializer.Constant(value=0.0), ) self.act_scale.is_distributed = False self.act_scale.stop_gradient = True self.group = get_act_scale_group(is_row=True) else: raise NotImplementedError( f"Not supported weight_quantize_algo {self.weight_quantize_algo}" ) ' between 'self.bias = layer.bias' and 'self.lora_config = lora_config' In method forward: insert 'act_state=(self.state, self.training, self.act_scale, self.group)' in the parameter list of 'output=quant_weight_linear' insert codes ' if self.training: self.state += 1 ' before 'return output'

paddle-bot · 2025-07-21T13:10:25Z

Thanks for your contribution!

CLAassistant · 2025-07-21T13:10:28Z

All committers have signed the CLA.

paddle-bot bot added the contributor label Jul 21, 2025

paddle-bot bot assigned lugimzzz Jul 21, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update lora_quantization_layers.py #10876

Update lora_quantization_layers.py #10876

Uh oh!

tugang-baidu commented Jul 21, 2025

Uh oh!

paddle-bot bot commented Jul 21, 2025

Uh oh!

CLAassistant commented Jul 21, 2025 •

edited

Loading

Uh oh!

Uh oh!

Update lora_quantization_layers.py #10876

Are you sure you want to change the base?

Update lora_quantization_layers.py #10876

Uh oh!

Conversation

tugang-baidu commented Jul 21, 2025

Uh oh!

paddle-bot bot commented Jul 21, 2025

Uh oh!

CLAassistant commented Jul 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

CLAassistant commented Jul 21, 2025 •

edited

Loading