Skip to content

Commit 734ab81

Browse files
authored
Distiller and onnx converter and pqmha (#40)
* initial onnx converter,layerwise and model distillation for torch, mha * gelu, leakyrelu activations , layernorm for Torch * dynamic data quantization option
1 parent 0e856a0 commit 734ab81

23 files changed

Lines changed: 5505 additions & 44 deletions

src/pquant/configs/config_ap.yaml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -13,6 +13,7 @@ quantization_parameters:
1313
default_data_keep_negatives: 0.
1414
default_data_integer_bits: 0.
1515
default_data_fractional_bits: 7.
16+
dynamic_data_quantization: false
1617
granularity: "per_tensor"
1718
quantize_input: true
1819
quantize_output: false

src/pquant/configs/config_autosparse.yaml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -16,6 +16,7 @@ quantization_parameters:
1616
default_data_keep_negatives: 0.
1717
default_data_integer_bits: 0.
1818
default_data_fractional_bits: 7.
19+
dynamic_data_quantization: false
1920
granularity: "per_tensor"
2021
quantize_input: true
2122
quantize_output: false

src/pquant/configs/config_cs.yaml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -12,6 +12,7 @@ quantization_parameters:
1212
default_data_keep_negatives: 0.
1313
default_data_integer_bits: 0.
1414
default_data_fractional_bits: 7.
15+
dynamic_data_quantization: false
1516
granularity: "per_tensor"
1617
quantize_input: true
1718
quantize_output: false

src/pquant/configs/config_dst.yaml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -14,6 +14,7 @@ quantization_parameters:
1414
default_data_keep_negatives: 0.
1515
default_data_integer_bits: 0.
1616
default_data_fractional_bits: 7.
17+
dynamic_data_quantization: false
1718
granularity: "per_tensor"
1819
quantize_input: true
1920
quantize_output: false

src/pquant/configs/config_fitcompress.yaml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -11,6 +11,7 @@ quantization_parameters:
1111
default_data_keep_negatives: 0.
1212
default_data_integer_bits: 0.
1313
default_data_fractional_bits: 7.
14+
dynamic_data_quantization: false
1415
granularity: "per_tensor"
1516
quantize_input: true
1617
quantize_output: false

src/pquant/configs/config_mdmm.yaml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -24,6 +24,7 @@ quantization_parameters:
2424
default_data_keep_negatives: 0.
2525
default_data_integer_bits: 0.
2626
default_data_fractional_bits: 7.
27+
dynamic_data_quantization: false
2728
granularity: "per_tensor"
2829
quantize_input: true
2930
quantize_output: false

src/pquant/configs/config_pdp.yaml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -14,6 +14,7 @@ quantization_parameters:
1414
default_data_keep_negatives: 0.
1515
default_data_integer_bits: 0.
1616
default_data_fractional_bits: 7.
17+
dynamic_data_quantization: false
1718
granularity: "per_tensor"
1819
quantize_input: true
1920
quantize_output: false

src/pquant/configs/config_wanda.yaml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -16,6 +16,7 @@ quantization_parameters:
1616
default_data_keep_negatives: 0.
1717
default_data_integer_bits: 0.
1818
default_data_fractional_bits: 7.
19+
dynamic_data_quantization: false
1920
granularity: "per_tensor"
2021
quantize_input: true
2122
quantize_output: false

src/pquant/configs/finetuning.yaml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -13,6 +13,7 @@ quantization_parameters:
1313
integer_bits: 4
1414
fractional_bits: 6
1515
use_high_granularity_quantization: false
16+
dynamic_data_quantization: false
1617
granularity: "per_tensor"
1718
use_real_tanh: false
1819
use_symmetric_quantization: false

src/pquant/core/keras/activations.py

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -68,6 +68,7 @@ def __init__(
6868
self.hgq_gamma = config.quantization_parameters.hgq_gamma
6969
self.hgq_heterogeneous = config.quantization_parameters.hgq_heterogeneous
7070
self.use_fitcompress = config.fitcompress_parameters.enable_fitcompress
71+
self.dynamic_data = config.quantization_parameters.dynamic_data_quantization
7172

7273
self.post_fitcompress_calibration = False
7374
self.saved_inputs = []
@@ -89,6 +90,7 @@ def build(self, input_shape):
8990
is_heterogeneous=self.use_hgq,
9091
hgq_gamma=self.hgq_gamma,
9192
place="datalane",
93+
dynamic_data=self.dynamic_data,
9294
)
9395
if self.quantize_output:
9496
self.output_quantizer = Quantizer(
@@ -101,6 +103,7 @@ def build(self, input_shape):
101103
is_heterogeneous=self.use_hgq,
102104
hgq_gamma=self.hgq_gamma,
103105
place="datalane",
106+
dynamic_data=self.dynamic_data,
104107
)
105108

106109
if self.use_multiplier:

0 commit comments

Comments
 (0)