FairQuanti: Enhancing Fairness in Deep Neural Network Quantization via Neuron Role Contribution

Jinyin Chen; Zhiqi Cao; Xiaojuan Wang; Haibin Zheng; Zhaoyan Ming; Yayu Zheng

doi:10.1145/3744560

Loading next page...

References (68)

AI Fairness 360: An extensible toolkit for detecting and mitigating algorithmic bias
[ (2024)
Uncovering the hidden cost of model compression
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition
[ (2021)
Brecq: Pushing the limit of post-training quantization by block reconstruction
arXiv:2102.05426. Retrieved from https://arxiv.org/abs/2102.05426
[ (2022)
Investigating and mitigating effects of quantization on algorithmic bias
LU-CS-EX
Fairness through awareness
FairNeuron
[ (2018)
Gender bias in coreference resolution: Evaluation and debiasing methods
arXiv:1804.06876. Retrieved from https://arxiv.org/abs/1804.06876
[ (2023)
Bias mimicking: A simple sampling approach for bias mitigation
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition
[ (2019)
Bert: Pre-training of deep bidirectional transformers for language understanding
Proceedings of naacL-HLT. Minneapolis
[ (2024)
DeepTensor: Low-rank tensor decomposition with deep network priors
IEEE Transactions on Pattern Analysis and Machine Intelligence, 46
Knowledge Distillation: A Survey
[ (2020)
Up or down? Adaptive rounding for post-training quantization
Proceedings of the International Conference on Machine Learning. PMLR
Simultaneous Deep Transfer Across Domains and Tasks
[ (2022)
The effect of model compression on fairness in facial expression recognition
Proceedings of the International Conference on Pattern Recognition. Springer
[ (2023)
Deep learning model compression with rank reduction in tensor decomposition
IEEE Transactions on Neural Networks and Learning Systems, 36
[ (2020)
FERMI: Fair empirical risk minimization via exponential rényi mutual information
arXiv:2304.03935. Retrieved from https://arxiv.org/abs/2304.03935
[ (2022)
FairNeuron: Improving deep neural network fairness with adversary games on selective neurons
Proceedings of the 44th International Conference on Software Engineering
[ (2020)
Zeroq: A novel zero shot quantization framework
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition
[ (2020)
Post-training piecewise linear quantization for deep neural networks
Computer Vision–ECCV 2020: 16th European Conference, 2020
[ (2021)
Fair attribute classification through latent space de-biasing
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition
Multi-Task Learning in Natural Language Processing: An Overview
[ (2024)
Rsmamba: Remote sensing image classification with state space model
IEEE Geoscience and Remote Sensing Letters, 21
[ (2019)
AI fairness 360: An extensible toolkit for detecting and mitigating algorithmic bias
IBM Journal of Research and Development, 63
[ (2021)
A survey on bias and fairness in machine learning
ACM Computing Surveys, 54
Data-Free Quantization Through Weight Equalization and Bias Correction
[ (2017)
Data decisions and theoretical implications when adversarially learning fair representations
arXiv:1707.00075. Retrieved from https://arxiv.org/abs/1707.00075
[ (2019)
Data-free quantization through weight equalization and bias correction
Proceedings of the IEEE/CVF International Conference on Computer Vision
[ (2023)
Last-layer fairness fine-tuning is simple and effective for neural networks
arXiv:2304.03935. Retrieved from https://arxiv.org/abs/2304.03935
[ (2012)
Fairness through awareness
Proceedings of the 3rd Innovations in Theoretical Computer Science Conference
[ (2015)
Simultaneous deep transfer across domains and tasks
Proceedings of the IEEE International Conference on Computer Vision
[ (2017)
To prune, or not to prune: Exploring the efficacy of pruning for model compression
arXiv:1710.01878. Retrieved from https://arxiv.org/abs/1710.01878
[ (2016)
Deep residual learning for image recognition
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition
[ (2021)
Sparsity in deep learning: Pruning and growth for efficient inference and training in neural networks
Journal of Machine Learning Research, 22
[ (2019)
Patient knowledge distillation for bert model compression
arXiv:1908.09355. Retrieved from https://arxiv.org/abs/1908.09355
[ (2020)
Fair generative modeling via weak supervision
Proceedings of the International Conference on Machine Learning. PMLR
[ (2019)
Roberta: A robustly optimized bert pretraining approach
arXiv:1907.11692. Retrieved from https://arxiv.org/abs/1907.11692
[ (2020)
Characterising bias in compressed models
arXiv:2010.03058. Retrieved from https://arxiv.org/abs/2010.03058
Fair Attribute Classification through Latent Space De-biasing
Patient Knowledge Distillation for BERT Model Compression
[ (2024)
Multi-task learning in natural language processing: An overview
Computing Surveys, 56
Ethical Adversaries
[ (2024)
UniRepLKNet: A universal perception large-kernel convnet for audio video point cloud time-series and image recognition
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition
[ (2024)
RUNNER: Responsible UNfair NEuron repair for enhancing deep neural network fairness
Proceedings of the 46th IEEE/ACM International Conference on Software Engineering
Deep Learning Face Attributes in the Wild
[ (2024)
A review of modern recommender systems using generative models (gen-recsys)
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining
[ (2024)
VkD: Improving knowledge distillation using orthogonal projections
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition
[ (2024)
Recommender systems in the era of large language models (llms)
IEEE Transactions on Knowledge and Data Engineering, 36
[ (2023)
To be robust and to be fair: Aligning fairness with robustness
arXiv:2304.00061. Retrieved from https://arxiv.org/abs/2304.00061
[ (2024)
Balancing act: Distribution-guided debiasing in diffusion models
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition
NeuronFair
[ (2022)
Neuronfair: Interpretable white-box fairness testing through biased neuron identification
Proceedings of the 44th International Conference on Software Engineering
[ (2024)
MixQuantBio: Towards extreme face and periocular recognition model compression with mixed-precision quantization
Engineering Applications of Artificial Intelligence, 137
MobileNetV2: Inverted Residuals and Linear Bottlenecks
[ (2023)
Compressed models decompress race biases: What quantized models forget for fair face recognition
Proceedings of the 2023 International Conference of the Biometrics Special Interest Group. IEEE, 2023
[ (2018)
Mobilenetv2: Inverted residuals and linear bottlenecks
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition
[ (2024)
Medication recommendation system based on natural language processing for patient emotion analysis
Academic Journal of Science and Technology, 10
[ (2021)
Ethical adversaries: Towards mitigating unfairness with adversarial machine learning
ACM SIGKDD Explorations Newsletter, 23
Deep Residual Learning for Image Recognition
[ (2022)
Quantface: Towards lightweight face recognition by synthetic data low-bit quantization
Proceedings of the 2022 26th International Conference on Pattern Recognition. IEEE, 2022
[ (2021)
Fair feature distillation for visual recognition
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition
[ (2023)
Variation-aware vision transformer quantization
arXiv:2307.00331. Retrieved from https://arxiv.org/abs/2307.00331
MixQuantBio: Towards extreme face and periocular recognition model compression with mixed-precision quantization
A Review of Modern Recommender Systems Using Generative Models (Gen-RecSys)
[ (2021)
Knowledge distillation: A survey
International Journal of Computer Vision, 129
[ (2015)
Deep learning face attributes in the wild
Proceedings of the IEEE International Conference on Computer Vision
Gender Bias in Coreference Resolution: Evaluation and Debiasing Methods
ZeroQ: A Novel Zero Shot Quantization Framework
[ (2024)
A survey on deep neural network pruning: Taxonomy, comparison, analysis, and recommendations
IEEE Transactions on Pattern Analysis and Machine Intelligence, 46

Publisher: Association for Computing Machinery
Copyright: Copyright © 2025 Copyright held by the owner/author(s). Publication rights licensed to ACM.
ISSN: 2471-2566
eISSN: 2471-2574
DOI: 10.1145/3744560
Publisher site: See Article on Publisher Site

Abstract

The increasing complexity of deep neural networks (DNNs) poses significant resource challenges for edge devices, prompting the development of compression technologies like model quantization. However, while improving model efficiency, quantization can introduce or perpetuate the original model’s bias. Existing debiasing methods for quantized models often incur additional costs. To address this issue, we propose FairQuanti, a novel quantization approach that leverages neuron role contribution to achieve fairness. By distinguishing between biased and normal neurons, FairQuanti employs mixed precision quantization to mitigate model bias during the quantization process. FairQuanti has four key differences from previous studies: (1) Neuron Roles - It formally defines biased and normal neuron roles, establishing a framework for feasible model quantization and bias mitigation; (2) Effectiveness - It introduces a fair quantization strategy that discriminatively quantizes neuron roles, balancing model accuracy and fairness through Bayesian optimization; (3) Generality - It applies to both structured and unstructured data across various quantization bit levels; (4) Robustness - It demonstrates resilience against adaptive attacks. Extensive experiments on five datasets (three structured and two unstructured) using five different models validate FairQuanti’s superior performance against eight baseline methods. Specifically, fairness metrics such as demographic parity (DP) improve by approximately 1.03 times, and the demographic parity ratio (DPR) improves by approximately 1.51 times compared to the baselines, with an average accuracy loss of less than 7.5% at 8-bit quantization. FairQuanti presents a promising solution for deploying fair and efficient deep models on resource-constrained devices and holds potential for application in large language models to reduce size and computational demands while minimizing bias. Our source code is available at https://github.com/Caozq2/FairQuanti.

Journal

ACM Transactions on Privacy and Security (TOPS) – Association for Computing Machinery

Published: Aug 23, 2025

Keywords: Deep neural network

Get 20M+ Full-Text Papers For Less Than $1.50/day. Start a 7-Day Trial for You or Your Team.

Learn More →

FairQuanti: Enhancing Fairness in Deep Neural Network Quantization via Neuron Role Contribution

FairQuanti: Enhancing Fairness in Deep Neural Network Quantization via Neuron Role Contribution

Get 20M+ Full-Text Papers For Less Than $1.50/day. Start a 7-Day Trial for You or Your Team.

Learn More →

FairQuanti: Enhancing Fairness in Deep Neural Network Quantization via Neuron Role Contribution

FairQuanti: Enhancing Fairness in Deep Neural Network Quantization via Neuron Role Contribution

References (68)

Abstract

Journal

Recommended Articles

There are no references for this article.

Our policy towards the use of cookies