Bias-Aware Low-Rank Adaptation: Mitigating Catastrophic Inheritance of Large Language Models

Chang, Yupeng; Chang, Yi; Wu, Yuan

Computer Science > Computation and Language

arXiv:2408.04556v1 (cs)

[Submitted on 8 Aug 2024 (this version), latest version 21 Feb 2025 (v4)]

Title:Bias-Aware Low-Rank Adaptation: Mitigating Catastrophic Inheritance of Large Language Models

Authors:Yupeng Chang, Yi Chang, Yuan Wu

View PDF HTML (experimental)

Abstract:Large language models (LLMs) have exhibited remarkable proficiency across a diverse array of natural language processing (NLP) tasks. However, adapting LLMs to downstream applications typically necessitates computationally intensive and memory-demanding fine-tuning procedures. To mitigate these burdens, parameter-efficient fine-tuning (PEFT) techniques have emerged as a promising approach to tailor LLMs with minimal computational overhead. While PEFT methods offer substantial advantages, they do not fully address the pervasive issue of bias propagation from pre-training data. In this work, we introduce Bias-Aware Low-Rank Adaptation (BA-LoRA), a novel PEFT method designed to counteract bias inheritance. BA-LoRA incorporates three distinct regularization terms: (1) consistency regularizer, (2) diversity regularizer, and (3) singular vector decomposition regularizer. These regularizers collectively aim to improve the generative models' consistency, diversity, and generalization capabilities during the fine-tuning process. Through extensive experiments on a variety of natural language understanding (NLU) and natural language generation (NLG) tasks, employing prominent LLMs such as LLaMA, Mistral, and Gemma, we demonstrate that BA-LoRA surpasses the performance of LoRA and its state-of-the-art variants. Moreover, our method effectively mitigates the deleterious effects of pre-training bias, leading to more reliable and robust model outputs. The code is available at this https URL.

Comments:	Work in progress
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2408.04556 [cs.CL]
	(or arXiv:2408.04556v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2408.04556

Submission history

From: Yuan Wu [view email]
[v1] Thu, 8 Aug 2024 16:13:26 UTC (135 KB)
[v2] Mon, 14 Oct 2024 14:27:04 UTC (1,022 KB)
[v3] Sun, 8 Dec 2024 16:10:25 UTC (1,040 KB)
[v4] Fri, 21 Feb 2025 04:43:43 UTC (1,022 KB)

Computer Science > Computation and Language

Title:Bias-Aware Low-Rank Adaptation: Mitigating Catastrophic Inheritance of Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Bias-Aware Low-Rank Adaptation: Mitigating Catastrophic Inheritance of Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators