Neural Networks Remember More: The Power of Parameter Isolation and Combination

Biqing Zeng* (Corresponding Author), Zehan Li, Aladdin Ayesh

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingPublished conference contribution

Abstract

Pre-trained language models(PLMs) suffer from catastrophic forgetting in continual learning, as sequential task training overwrites previously learned representations. The model’s ability to remain old tasks is referred to as stability, while its adaptability to new tasks is called plasticity. Therefore, the key to addressing this challenge requires balancing model plasticity with stability. To address this issue, in this paper, we propose a novel method to achieve a balance between model stability and plasticity, thereby mitigating catastrophic forgetting. More specific, our proposed approach leverages parameter isolation and subsequent combination strategy. Initially, in training stage, the model adapts on each downstream task via parameter isolation method to prevent potential inference among different tasks. We then combine all trained parameters which containing acquired knowledge by model merging method and finally apply to the backbone model. Empirical evaluations on continual language learning benchmarks substantiate the effectiveness of our approach, revealing a marked enhancement over existing state-of-the-art approaches.

Original languageEnglish
Title of host publicationAdvanced Intelligent Computing Technology and Applications - 21st International Conference, ICIC 2025, Proceedings
EditorsDe-Shuang Huang, Chuanlei Zhang, Qinhu Zhang, Yijie Pan
PublisherSpringer Science and Business Media Deutschland GmbH
Pages83-93
Number of pages11
ISBN (Electronic)978-981-96-9911-7
ISBN (Print)9789819699100
DOIs
Publication statusPublished - 25 Jul 2025
Event21st International Conference on Intelligent Computing, ICIC 2025 - Ningbo, China
Duration: 26 Jul 202529 Jul 2025

Publication series

NameCommunications in Computer and Information Science
Volume2564 CCIS
ISSN (Print)1865-0929
ISSN (Electronic)1865-0937

Conference

Conference21st International Conference on Intelligent Computing, ICIC 2025
Country/TerritoryChina
CityNingbo
Period26/07/2529/07/25

Keywords

  • Catastrophic Forgetting
  • Continual Learning
  • Model Merging
  • Parameter-Efficient Fine-Tuning

Fingerprint

Dive into the research topics of 'Neural Networks Remember More: The Power of Parameter Isolation and Combination'. Together they form a unique fingerprint.

Cite this