Adapting Pre-trained Neural Networks for Dynamic Inference: from Residual Weight Generation to Slimmable Fine-tuning

The rigid computational requirements of traditional neural networks often conflict with the need for hardware-agnostic deployment across a broad range of diverse real-world systems. Frameworks like Slimmable Neural Networks offer a dynamic alternative within a single model but traditionally require training from scratch, failing to leverage the presence of existing pre-trained weights. This work investigates adapting pre-trained models for runtime flexibility. We start from Neural Metamorphosis, a generative approach that uses hypernetworks to produce weights for various sub-network configurations. To address scalability issues in generating weights for larger architectures, we incorporate ideas from classical pruning and reframed the task as a residual problem. Instead of generating weights from scratch, the hypernetwork produces an additive change to be applied to the pre-trained weights, making the optimization landscape more tractable. Building on the insights of this first part, we transition to a strategy of direct fine-tuning. Specifically, we explore the possibility of adapting static, pre-trained neural networks into dynamic ones by applying the training principles of Slimmable Networks. This progression highlights the challenges of scaling weight generation and evaluates the viability of converting standard pre-trained models into dynamic ones without the necessity of full retraining.

Adapting Pre-trained Neural Networks for Dynamic Inference: from Residual Weight Generation to Slimmable Fine-tuning

MORIELLO, PIETRO

2024/2025

Abstract

The rigid computational requirements of traditional neural networks often conflict with the need for hardware-agnostic deployment across a broad range of diverse real-world systems. Frameworks like Slimmable Neural Networks offer a dynamic alternative within a single model but traditionally require training from scratch, failing to leverage the presence of existing pre-trained weights. This work investigates adapting pre-trained models for runtime flexibility. We start from Neural Metamorphosis, a generative approach that uses hypernetworks to produce weights for various sub-network configurations. To address scalability issues in generating weights for larger architectures, we incorporate ideas from classical pruning and reframed the task as a residual problem. Instead of generating weights from scratch, the hypernetwork produces an additive change to be applied to the pre-trained weights, making the optimization landscape more tractable. Building on the insights of this first part, we transition to a strategy of direct fine-tuning. Specifically, we explore the possibility of adapting static, pre-trained neural networks into dynamic ones by applying the training principles of Slimmable Networks. This progression highlights the challenges of scaling weight generation and evaluates the viability of converting standard pre-trained models into dynamic ones without the necessity of full retraining.

Scheda breve

Scheda completa

Scheda completa (DC)

	Facoltà/Dipartimento
	
				Dipartimento di Ingegneria "Enzo Ferrari"
			
	Corso di studio
	
				Artificial intelligence engineering
			
	Anno Accademico
	
				2024
			
	Parola chiave
	
				Dynamic Networks
Hypernetworks
Efficient Inference
Slimmable Networks
Model Adaptation
			
	Relatore
	
				CALDERARA, SIMONE
			
	Controrelatore
	
				BUZZEGA, PIETRO
PORRELLO, ANGELO
			
	Appare nelle tipologie:
	
				Lauree Magistrali

File in questo prodotto:

File	Dimensione	Formato
Moriello.Pietro.pdf accesso aperto Dimensione 1.4 MB Formato Adobe PDF Visualizza/Apri	1.4 MB	Adobe PDF	Visualizza/Apri

I documenti in UNITESI sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14251/5405