Hypernetworks can generate INRs & model functional similarity

19 Aug, 2025

TLDR

hypernetworks can be interpreted to

generate of implicit neural representations, and
quantify of functional similarity between models

I've worked on hypernetworks (neural networks that parameterize other neural networks) for a little while, that has led to a workshop paper at ICLR 2025 and a main conference paper at EMNLP 2025. While working on these papers, I've had the time to think about hypernetworks in a couple of fascinating ways that I've described below.

Hypernetworks as generators of INRs

Implicit neural representations (INRs) are a clever and rather under-appreciated class of representations. Contrary to the popular approach of predicting representations, INRs are representations themselves: given $f (x) = y$ , if we train a neural network $g_{θ}$ to predict $y$ from $x$ , the parameters of this network $θ$ implicitly represent the function $f (\cdot)$ .

Hence, when learning several functions $f_{1}, \dots, f_{n}$ , we can decide to predict weights (or INRs) for $f_{i}$ , using a generating function $H (\cdot)$ as $H (i) = θ_{i}$ such that $g_{θ_{i}} (x) = f_{i} (x)$ . From the deep learning perspective, this generating function $H (\cdot)$ is called a hypernetwork.

This ability to conditionally generate INRs is what makes hypernetworks strongly applicable in physics-informed machine learning. In particular, we can see how the process is just another formuation of partial differential equations (PDEs), which denote time-varying functions of space $ν_{t} (x)$ in differential form.

Hypernetworks can very well predict spatial functions given a timestep: $H (t) = θ_{t}$ such that $g_{θ_{t}} (x) = ν_{t} (x)$ as they can conditionally generate an INR corresponding to the function at tilmestep $t$ , and can be used to forecast PDEs. Indeed they've been used to do so in a very interesting NeurIPs paper that I've presented in a tutorial here.

Hypernetworks as quantifiers of functional similarity

Let's say that we wish to know the functional similarity between two encoders $f_{A},$ f_B: \mathbb{R}^{m} \to \mathbb{R}^{d} $. T h e m o s t s t r a i g h t f o r w a r d w a y t o d o t h i s w o u l d b e t o c o l l e c t a s t a c k o f e n c o d i n g s f o r$ N$ inputs for each encoder and use a similarity function like Centered-Kernel-Alignment (CKA) on these features: $s_{1} = C K A (O_{A}, O_{B}); O_{A}, O_{B} \in ℝ^{N \times d}$ where $s_{1} \in [0, 1]$ is a score denoting the functional similarity between the two models, and $O_{A}, O_{B}$ are the stacks of encoders for $N$ inputs.

However, let's think of another way to quantify the functional similarity between $f_{A}$ and $f_{B}$ , specifically, by using a hypernetwork $H$ that predicts linear classifiers $W_{A}, W_{B} \in ℝ^{d \times k}$ on top of the two encoders $f_{A}, f_{B}$ . Here, $k$ is the number of classes.

Writing, the composite function $g_{j} (•) = W_{j} f_{j} (•), j \in {A, B}$ to classify the inputs, we outline the scheme that let's the hypernetwork depict functional similarity between $f_{A}$ and $f_{B}$ as follows:

Given a dataset $𝒟$ and the current state of the hypernetwork's parameters $ϕ^{0}$ , predict $W_{A}^{0}, W_{B}^{0} = H_{ϕ^{0}} (c_{A}), H_{ϕ^{0}} (c_{B})$
Obtain the classification loss of $g_{A}^{0}$ and $g_{B}^{0}$ on $𝒟$ as $ℓ_{A}^{0}$ and $ℓ_{B}^{0}$ .
Train the hypernetwork's parameters $ϕ$ only on encoder $f_{A}$ as $ϕ^{1} \leftarrow ϕ^{0} - η \nabla_{ϕ^{0}} ℓ_{A}^{0}$
Predict $W_{A}^{1}, W_{B}^{1} = H_{ϕ^{1}} (c_{A}), H_{ϕ^{1}} (c_{B})$ and obtain the classification loss of $g_{A}^{1}$ and $g_{B}^{1}$ as $ℓ_{A}^{1}$ and $ℓ_{B}^{1}$ .
Then, the magnitude of $Δ = ℓ_{B}^{0} - ℓ_{B}^{1}$ depicts the functional similarity between encoders $f_{A}$ and $f_{B}$ .

In other words, if training the hypernetwork only using network A lowers the loss of network B as well, then network A and B can be called functionally similar.

This is because the hypernetwork is able to parmaterise them well from a common state achieved using only network A.

The above interpretations of hypernetworks are avenues that I think are rather under-utilised in the current literature, and can offer creative and potentially powerful ways of modulating neural networks. I'd be happy to know what you think of the above, or about parameter prediction in general. Hit my up for a chat on X/twitter if you like.