On the Random

Subset Sum Problem

and Neural Networks

Emanuele Natale jointly with A. Da Cunha & L. Viennot

23 March 2023

Better Bound for SLTH

(assume $x$ and $w'_i$ s are positive)

$y= \sum_{i} w_i w'_i x$

Pensia et al. (NeurIPS 2020)

w

Find combination of random weights close to $w$

alternative in Orseau et al. (Neurips 2020)

w_1

w_1

w_n

w_n

w'_n

w'_n

w'_1

w'_1

x

y

RSSP. Given $X_1,...,X_n$ i.i.d. random variables, with prob. $1-\epsilon$ for each $z\in [-1,1]$ find a subset $S\subseteq\{1,...,n\}$ such that $|z-\sum_{i\in S} X_i |\leq \epsilon.$

Lueker '98. Solution exists with prob. $1-\epsilon$ if $n=O(\log \frac 1{\epsilon})$ .

RSS - Proof Idea 1/2

If $n=O(\log \frac 1{\epsilon})$ , given $X_1,...,X_n$ i.i.d. random variables, with prob. $1-\epsilon$ for each $z\in [-\frac 12, \frac 12]$ there is $S\subseteq\{1,...,n\}$ such that $|z-\sum_{i\in S} X_i |\leq \epsilon.$

Let $f_t(z)=\mathbf 1(z\in (-\frac 12, \frac 12),\exists S\subseteq\{1,...,t\}: |z-\sum_{i\in S} X_i |\leq \epsilon)$

then $f_t(z)=f_{t-1}(z)+(1-f_{t-1}(z))f_{t-1}(z-X_t)$ .

Observation: If we can approximate any $z\in (a,b)$ and we add $X'$ to the sample, then we can approximate any

$z\in (a,b) \cup (a+X',b+X')$ .

RSS - Proof Idea 2/2

$z\in(-\frac 12, \frac 12), f_t(z)=f_{t-1}(z)+(1-f_{t-1}(z))f_{t-1}(z-X_t)$ .

$\int_{-\frac 12}^{\frac 12}f_{t-1}(z)dz+\mathbb E[\int_{-\frac 12}^{\frac 12}(1-f_{t-1}(z))f_{t-1}(z-X_t)dz|\,X_{t-1},...,X_1]$

$=v_{t-1}+\frac 12 (1-v_{t-1})v_{t-1}.$

$=v_{t-1}+\frac 12 \int_{-1}^{1}[\int_{-\frac 12}^{\frac 12}(1-f_{t-1}(z))f_{t-1}(z-x)dz]dx$

$=v_{t-1}+\frac 12 \int_{-\frac 12}^{\frac 12}(1-f_{t-1}(z))[\int_{-1}^{1}f_{t-1}(z-x)dx]dz$

$=v_{t-1}+\frac 12 \int_{-\frac 12}^{\frac 12}(1-f_{t-1}(z))[\int_{z-1}^{z+1}f_{t-1}(s)ds]dz$

$=v_{t-1}+\frac 12 \int_{-\frac 12}^{\frac 12}(1-f_{t-1}(z))[\int_{-\frac 12}^{\frac 12}f_{t-1}(s)ds]dz$

$\mathbb E[v_t\,|\,X_{t-1},...,X_1]=$

Let $v_t=\int_{-\frac 12}^{\frac 12}f_t(z)dz$ , then

"Revisiting the Random Subset Sum problem" https://hal.science/hal-03654720/

s = z-x

s = z-x

SLTH for Convolutional Neural Networks

Theorem (da Cunha et al., ICLR 2022).

Given $\epsilon,\delta>0$ , any CNN with $k$ parameters and $\ell$ layers, and kernels with $\ell_1$ norm at most 1, can be approximated within error $\epsilon$ by pruning a random CNN with $O\bigl(k\log \frac{k\ell}{\min\{\epsilon,\delta\}}\bigr)$ parameters and $2\ell$ layers with probability at least $1-\delta$ .

Proof Idea 1/2

For any $K\in [-1,1]^{d\times d\times c\times1}$ with $\|K\|_{1}\leq1$ and $X\in [0,1]^{D\times D\times c}$ we want to approximate $K*X$ with $V*\sigma(U*X)$ where $U$ and $V$ are tensors with i.i.d. $\text{Uniform}(-1,1)$ entries.

Let $U$ be $d\times d \times c\times n$ and $V$ be $1\times 1 \times n \times 1$ .

Proof Idea 2/2

\left(V*\left(U* X\right)\right)_{r,s,1} =\sum_{t=1}^{n}V_{1,1,t,1}\cdot\left(U* X\right)_{r,s,t}\\ =\sum_{t=1}^{n}V_{1,1,t,1}\cdot\left(\sum_{i,j\in\left[d\right],k\in\left[c\right]}U_{i,j,k,t}\cdot X_{r-i+1,s-j+1,k}\right)_{r,s,t}\\ =\sum_{t=1}^{n}\sum_{i,j\in\left[d\right],k\in\left[c\right]}\left(V_{1,1,t,1}\cdot U_{i,j,k,t}\right)\cdot X_{r-i+1,s-j+1,k}\\ =\sum_{i,j\in\left[d\right],k\in\left[c\right]}\left(\sum_{t=1}^{n}V_{1,1,t,1}\cdot U_{i,j,k,t}\right)\cdot X_{r-i+1,s-j+1,k}\\ =\sum_{i,j\in\left[d\right],k\in\left[c\right]}L_{i,j,k,1}\cdot X_{r-i+1,s-j+1,k}

\left(V*\left(U* X\right)\right)_{r,s,1} =\sum_{t=1}^{n}V_{1,1,t,1}\cdot\left(U* X\right)_{r,s,t}\\ =\sum_{t=1}^{n}V_{1,1,t,1}\cdot\left(\sum_{i,j\in\left[d\right],k\in\left[c\right]}U_{i,j,k,t}\cdot X_{r-i+1,s-j+1,k}\right)_{r,s,t}\\ =\sum_{t=1}^{n}\sum_{i,j\in\left[d\right],k\in\left[c\right]}\left(V_{1,1,t,1}\cdot U_{i,j,k,t}\right)\cdot X_{r-i+1,s-j+1,k}\\ =\sum_{i,j\in\left[d\right],k\in\left[c\right]}\left(\sum_{t=1}^{n}V_{1,1,t,1}\cdot U_{i,j,k,t}\right)\cdot X_{r-i+1,s-j+1,k}\\ =\sum_{i,j\in\left[d\right],k\in\left[c\right]}L_{i,j,k,1}\cdot X_{r-i+1,s-j+1,k}

where $L_{i,j,k,1}=\sum_{t=1}^{n}V_{1,1,t,1}\cdot U_{i,j,k,t}$

Prune negative entries of $U$ so that $\sigma(U*X)=U*X$ .

On the Random Subset Sum Problem and Neural Networks Emanuele Natale jointly with A. Da Cunha & L. Viennot 23 March 2023

On the Random

Subset Sum Problem

and Neural Networks

Deep Learning on the Edge

Roadmap

Dense ANNs

Compressing ANN

Neural Network Pruning

The Lottery Ticket Hypothesis

Roadmap

The Strong LTH

Formalizing the SLTH

Proving the SLTH

Malach et al.'s Idea

Better Bound for SLTH

RSS - Proof Idea 1/2

RSS - Proof Idea 2/2

Roadmap

Convolutional Neural Network

2D Discrete Convolution

SLTH for Convolutional Neural Networks

Proof Idea 1/2

Proof Idea 2/2

Roadmap

Reducing Energy: the Hardware Way

Letting Physics Do The Math

Resistive Crossbar Device

"Résistance équivalente modulable

à partir de résistances imprécises"

RSS in Practice

Conclusions

Thank you

GSSI 2023

GSSI 2023

Emanuele Natale

On the Random

Subset Sum Problem

and Neural Networks

GSSI 2023

More from Emanuele Natale