SPARLING: Learning Latent Representations with Extremely Sparse Activations

02/03/2023
by   Kavi Gupta, et al.
0

Real-world processes often contain intermediate state that can be modeled as an extremely sparse tensor. We introduce Sparling, a new kind of informational bottleneck that explicitly models this state by enforcing extreme activation sparsity. We additionally demonstrate that this technique can be used to learn the true intermediate representation with no additional supervision (i.e., from only end-to-end labeled examples), and thus improve the interpretability of the resulting models. On our DigitCircle domain, we are able to get an intermediate state prediction accuracy of 98.84

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset