The well-known Gumbel-Max Trick for sampling elements from a categorical...
More and more evidence has shown that strengthening layer interactions c...
Time-series is ubiquitous across applications, such as transportation,
f...
The LSTM network was proposed to overcome the difficulty in learning
lon...
The well-known Gumbel-Max Trick for sampling elements from a categorical...
Autoregressive networks can achieve promising performance in many sequen...