Pre-trained large language models (PLMs) underlie most new developments ...
Cross-lingual summarization consists of generating a summary in one lang...
The availability of large, high-quality datasets has been one of the mai...
We propose an adaptation of the curriculum training framework, applicabl...
Transformer is a popularly used neural network architecture, especially ...
Semantic parsing over multiple knowledge bases enables a parser to explo...
Converting an n-dimensional vector to a probability distribution over n
...
The goal behind Domain Adaptation (DA) is to leverage the labeled exampl...
Existing Natural Language Generation (NLG) systems are weak AI systems a...
Dirichlet Process(DP) is a Bayesian non-parametric prior for infinite mi...