Existing full text datasets of U.S. public domain newspapers do not reco...
Large language models have introduced exciting new opportunities and
cha...
The volume of scientific output is creating an urgent need for automated...
With the advent of large language models, methods for abstractive
summar...
Abstractive summarization systems today produce fluent and relevant outp...
Classifying the core textual components of a scientific paper-title, aut...
Recent advances in document image analysis (DIA) have been primarily dri...
Adobe's Portable Document Format (PDF) is a popular way of distributing
...
In layout object detection problems, the ground-truth datasets are
const...
Deep learning-based approaches for automatic document layout analysis an...
We present an algorithm to generate diverse foreground objects and compo...