b'Xuezhe Ma'

research

∙ 06/12/2023

RECAP: Retrieval-Enhanced Context-Aware Prefix Encoder for Personalized Dialogue Response Generation

Endowing chatbots with a consistent persona is essential to an engaging ...

0 Shuai Liu, et al. ∙

research

∙ 05/22/2023

Look-back Decoding for Open-Ended Text Generation

Given a prefix (context), open-ended generation aims to decode texts tha...

0 Nan Xu, et al. ∙

research

∙ 05/18/2023

LIMA: Less Is More for Alignment

Large language models are trained in two stages: (1) unsupervised pretra...

0 Chunting Zhou, et al. ∙

research

∙ 12/16/2022

On Human Visual Contrast Sensitivity and Machine Vision Robustness: A Comparative Study

It is well established in neuroscience that color vision plays an essent...

0 Ming-Chang Chiu, et al. ∙

research

∙ 12/16/2022

Better May Not Be Fairer: Can Data Augmentation Mitigate Subgroup Degradation?

It is no secret that deep learning models exhibit undesirable behaviors ...

0 Ming-Chang Chiu, et al. ∙

research

∙ 10/19/2022

Improving Stability of Fine-Tuning Pretrained Language Models via Component-Wise Gradient Norm Clipping

Fine-tuning over large pretrained language models (PLMs) has established...

0 Chenghao Yang, et al. ∙

research

∙ 09/21/2022

Mega: Moving Average Equipped Gated Attention

The design choices in the Transformer attention mechanism, including wea...

2 Xuezhe Ma, et al. ∙

research

∙ 05/25/2022

Know Where You're Going: Meta-Learning for Parameter-Efficient Fine-tuning

A recent family of techniques, dubbed as lightweight fine-tuning methods...

0 Mozhdeh Gheini, et al. ∙

research

∙ 04/29/2022

Prompt Consistency for Zero-Shot Task Generalization

One of the most impressive results of recent NLP history is the ability ...

0 Chunting Zhou, et al. ∙

research

∙ 02/18/2022

Learning Representations Robust to Group Shifts and Adversarial Examples

Despite the high performance achieved by deep neural networks on various...

0 Ming-Chang Chiu, et al. ∙

research

∙ 10/08/2021

Towards a Unified View of Parameter-Efficient Transfer Learning

Fine-tuning large pre-trained language models on downstream tasks has be...

0 Junxian He, et al. ∙

research

∙ 06/14/2021

Examining and Combating Spurious Features under Distribution Shift

A central goal of machine learning is to learn robust representations th...

0 Chunting Zhou, et al. ∙

research

∙ 06/03/2021

Luna: Linear Unified Nested Attention

The quadratic computational and memory complexities of the Transformer's...

31 Xuezhe Ma, et al. ∙

research

∙ 06/02/2021

COM2SENSE: A Commonsense Reasoning Benchmark with Complementary Sentences

Commonsense reasoning is intuitive for humans but has been a long-term c...

9 Shikhar Singh, et al. ∙

research

∙ 02/03/2021

DiSCoL: Toward Engaging Dialogue Systems through Conversational Line Guided Response Generation

Having engaging and informative conversations with users is the utmost g...

10 Sarik Ghazarian, et al. ∙

research

∙ 09/28/2020

Apollo: An Adaptive Parameter-wise Diagonal Quasi-Newton Method for Nonconvex Stochastic Optimization

In this paper, we introduce Apollo, a quasi-Newton method for nonconvex ...

0 Xuezhe Ma, et al. ∙

research

∙ 09/20/2019

Cross-lingual Dependency Parsing with Unlabeled Auxiliary Languages

Cross-lingual transfer learning has become an important weapon to battle...

0 Wasi Uddin Ahmad, et al. ∙

research

∙ 09/05/2019

FlowSeq: Non-Autoregressive Conditional Sequence Generation with Generative Flow

Most sequence-to-sequence (seq2seq) models are autoregressive; they gene...

0 Xuezhe Ma, et al. ∙

research

∙ 08/30/2019

Handling Syntactic Divergence in Low-resource Machine Translation

Despite impressive empirical successes of neural machine translation (NM...

0 Chunting Zhou, et al. ∙

research

∙ 05/29/2019

Choosing Transfer Languages for Cross-Lingual Learning

Cross-lingual transfer, where a high-resource transfer language is used ...

0 Yu-Hsiang Lin, et al. ∙

research

∙ 04/04/2019

Density Matching for Bilingual Word Embedding

Recent approaches to cross-lingual word embedding have generally been ba...

0 Chunting Zhou, et al. ∙

research

∙ 02/12/2019

MaCow: Masked Convolutional Generative Flow

Flow-based generative models, conceptually attractive due to tractabilit...

2 Xuezhe Ma, et al. ∙

research

∙ 01/06/2019

MAE: Mutual Posterior-Divergence Regularization for Variational AutoEncoders

Variational Autoencoder (VAE), a simple and effective deep generative mo...

6 Xuezhe Ma, et al. ∙

research

∙ 11/01/2018

Near or Far, Wide Range Zero-Shot Cross-Lingual Dependency Parsing

Cross-lingual transfer is the major means toleverage knowledge from high...

0 Wasi Uddin Ahmad, et al. ∙

research

∙ 09/04/2018

Texar: A Modularized, Versatile, and Extensible Toolkit for Text Generation

We introduce Texar, an open-source toolkit aiming to support the broad s...

0 Zhiting Hu, et al. ∙

research

∙ 05/03/2018

Stack-Pointer Networks for Dependency Parsing

We introduce a novel architecture for dependency parsing: stack-pointer ...

0 Xuezhe Ma, et al. ∙

research

∙ 05/19/2017

Softmax Q-Distribution Estimation for Structured Prediction: A Theoretical Interpretation for RAML

Reward augmented maximum likelihood (RAML), a simple and effective learn...

0 Xuezhe Ma, et al. ∙

research

∙ 04/19/2017

An Interpretable Knowledge Transfer Model for Knowledge Base Completion

Knowledge bases are important resources for a variety of natural languag...

0 Qizhe Xie, et al. ∙

research

∙ 01/04/2017

Neural Probabilistic Model for Non-projective MST Parsing

In this paper, we propose a probabilistic parsing model, which defines a...

0 Xuezhe Ma, et al. ∙

research

∙ 09/26/2016

Dropout with Expectation-linear Regularization

Dropout, a simple and effective way to train deep neural networks, has l...

0 Xuezhe Ma, et al. ∙

research

∙ 03/21/2016

Harnessing Deep Neural Networks with Logic Rules

Combining deep neural networks with structured logic rules is desirable ...

0 Zhiting Hu, et al. ∙

research

∙ 03/15/2016

Unsupervised Ranking Model for Entity Coreference Resolution

Coreference resolution is one of the first stages in deep language under...

0 Xuezhe Ma, et al. ∙

research

∙ 03/04/2016

End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF

State-of-the-art sequence labeling systems traditionally require large a...

0 Xuezhe Ma, et al. ∙

research

∙ 02/14/2015

Probabilistic Models for High-Order Projective Dependency Parsing

This paper presents generalized probabilistic models for high-order proj...

0 Xuezhe Ma, et al. ∙

Xuezhe Ma

Featured Co-authors

Sign in with Google

Consider DeepAI Pro