Publications

2023

Macaw-LLM: Multi-Modal Language Modeling with Image, Audio, Video, and Text Integration

Chenyang Lyu, Minghao Wu, Longyue Wang†, Xinting Huang, Bingshuai Liu, Zefeng Du, Shuming Shi, Zhaopeng Tu
Arxiv 2023 [pdf] [code]

TextBind: Multi-turn Interleaved Multimodal Instruction-following

Huayang Li, Siheng Li, Deng Cai, Longyue Wang, Lemao Liu, Taro Watanabe, Yujiu Yang, Shuming Shi
Arxiv 2023 [pdf] [code]

On the Cultural Gap in Text-to-Image Generation

Bingshuai Liu, Longyue Wang* Chenyang Lyu, Yong Zhang, Jinsong Su, Shuming Shi, Zhaopeng Tu
Arxiv 2023 [pdf] [code] [slides]

New Trends in Machine Translation using Large Language Models

Chenyang Lyu, Jitao Xu, Longyue Wang*
ACL 2023 [pdf]

On Extrapolation of Long-Text Translation with Large Language Models

Zefeng Du, Wenxiang Jiao, Longyue Wang*, Chenyang Lyu, Jianhui Pang, Leyang Cui, Kaiqiang Song, Derek F Wong, Shuming Shi, Zhaopeng Tu
ACL 2023 [pdf]

A Survey on Zero Pronoun Translation

Longyue Wang, Siyou Liu*, Mingzhou Xu, Linfeng Song, Shuming Shi, and Zhaopeng Tu
ACL 2023 [pdf]

Disco-Bench: A Discourse-Aware Evaluation Benchmark for Language Modelling

Longyue Wang, Donghuai Liu, Cai Deng, Dian Yu, Haiyun Jiang, Yan Wang, Leyang Cui, Shuming Shi, Zhaopeng Tu
Arxiv 2023 [pdf]

Revisiting Non-Autoregressive Translation at Scale

Zhihao Wang, Longyue Wang*, Jinsong Su, Junfeng Yao, and Zhaopeng Tu
ACL 2023 (Findings) [pdf]

Effidit: An Assistant for Improving Writing Efficiency

Shuming Shi, Enbo Zhao, Wei Bi, Deng Cai, Leyang Cui, Xinting Huang, Haiyun Jiang, Duyu Tang, Kaiqiang Song, Longyue Wang, Chengyan Huang, Guoping Huang, Yan Wang, Piji Li
ACL 2023 (Demo) [pdf] [demo]

New Trends in Machine Translation using Large Language Models: Case Examples with ChatGPT

Chenyang Lyu, Jitao Xu, Longyue Wang
Arxiv 2023 [pdf] [blog]

Document-Level Machine Translation with Large Language Models

Longyue Wang*, Chenyang Lyu*, Tianbo Ji*, Zhirui Zhang*, Dian Yu, Shuming Shi, Zhaopeng Tu
Arxiv 2023 [pdf] [data]

Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language Models

Yue Zhang, Yafu Li, Leyang Cui, Deng Cai, Lemao Liu, Tingchen Fu, Xinting Huang, Enbo Zhao, Yu Zhang, Yulong Chen, Longyue Wang, Anh Tuan Luu, Wei Bi, Freda Shi, Shuming Shi
Arxiv 2023 [pdf]

Large Language Models Meet Harry Potter: A Bilingual Dataset for Aligning Dialogue Agents with Characters

Nuo Chen, Yan Wang, Haiyun Jiang, Deng Cai, Yuhan Li, Ziyang Chen, Longyue Wang, Jia Li
EMNLP 2023 [pdf]

A Benchmark for Text Expansion: Datasets, Metrics, and Baselines

Yi Chen, Haiyun Jiang, Wei Bi, Rui Wang, Longyue Wang, Shuming Shi, Ruifeng Xu
Arxiv 2023 [pdf]

Deepfake Text Detection in the Wild

Yafu Li, Qintong Li, Leyang Cui, Wei Bi, Longyue Wang, Linyi Yang, Shuming Shi, Yue Zhang
Arxiv 2023 [pdf]

Graph-Based Video-Language Learning with Multi-Grained Audio-Visual Alignment

Chenyang Lyu, Wenxi Li, Tianbo Ji, Longyue Wang, Liting Zhou, Cathal Gurrin, Linyi Yang, Yi Yu, Yvette Graham, Jennifer Foster
MM 2023 [pdf]

TaleCrafter: Interactive Story Visualization with Multiple Characters

Yuan Gong, Youxin Pang, Xiaodong Cun, Menghan Xia, Yingqing He, Haoxin Chen, Longyue Wang, Yong Zhang, Xintao Wang, Ying Shan, Yujiu Yang
Arxiv 2023 [pdf]

A Benchmark Dataset and Evaluation Methodology for Chinese Zero Pronoun Translation

Mingzhou Xu*, Longyue Wang*, Siyou Liu, Derek F. Wong, Shuming Shi, Zhaopeng Tu
Language Resources and Evaluation [pdf]

Search-Engine-augmented Dialogue Response Generation with Cheaply Supervised Query Production Corresponding

Ante Wang, Qi Liu, Haitao Mi, Longyue Wang, Zhaopeng Tu, Jinsong Su, Dong Yu, Linfeng Song
Artificial Intelligence [pdf]

Towards A Unified Training for Levenshtein Transformer

Kangjie Zheng, Longyue Wang, Zhihao Wang, Binqi Chen, Ming Zhang, Zhaopeng Tu
ICASSP 2023 [pdf]

How Does Pretraining Improve Discourse-Aware Translation?

Zhihong Huang, Longyue Wang*, Siyou Liu, Derek F. Wong
INTERSPEECH 2023 [pdf]

Prompt-Learning for Cross-Lingual Relation Extraction

Chiaming Hsu, Changtong Zan, Liang Ding, Longyue Wang, Xiaoting Wang, Weifeng Liu, Fu Lin, Wenbin Hu
IJCNN 2023 [pdf]

2022

GuoFeng: A Benchmark for Zero Pronoun Recovery and Translation

Longyue Wang, Mingzhou Xu*, Derek F. Wong, Hongye Liu, Linfeng Song, Lidia S. Chao, Shuming Shi and Zhaopeng Tu
EMNLP 2022 [pdf] [code]

ngram-OAXE: Phrase-Based Order-Agnostic Cross Entropy for Non-Autoregressive Machine Translation

Cunxiao Du, Zhaopeng Tu, Longyue Wang, Jing Jiang
COLING 2022 [pdf]

Recurrent Graph Encoder for Syntax-Aware Neural Machine Translation

Liang Ding, Longyue Wang* and Siyou Liu*
International Journal of Machine Learning and Cybernetics [pdf]

Redistributing Low-Frequency Words: Making the Most of Monolingual Data in Non-Autoregressive Translation

Liang Ding, Longyue Wang*, Shuming Shi, Dacheng Tao, Zhaopeng Tu
ACL 2022 [pdf]

Learning to Refine Source Representations for Neural Machine Translation

Xinwei Geng, Longyue Wang, Xing Wang, Mingtao Yang, Xiaocheng Feng, Bing Qin, Zhaopeng Tu
International Journal of Machine Learning and Cybernetics [pdf]

2021

Context-Aware Self-Attention Networks for Natural Language Processing

Baosong Yang, Longyue Wang*, Derek F. Wong, Shuming Shi, Zhaopeng Tu
Neurocomputing [pdf] [code] [bitex]

Recent Advances in Dialogue Machine Translation

Liu, Siyou, Yuqi Sun, Longyue Wang†
Information [pdf] [bitex]

Rejuvenating Low-Frequency Words: Making the Most of Parallel Data in Non-Autoregressive Translation

Liang Ding, Longyue Wang*, Xuebo Liu, Derek F. Wong, Dacheng Tao, Zhaopeng Tu
ACL 2021 [pdf] [bitex]

On the Copying Behaviors of Pre-Training for Neural Machine Translation

Xuebo Liu, Longyue Wang, Derek F. Wong, Liang Ding, Lidia S. Chao, Shuming Shi, Zhaopeng Tu
ACL 2021 [pdf] [bitex]

Progressive Multi-Granularity Training for Non-Autoregressive Translation

Liang Ding, Longyue Wang*, Xuebo Liu, Derek F. Wong, Dacheng Tao, Zhaopeng Tu
ACL 2021 [pdf] [bitex]

Understanding and Improving Lexical Choice in Non-Autoregressive Translation

Liang Ding, Longyue Wang, Xuebo Liu, Derek F. Wong, Dacheng Tao, Zhaopeng Tu
ICLR 2021 [pdf] [bitex]

Understanding and Improving Encoder Layer Fusion in Sequence-to-Sequence Learning

Xuebo Liu, Longyue Wang, Derek F. Wong, Liang Ding, Lidia S. Chao, Zhaopeng Tu
ICLR 2021 [pdf] [bitex]

Tencent Translation System for The WMT21 News Translation Task

Longyue Wang, Mu Li, Fangxu Liu, Shuming Shi, Zhaopeng Tu, Xing Wang, Shuangzhi Wu, Jiali Zeng, Wen Zhang
WMT 2021 [pdf] [bitex]

On the Complementarity between Pre-Training and Back-Translation for Neural Machine Translation

Xuebo Liu, Longyue Wang, Derek F. Wong, Liang Ding, Lidia S. Chao, Shuming Shi, Zhaopeng Tu
WMT 2021 [pdf] [bitex]

2020

Context-Aware Cross-Attention for Non-Autoregressive Translation

Liang Ding, Longyue Wang, Di Wu, Dacheng Tao, Zhaopeng Tu
COLING 2020 [pdf] [bitex]

On the Sub-layer Functionalities of Transformer Decoder

Yilin Yang, Longyue Wang, Shuming Shi, Prasad Tadepalli, Stefan Lee, Zhaopeng Tu
EMNLP 2020 [pdf] [bitex]

On the Sparsity of Neural Machine Translation Models

Yong Wang, Longyue Wang, Victor O.K. Li, Zhaopeng Tu
EMNLP 2020 [pdf] [bitex]

How Does Selective Mechanism Improve Self-Attention Networks?

Xinwei Geng, Longyue Wang, Xing Wang, Bing Qin, Ting Liu, and Zhaopeng Tu
ACL 2020 [pdf] [bitex]

Self-Attention with Cross-Lingual Position Representation.

Liang Ding, Longyue Wang, and Dacheng Tao.
ACL 2020 [pdf] [bitex]

Go from the General to the Particular: Multi-Domain Translation with Domain Transformation Networks

Yong Wang, Longyue Wang, Shuming Shi, Victor Li, and Zhaopeng Tu
AAAI 2020 [pdf] [bitex]

Tencent Neural Machine Translation Systems for The WMT20 News Translation Task

Shuangzhi Wu, Xing Wang, Longyue Wang, Fangxu Liu, Jun Xie, Zhaopeng Tu, Shuming Shi, Mu Li
WMT 2020 [pdf] [bitex]

Tencent AI Lab Machine Translation Systems for WMT20 Chat Translation Task

Longyue Wang, Zhaopeng Tu, Xing Wang, Li Ding, Liang Ding, Shuming Shi
WMT 2020 [pdf] [bitex]

2019

One Model to Learn Both: Zero Pronoun Prediction and Translation

Longyue Wang, Zhaopeng Tu, Xing Wang, Shuming Shi
EMNLP 2019 [pdf] [bitex]

Towards Understanding Neural Machine Translation with Word Importance

Shilin He, Zhaopeng Tu, Xing Wang, Longyue Wang, Michael R. Lyu, Shuming Shi
EMNLP 2019 [pdf] [bitex]

Assessing the Ability of Self-Attention Networks to Learn Word Order

Baosong Yang, Longyue Wang, Derek Wong, Lidia S. Chao, Zhaopeng Tu
ACL 2019 [pdf] [bitex]

Exploiting Sentential Context for Neural Machine Translation.

Xing Wang, Zhaopeng Tu, Longyue Wang, Shuming Shi
ACL 2019 [pdf] [bitex]

Convolutional Self-Attention Networks

Baosong Yang, Longyue Wang, Derek Wong, Lidia S. Chao, Zhaopeng Tu
NAACL 2019 [pdf]

Modeling Recurrence for Transformer

Jie Hao, Xing Wang, Baosong Yang, Longyue Wang, Jinfeng Zhang, Zhaopeng Tu
NAACL 2019 [pdf]

Dynamic Layer Aggregation for Neural Machine Translation with Routing-by-Agreement

Zi-Yi Dou, Zhaopeng Tu, Xing Wang, Longyue Wang, Shuming Shi, and Tong Zhang
AAAI 2019 [pdf]

2018

Learning to Jointly Translate and Predict Dropped Pronouns with a Shared Reconstruction Mechanism

Longyue Wang, Zhaopeng Tu, Andy Way, and Qun Liu.
EMNLP 2018[pdf]

Translating Pro-Drop Languages with Reconstruction Models

Longyue Wang, Zhaopeng Tu, Shuming Shi, Tong Zhang, Yvette Graham, Qun Liu
AAAI 2018[pdf] [bitex] [poster]

Chinese–Portuguese Machine Translation: A Study on Building Parallel Corpora from Comparable Texts

Siyou Liu, Longyue Wang, Chao-Hong Liu
LREC 2018 [pdf] [bitex] [slides]

2017

Semantics-Enhanced Task-Oriented Dialogue Translation: A Case Study on Hotel Booking

Longyue Wang, Jinhua Du, Liangyou Li, Zhaopeng Tu, Andy Way, Qun Liu
IJCNLP 2017[pdf] [bitex] [demo]

Exploiting Cross-Sentence Context for Neural Machine Translation

Longyue Wang, Zhaopeng Tu, Andy Way, Qun Liu
EMNLP 2017[pdf] [bitex] [slides]

Automatic Construction of Parallel Dialogue Corpora with Rich Information

Xiaojun Zhang, Longyue Wang, Alberto Poncelas, Qun Liu
Chinese Language Resources (Springer)[pdf] [bitex]

2016

A Novel and Robust Approach for Pro-Drop Language Translation

Longyue Wang, Zhaopeng Tu, Xiaojun Zhang, Siyou Liu, Hang Li, Andy Way and Qun Liu
Machine Translation (Springer)[pdf] [bitex]

A Novel Approach for Dropped Pronoun Translation

Longyue Wang, Zhaopeng Tu, Xiaojun Zhang, Hang Li, Andy Way and Qun Liu
NAACL-HLT 2016[pdf] [bitex] [slides]

The Automatic Construction of Discourse Corpus for Dialogue Translation

Longyue Wang, Xiaojun Zhang, Zhaopeng Tu, Andy Way, Qun Liu
LREC 2016[pdf] [bitex] [slides]

Dropped Pronoun Generation for Dialogue Machine Translation

Longyue Wang, Xiaojun Zhang, Zhaopeng Tu, Hang Li, Qun Liu
ICASSP 2016[pdf] [bitex] [poster]

2015

Linguistically-augmented perplexity-based data selection for language models

Antonio Toral, Pavel Pecina, Longyue Wang, Josef Genabith
Computer Speech and Language (Elsevier)[pdf] [bitex]

The DCU Discourse Parser for Connective, Argument Identification and Explicit Sense Classification

Longyue Wang, Chris Hokamp, Tsuyoshi Okita, Xiaojun Zhang, Qun Liu
CoNLL 2015[pdf] [bitex] [poster]

The DCU Discourse Parser: A Sense Classification Task

Tsuyoshi Okita, Longyue Wang, Qun Liu
CoNLL 2015[pdf] [bitex] [poster]

2014

Thesis: Domain Adaptation for Statistical Machine Translation

Longyue Wang
University of Macau Library [pdf] [link] [slides] [video] [bitex]

A Systematic Comparison of Data Selection Criteria for SMT Domain Adaptation

Longyue Wang, Derek F. Wong, Lidia S. Chao, Yi Lu, Junwen Xing
The Scientific World Journal[pdf] [bitex]

Data Selection via Semi-Supervised Recursive Autoencoders for SMT Domain Adaptation

Yi Lu, Derek F. Wong, Lidia S. Chao, Longyue Wang
CWMT 2014 (Springer LNCS)[pdf] [bitex]

Effective Hypothese Reordering Model in Machine Translation

Yiming Wang, Longyue Wang, Derek F. Wong, Lidia S. Chao
CWMT 2014 (Springer LNCS) [pdf] [bitex]

Combining Domain Adaptation Approaches for Medical Text Translation

Longyue Wang, Yi Lu, Derek F. Wong, Lidia S. Chao, Yiming Wang, Francisco Oliveira
WMT 2014[pdf] [bitex]

Domain Adaptation for Medical Text Translation using Web Resources

Yi Lu, Longyue Wang, Derek F. Wong, Lidia S. Chao, Yiming Wang, Francisco Oliveira
WMT 2014[pdf] [bitex] 

Factored Statistical Machine Translation for Grammatical Error Correction

Yiming Wang, Longyue Wang, Xiaodong Zeng, Derek F. Wong, Lidia S. Chao, Yi Lu
CoNLL 2014[pdf] [bitex]

UM-Corpus: A Large English-Chinese Parallel Corpus for Statistical Machine Translation

Liang Tian, Derek F. Wong, Lidia S. Chao, Paulo Quaresma, Francisco Oliveira, Yi Lu, Shuo Li, Yiming Wang, Longyue Wang
LREC 2014[pdf] [bitex]

2013

Edit Distance: A New Data Selection Criterion for Domain Adaptation in SMT

Longyue Wang, Derek F. Wong, Lidia S. Chao, Junwen Xing, Yi Lu, Isabel Trancoso
RANLP 2013[pdf] [bitex]

iCPE: A Hybrid Data Selection Model for SMT Domain Adaptation

Longyue Wang, Derek F. Wong, Lidia S. Chao, Yi Lu, Junwen Xing
CCL 2013[pdf] [bitex]

UM-Checker: A Hybrid System for English Grammatical Error Correction

Junwen Xing, Longyue Wang, Derek F. Wong, Lidia S. Chao, Xiaodong Zeng
CoNLL 2013[pdf] [bitex]

An Experimental Platform for Cross-Language Document Retrieval

Longyue Wang, Derek F. Wong, Lidia S. Chao
ICASE 2013[pdf] [bitex]

2012

TQDL: Integrated Models for Cross-Language Document Retrieval

Longyue Wang, Derek F. Wong, Lidia S. Chao
International Journal of Computational Linguistics and Chinese Language Processing[pdf] [bitex]

An Improvement in Cross-Language Document Retrieval Based on Statistical Models

Longyue Wang, Derek F. Wong, Lidia S. Chao
ROCLING 2012[pdf] [bitex]

CRFs-Based Chinese Word Segmentation for Micro-Blog with Small-Scale Data

Longyue Wang, Derek F. Wong, Lidia S. Chao, Junwen Xing
CLP 2012[pdf] [bitex]

A Joint Chinese Named Entity Recognition and Disambiguation System

Longyue Wang, Shuo Li, Derek F. Wong, Lidia S. Chao
CLP 2012[pdf] [bitex]

[Go Back]