Journal Home Online First Current Issue Archive For Authors Journal Information 中文版

Frontiers of Information Technology & Electronic Engineering >> 2018, Volume 19, Issue 5 doi: 10.1631/FITEE.1601865

Cross-lingual implicit discourse relation recognitionwith co-training

. School of Software, Xiamen University, Xiamen 361005, China.. State Key Laboratory for Novel Software Technology, Nanjing University, Nanjing 210023, China.. Institute of Software, Chinese Academy of Sciences, Beijing 100190, China.. Provincial Key Laboratory for Computer Information Processing Technology, Soochow University, Suzhou 215006, China.

Available online: 2018-07-20

Next Previous

Abstract

A lack of labeled corpora obstructs the research progress on implicit discourse relation recognition (DRR) for Chinese, while there are some available discourse corpora in other languages, such as English. In this paper, we propose a cross-lingual implicit DRR framework that exploits an available English corpus for the Chinese DRR task. We use machine translation to generate Chinese instances from a labeled English discourse corpus. In this way, each instance has two independent views: Chinese and English views. Then we train two classifiers in Chinese and English in a co-training way, which exploits unlabeled Chinese data to implement better implicit DRR for Chinese. Experimental results demonstrate the effectiveness of our method.

Related Research