NLP-TEA-2 Shared Task on Chinese Grammatical Error Diagnosis

Event Notification Type: 
Call for Participation
Friday, 31 July 2015
Country: 
China
Contact Email: 
City: 
Beijing
Contact: 
Lung-Hao Lee
Submission Deadline: 
Wednesday, 15 April 2015

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

The 2nd Workshop on Natural Language Processing
Techniques for Educational Applications (NLP-TEA-2)
with Shared Task on Chinese Grammatical Error Diagnosis (CGED)

July 31, 2015 at Beijing, China
in conjunction with ACL-IJCNLP 2015

NLP-TEA-2: https://sites.google.com/site/nlptea2cged/
CGED: https://sites.google.com/site/nlptea2cged/shared-task
ACL-IJCNLP 2015: http://acl2015.org/

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Shared Task Description

The goal of this shared task is developing the computer-assisted tools to diagnose several kinds of grammatical errors, i.e., redundant word, missing word, word disorder, and word selection. The input sentence contains one of defined error types. The developed tool should indicate which kind of error type is embedded in the given sentence and its occurred positions. If the input contains no grammatical errors, the tool should return: sid, correct. The output format should be a quadruple of sid, start_off, end_off, error_type , if the input contains a grammatical error. In this output format, sid means the unique sentence identifier, start_off and end_off represent the positions of starting and ending character where a grammatical error occurs, in which each character or punctuation occupies 1 for counting positions, and error_type should be one of defined errors. Examples are shown as follows:
Example 1:
Input: (sid=B2-0880) 他是我的以前的室友
output: B2-0880, 4, 4, Redundant
(Notes: "的" is a redundant character in this input sentence)
Example 2:
Input: (sid=A2-0017) 那電影是機器人的故事
Output: A2-0017, 2, 2, Missing
(Notes: There is a missing word between "那" and "電影")
Example 3:
Input: (sid=A2-0017) 那部電影是機器人的故事
Output: A2-0017, correct
Example 4:
Input: (sid=B1-1193) 吳先生是修理腳踏車的拿手
Output: B1-1193, 11, 12, Selection
(Notes: "拿手" is wrong word. One of correct words should be "好手")
Example 5:
Input: (sid=B2-2292) 所以我不會讓失望她
Output: B2-2292, 7, 9, Disorder
(Notes: "失望她" is a word ordering error. The correct order should be "她失望")

Important Dates
Registration Open: January 5, 2015
Release of Training Data: March 2, 2015
Registration Close: April 15, 2015
Dry Run (Format Validation) Data Released: May 1, 2015
Release of Test Data: May 11, 2015
Testing Results Submission Due: May 13, 2015
Release of Evaluation Results: May 15, 2015
Technical Report Submission Due: June 1, 2015
Report Reviews Returned: June 11, 2015
Camera-Ready Due: June 21, 2015
Workshop Date: July 31, 2015

Workshop Co-organizers
Hsin-Hsi Chen (National Taiwan University)
Yuen-Hsien Tseng (National Taiwan Normal University)
Yuji Matsumoto (Nara Institute of Science and Technology)
Lung Hsiang Wong (Nanyang Technological University)

Shared Task Co-organizers
Lung-Hao Lee (National Taiwan University)
Liang-Chih Yu (Yuan Ze University)
Li-Ping Chang (National Taiwan Normal University)