結果提交及評估公式
結果提交
各系統的輸出結果存為一個結果檔 (run),檔名格式為
RITEVAL-(Team Name)-(Lang)-(Subtask Name)-(Run Number).txt
- Team Name = 報名時所使用的 Group ID
- Lang = CS, CT
- Subtask Name = FV, SVBC, SVMC
- Run Number = 01, 02, ... 05
例如:
RITEVAL-NTOUA-CT-SVBC-01.txt
結果檔內容格式
結果檔中每一行對應到每道題目,格式如下:
t2_ID [SPACE] Label [SPACE] Confidence
其中 Confidence 表示推測的信心度。
中文 FV 任務每道題目都應標出所屬三種標籤 (E, C, U 請參考任務定義) 之一。
中文 SV 的 BC 任務每道題目都應標出所屬兩種標籤 (Y, N 請參考任務定義) 之一。
中文 SV 的 MC 任務每道題目都應標出所屬四種標籤 (F, B, C, I 請參考任務定義) 之一。
FV 結果檔內容範例如下:
1 E 0.852
2 U 0.994
3 E 0.789
4 C 1.000
:
SV 結果檔內容與 FV 相同,僅標籤集合不同。
系統描述
亦請提供各系統之簡短描述 (各結果所對應之系統需各提交一個系統描述檔),
包括策略、所使用之資訊或特徵、資源 (例如語言資源或 www)、以及所使用工具 (例 NLP 工具) 等等。
系統描述檔取名規則如下:
RITEVAL-(Team Name)-(Lang)-(Subtask Name)-(Run Number)-sysdesc.txt
- Team Name = 報名時所使用的 Group ID
- Lang = CS, CT
- Subtask Name = FV, SVBC, SVMC
- Run Number = 01, 02, ... 05
例如:
RITEVAL-NTOUA-CT-SVBC-01-sysdesc.txt
系統描述檔內容範例如下:
1. Approach
[ ]rule:
[x]statistics: SVM
[ ]hybrid
2. Feature/Information
[ ]Overlapping
[x]Alignment
[ ]Transformation
[x]Char/Word Overlapping
[x]Syntactic Information
[ ]Predicate-Argument Relationship
[x]Named Entity
[ ]Entity/Event
[x]Temporal/Numeric Information
[ ]Entailment
[ ]Modality
[ ]Polarity
[x]Synonym/Antonym
[x]Hypernym/Hyponym
[ ]Meronym/Holonym
[ ]Entity/Event Relationship
[ ]Entailment Rule
3. Resources: word segmentation, syntactic parser
4. Tools: WordNet, Wikipedia
提交方式
請將所有檔案壓縮成一個檔案,以 e-mail 寄至 rite-val-organizers@nii.ac.jp , 期限是 2014/8/7 23:59 (已延長)。
請在信件檔題標明所參加之任務以及語言為何。
收到結果檔後,我們會在 24 小時內回信確認。若您沒收到確認信,請與我們聯絡。
結果評估公式
Macro-F (所有類別的 F-measures 之 Macro-averaging 平均值)
Submission
Results from one system comprise a run. Name a run file in the following format:
RITEVAL-(Team Name)-(Lang)-(Subtask Name)-(Run Number).txt
- Team Name = use the same short id used in NTCIR registration (Group ID).
- Lang = CS, CT
- Subtask Name = FV, SVBC, SVMC
- Run Number = 01, 02, ... 05
Example:
RITEVAL-NTOUA-CT-SVBC-01.txt
Run Format
Each line contains the result for one t2 in the following format:
t2_ID [SPACE] Label [SPACE] Confidence
where Confidence is the confidence score.
In Chinese FV Subtasks, each t2 should be tagged in one of the three labels (E, C, U cf. Task Definition).
In Chinese SV-BC Subtasks, each t2 should be tagged in one of the two labels (Y, N cf. Task Definition).
In Chinese SV-MC Subtasks, each t2 should be tagged in one of the four labels (F, B, C, I cf. Task Definition).
Examples of one FV run:
1 E 0.852
2 U 0.994
3 E 0.789
4 C 1.000
:
SV runs look like FV runs except that their label sets are different.
System Description
Please also create a text file (one file for each run) to provide brief system description,
including approaches, used information or features, resources (any language resource, web etc), and tools (NLP tools etc).
Name a run file in the following format:
RITEVAL-(Team Name)-(Lang)-(Subtask Name)-(Run Number)-sysdesc.txt
- Team Name = use the same short id used in NTCIR registration (Group ID).
- Lang = CS, CT
- Subtask Name = FV, SVBC, SVMC
- Run Number = 01, 02, ... 05
Example:
RITEVAL-NTOUA-CT-SVBC-01-sysdesc.txt
Example of a system description file is as follows:
1. Approach
[ ]rule:
[x]statistics: SVM
[ ]hybrid
2. Feature/Information
[ ]Overlapping
[x]Alignment
[ ]Transformation
[x]Char/Word Overlapping
[x]Syntactic Information
[ ]Predicate-Argument Relationship
[x]Named Entity
[ ]Entity/Event
[x]Temporal/Numeric Information
[ ]Entailment
[ ]Modality
[ ]Polarity
[x]Synonym/Antonym
[x]Hypernym/Hyponym
[ ]Meronym/Holonym
[ ]Entity/Event Relationship
[ ]Entailment Rule
3. Resources: word segmentation, syntactic parser
4. Tools: WordNet, Wikipedia
Run Submission
Archive all files in zip, and send the result to rite-val-organizers@nii.ac.jp via email attachment by the end of the formal run period (2014/8/7 23:59, extended).
Specify which subtasks and languages you are participating in the mail title.
We'll email you a notification of acceptance as a reply within 24 hours of submission. Contact us if you did not receive the notification.
Evaluation Metrics
Macro-F (Macro-averaging F-measures over all labels)