第18行: |
第18行: |
| In general objective, the ACE program is motivated by and addresses the same issues as the MUC program that preceded it. The ACE program, however, defines the research objectives in terms of the target objects (i.e., the entities, the relations, and the events) rather than in terms of the words in the text. For example, the so-called "named entity" task, as defined in MUC, is to identify those words (on the page) that are names of entities. In ACE, on the other hand, the corresponding task is to identify the entity so named. This is a different task, one that is more abstract and that involves inference more explicitly in producing an answer. In a real sense, the task is to detect things that "aren't there". | | In general objective, the ACE program is motivated by and addresses the same issues as the MUC program that preceded it. The ACE program, however, defines the research objectives in terms of the target objects (i.e., the entities, the relations, and the events) rather than in terms of the words in the text. For example, the so-called "named entity" task, as defined in MUC, is to identify those words (on the page) that are names of entities. In ACE, on the other hand, the corresponding task is to identify the entity so named. This is a different task, one that is more abstract and that involves inference more explicitly in producing an answer. In a real sense, the task is to detect things that "aren't there". |
| | | |
− | 总的来说,ACE 程序的动机和解决的问题与之前的 MUC 程序相同。然而,ACE 程序以目标对象(即实体、关系和事件)来定义研究目标,而不是以文本中的词语来定义。例如,在 MUC 中定义的所谓“命名实体”任务就是识别(页面上的)实体名称的单词。另一方面,在 ACE 中,相应的任务是标识如此命名的实体。这是一个不同的任务,一个更抽象的任务,在产生答案时涉及更明确的推理。在真正意义上,任务是检测“不存在”的东西。 | + | 总的来说,ACE 程序的动机和解决的问题与之前的 MUC 程序相同。然而,ACE 程序以目标对象(即实体、关系和事件)来定义研究目标,而不是以文本中的词语来定义。例如,在 MUC 中定义的所谓“命名实体”任务就是识别(页面上的)实体名称的单词。而在 ACE 中,相应的任务是标识如此命名的实体。后一个任务更加抽象,在产生答案时涉及更明确的推理。从某种意义上说,AUC检测的实体“并不在那里”。 |
| | | |
| While the ACE program is directed toward extraction of information from [[Sound|audio]] and [[image]] sources in addition to pure text, the research effort is restricted to information extraction from text. The actual [[transduction (machine learning)|transduction]] of audio and image data into text is not part of the ACE research effort, although the processing of [[Speech recognition | ASR]] and [[Optical character recognition | OCR]] output from such transducers is. | | While the ACE program is directed toward extraction of information from [[Sound|audio]] and [[image]] sources in addition to pure text, the research effort is restricted to information extraction from text. The actual [[transduction (machine learning)|transduction]] of audio and image data into text is not part of the ACE research effort, although the processing of [[Speech recognition | ASR]] and [[Optical character recognition | OCR]] output from such transducers is. |
第24行: |
第24行: |
| While the ACE program is directed toward extraction of information from audio and image sources in addition to pure text, the research effort is restricted to information extraction from text. The actual transduction of audio and image data into text is not part of the ACE research effort, although the processing of ASR and OCR output from such transducers is. | | While the ACE program is directed toward extraction of information from audio and image sources in addition to pure text, the research effort is restricted to information extraction from text. The actual transduction of audio and image data into text is not part of the ACE research effort, although the processing of ASR and OCR output from such transducers is. |
| | | |
− | 虽然 ACE 项目的目标是从音频和图像资源中提取除纯文本以外的信息,但研究工作仅限于从文本中提取信息抽取。实际将音频和图像数据转换成文本并不是 ACE 研究工作的一部分,尽管这些转换器输出的 ASR 和 OCR 信号的处理是 ACE 研究工作的一部分。 | + | 虽然 ACE 项目的目标是从音频和图像资源中提取除纯文本以外的信息,但研究工作仅限于从文本中提取信息抽取。实际将音频和图像数据转换成文本并不是 ACE 研究工作的一部分,不过这些转换器输出的 ASR 和 OCR 的输出是 ACE 研究工作的一部分。 |
| | | |
| The effort involves: | | The effort involves: |
第38行: |
第38行: |
| 这项工作包括: | | 这项工作包括: |
| * 详细定义研究任务, | | * 详细定义研究任务, |
− | * 收集和注释培训、开发和评估所需的数据, | + | * 收集和注释数据集,包括训练集测试集等, |
| * 通过评估工具和研究工作坊支持研究。 | | * 通过评估工具和研究工作坊支持研究。 |
| | | |
第52行: |
第52行: |
| # events mentioned in the text, such as: interaction, movement, transfer, creation and destruction. | | # events mentioned in the text, such as: interaction, movement, transfer, creation and destruction. |
| | | |
− | 给定一个自然语言的文本,ACE 的挑战是检测: 文本中提到的 # 实体,如: 个人、组织、地点、设施、武器、车辆和地理政治实体。# 实体之间的关系,例如: a 人是 b 公司的经理。关系类型包括: 角色、部分、位置、接近和社会。# 文本中提到的事件,比如: 互动,运动,转移,创造和毁灭。 | + | 给定一个自然语言的文本,ACE 的挑战是检测: 文本中提到的 |
| + | |
| + | <nowiki>#</nowiki> 实体,如: 个人、组织、地点、设施、武器、车辆和地理政治实体。 |
| + | |
| + | <nowiki>#</nowiki> 实体之间的关系,例如: a 人是 b 公司的经理。关系类型包括: 角色、部分、位置、接近和社会。 |
| + | |
| + | <nowiki>#</nowiki> 文本中提到的事件,比如: 互动,运动,转移,创造和毁灭。 |
| | | |
| The program relates to [[English language|English]], [[Arabic language|Arabic]] and [[Chinese language|Chinese]] texts. | | The program relates to [[English language|English]], [[Arabic language|Arabic]] and [[Chinese language|Chinese]] texts. |