千言数据集:知识对话评测
大赛名称 千言数据集:知识对话评测
详情链接 https://aistudio.baidu.com/aistudio/competition/detail/432/0/introduction
大赛简介

知识对话是指对话系统利用外部知识信息,使聊天内容更加丰富、准确,这对提升用户体验是非常重要的,近年来受到学术界和工业界的广泛关注。
Knowledge grounded dialogue means that the dialogue system uses external knowledge information to make the conversation more engaging and factually correct. This is very important for improving user engagement and has gained a lot of attention from both academia and industry in recent years.

为了解决静态知识的丰富性、时效性和个性化问题,我们提出了一个全新的知识对话任务——搜索信息增强的对话(SINC)。对话系统在对话的过程中动态地搜索外部知识信息,并将搜索知识用于回复生成中。为此我们建设了外部知识搜索API,可以根据给定query和用户地理位置实时搜索各类通用知识、动态知识和个性化知识,同时我们利用这个API人工建设了用于该任务研究的对话数据集DuSinc。
To address the lack of richness, timeliness, and personalization of static knowledge, we propose a novel knowledge grounded dialogue task called Search INformation augmented Conversation (SINC). The dialogue system dynamically searches for external knowledge information in the process of conversation and uses the searched knowledge in response generation. To this end, we have built an external knowledge search API, which can search various general knowledge, dynamic knowledge, and personalized knowledge in real-time according to the given query and user geographic location. At the same time, we use this API to manually build a dialogue dataset named DuSinc for this task research.

本次竞赛中,我们主要从以下两个子任务评测系统的知识对话能力:1)Query生成任务:给定多轮对话历史,生成用于查询搜索引擎的Query;2)回复生成任务:给定文本知识与多轮对话历史,生成合适的对话回复。
In this challenge, we mainly evaluate the knowledge grounded dialogue ability of the system from the following two subtasks: 1) Query Generation Task: given dialogue history, generate a query for querying the search engine; 2) Response Generation Task: given text knowledge and dialogue history to generate appropriate dialogue responses.

举办方 百度
参赛方式

(1)公平竞技: 参赛者禁止在比赛中抄袭他人作品、交换答案、使用多个小号,经发现将取消比赛成绩并严肃处理;
(2)组织声明: 组委会保留对比赛规则、赛事安排进行调整和修改的权利、比赛作弊行为的判定权利和处置权利、收回或拒绝授予影响组织及公平性的参赛团队奖项的权利;
(3)基线模型: 基线模型供参赛选手参考,可以选择在其基础上改进。参赛选手不能直接提交基线模型结果;如果提交文件与基线模型结果高度相似,则将取消比赛成绩;
(4)作品产权: 参赛作品(包含但不限于算法、模型等)知识产权归参赛选手所有,组委会有权将参赛作品、作品相关、参赛团队信息用于宣传品、相关出版物、指定及授权媒体发布、官方网站浏览及下载、展览(含巡展)等活动项目,大赛组织单位享有优先合作权利。

注:信息来源于赛事平台,侵删