- 专利标题: Entity level data augmentation in chatbots for robust named entity recognition
-
申请号: US17345288申请日: 2021-06-11
-
公开(公告)号: US11804219B2公开(公告)日: 2023-10-31
- 发明人: Srinivasa Phani Kumar Gadde , Yuanxu Wu , Aashna Devang Kanuga , Elias Luqman Jalaluddin , Vishal Vishnoi , Mark Edward Johnson
- 申请人: Oracle International Corporation
- 申请人地址: US CA Redwood Shores
- 专利权人: Oracle International Corporation
- 当前专利权人: Oracle International Corporation
- 当前专利权人地址: US CA Redwood Shores
- 代理机构: Kilpatrick Townsend & Stockton LLP
- 主分类号: G10L15/197
- IPC分类号: G10L15/197 ; G10L15/06 ; G10L15/26 ; G06F40/186 ; G06F40/295 ; G06F40/30 ; G06F40/35 ; G06N20/00 ; H04L51/02 ; H04L51/52 ; G06N3/044 ; G06N3/045
摘要:
Techniques for data augmentation for training chatbot systems in natural language processing. In one particular aspect, a method is provided that includes generating a list of values to cover for an entity, selecting utterances from a set of data that have context for the entity, converting the utterances into templates, where each template of the templates comprises a slot that maps to the list of values for the entity, selecting a template from the templates, selecting a value from the list of values based on the mapping between the slot within the selected template and the list of values for the entity; and creating an artificial utterance based on the selected template and the selected value, where the creating the artificial utterance comprises inserting the selected value into the slot of the selected template that maps to the list of values for the entity.
公开/授权文献
信息查询