Invention Application
WO2017044409A1 SYSTEM AND METHOD OF ANNOTATING UTTERANCES BASED ON TAGS ASSIGNED BY UNMANAGED CROWDS
审中-公开
基于由不同角色分配的标签提取UTTERANCES的系统和方法
- Patent Title: SYSTEM AND METHOD OF ANNOTATING UTTERANCES BASED ON TAGS ASSIGNED BY UNMANAGED CROWDS
- Patent Title (中): 基于由不同角色分配的标签提取UTTERANCES的系统和方法
-
Application No.: PCT/US2016/050373Application Date: 2016-09-06
-
Publication No.: WO2017044409A1Publication Date: 2017-03-16
- Inventor: ROTHWELL, Spencer, John , BRAGA, Daniela , ELSHENAWY, Ahmad, Khamis , CARTER, Stephen, Steele
- Applicant: VOICEBOX TECHNOLOGIES CORPORATION
- Applicant Address: 11980 NE 24th Street Suite 100 Bellevue, WA 98005 US
- Assignee: VOICEBOX TECHNOLOGIES CORPORATION
- Current Assignee: VOICEBOX TECHNOLOGIES CORPORATION
- Current Assignee Address: 11980 NE 24th Street Suite 100 Bellevue, WA 98005 US
- Agency: KOO, Hean L. et al.
- Priority: US62/215,116 20150907
- Main IPC: G10L15/00
- IPC: G10L15/00
Abstract:
A system and method of tagging utterances with Named Entity Recognition ("NER") labels using unmanaged crowds is provided. The system may generate various annotation jobs in which a user, among a crowd, is asked to tag which parts of an utterance, if any, relate to various entities associated with a domain. For a given domain that is associated with a number of entities that exceeds a threshold N value, multiple batches of jobs (each batch having jobs that have a limited number of entities for tagging) may be used to tag a given utterance from that domain. This reduces the cognitive load imposed on a user, and prevents the user from having to tag more than N entities. As such, a domain with a large number of entities may be tagged efficiently by crowd participants without overloading each crowd participant with too many entities to tag.
Information query