Weighting features for an intent classification system

Invention Grant

US10977445B2 Weighting features for an intent classification system 有权

Please log in to see more content

Patent Title: Weighting features for an intent classification system
Application No.: US16265618

Application Date: 2019-02-01
Publication No.: US10977445B2

Publication Date: 2021-04-13
Inventor: Yang Yu , Ladislav Kunc , Haoyu Wang , Ming Tan , Saloni Potdar
Applicant: International Business Machines Corporation
Applicant Address: US NY Armonk
Assignee: International Business Machines Corporation
Current Assignee: International Business Machines Corporation
Current Assignee Address: US NY Armonk
Agent Tyrus S. Cartwright
Main IPC: G10L15/22
IPC: G10L15/22 ; G06F40/30 ; G06N20/00

Weighting features for an intent classification system

Abstract:

A computer-implemented method includes obtaining a training data set including a plurality of training examples. The method includes generating, for each training example, multiple feature vectors corresponding, respectively, to multiple feature types. The method includes applying weighting factors to feature vectors corresponding to a subset of the feature types. The weighting factors are determined based on one or more of: a number of training examples, a number of classes associated with the training data set, an average number of training examples per class, a language of the training data set, a vocabulary size of the training data set, or a commonality of the vocabulary with a public corpus. The method includes concatenating the feature vectors of a particular training example to form an input vector and providing the input vector as training data to a machine-learning intent classification model to train the model to determine intent based on text input.

Public/Granted literature

US20200250270A1 WEIGHTING FEATURES FOR AN INTENT CLASSIFICATION SYSTEM Public/Granted day:2020-08-06

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）
G10L15/22	.在语音识别过程中（例如在人机对话过程中）使用的程序