Invention Grant
- Patent Title: Method and apparatus for labeling data
-
Application No.: US16866798Application Date: 2020-05-05
-
Publication No.: US11386463B2Publication Date: 2022-07-12
- Inventor: Sanjeev Misra , Appavu Siva Prakasam , Ann Eileen Skudlark , Siva Kolachina , Nisha Shahul Hameed , Prashanth Boddhireddy , Lien Tran , Jenq-Chyuan Wang
- Applicant: AT&T Intellectual Property I, L.P.
- Applicant Address: US GA Atlanta
- Assignee: AT&T Intellectual Property I, L.P.
- Current Assignee: AT&T Intellectual Property I, L.P.
- Current Assignee Address: US GA Atlanta
- Agency: Guntin & Gust, PLC
- Agent Mark Wilinski
- Main IPC: G06Q30/02
- IPC: G06Q30/02 ; G06F16/35 ; G06F16/93 ; G06F40/295 ; G06N5/04 ; G06Q10/10 ; G06N20/00

Abstract:
Aspects of the subject disclosure may include, for example, determining classes from a corpus based on topic modeling, data clustering and unsupervised learning. Labels are determined for each of the classes and trained models are generated for each of the classes by assignment of a plurality of textual documents to labels based on a highest number of matches. A raw textual document can be tokenized and stop words removed. A corresponding one of the trained models can be selected according to a class that is applicable to subject matter of the raw textual document. The processed document can be assigned to a target label based on a highest number of matches of words. Other embodiments are disclosed.
Public/Granted literature
- US20210182912A1 METHOD AND APPARATUS FOR LABELING DATA Public/Granted day:2021-06-17
Information query