Invention Grant
- Patent Title: Automatic extraction of named entities from texts
- Patent Title (中): 从文本自动提取命名实体
-
Application No.: US14508419Application Date: 2014-10-07
-
Publication No.: US09588960B2Publication Date: 2017-03-07
- Inventor: Ilya Nekhay
- Applicant: ABBYY InfoPoisk LLC
- Applicant Address: RU
- Assignee: ABBYY InfoPoisk LLC
- Current Assignee: ABBYY InfoPoisk LLC
- Current Assignee Address: RU
- Agent Veronica Weinstein
- Priority: RU2014101126 20140115
- Main IPC: G06F17/27
- IPC: G06F17/27

Abstract:
Disclosed are systems, computer-readable mediums, and methods for extracting named entities from an untagged corpus of texts. Generating a set of attributes for each of the tokens based at least on a deep semantic-syntactic analysis. The set of attributes include lexical, syntactic, and semantic attributes. Selecting a subset of the attributes for each of the tokens. Retrieving classifier attributes and categories based on a trained model, wherein the classifier attributes are related to one or more categories. Comparing the subset of the attributes for each of the tokens with the classifier attributes. Classifying each of tokens to at least one of the categories based on the comparing. Generating tagged text based on the categorized tokens.
Public/Granted literature
- US20150199333A1 AUTOMATIC EXTRACTION OF NAMED ENTITIES FROM TEXTS Public/Granted day:2015-07-16
Information query