-
公开(公告)号:US20190243893A1
公开(公告)日:2019-08-08
申请号:US16270508
申请日:2019-02-07
Applicant: LIFE TECHNOLOGIES CORPORATION
Inventor: Aron WAHL , Matthew MURRAY , Austin BEER , Emily WENGERT , Heiko WAECHTER , Matt LOOMIS , Namit JOSHI
IPC: G06F17/27 , G06F16/31 , G06F16/332 , G06Q30/06
CPC classification number: G06F17/271 , G06F16/313 , G06F16/3329 , G06F17/2775 , G06F17/278 , G06Q30/0631
Abstract: Methods, systems, and computer readable media concern natural language processing and searching for identifying biological products in an electronic document. The method includes extracting, from the electronic document, a candidate text phrase representing a potential biological product reference in the electronic document and parsing the candidate text phrase into a syntactic structure including one or more terms. The method includes tagging each of the one or more terms in the syntactic structure with a vocabulary tag. The vocabulary tag represents a technical meaning of a term in the potential biological product reference. The method includes calculating a total score for the candidate text phrase based on relative tag scores associated with each vocabulary tag for the one or more terms. The method includes classifying the candidate text phrase as a biological product reference and includes searching a database for one or more product entries based on the biological product references.