- 专利标题: SEMANTIC SEARCH AND RESPONSE
-
申请号: US17478945申请日: 2021-09-19
-
公开(公告)号: US20220121694A1公开(公告)日: 2022-04-21
- 发明人: Mark Rosno , Patrick Deziel , Christopher Korzenowski , Rebecca Bilbro , Kelsey L. Bruso , Robert Malek
- 申请人: Mark Rosno , Patrick Deziel , Christopher Korzenowski , Rebecca Bilbro , Kelsey L. Bruso , Robert Malek
- 申请人地址: US MN Eagan; US MN Eagan; US MN Eagan; US MN Eagan; US MN Eagan; US MN Eagan
- 专利权人: Mark Rosno,Patrick Deziel,Christopher Korzenowski,Rebecca Bilbro,Kelsey L. Bruso,Robert Malek
- 当前专利权人: Mark Rosno,Patrick Deziel,Christopher Korzenowski,Rebecca Bilbro,Kelsey L. Bruso,Robert Malek
- 当前专利权人地址: US MN Eagan; US MN Eagan; US MN Eagan; US MN Eagan; US MN Eagan; US MN Eagan
- 主分类号: G06F16/332
- IPC分类号: G06F16/332 ; G06F40/30 ; G06F40/205
摘要:
An approach to information retrieval is contemplated for facilitating semantic search and response over a large domain of technical documents is disclosed. First, the grammar and morphology of the statements and instructions expressed in the technical documents is used to filter training data to extract the text that is most information-rich, that is the text that contains domain-specific jargon, in context. This training data is then vectorized and fed as input to an SBERT neural network model that learns an embedding of related words and terms in the text, i.e. the relationship between a given set of words contained in a user's query and the instructions from the technical documentation text most likely to assist in the user's operations. There are two parsing tasks. The first is to select a minimal sample of sentences from the document corpus that capture the domain-specific terminology (jargon). The result is set of sentences used to train BERT and SBERT. The second parsing task to create a set of action-trigger phrases from the document corpus. The trigger potentially matches a user query and the action is the related task.
信息查询