-
公开(公告)号:US12032613B2
公开(公告)日:2024-07-09
申请号:US17373294
申请日:2021-07-12
Applicant: BASF SE
Inventor: Henning Schwabe , Arunav Mishra , Juergen Mueller , Michael Schuhmacher
IPC: G06F16/33 , G06F16/31 , G06F40/211 , G06F40/30 , G06N5/04
CPC classification number: G06F16/3344 , G06F16/316 , G06F40/211 , G06F40/30 , G06N5/04
Abstract: In order to facilitate a search and identification of documents, an information retrieval system is provided for performing a search on a corpus of data objects. The information retrieval system comprises a device and a database. The database is configured to store at least one syntactic search index data structure and at least one semantic search index data structure. The syntactic search index data structure is configured to index and store in the database a plurality of terms from the corpus of data objects along with syntactic annotations indicating syntactic information. The at least one semantic search index data structure is configured to index and store in the database the plurality of terms from the corpus of data objects along with semantic annotations indicating semantic information. The device comprises an input unit, a processing unit, and an output unit. The input unit is configured to receive a syntactic query and a semantic query. The processing unit is configured to match the syntactic query against the syntactic search index data structure to obtain a first set of data objects, each of which has a set of terms that are syntactically related to the syntactic query. The processing is configured to match the semantic query against The at least one semantic search index data structure to obtain second set of the data objects, each of which has a set of terms that are semantically related to the semantic query, wherein the second set of data objects is a sub-set of the first set of the data objects. The output unit is configured to output information of the second set of data objects.
-
公开(公告)号:US20220019608A1
公开(公告)日:2022-01-20
申请号:US17373294
申请日:2021-07-12
Applicant: BASF SE
Inventor: Henning Schwabe , Arunav Mishra , Juergen Mueller , Michael Schuhmacher
IPC: G06F16/33 , G06F40/30 , G06F40/211 , G06F16/31 , G06N5/04
Abstract: In order to facilitate a search and identification of documents, an information retrieval system is provided for performing a search on a corpus of data objects. The information retrieval system comprises a device and a database. The database is configured to store at least one syntactic search index data structure and at least one semantic search index data structure. The syntactic search index data structure is configured to index and store in the database a plurality of terms from the corpus of data objects along with syntactic annotations indicating syntactic information. The at least one semantic search index data structure is configured to index and store in the database the plurality of terms from the corpus of data objects along with semantic annotations indicating semantic information. The device comprises an input unit, a processing unit, and an output unit. The input unit is configured to receive a syntactic query and a semantic query. The processing unit is configured to match the syntactic query against the syntactic search index data structure to obtain a first set of data objects, each of which has a set of terms that are syntactically related to the syntactic query. The processing is configured to match the semantic query against The at least one semantic search index data structure to obtain second set of the data objects, each of which has a set of terms that are semantically related to the semantic query, wherein the second set of data objects is a sub-set of the first set of the data objects. The output unit is configured to output information of the second set of data objects.
-