-
公开(公告)号:US20230078094A1
公开(公告)日:2023-03-16
申请号:US17943746
申请日:2022-09-13
Applicant: SEMICONDUCTOR ENERGY LABORATORY CO., LTD.
Inventor: Yoshitaka DOZEN , Kunitaka YAMAMOTO
IPC: G06F16/35 , G06F16/383 , G06V30/413 , G06V30/418
Abstract: A search system capable of searching for an image with a similar represented concept is provided. The search system includes an input unit, a text extraction unit, a tag obtaining unit, and a tag similarity calculation unit. When image data to which an image label is assigned and document data including the image label are supplied to the input unit, the text extraction unit is configured to extract tag-obtaining-purpose text data from the document data on the basis of the image label. The tag obtaining unit is configured to obtain a tag including at least a part of words included in the tag-obtaining-purpose text data. The tag similarity calculation unit is configured to calculate similarity between tags. It is possible to search for an image having a greatly different feature value of the image itself but having a similar represented concept.
-
公开(公告)号:US20220350827A1
公开(公告)日:2022-11-03
申请号:US17763793
申请日:2020-09-22
Applicant: SEMICONDUCTOR ENERGY LABORATORY CO., LTD.
Inventor: Kunitaka YAMAMOTO , Kazuki HIGASHI , Yoshitaka DOZEN
IPC: G06F16/33 , G06F40/279
Abstract: Input of natural language as query text and a search from a plurality of documents are enabled, and a portion highly relevant to the input text is presented to a reader. A document data processing system including a document readout unit that reads out a plurality of subject documents, a document division unit that divides each of the plurality of subject documents into a plurality of blocks, a first distributed representation acquisition unit that acquires a distributed representation of a word in each of the blocks, a first distributed representation retention unit that stores the distributed representation acquired by the first distributed representation acquisition unit on a subject-document-by-subject-document basis and on a block-by-block basis, a query text readout unit that reads out query text, a second distributed representation acquisition unit that extracts a word included in the query text and acquires a distributed representation of the word, a second distributed representation retention unit that stores the distributed representation acquired by the second distributed representation acquisition unit, and a similarity calculation unit that compares the distributed representation of the word included in the query text and the distributed representation of the word included in each of the blocks and calculates similarity of each of the blocks is provided.
-
公开(公告)号:US20240386737A1
公开(公告)日:2024-11-21
申请号:US18682496
申请日:2022-08-17
Applicant: Semiconductor Energy Laboratory Co., Ltd.
Inventor: Yoshitaka DOZEN , Kunitaka YAMAMOTO
IPC: G06V30/413 , G06V30/418
Abstract: A document classification system that enables highly accurate document classification is provided. The document classification system includes an input unit, a storage unit, a processing unit, and an output unit. The input unit has a function of receiving document data and reference document data. The storage unit has a function of storing a classification model. The processing unit has a function of creating first classification data to third classification data from the document data and the reference document data. A word contained in the document data and not contained in the reference document data belongs to the first classification data. A word contained in the document data and contained in the reference document data belongs to the second classification data. A word not contained in the document data and contained in the reference document data belongs to the third classification data. The processing unit has a function of creating document comparison data from the first classification data to the third classification data and determining a category of the reference document data using the classification model. The output unit has a function of outputting the category.
-
公开(公告)号:US20220164381A1
公开(公告)日:2022-05-26
申请号:US17439684
申请日:2020-03-17
Applicant: SEMICONDUCTOR ENERGY LABORATORY CO., LTD.
Inventor: Kengo AKIMOTO , Shigeru TAMAKI , Kunitaka YAMAMOTO , Isamu SHIGEMORI
IPC: G06F16/58 , G06F16/53 , G06N3/04 , G06F16/583 , G06F40/268
Abstract: An image retrieval system with high retrieval accuracy is provided. The image retrieval system includes a database and a processing portion. The database has a function of storing a plurality of pieces of database image data, and a database tag is linked to each of the plurality of pieces of database image data. The processing portion has a function of obtaining database image feature value data representing a feature value of the database image data for each piece of the database image data. The processing portion has a function of obtaining query image feature value data representing a feature value of the query image data. The processing portion has a function of calculating first similarity of the database image data to the query image data for each piece of the database image data. The processing portion has a function of obtaining a query tag linked to the query image data using some of the database tags.
-
-
-