Multi-modal image search
    2.
    发明授权

    公开(公告)号:US12093310B2

    公开(公告)日:2024-09-17

    申请号:US16492062

    申请日:2018-03-07

    CPC classification number: G06F16/5866 G06F16/53

    Abstract: The present invention relates to methods for searching for two-dimensional or three-dimensional objects. More particularly, the present invention relates to searching for two-dimensional or three-dimensional objects in a collection by using a multi-modal query of image and/or tag data. Aspects and/or embodiments seek to provide a method of searching for digital objects using any combination of images, three-dimensional shapes and text by embedding the vector representations for these multiple modes in the same space. Aspects and/or embodiments can be easily extensible to any other type of modality, making it more general.

    Document search device, document search program, and document search method

    公开(公告)号:US12086189B2

    公开(公告)日:2024-09-10

    申请号:US18339544

    申请日:2023-06-22

    CPC classification number: G06F16/90344 G06F16/5866 G06F16/907 G06F16/93

    Abstract: A document search device includes a processor, and a memory storing program instructions that cause the processor to search for an input keyword in a document database in which document information including text data extracted by using a character recognition process from document image data generated by imaging a paper document is stored, select a similar keyword in accordance with a degree of similarity to the input keyword from a group of wildcard strings generated from the input keyword and search for the similar keyword in the document database, the degree of similarity being determined by comparing each character of the input keyword with a corresponding character of a wildcard string in the group of wildcard strings, and output a search result obtained by searching for the input keyword in the document database and a search result obtained by searching for the similar keyword in the document database.

    Interactive Content Feedback System
    5.
    发明公开

    公开(公告)号:US20240296184A1

    公开(公告)日:2024-09-05

    申请号:US18217598

    申请日:2023-07-02

    CPC classification number: G06F16/686 G06F16/5866 G06F16/9038 G06F16/907

    Abstract: This invention is directed to a tool that enables content creators to collect and analyze feedback on their content during production and live performances. During playback of content, users are enabled to provide detailed feedback and comments via various feedback interfaces on user devices. Users may indicate that they like and dislike certain aspects of the content, such as musical instruments featured in a song, at specific points in time. Feedback is timestamped, transformed into values, and aggregated for review and analysis. Using machine learning techniques, the present invention can identify trends in audience preferences and generate recommendations for tailoring content and content delivery. An interactive display enables the content creator to efficiently manipulate and make sense of collected feedback. With robust security features, the interactive content feedback system described herein may integrate with content streaming platforms as well as operate as an independent application.

    REMOTE MONITORING APPARATUS, REMOTE MONITORING METHOD, COMPUTER PROGRAM AND RECORDING MEDIUM

    公开(公告)号:US20240290103A1

    公开(公告)日:2024-08-29

    申请号:US18659140

    申请日:2024-05-09

    Inventor: Yuji TAHARA

    CPC classification number: G06V20/52 G06F16/5866 G06V40/10 H04N7/18

    Abstract: A remote monitoring apparatus includes a distribution unit that (i) distributes to a user terminal a second image to which position information indicating a different position from that indicated by first position information, which is position information added to a first image, is added and to which time information indicating a same time as that indicated by first time information, which is time information added to the first image, is added when receiving from the user terminal a first switching instruction indicating image switching of the first image in one direction, and (ii) distributes to the user terminal a third image to which position information indicating a same position as that indicated by the first position information is added and to which time information indicating a different time from that indicated by the first time information is added, when receiving from the user terminal a second switching instruction indicating image switching of the first image in another direction that crosses the one direction.

    DISPLAY APPARATUS AND METHOD FOR PERSON RECOGNITION AND PRESENTATION

    公开(公告)号:US20240283994A1

    公开(公告)日:2024-08-22

    申请号:US18650530

    申请日:2024-04-30

    CPC classification number: H04N21/4104 G06F16/5866 G06V40/172

    Abstract: Provided are a display apparatus and a person recognition and presentation method. The display apparatus includes a display and a controller that is in communication with the display. The controller is configured to: associated information of a display interface of the display and generate a scenario image for recognition in response to a user command; obtain facial feature information for recognition in the scenario image; obtain similar facial feature information when a matching confidence level of pre-stored facial feature information in a database with the facial feature information for recognition does not exceed a preset confidence level; obtain average-person recognition data; generate a sharing control uniquely matching with the facial feature information for recognition; and control the display to present the average-person recognition data and the sharing control on a current display interface.

    Method and system for facilitating keyword-based searching in images

    公开(公告)号:US12045280B2

    公开(公告)日:2024-07-23

    申请号:US17861144

    申请日:2022-07-08

    Abstract: Technologies are generally described for a system to extract description of reference numerals in images and facilitate keyword-based search in images. In various examples, the system may include one or more databases, a computer readable memory, and one or more processors. The system may be configured to extract one or more reference numerals from an image, and identify and extract corresponding description of the one or more reference numerals from a description document corresponding to the image. The system may be further configured to extract text from the images, and store the images in a database with the extracted data, i.e., text, reference numerals, and corresponding descriptions. The system may be further configured to receive an input query intending to search images related to a search logic of the input query, search a database to identify an image including either of text or a reference numeral having a corresponding description that corresponds to the search logic of the input query, and render the identified image via a display device executing the output interface.

Patent Agency Ranking