Finding similar persons in images

    公开(公告)号:US11915520B2

    公开(公告)日:2024-02-27

    申请号:US17902349

    申请日:2022-09-02

    Applicant: Adobe Inc.

    CPC classification number: G06V40/172 G06F18/00 G06F18/29 G06V30/194 G06V40/10

    Abstract: Embodiments are disclosed for finding similar persons in images. In particular, in one or more embodiments, the disclosed systems and methods comprise receiving an image query, the image query including an input image that includes a representation of a person, generating a first cropped image including a representation of the person's face and a second cropped image including a representation of the person's body, generating an image embedding for the input image by combining a face embedding corresponding to the first cropped image and a body embedding corresponding to the second cropped image, and querying an image repository in embedding space by comparing the image embedding to a plurality of image embeddings associated with a plurality of images in the image repository to obtain one or more images based on similarity to the input image in the embedding space.

    Model compression using cycle generative adversarial network knowledge distillation

    公开(公告)号:US11886542B2

    公开(公告)日:2024-01-30

    申请号:US17325877

    申请日:2021-05-20

    Applicant: Apple Inc.

    CPC classification number: G06F18/2148 G06F18/22 G06N3/045 G06V30/194

    Abstract: Systems and processes for prediction using generative adversarial network and distillation technology are provided. For example, an input is received at a first portion of a language model. A first output distribution is obtained, based on the input, from the first portion of the language model. Using a first training model, the language model is adjusted based on the first output distribution. The first output distribution is received at a second portion of the language model. A first representation of the input is obtained, based on the first output distribution, from the second portion of the language model. The language model is adjusted, using a second training model, based on the first representation of the input. Using the adjusted language model, an output is provided based on a received user input.

    INFORMATION PROCESSING APPARATUS, CONTROL METHOD, AND NON-TRANSITORY STORAGE MEDIUM

    公开(公告)号:US20230386243A1

    公开(公告)日:2023-11-30

    申请号:US18232215

    申请日:2023-08-09

    Inventor: Hiroo Ikeda

    CPC classification number: G06V40/10 G06V30/194

    Abstract: An information processing apparatus (2000) includes a recognizer (2020). An image (10) is input to the recognizer (2020). The recognizer (2020) outputs, for a crowd included in the input image (10), a label (30) describing a type of the crowd and structure information (40) describing a structure of the crowd. The structure information (40) indicates a location and a direction of an object included in the crowd. The information processing apparatus (2000) acquires training data (50) which includes a training image (52), a training label (54), and training structure information (56). The information processing apparatus (2000) performs training of the recognizer (2020) using the label (30) and the structure information (40), which are acquired by inputting the training image (52) with respect to the recognizer (2020, and the training label (54) and the training structure information (56).

Patent Agency Ranking