Systems and methods of image processing based on gaze detection

    公开(公告)号:US11798204B2

    公开(公告)日:2023-10-24

    申请号:US17685278

    申请日:2022-03-02

    CPC classification number: G06T11/00 G06F3/013 G06V40/174 G06V40/18

    Abstract: Imaging systems and techniques are described. An imaging system receives image data representing at least a portion (e.g., a face) of a first user as captured by a first image sensor. The imaging system identifies that a gaze of the first user as represented in the image data is directed toward a displayed representation of at least a portion (e.g., a face) of a second user. The imaging system identifies an arrangement of representations of users for output. The imaging system generates modified image data based on the gaze and the arrangement at least in part by modifying the image data to modify at least the portion of the first user in the image data to be visually directed toward a direction corresponding to the second user based on the gaze and the arrangement. The imaging system outputs the modified image data arranged according to the arrangement.

    Task agnostic open-set prototypes for few-shot open-set recognition

    公开(公告)号:US12019641B2

    公开(公告)日:2024-06-25

    申请号:US18153899

    申请日:2023-01-12

    CPC classification number: G06F16/2462 G06F16/285

    Abstract: Systems and techniques are provided for processing one or more data samples. For example, a neural network classifier can be trained to perform few-shot open-set recognition (FSOSR) based on a task-agnostic open-set prototype. A process can include determining one or more prototype representations for each class included in a plurality of support samples. A task-agnostic open-set prototype representation can be determined, in a same learned metric space as the one or more prototype representations. One or more distance metrics can be determined for each query sample of one or more query samples, based on the one or more prototype representations and the task-agnostic open-set prototype representation. Based on the one or more distance metrics, each query sample can be classified into one of classes associated with the one or more prototype representations or an open-set class associated with the task-agnostic open-set prototype representation.

    Method and apparatus for activating speech recognition

    公开(公告)号:US11205433B2

    公开(公告)日:2021-12-21

    申请号:US16547263

    申请日:2019-08-21

    Abstract: A device to process an audio signal representing input sound includes a user voice verifier configured to generate a first indication based on whether the audio signal represents a user's voice. The device includes a speaking target detector configured to generate a second indication based on whether the audio signal represents at least one of a command or a question. The device includes an activation signal unit configured to selectively generate an activation signal based on the first indication and the second indication. The device also includes an automatic speech recognition engine configured to be activated, responsive to the activation signal, to process the audio signal.

Patent Agency Ranking