ATTENTION TRACKING TO AUGMENT FOCUS TRANSITIONS

    公开(公告)号:US20230086766A1

    公开(公告)日:2023-03-23

    申请号:US17933631

    申请日:2022-09-20

    Applicant: Google LLC

    Abstract: Systems and methods are related to tracking an attention of a user with respect to content presented on a virtual screen, detecting a defocus event associated with a first region of the content, and determining a next focus event associated with a second region of the content. The determination can be based at least in part on the defocus event and on the tracked attention of the user. The systems and methods can include generating, based on the determined next focus event, a marker for differentiating the second region of the content from a remainder of the content, and in response to detecting a refocus event associated with the virtual screen, triggering execution of the marker associated with the second region of the content.

    AVATAR ANIMATION IN VIRTUAL CONFERENCING

    公开(公告)号:US20230051409A1

    公开(公告)日:2023-02-16

    申请号:US17444890

    申请日:2021-08-11

    Applicant: Google LLC

    Abstract: According to a general aspect, a method can include receiving a photo of a virtual conference participant, and a depth map based on the photo, and generating a plurality of synthesized images based on the photo. The plurality of synthesized images can have respective simulated gaze directions of the virtual conference participant. The method can also include receiving, during a virtual conference, an indication of a current gaze direction of the virtual conference participant. The method can further include animating, in a display of the virtual conference, an avatar corresponding with the virtual conference participant. The avatar can be based on the photo. Animating the avatar can be based on the photo, the depth map and at least one synthesized image of the plurality of synthesized images, the at least one synthesized image corresponding with the current gaze direction.

    Visual Programming Platform Featuring Machine Learning for Automated Code Development

    公开(公告)号:US20250094137A1

    公开(公告)日:2025-03-20

    申请号:US18468025

    申请日:2023-09-15

    Applicant: Google LLC

    Abstract: A visual programming platform can leverage a machine learning-based coding system to generate an initial set of programming-language code for further graphical editing by a human user. As an example, the visual programming platform can obtain a natural language description of a task to be performed by a computational pipeline. The visual programming platform can process the natural language description of the task with a machine learning coding system that includes one or more machine-learned models to generate, as an output of the machine learning coding system, a set of pseudocode that describes performance of the task. The platform can process the set of pseudocode that describes performance of the task with a compiler to generate a set of programming-language code that defines the computational pipeline for performing the task. The visual programming platform can generate a graphical visualization of the computational pipeline defined by the set of programming-language code.

    GAZE-MEDIATED AUGMENTED REALITY INTERACTION WITH SOURCES OF SOUND IN AN ENVIRONMENT

    公开(公告)号:US20250054246A1

    公开(公告)日:2025-02-13

    申请号:US18707075

    申请日:2022-10-14

    Applicant: Google LLC

    Abstract: A user can interact with sounds and speech in an environment using an augmented reality device. The augmented reality device can be configured to identify objects in the environment and display messages beside the object that are related to sounds produced by the object. For example, the messages may include sound statistics, transcripts of speech, and/or sound detection events. The disclosed approach enables a user to interact with these messages using a gaze and a gesture.

    Nonlinear Peri-Codec Optimization For Image And Video Coding

    公开(公告)号:US20250045968A1

    公开(公告)日:2025-02-06

    申请号:US18570562

    申请日:2021-06-16

    Applicant: Google LLC

    Abstract: Nonlinear peri-codec optimization for image and video coding includes obtaining a source image including pixel values expressed in a first defined image sample space, generating a neuralized image representing the source image, the neuralized image including pixel values that are expressed as neural latent space values, encoding the input image wherein the neural latent space values are used as pixel values in a second defined image sample space and the input image is in an operative image format of the encoder, such that a decoder decodes the encoded image to obtain a reconstructed image in the second defined image sample space, wherein the reconstructed image is a reconstructed neuralized image including reconstructed neural latent space values, such that a deneuralized reconstructed image corresponding to the source image is obtained by a nonlinear post-codec image processor in the first defined image sample space.

    CONTEXT-AIDED IDENTIFICATION
    29.
    发明申请

    公开(公告)号:US20230136553A1

    公开(公告)日:2023-05-04

    申请号:US18050329

    申请日:2022-10-27

    Applicant: GOOGLE LLC

    Abstract: Smart devices can be configured to collect and share various forms of context data about where a user is located (e.g., location), what a user will be doing (e.g., schedule), and what a user is currently doing (e.g., activity). This context data may be combined with fingerprint data (e.g., biometrics) to help identify the fingerprint data. For example, a location of a user may help associated speech detected at that location with the user. These associations may be stored in a mapping database that can be updated over time to reduce ambiguities in identification. The mappings in the database may be used to train a machine learning model to recognize fingerprints as identities, which may be useful in applications, such as speaker identification.

    RESPONSE TO SOUNDS IN AN ENVIRONMENT BASED ON CORRELATED AUDIO AND USER EVENTS

    公开(公告)号:US20230132041A1

    公开(公告)日:2023-04-27

    申请号:US18047494

    申请日:2022-10-18

    Applicant: GOOGLE LLC

    Abstract: The disclosed systems and method correlates user behaviors with audio processing to achieve more accurate conclusions about sounds in a user's environment. These conclusions may, in turn, be used to adjust the way a device, such as AR glasses, operate or respond to the sounds. For example, audio events determined from processing speech can be correlated with behavior events determined by sensing a user to improve a speech-to-text transcript of the speech by separating, or otherwise altering, the text in the transcript by speaker.

Patent Agency Ranking