Self-supervised audio representation learning for mobile devices

    公开(公告)号:US12165663B2

    公开(公告)日:2024-12-10

    申请号:US17986477

    申请日:2022-11-14

    Applicant: Google LLC

    Abstract: Systems and methods for training a machine-learned model are provided. A method can include can include obtaining an unlabeled audio signal, sampling the unlabeled audio signal to select one or more sampled slices, inputting the one or more sampled slices into a machine-learned model, receiving, as an output of the machine-learned model, one or more determined characteristics associated with the audio signal, determining a loss function for the machine-learned model based at least in part on a difference between the one or more determined characteristics and one or more corresponding ground truth characteristics of the audio signal, and training the machine-learned model from end to end based at least in part on the loss function. The one or more determined characteristics can include one or more reconstructed portions of the audio signal temporally adjacent to the one or more sampled slices or an estimated distance between two sampled slices.

    Information processing device and method, and program

    公开(公告)号:US11790925B2

    公开(公告)日:2023-10-17

    申请号:US17255191

    申请日:2019-06-20

    CPC classification number: G10L19/035 G10L21/00

    Abstract: The present technology relates to an information processing device and method, and a program capable of reducing a code amount.
    The information processing device includes: an acquisition unit that acquires space information regarding a position and a size of a child space within a parent space and position information in the child space indicating a position of an object within the child space, the child space being included in the parent space, and the object being included in the child space; and a calculation unit that calculates position information in the parent space indicating a position of the object within the parent space on the basis of the space information and the position information in the child space. The present technology can be applied to a signal processing device.

Patent Agency Ranking