Object oriented image editing
    1.
    发明授权

    公开(公告)号:US10198820B1

    公开(公告)日:2019-02-05

    申请号:US15420652

    申请日:2017-01-31

    Applicant: Google LLC

    Abstract: Implementations generally relate to object based image editing. In some implementations, a method includes segmenting an image into object data by identifying one or more object classifications in the image and storing at least one locator for one or more regions of the image corresponding to each instance of the object classification. The method further includes receiving a selection of a representative portion of the segmented image from a user, and matching the representative portion with the object data to determine at least one matched object classification associated with the representative portion. The method further includes presenting the user with one or more of the matched object classifications for the user to instruct one or more edit operations to be applied to at least one object represented by the matched object classification.

    Multi-Channel Voice Activity Detection
    2.
    发明公开

    公开(公告)号:US20240013772A1

    公开(公告)日:2024-01-11

    申请号:US18471627

    申请日:2023-09-21

    Applicant: Google LLC

    CPC classification number: G10L15/02 H04R3/005

    Abstract: A method for multi-channel voice activity detection includes receiving a sequence of input frames characterizing streaming multi-channel audio captured by an array of microphones. Each channel of the streaming multi-channel audio includes respective audio features captured by a separate dedicated microphone. The method also includes determining, using a location fingerprint model, a location fingerprint indicating a location of a source of the multi-channel audio relative to the user device based on the respective audio features of each channel of the multi-channel audio. The method also includes generating an output from an application-specific classifier. The first score indicates a likelihood that the multi-channel audio corresponds to a particular audio type that the particular application is configured to process. The method also includes determining whether to accept or reject the multi-channel audio for processing by the particular application based on the first score generated as output from the application-specific classifier.

    Multi-channel voice activity detection

    公开(公告)号:US12154547B2

    公开(公告)日:2024-11-26

    申请号:US18471627

    申请日:2023-09-21

    Applicant: Google LLC

    Abstract: A method for multi-channel voice activity detection includes receiving a sequence of input frames characterizing streaming multi-channel audio captured by an array of microphones. Each channel of the streaming multi-channel audio includes respective audio features captured by a separate dedicated microphone. The method also includes determining, using a location fingerprint model, a location fingerprint indicating a location of a source of the multi-channel audio relative to the user device based on the respective audio features of each channel of the multi-channel audio. The method also includes generating an output from an application-specific classifier. The first score indicates a likelihood that the multi-channel audio corresponds to a particular audio type that the particular application is configured to process. The method also includes determining whether to accept or reject the multi-channel audio for processing by the particular application based on the first score generated as output from the application-specific classifier.

    Multi channel voice activity detection

    公开(公告)号:US11380302B2

    公开(公告)日:2022-07-05

    申请号:US17077679

    申请日:2020-10-22

    Applicant: Google LLC

    Abstract: A method for multi-channel voice activity detection includes receiving a sequence of input frames characterizing streaming multi-channel audio captured by an array of microphones. Each channel of the streaming multi-channel audio includes respective audio features captured by a separate dedicated microphone. The method also includes determining, using a location fingerprint model, a location fingerprint indicating a location of a source of the multi-channel audio relative to the user device based on the respective audio features of each channel of the multi-channel audio. The method also includes generating an output from an application-specific classifier. The first score indicates a likelihood that the multi-channel audio corresponds to a particular audio type that the particular application is configured to process. The method also includes determining whether to accept or reject the multi-channel audio for processing by the particular application based on the first score generated as output from the application-specific classifier.

    Multi channel voice activity detection

    公开(公告)号:US11790888B2

    公开(公告)日:2023-10-17

    申请号:US17806198

    申请日:2022-06-09

    Applicant: Google LLC

    CPC classification number: G10L15/02 H04R3/005

    Abstract: A method for multi-channel voice activity detection includes receiving a sequence of input frames characterizing streaming multi-channel audio captured by an array of microphones. Each channel of the streaming multi-channel audio includes respective audio features captured by a separate dedicated microphone. The method also includes determining, using a location fingerprint model, a location fingerprint indicating a location of a source of the multi-channel audio relative to the user device based on the respective audio features of each channel of the multi-channel audio. The method also includes generating an output from an application-specific classifier. The first score indicates a likelihood that the multi-channel audio corresponds to a particular audio type that the particular application is configured to process. The method also includes determining whether to accept or reject the multi-channel audio for processing by the particular application based on the first score generated as output from the application-specific classifier.

    Multi Channel Voice Activity Detection

    公开(公告)号:US20220310060A1

    公开(公告)日:2022-09-29

    申请号:US17806198

    申请日:2022-06-09

    Applicant: Google LLC

    Abstract: A method for multi-channel voice activity detection includes receiving a sequence of input frames characterizing streaming multi-channel audio captured by an array of microphones. Each channel of the streaming multi-channel audio includes respective audio features captured by a separate dedicated microphone. The method also includes determining, using a location fingerprint model, a location fingerprint indicating a location of a source of the multi-channel audio relative to the user device based on the respective audio features of each channel of the multi-channel audio. The method also includes generating an output from an application-specific classifier. The first score indicates a likelihood that the multi-channel audio corresponds to a particular audio type that the particular application is configured to process. The method also includes determining whether to accept or reject the multi-channel audio for processing by the particular application based on the first score generated as output from the application-specific classifier.

    Multi Channel Voice Activity Detection

    公开(公告)号:US20220130375A1

    公开(公告)日:2022-04-28

    申请号:US17077679

    申请日:2020-10-22

    Applicant: Google LLC

    Abstract: A method for multi-channel voice activity detection includes receiving a sequence of input frames characterizing streaming multi-channel audio captured by an array of microphones. Each channel of the streaming multi-channel audio includes respective audio features captured by a separate dedicated microphone. The method also includes determining, using a location fingerprint model, a location fingerprint indicating a location of a source of the multi-channel audio relative to the user device based on the respective audio features of each channel of the multi-channel audio. The method also includes generating an output from an application-specific classifier. The first score indicates a likelihood that the multi-channel audio corresponds to a particular audio type that the particular application is configured to process. The method also includes determining whether to accept or reject the multi-channel audio for processing by the particular application based on the first score generated as output from the application-specific classifier.

Patent Agency Ranking