Media arbitration
    31.
    发明授权

    公开(公告)号:US11838582B1

    公开(公告)日:2023-12-05

    申请号:US18064921

    申请日:2022-12-12

    Applicant: Google LLC

    CPC classification number: H04N21/44209 H04N21/25891 H04W4/029

    Abstract: A method using media arbitration includes, while a first assistant-enabled device is performing a first long-standing operation, determining the first assistant-enabled device satisfies a co-presence condition with a second assistant-enabled device, and determining that the second assistant-enabled device is performing a second long-standing operation that conflicts with the first long-standing operation performed by the first assistant-enabled device. Based on determining that the first long-standing operation and the second long-standing operation conflict, the method also includes executing an operation arbitration routine to identify one or more compromise operations for at least one of the first assistant-enabled device or the second assistant-enabled device to perform, and instructing the first assistant-enabled device or the second assistant-enabled device to perform a selected compromise operation among the identified compromise operations.

    ADAPTING HOTWORD RECOGNITION BASED ON PERSONALIZED NEGATIVES

    公开(公告)号:US20230386468A1

    公开(公告)日:2023-11-30

    申请号:US18446420

    申请日:2023-08-08

    Applicant: Google LLC

    Abstract: A method for adapting hotword recognition includes receiving audio data characterizing a hotword event detected by a first stage hotword detector in streaming audio captured by a user device. The method also includes processing, using a second stage hotword detector, the audio data to determine whether a hotword is detected by the second stage hotword detector in a first segment of the audio data. When the hotword is not detected by the second stage hotword detector, the method includes, classifying the first segment of the audio data as containing a negative hotword that caused a false detection of the hotword event in the streaming audio by the first stage hotword detector. Based on the first segment of the audio data classified as containing the negative hotword, the method includes updating the first stage hotword detector to prevent triggering the hotword event in subsequent audio data that contains the negative hotword.

    Correcting speech misrecognition of spoken utterances

    公开(公告)号:US11823664B2

    公开(公告)日:2023-11-21

    申请号:US17982834

    申请日:2022-11-08

    Applicant: GOOGLE LLC

    CPC classification number: G10L15/08 G06F3/16 G10L13/02 G10L2015/088

    Abstract: Implementations can receive audio data corresponding to a spoken utterance of a user, process the audio data to generate a plurality of speech hypotheses, determine an action to be performed by an automated assistant based on the speech hypotheses, and cause the computing device to render an indication of the action. In response to the computing device rendering the indication, implementations can receive additional audio data corresponding to an additional spoken utterance of the user, process the additional audio data to determine that a portion of the spoken utterance is similar to an additional portion of the additional spoken utterance, supplant the action with an alternate action, and cause the automated assistant to initiate performance of the alternate action. Some implementations can determine whether to render the indication of the action based on a confidence level associated with the action.

    SYSTEMS AND METHODS FOR LIVE MEDIA CONTENT MATCHING

    公开(公告)号:US20230345061A1

    公开(公告)日:2023-10-26

    申请号:US18208570

    申请日:2023-06-12

    Applicant: Google LLC

    Inventor: Matthew Sharifi

    Abstract: Systems and methods for matching live media content are disclosed. At a server, obtaining first media content from a client device, herein the first media content corresponds to a portion of media content being played on the client device, and the first media content is associated with a predefined expiration time; obtaining second media content from one or more content feeds, wherein the second media content also corresponds to a portion of the media content being played on the client device; in accordance with a determination that the second media content corresponds to a portion of the media content that has been played on the client device: before the predefined expiration time, obtaining third media content corresponding to the media content being played on the client device, from the one or more content feeds; and comparing the first media content with the third media content.

    Hotphrase Triggering Based On A Sequence Of Detections

    公开(公告)号:US20230298588A1

    公开(公告)日:2023-09-21

    申请号:US18323725

    申请日:2023-05-25

    Applicant: Google LLC

    Abstract: A method includes receiving audio data corresponding to an utterance spoken by the user and captured by the user device. The utterance includes a command for a digital assistant to perform an operation. The method also includes determining, using a hotphrase detector configured to detect each trigger word in a set of trigger words associated with a hotphrase, whether any of the trigger words in the set of trigger words are detected in the audio data during the corresponding fixed-duration time window. The method also includes determining identifying, in the audio corresponding to the utterance, the hotphrase when each other trigger word in the set of trigger words was also detected in the audio data. The method also includes triggering an automated speech recognizer to perform speech recognition on the audio data when the hotphrase is identified in the audio data corresponding to the utterance.

Patent Agency Ranking