VOICE SHORTCUT DETECTION WITH SPEAKER VERIFICATION

    公开(公告)号:US20240363122A1

    公开(公告)日:2024-10-31

    申请号:US18765108

    申请日:2024-07-05

    申请人: GOOGLE LLC

    摘要: Techniques disclosed herein are directed towards streaming keyphrase detection which can be customized to detect one or more particular keyphrases, without requiring retraining of any model(s) for those particular keyphrase(s). Many implementations include processing audio data using a speaker separation model to generate separated audio data which isolates an utterance spoken by a human speaker from one or more additional sounds not spoken by the human speaker, and processing the separated audio data using a text independent speaker identification model to determine whether a verified and/or registered user spoke a spoken utterance captured in the audio data. Various implementations include processing the audio data and/or the separated audio data using an automatic speech recognition model to generate a text representation of the utterance. Additionally or alternatively, the text representation of the utterance can be processed to determine whether at least a portion of the text representation of the utterance captures a particular keyphrase. When the system determines the registered and/or verified user spoke the utterance and the system determines the text representation of the utterance captures the particular keyphrase, the system can cause a computing device to perform one or more actions corresponding to the particular keyphrase.

    AUDIO SIGNAL PROCESSING METHOD AND APPARATUS, DEVICE AND STORAGE MEDIUM

    公开(公告)号:US20240355335A1

    公开(公告)日:2024-10-24

    申请号:US18685019

    申请日:2022-11-08

    摘要: The present disclosure relates to an audio signal processing method and apparatus, a device and a storage medium. The present disclosure performs a segmenting processing on an audio signal to obtain multiple audio segments, performs a clustering processing on the multiple audio segments according to feature information of each audio segment in the multiple audio segments to obtain one or more first sets, determines a first cluster center of each first set according to the feature information of the audio segment included in each first set, and performs a clustering processing on the multiple audio segments according to the first cluster center of each first set to obtain one or more second sets, where audio segments in a same second set corresponding to a same role label. In this way, an accuracy of an unsupervised role separation based on a single channel speech is improved.

    Service authentication through a voice assistant

    公开(公告)号:US12063214B2

    公开(公告)日:2024-08-13

    申请号:US16799867

    申请日:2020-02-25

    申请人: VMware LLC

    摘要: Disclosed are various approaches for authenticating a user through a voice assistant device and creating an association between the device and a user account. The request is associated with a network or federated service. The user can use a client device, such as a smartphone, to initiate an authentication flow. A passphrase is provided to the client device can captured by the client device and a voice assistant device. Audio captured by the client device and voice assistant device can be sent to an assistant connection service. The passphrase and an audio signature calculated from the audio can be validated. An association between the user account and the voice assistant device can then be created.

    COMMUNICATIONS AND CONTENT PLATFORM
    10.
    发明公开

    公开(公告)号:US20240249727A1

    公开(公告)日:2024-07-25

    申请号:US18605696

    申请日:2024-03-14

    摘要: A system and method that overcomes technological hurdles related to litigation-related management is disclosed. The technological hurdles were overcome with industry-transformative innovations in in-person, hybrid, and remote legal proceedings; court reporting; testimony management; trial preparation; and utilization of video evidence, to name several. These innovations resulted in many advantages, such as could-based testimony management, scalable digital transformation, dramatic savings in litigation costs, and fast turn-around on certified transcripts, to name several.