Voice-history Based Speech Biasing
    1.
    发明公开

    公开(公告)号:US20240194188A1

    公开(公告)日:2024-06-13

    申请号:US18063118

    申请日:2022-12-08

    Applicant: Google LLC

    CPC classification number: G10L15/07 G10L15/30

    Abstract: A method of using voice query history to improve speech recognition includes receiving audio data corresponding to a current query spoken by a user and processing the audio data to generate a lattice of candidate hypotheses. The method also includes obtaining voice query history data associated with the user that includes n-grams extracted from transcriptions of previous queries spoken by the user, and generating, using a biasing context model configured to receive the voice query history data, a biasing context vector. The biasing context vector indicates a likelihood that each n-gram from the n-grams extracted from the transcriptions of the previous queries spoken by the user will appear in the current query. The method also includes augmenting the lattice of candidate hypotheses based on the biasing context vector and determining a transcription for the current query based on the augmented lattice of candidate hypotheses.

    Training Speech Recognizers Based On Biased Transcriptions

    公开(公告)号:US20240257799A1

    公开(公告)日:2024-08-01

    申请号:US18161608

    申请日:2023-01-30

    Applicant: Google LLC

    Abstract: A method includes receiving a biased transcription for a voice command spoken by a user and captured by a user device, the biased transcription biased to include a biasing phrase from a set of biasing phrases specific to the user. The method also includes instructing an application executing on the user device to perform an action specified by the biased transcription for the voice command, and receiving one or more user behavior signals responsive to the application performing the action specified by the biased transcription. The method further includes generating, as output from a confidence model, a confidence score of the biased transcription based on the one or more user behavior signals input to the confidence model and, based on the confidence score output from the confidence model, training a speech recognizer on the biased transcription.

Patent Agency Ranking