On-The-Fly Feeding of Personalized or Domain-Specific Submodels

    公开(公告)号:US20220414542A1

    公开(公告)日:2022-12-29

    申请号:US17851712

    申请日:2022-06-28

    Applicant: Google LLC

    Abstract: The present disclosure relates generally to machine learning. More particularly, the present disclosure relates to on-the-fly feeding of personalized, domain-specific, context-specific, and/or task-specific submodels as input to an existing base model which has already been loaded into a memory (e.g., loaded into an existing session associated with execution of a machine learning library).

    Synthesized Data Augmentation Using Voice Conversion and Speech Recognition Models

    公开(公告)号:US20220068257A1

    公开(公告)日:2022-03-03

    申请号:US17008278

    申请日:2020-08-31

    Applicant: Google LLC

    Abstract: A method for training a speech conversion model personalized for a target speaker with atypical speech includes obtaining a plurality of transcriptions in a set of spoken training utterances and obtaining a plurality of unspoken training text utterances. Each spoken training utterance is spoken by a target speaker associated with atypical speech and includes a corresponding transcription paired with a corresponding non-synthetic speech representation. The method also includes adapting, using the set of spoken training utterances, a text-to-speech (TTS) model to synthesize speech in a voice of the target speaker and that captures the atypical speech. For each unspoken training text utterance, the method also includes generating, as output from the adapted TTS model, a synthetic speech representation that includes the voice of the target speaker and that captures the atypical speech. The method also includes training the speech conversion model based on the synthetic speech representations.

    LANGUAGE MODELS USING DOMAIN-SPECIFIC MODEL COMPONENTS

    公开(公告)号:US20210020170A1

    公开(公告)日:2021-01-21

    申请号:US17060347

    申请日:2020-10-01

    Applicant: Google LLC

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for language models using domain-specific model components. In some implementations, context data for an utterance is obtained. A domain-specific model component is selected from among multiple domain-specific model components of a language model based on the non-linguistic context of the utterance. A score for a candidate transcription for the utterance is generated using the selected domain-specific model component and a baseline model component of the language model that is domain-independent. A transcription for the utterance is determined using the score the transcription is provided as output of an automated speech recognition system.

Patent Agency Ranking