Invention Grant
- Patent Title: Augmented training data for end-to-end models
-
Application No.: US17124341Application Date: 2020-12-16
-
Publication No.: US11862144B2Publication Date: 2024-01-02
- Inventor: Rui Zhao , Jinyu Li , Yifan Gong
- Applicant: Microsoft Technology Licensing, LLC
- Applicant Address: US WA Redmond
- Assignee: Microsoft Technology Licensing, LLC
- Current Assignee: Microsoft Technology Licensing, LLC
- Current Assignee Address: US WA Redmond
- Agency: Alleman Hall Creasman & Tuttle LLP
- Main IPC: G10L15/06
- IPC: G10L15/06 ; G10L13/07 ; G10L15/19 ; G10L15/26

Abstract:
A computer system is provided that includes a processor configured to store a set of audio training data that includes a plurality of audio segments and metadata indicating a word or phrase associated with each audio segment. For a target training statement of a set of structured text data, the processor is configured to generate a concatenated audio signal that matches a word content of a target training statement by comparing the words or phrases of a plurality of text segments of the target training statement to respective words or phrases of audio segments of the stored set of audio training data, selecting a plurality of audio segments from the set of audio training data based on a match in the words or phrases between the plurality of text segments of the target training statement and the selected plurality of audio segments, and concatenating the selected plurality of audio segments.
Public/Granted literature
- US20220189461A1 AUGMENTED TRAINING DATA FOR END-TO-END MODELS Public/Granted day:2022-06-16
Information query