Invention Publication
- Patent Title: METHOD AND SYSTEM FOR UNSUPERVISED DISCOVERY OF UNIGRAMS IN SPEECH RECOGNITION SYSTEMS
-
Application No.: US17520816Application Date: 2021-11-08
-
Publication No.: US20230144379A1Publication Date: 2023-05-11
- Inventor: LEV HAIKIN , ARNON MAZZA , EYAL ORBACH , AVRAHAM FAIZAKOF
- Applicant: GENESYS CLOUD SERVICES, INC.
- Applicant Address: US CA Daly City
- Assignee: GENESYS CLOUD SERVICES, INC.
- Current Assignee: GENESYS CLOUD SERVICES, INC.
- Current Assignee Address: US CA Daly City
- Main IPC: G10L15/197
- IPC: G10L15/197 ; G10L15/06 ; G10L15/22 ; G10L15/10 ; G06N20/00

Abstract:
A system and method of automatically discovering unigrams in a speech data element may include receiving a language model that includes a plurality of n-grams, where each n-gram includes one or more unigrams; applying an acoustic machine-learning (ML) model on one or more speech data elements to obtain a character distribution function; applying a greedy decoder on the character distribution function, to predict an initial corpus of unigrams; filtering out one or more unigrams of the initial corpus to obtain a corpus of candidate unigrams, where the candidate unigrams are not included in the language model; analyzing the one or more first speech data elements, to extract at least one n-gram that comprises a candidate unigram; and updating the language model to include the extracted at least one n-gram.
Public/Granted literature
- US11984116B2 Method and system for unsupervised discovery of unigrams in speech recognition systems Public/Granted day:2024-05-14
Information query