Invention Publication
- Patent Title: METHOD AND SYSTEM FOR AUTOMATIC SPEECH RECOGNITION (ASR) USING MULTI-TASK LEARNED (MTL) EMBEDDINGS
-
Application No.: US18448628Application Date: 2023-08-11
-
Publication No.: US20240071373A1Publication Date: 2024-02-29
- Inventor: ASHISH PANDA , SUNIL KUMAR KOPPARAPU , ADITYA RAIKAR , MEETKUMAR HEMAKSHU SONI
- Applicant: Tata Consultancy Services Limited
- Applicant Address: IN Mumbai
- Assignee: Tata Consultancy Services Limited
- Current Assignee: Tata Consultancy Services Limited
- Current Assignee Address: IN Mumbai
- Priority: IN 2221048968 2022.08.26
- Main IPC: G10L15/16
- IPC: G10L15/16

Abstract:
State of the art Acoustic Models (AM), which are trained using data from one environment, may fail to adapt to another environment, and as a result, application is restricted. The disclosure herein generally relates to speech signal processing, and, more particularly, to a method and system for Automatic Speech Recognition (ASR) using Multi-task Learned Embeddings (MTL). In this approach, MTL embeddings are extracted from an MTL neural network that has been trained using feature vectors from a plurality of speech files. The MTL embeddings are then used for generating an acoustic model, which maybe then used for the purpose of Automatic Speech Recognition, along with the feature vectors and the MTL embeddings.
Information query