-
1.
公开(公告)号:US20240071373A1
公开(公告)日:2024-02-29
申请号:US18448628
申请日:2023-08-11
Applicant: Tata Consultancy Services Limited
Inventor: ASHISH PANDA , SUNIL KUMAR KOPPARAPU , ADITYA RAIKAR , MEETKUMAR HEMAKSHU SONI
IPC: G10L15/16
CPC classification number: G10L15/16
Abstract: State of the art Acoustic Models (AM), which are trained using data from one environment, may fail to adapt to another environment, and as a result, application is restricted. The disclosure herein generally relates to speech signal processing, and, more particularly, to a method and system for Automatic Speech Recognition (ASR) using Multi-task Learned Embeddings (MTL). In this approach, MTL embeddings are extracted from an MTL neural network that has been trained using feature vectors from a plurality of speech files. The MTL embeddings are then used for generating an acoustic model, which maybe then used for the purpose of Automatic Speech Recognition, along with the feature vectors and the MTL embeddings.
-
公开(公告)号:US20170270952A1
公开(公告)日:2017-09-21
申请号:US15444759
申请日:2017-02-28
Applicant: Tata Consultancy Services Limited
Inventor: ASHISH PANDA , Sunil Kumar Kopparapu
IPC: G10L25/84 , G10L21/0232
CPC classification number: G10L15/20 , G10L15/02 , G10L17/02 , G10L17/20 , G10L21/0208
Abstract: A method and system is provided for estimating clean speech parameters from noisy speech parameters. The method is performed by acquiring speech signals, estimating noise from the acquired speech signals, computing speech features from the acquired speech signals, estimating model parameters from the computed speech features and estimating clean parameters from the estimated noise and the estimated model parameters.
-