Patent search ap:("AT&T Intellectual Property I Page L.P.") AND inv:"Ilya Dan MELAMED"

1.

发明申请
MULTI-CHANNEL SPEECH RECOGNITION 审中-公开
Title translation: 多声道语音识别

公开(公告)号：US20150149162A1

公开(公告)日：2015-05-28

申请号：US14087885

申请日：2013-11-22

Applicant: AT&T Intellectual Property I, L.P.

Inventor： Ilya Dan MELAMED , Andrej LJOLJE

IPC: G10L15/00

CPC classification number: G10L15/07 , G10L15/20 , G10L15/28 , G10L2015/227

Abstract: Disclosed herein are systems, methods, and computer-readable storage devices for performing per-channel automatic speech recognition. An example system configured to practice the method combines a first audio signal of a first speaker in a communication session and a second audio signal from a second speaker in the communication session as a first audio channel and a second audio channel. The system can recognize speech in the first audio channel of the recording using a first model associated with the first speaker, and recognize speech in the second audio channel of the recording using a second model associated with the second speaker, wherein the first model is different from the second model. The system can generate recognized speech as an output from the communication session. The system can identify the models based on identifiers of the speakers, such as a telephone number, an IP address, a customer number, or account number.

Abstract translation: 这里公开了用于执行每通道自动语音识别的系统，方法和计算机可读存储装置。配置为实施该方法的示例系统将通信会话中的第一扬声器的第一音频信号和来自通信会话中的第二扬声器的第二音频信号组合为第一音频通道和第二音频通道。该系统可以使用与第一扬声器相关联的第一模型识别记录的第一音频通道中的语音，并且使用与第二扬声器相关联的第二模型识别记录的第二音频通道中的语音，其中第一模型不同从第二个模型。该系统可以将识别的语音产生为来自通信会话的输出。该系统可以基于扬声器的标识符来识别模型，例如电话号码，IP地址，客户号码或帐号。

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification