Patent search cpc:"G10L15/20" Page 1

1.

发明申请
VOICE ISOLATION SYSTEM 审中-公开

公开(公告)号：WO2019136475A1

公开(公告)日：2019-07-11

申请号：PCT/US2019/012767

申请日：2019-01-08

Applicant: AVNERA CORPORATION

Inventor： WURTZ, David , WURTZ, Michael , KUMAR, Amit , DOOLITTLE, Colin

IPC: G10L21/0208 , H04M9/08 , H04R3/00 , G10K11/175 , H04R5/033 , G10L21/0216 , G10L25/78

CPC classification number: G10L21/0216 , G10K11/17827 , G10K11/17833 , G10K2210/108 , G10L15/20 , G10L21/0208 , G10L25/78 , G10L2021/02082 , G10L2021/02165 , G10L2021/02166 , H04M9/082 , H04R1/1083 , H04R3/005 , H04R5/033

Abstract: The disclosure includes a voice isolation system comprising an acoustic echo-cancelation subsystem configured to receive a plurality of input signals, subtract an interference component from the input signals, and provide a plurality of output signals. The system also includes an adaptive beamformer subsystem configured to receive the plurality of output signals from the acoustic echo-cancelation subsystem and compute a signal-to-noise ratio enhanced signal based on the received output signals. The system also includes a residual noise suppressor subsystem configured to attenuate at least one portion of the SNR enhanced signal received from the adaptive beamformer subsystem based on the at least one portion having an SNR below a predetermined SNR threshold. The system also includes an automatic gain control subsystem configured to process a signal outputted from the residual noise suppressor subsystem and transmit a resulting signal as an output signal.

2.

发明申请
SYNTACTIC RE-RANKING OF POTENTIAL TRANSCRIPTIONS DURING AUTOMATIC SPEECH RECOGNITION 审中-公开
Title translation: 对自动语音识别期间的潜在转录进行重新排名

公开(公告)号：WO2018057427A1

公开(公告)日：2018-03-29

申请号：PCT/US2017/051823

申请日：2017-09-15

Applicant: INTEL CORPORATION

Inventor： PEREG, Oren , WASSERBLAT, Moshe , MAMOU, Jonathan , ASSAYAG, Michel

IPC: G10L17/12 , G10L17/22 , G10L15/22 , G10L15/28 , G10L15/18

CPC classification number: G10L15/197 , G06F17/274 , G10L15/02 , G10L15/1822 , G10L15/19 , G10L15/20 , G10L15/22

Abstract: A system and method for syntactic re-ranking of possible transcriptions generated by automatic speech recognition are disclosed. A computer system accesses acoustic data for a recorded spoken language and generates a plurality of potential transcriptions for the acoustic data. The computer system scores the plurality of potential transcriptions to create an initial likelihood score for the plurality of potential transcriptions. For a particular potential transcription in the plurality of transcriptions, the computer system generates a syntactical likelihood score. The computer system creates an adjusted score for the particular potential transcription by combining the initial likelihood score and the syntactic likelihood score for the particular potential transcription.

Abstract translation: 公开了一种用于对通过自动语音识别生成的可能转录进行句法重新排序的系统和方法。计算机系统访问记录的口语的声学数据并为声学数据生成多个潜在的转录。计算机系统对多个潜在转录进行评分以为多个潜在转录创建初始可能性分数。对于多个转录中的特定潜在转录，计算机系统生成语法似然分数。计算机系统通过将特定潜在转录的初始可能性得分和句法可能性得分相结合来为特定潜在转录创建调整得分。

3.

发明申请
SPEAKER RECOGNITION USING ADAPTIVE THRESHOLDING 审中-公开
Title translation: 使用自适应阈值进行扬声器识别

公开(公告)号：WO2017172113A1

公开(公告)日：2017-10-05

申请号：PCT/US2017/018716

申请日：2017-02-21

Applicant: INTEL CORPORATION

Inventor： BISWAL, Narayan , CILINGIR, Gokcen

IPC: G10L17/12 , G10L17/20 , G10L21/02

CPC classification number: G10L17/20 , G10L15/20 , G10L15/265 , G10L17/02 , G10L17/04 , G10L17/06 , G10L17/22 , G10L25/78 , G10L25/81

Abstract: Techniques related to speaker recognition are discussed. Such techniques may include determining an adaptive speaker recognition threshold based on a speech to noise ratio and noise type label corresponding to received audio and performing speaker recognition based on the adaptive speaker recognition threshold and a speaker recognition score corresponding to received audio.

Abstract translation: 讨论了与说话人识别有关的技术。这样的技术可以包括基于与接收到的音频对应的语音噪声比和噪声类型标签来确定自适应说话人识别阈值，并且基于自适应说话人识别阈值和与接收到的音频对应的说话者识别分数来执行说话者识别。

4.

发明申请
TELEKOMMUNIKATIONSGERÄT, TELEKOMMUNIKATIONSSYSTEM, VERFAHREN ZUM BETRIEB EINES TELEKOMMUNIKATIONSGERÄTS UND COMPUTERPROGRAMM 审中-公开
Title translation: 电信设备，电信系统，操作电信设备的方法和计算机程序

公开(公告)号：WO2017148949A1

公开(公告)日：2017-09-08

申请号：PCT/EP2017/054651

申请日：2017-02-28

Applicant: FRAUNHOFER-GESELLSCHAFT ZUR FÖRDERUNG DER ANGEWANDTEN FORSCHUNG E.V.

Inventor： APPELL, Jens Ekkehart , RENNIES-HOCHMUTH, Jan

IPC: H04M1/19 , H04M1/68

CPC classification number: H04M1/68 , G10L15/08 , G10L15/20 , G10L2015/088

Abstract: Ein Telekommunikationsgerät umfasst eine Audiosignalübertragungseinrichtung, die ausgelegt ist, um ein Audiosignal zu empfangen und zu einem weiteren Telekommunikationsgerät zu übertragen. Das Telekommunikationsgerät umfasst ferner eine Signalisierungseinrichtung, die ausgelegt ist, um eine Signalisierung auszugeben, wenn zu besorgen ist, dass das Audiosignal akustisch für Dritte verständlich ist oder eine Störung für Dritte darstellt. Ein weiteres Telekommunikationsgerät umfasst eine Audiosignalempfangseinrichtung, die ausgelegt ist, um ein Audiosignal von einem weiteren Telekommunikationsgerät zu empfangen und akustisch auszugeben sowie eine Signalisierungseinrichtung, die ausgelegt ist, um eine Signalisierung auszugeben, wenn zu besorgen ist, dass das ausgegebene Audiosignal akustisch für Dritte verständlich ist oder eine Störung darstellt. Die Telekommunikationsgeräte können in einem System zusammengeschaltet werden. Entsprechende Betriebsverfahren sowie ein Computerprogramm werden ebenfalls beschrieben.

Abstract translation: 电信设备包括音频信号传输设备，该音频信号传输设备被配置为接收音频信号并将其传输到另一电信设备。该电信设备还包括信令装置，该信令装置适于在要理解该音频信号对第三方可听见或构成对第三方的干扰时发出信令。另一个Telekommunikationsger BEAR吨包括音频信号接收，其适于从另一Telekommunikationsger BEAR的音频信号接收吨和听觉输出，并适于输出警报信号装置的装置，如果是要担心的输出音频信号的声学对于第三方是或构成故障。电信设备可以在一个系统中互连。还介绍了适当的操作程序和计算机程序。

5.

发明申请
AMBIENT AWARENESS IN VIRTUAL REALITY 审中-公开
Title translation: 虚拟现实中的环境意识

公开(公告)号：WO2017112277A1

公开(公告)日：2017-06-29

申请号：PCT/US2016/063460

申请日：2016-11-23

Applicant: INTEL CORPORATION

Inventor： BEGUM, Shamim , WHITNEY, Kofi C.

IPC: H04N5/445 , H04R3/00 , G10L21/10 , G02B27/01

CPC classification number: A63F13/217 , A63F13/215 , A63F13/25 , A63F13/5255 , A63F13/65 , A63F2300/8082 , G06T19/006 , G10L15/20 , G10L17/00 , G10L25/51 , G10L25/72

Abstract: Systems, apparatus and methods may provide for audio processing of received user audio input from a microphone that may optionally be a tissue conducting microphone. Audio processing may be further conducted on received ambient audio from one or more additional microphones. A translator may translate the ambient audio into content to be output to a user. In an embodiment, ambient audio is translated into visual content to be displayed on a virtual reality device.

Abstract translation: 系统，设备和方法可以提供从麦克风接收的用户音频输入的音频处理，该麦克风可以可选地是组织传导麦克风。可以对来自一个或多个附加麦克风的接收到的环境音频进一步进行音频处理。翻译器可以将环境音频转换成内容以输出给用户。在一个实施例中，环境音频被转换成可视内容以显示在虚拟现实设备上。

6.

发明申请
AUTOMATIC TUNING OF SPEECH RECOGNITION PARAMETERS 审中-公开
Title translation: 自动调整语音识别参数

公开(公告)号：WO2017111634A1

公开(公告)日：2017-06-29

申请号：PCT/PL2015/050074

申请日：2015-12-22

Applicant: INTEL CORPORATION

Inventor： CHLEBEK, Piotr , KURYLO, Lukasz , BORWANSKI, Michal , MAZIEWSKI, Przemyslaw , KOSTYK, Roksana , BURNY, Tomasz K. , DUZINKIEWICZ, Karol J. , BURACZEWSKA, Sylwia

IPC: G10L15/20 , G10L21/00

CPC classification number: G10L15/20 , G10L21/00

Abstract: System and techniques for automatic tuning of speech recognition parameters are described herein. A clean audio segment and a dirty audio segment may be obtained, in an iterative fashion, optimized preprocessing parameters may be obtained by, at an iteration, selecting a set of parameters, preprocessing the clean audio segment with the set of parameters to produce a first result, preprocessing the dirty audio segment with the set of parameters to produce a second result, and scoring a portion of the first result with the a corresponding portion of the second result using clean-diff. When an optimization threshold is reached, exit the iterative process and provide the set of parameters from the last iteration.

Abstract translation: 这里描述了用于语音识别参数的自动调整的系统和技术。可以以迭代的方式获得干净的音频片段和脏音频片段，可以通过在迭代中选择一组参数，利用该组参数预处理干净的音频片段以获得最优化的预处理参数，以产生第一结果，用该组参数预处理脏音频段以产生第二结果，并且使用clean-diff为第二结果的相应部分对第一结果的一部分打分。当达到最优化阈值时，退出迭代过程并提供上一次迭代的参数集。

7.

发明申请
音声認識装置、音声強調装置、音声認識方法、音声強調方法およびナビゲーションシステム审中-公开
Title translation: 语音识别装置，语音增强装置，语音识别方法，语音增强方法和导航系统

公开(公告)号：WO2017094121A1

公开(公告)日：2017-06-08

申请号：PCT/JP2015/083768

申请日：2015-12-01

Applicant: 三菱電機株式会社

Inventor： 太刀岡　勇気

IPC: G10L15/20 , G10L21/0216

CPC classification number: G10L15/20 , G10L21/0216

Abstract: 入力された騒音音声データに対して、それぞれ異なる手法の騒音抑圧処理を行う複数の騒音抑圧部（３）と、騒音信号が抑圧された音声データの音声認識を行う音声認識部（４）と、入力された騒音音声データの音響特徴量から、騒音音声データを複数の騒音抑圧部（３）によりそれぞれ騒音抑圧処理を行った場合に得られる音声認識率を予測する予測部と（２）、予測した音声認識率に基づいて、複数の騒音抑圧部から騒音音声データに対して騒音抑圧処理を行う騒音抑圧部（３）を選択する抑圧手法選択部（２）とを備える。

Abstract translation: 进行上
输入噪声的音频数据，多个噪声抑制单元的执行噪声抑制的各自不同的方法和（3），所述语音数据中的噪声信号的语音识别被抑制过程预测语音识别单元（4），输入噪声的语音数据的声学特征，由多个噪声抑制器的噪声的声音数据的进行相应的噪声抑制处理时获得的语音识别率（3）预测单元（2），基于所预测的语音识别率，用于选择噪声抑制单元（3）相对于所述噪声的声音数据从多个噪声抑制部分的执行噪声抑制处理抑制方法选择单元（2）配备了。

8.

发明申请
REMOTE SENSOR VOICE RECOGNITION 审中-公开
Title translation: 远程传感器语音识别

公开(公告)号：WO2017039575A1

公开(公告)日：2017-03-09

申请号：PCT/US2015/047376

申请日：2015-08-28

Applicant: HEWLETT-PACKARD DEVELOPMENT COMPANY, L.P.

Inventor： HANES, David, H.

IPC: G10L15/28

CPC classification number: G10L15/28 , G10L15/20 , G10L21/0208

Abstract: Examples described herein include systems, methods, and devices for transmitting a media signal to the remote sensor, receiving a sound signal from the remote sensor, and monitoring the sound signal and the media signal to recognize voice commands.

Abstract translation: 本文描述的示例包括用于向远程传感器发送媒体信号的系统，方法和设备，从远程传感器接收声音信号，以及监视声音信号和媒体信号以识别语音命令。

9.

发明申请
情報処理装置、制御方法、およびプログラム审中-公开
Title translation: 信息处理设备，控制方法和程序

公开(公告)号：WO2016157658A1

公开(公告)日：2016-10-06

申请号：PCT/JP2015/086098

申请日：2015-12-24

Applicant: ソニー株式会社

Inventor： 大村　淳己

IPC: G06F3/16 , G10L13/00 , G10L13/02

CPC classification number: G06F3/16 , G10L13/00 , G10L13/02 , G10L15/20 , G10L25/48 , H04M19/04

Abstract: 　現在の周辺環境に応じて適切な応答出力方法を決定することで、音声認識システムの利便性を向上することが可能な情報処理装置、制御方法、およびプログラムを提供する。　ユーザの発話に対する応答を生成し、現在の周辺環境に応じて応答出力方法を決定し、前記決定された応答出力方法で前記生成された応答を出力するよう制御する。

Abstract translation: 提供了一种信息处理装置，控制方法和程序，其能够通过根据当前周围环境确定适当的响应输出方法来提高语音识别系统的便利性。执行控制以使得对用户的话语的响应被产生，根据当前的周围环境确定响应输出方法，并且使用所确定的响应输出方法来输出所生成的响应。

10.

发明申请
音声調整装置审中-公开
Title translation: 语音调整装置

公开(公告)号：WO2016067644A1

公开(公告)日：2016-05-06

申请号：PCT/JP2015/055093

申请日：2015-02-23

Applicant: シャープ株式会社

Inventor： 中村　圭介

IPC: G10L15/20 , G10L21/034 , G10L25/84

CPC classification number: G10L15/20 , G10L21/034 , G10L25/84

Abstract: 　音声信号の入力ゲインや出力ゲインを使用環境に合わせて適切に調整し、音声認識率を向上させた音声調整装置を提供する。音声調整装置（２０）は、音声信号が音声強度閾値よりも小さい無音状態が継続する無音時間または音声信号が音声強度閾値よりも大きい有音状態が継続する有音時間と、予め設定した時間閾値と比較して音声信号を調整する音声調整部（４０）とを備える。

Abstract translation: 提供一种能够通过根据使用环境适当调整语音信号的输入增益和输出增益来提高语音识别率的语音调整装置。该语音调节装置（20）具有通过将无声周期或声音周期与预定时间阈值进行比较来调节语音信号的语音调整单元（40），无声周期是无声状态的延续，其中强度的语音信号小于语音强度阈值，声音周期是语音信号的强度大于语音强度阈值的声音状态的延续。

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification