专利检索 ap:("GM Global Technology Operations LLC") AND inv:"Gaurav Talwar" 第 1 页

1.

发明申请
AUTOMATED DEEP LEARNING BASED ON CUSTOMER DRIVEN NOISE DIAGNOSTIC ASSIST 有权

公开(公告)号：US20220406106A1

公开(公告)日：2022-12-22

申请号：US17304280

申请日：2021-06-17

申请人： GM GLOBAL TECHNOLOGY OPERATIONS LLC

发明人： Gaurav Talwar , Kenneth Ray Booker , William L. Villaire , Jeffery J. Milton , Mathew Anthony Clifford Keith Jones

IPC分类号： G07C5/08 , G06K9/62

摘要： Methods and apparatus are provided for diagnosing a vehicle. In one embodiment, a method includes: initiating, by a processor, a recording of a noise by at least one microphone based on user selection data from a user of the vehicle; receiving, by the processor, audio signal data based on the recording; generating, by the processor, vector data based on the audio signal data; processing, by the processor, the vector data with at least one trained machine, by the processor, learning model to determine a classification of the noise; predicting, by the processor, an action to be taken based on the classification; and storing, by the processor, the audio signal data, the classification, and the action in a datastore.

2.

发明授权
Vehicle control systems and methods for multi-intent queries input by voice 有权

公开(公告)号：US10339927B2

公开(公告)日：2019-07-02

申请号：US15434506

申请日：2017-02-16

申请人： GM Global Technology Operations LLC

发明人： Gaurav Talwar , Xu Fang Zhao

IPC分类号： G10L15/00 , G10L15/26 , G10L21/00 , G10L25/00 , G10L21/06 , G10L15/22 , G10L15/18 , G06F3/16

摘要： An infotainment system of a vehicle includes: a primary intent module configured to determine a primary intent included in voice input using automated speech recognition (ASR); and an execution module configured to, via a first hardware output device of the vehicle, execute the primary intent. A secondary intent module is configured to: based on the primary intent, determine a first domain of the primary intent; based on the first domain of the primary intent, determine a second domain; and based on the voice input and the second domain, determine a secondary intent included in the voice input using ASR. A display control module is configured to display a request for user input indicative of whether to execute the secondary intent. The execution module is further configured to, via a second hardware output device of the vehicle, execute the secondary intent in response to user input to execute the secondary intent.

3.

发明授权
Persistent training and pronunciation improvements through radio broadcast 有权

公开(公告)号：US10304454B2

公开(公告)日：2019-05-28

申请号：US15707315

申请日：2017-09-18

申请人： GM GLOBAL TECHNOLOGY OPERATIONS LLC

发明人： Gaurav Talwar , Kenneth R. Booker , Xu Fang Zhao

IPC分类号： G10L15/00 , G10L15/22 , G10L15/26 , B60R16/037

摘要： A processor receives a broadcast in a vehicle, select audio data from the broadcast, processes the audio data selected from the broadcast, determines a phonetic pattern of the selected audio data based on the processing, selects additional instances of audio data from the broadcast that resemble the selected audio data, processes the additional instances of audio data from the broadcast, determine phonetic patterns of the additional instances of audio data, and selects a plurality of phonetic patterns from the phonetic pattern of the selected audio data and the phonetic patterns of the additional instances of audio data. A transmitter transmits the plurality of phonetic patterns to a server to determine an optimal pronunciation of the selected audio data based on a statistical analysis of the plurality of phonetic patterns and to add the optimal pronunciation of the selected audio data to a database used to recognize speech in the vehicle.

4.

发明申请
NEURAL NETWORK FOR USE IN SPEECH RECOGNITION ARBITRATION 审中-公开

公开(公告)号：US20190147855A1

公开(公告)日：2019-05-16

申请号：US15811022

申请日：2017-11-13

申请人： GM GLOBAL TECHNOLOGY OPERATIONS LLC

发明人： Xu Fang Zhao , Gaurav Talwar

IPC分类号： G10L15/16 , G10L15/06 , G10L15/14 , G10L15/22 , G06N3/08

摘要： A system and method of performing speech arbitration at a client device that includes a neural network speech arbitration application, wherein the neural network speech arbitration application is configured to implement a neural network speech arbitration process, and wherein the method includes: receiving speech signals at a client device; generating and/or obtaining a set of inputs to be used in a speech arbitration neural network process, wherein the speech arbitration neural network process uses a neural network model that is tailored to speech arbitration and that can be used to determine whether and/or to what extent speech recognition processing of the received speech signals should be carried out at the client device; and receiving a speech arbitration output that indicates whether and/or to what extent the speech recognition processing of the received speech signals is to be carried out at the client device or at the remote server.

5.

发明申请
PREFERRED EMOJI IDENTIFICATION AND GENERATION 审中-公开

公开(公告)号：US20180074661A1

公开(公告)日：2018-03-15

申请号：US15265522

申请日：2016-09-14

申请人： GM Global Technology Operations LLC

发明人： Xu Fang Zhao , Gaurav Talwar

IPC分类号： G06F3/0482 , G06F3/0481 , G06F3/16 , G06F3/01 , G06F3/0488 , G10L15/24 , G10L15/08 , G06F17/30

CPC分类号： G06F3/0482 , G06F3/012 , G06F3/017 , G06F3/04817 , G06F3/04883 , G06F3/167 , G06F16/51 , G06F2203/011 , G10L13/08 , G10L15/08 , G10L15/18 , G10L15/24 , G10L15/26 , G10L2013/083 , H04L51/08

摘要： A system and method of identifying and generating preferred emojis includes: detecting at a wireless device a plurality of selected emoji; determining the frequency with which each emoji is selected; identifying a defined number of emojis from the plurality of selected emojis based on the frequency with which each emoji is selected; and creating a frequently-used emoji library for the identified emojis.

6.

发明授权
Speech recognition using a database and dynamic gate commands 有权
标题翻译：使用数据库和动态门命令进行语音识别

公开(公告)号：US09530414B2

公开(公告)日：2016-12-27

申请号：US14686042

申请日：2015-04-14

申请人： GM Global Technology Operations LLC

发明人： Xufang Zhao , Gaurav Talwar

IPC分类号： G10L21/00 , G10L15/00 , G10L15/22 , B60W50/08

CPC分类号： G10L15/22 , B60W50/08 , B60W2540/02 , G10L15/08 , G10L2015/088 , G10L2015/223

摘要： A system and method of controlling an automatic speech recognition (ASR) system includes: receiving speech at the ASR system from a vehicle occupant that includes a command to control a vehicle function; identifying a gate command from the speech; associating the identified gate command with the command to control the vehicle function; storing the associated gate command and vehicle command in a database; receiving additional speech at the ASR system from the vehicle occupant; detecting the gate command in the additional speech; and accessing the stored gate command and vehicle command from the database.

摘要翻译： 一种控制自动语音识别（ASR）系统的系统和方法包括：从包括控制车辆功能的命令的车辆乘员在ASR系统接收语音; 从语音识别门命令; 将所识别的门命令与用于控制车辆功能的命令相关联; 将相关联的门命令和车辆命令存储在数据库中; 在车载乘客的ASR系统接收额外的演讲; 在附加语音中检测门命令; 以及从数据库访问存储的门命令和车辆命令。

7.

发明申请
ADJUSTING AUDIO SAMPLING USED WITH WIDEBAND AUDIO 审中-公开
标题翻译：调整使用宽带音频的音频采样

公开(公告)号：US20160268987A1

公开(公告)日：2016-09-15

申请号：US14643632

申请日：2015-03-10

申请人： GM GLOBAL TECHNOLOGY OPERATIONS LLC

发明人： Gaurav Talwar , Xufang Zhao , MD Foezur Rahman Chowdhury , Eli Tzirkel-Hancock

IPC分类号： H03G3/00 , G06F3/16

CPC分类号： G06F3/165 , G06F3/162 , G10L15/28 , G10L19/022 , G10L21/04

摘要： A system and method of adjusting digital audio sampling used with wideband audio includes: performing audio sampling on an analog audio signal at an initial sampling rate and an initial bit rate over a wideband audio frequency range; generating a digital audio signal based on the audio sampling; detecting a qualitative error rate between the analog audio signal and the digital audio signal; and decreasing the initial sampling rate, the initial bit rate, or both for sampling subsequent analog audio when the qualitative error is below a threshold.

摘要翻译： 调整与宽带音频一起使用的数字音频采样的系统和方法包括：以初始采样率对模拟音频信号进行音频采样，并在宽带音频频率范围内执行初始比特率; 基于音频采样生成数字音频信号; 检测模拟音频信号和数字音频信号之间的定性错误率; 并且当定性误差低于阈值时，降低初始采样率，初始比特率或两者用于对后续模拟音频进行采样。

8.

发明申请
SELECTIVE NOISE SUPPRESSION DURING AUTOMATIC SPEECH RECOGNITION 有权
标题翻译：自动语音识别期间的选择性噪声抑制

公开(公告)号：US20160118042A1

公开(公告)日：2016-04-28

申请号：US14520974

申请日：2014-10-22

申请人： GM Global Technology Operations LLC

发明人： Gaurav Talwar , Xufang Zhao, III , Robert D. Sims, III , Md Foezur Rahman Chowdhury

IPC分类号： G10L15/22 , G10L21/0208

CPC分类号： G10L21/0208 , G10L15/20 , G10L25/93

摘要： An automatic speech recognition engine and a method of using the engine is described. The method pertains to front-end processing an audio signal and includes the steps of: identifying a plurality of voiced-frames of the audio signal; determining that one or more of the plurality of voiced-frames have a signal-to-noise (SNR) value greater than a first predetermined threshold; and based on the determination, bypassing noise suppression for the one or more of the plurality of voiced-frames.

摘要翻译： 描述了自动语音识别引擎和使用该引擎的方法。该方法涉及前端处理音频信号，并且包括以下步骤：识别音频信号的多个有声帧; 确定所述多个有声帧中的一个或多个具有大于第一预定阈值的信噪比（SNR）值; 并且基于所述确定，绕过所述多个有声帧中的一个或多个的边缘噪声抑制。

9.

发明授权
Methods and apparatus for processing multiple audio streams at a vehicle onboard computer system 有权
标题翻译：在车载车载计算机系统上处理多个音频流的方法和装置

公开(公告)号：US09286030B2

公开(公告)日：2016-03-15

申请号：US14058060

申请日：2013-10-18

申请人： GM GLOBAL TECHNOLOGY OPERATIONS LLC

发明人： John L. Holdren , Xufang Zhao , Gaurav Talwar

IPC分类号： H04B1/00 , G06F3/16 , B60R11/02 , G10L15/22 , H04R1/02

CPC分类号： G06F3/167 , B60R11/0247 , G06F3/165 , G10L15/22 , G10L2015/223 , H04R1/02 , H04R2410/00 , H04R2499/13

摘要： A method for processing a plurality of audio streams at a computer system onboard a vehicle is provided. The method receives the plurality of audio streams from a plurality of locations within a vehicle; prioritizes each of the plurality of audio streams to obtain a prioritization result; and completes a task associated with each of the plurality of audio streams, according to the prioritization result.

摘要翻译： 提供了一种在车辆上的计算机系统处理多个音频流的方法。该方法从车辆内的多个位置接收多个音频流; 优先考虑多个音频流中的每一个以获得优先化结果; 并且根据优先级结果来完成与多个音频流中的每一个相关联的任务。

10.

发明授权
Speech recognition with a plurality of microphones 有权
标题翻译：具有多个麦克风的语音识别

公开(公告)号：US09269352B2

公开(公告)日：2016-02-23

申请号：US13893088

申请日：2013-05-13

申请人： GM GLOBAL TECHNOLOGY OPERATIONS LLC

发明人： Gaurav Talwar , Xufang Zhao

IPC分类号： G10L15/08 , H04R3/00 , G10L15/00

CPC分类号： G10L15/083 , G10L15/005 , H04R3/005 , H04R2420/07 , H04R2430/03 , H04R2499/13

摘要： At least first and second microphones with different frequency responses form part of a speech recognition system. The microphones are coupled to a processor that is configured to recognize a spoken word based on the microphone signals. The processor classifies the spoken word, and weights the signals from the microphones based on the classification of the spoken word.

摘要翻译： 具有不同频率响应的至少第一和第二麦克风构成语音识别系统的一部分。麦克风被耦合到处理器，其被配置为基于麦克风信号来识别口语单词。处理器对口语进行分类，并根据口语单词的分类对来自麦克风的信号加权。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类