Vehicle control systems and methods for multi-intent queries input by voice

    公开(公告)号:US10339927B2

    公开(公告)日:2019-07-02

    申请号:US15434506

    申请日:2017-02-16

    摘要: An infotainment system of a vehicle includes: a primary intent module configured to determine a primary intent included in voice input using automated speech recognition (ASR); and an execution module configured to, via a first hardware output device of the vehicle, execute the primary intent. A secondary intent module is configured to: based on the primary intent, determine a first domain of the primary intent; based on the first domain of the primary intent, determine a second domain; and based on the voice input and the second domain, determine a secondary intent included in the voice input using ASR. A display control module is configured to display a request for user input indicative of whether to execute the secondary intent. The execution module is further configured to, via a second hardware output device of the vehicle, execute the secondary intent in response to user input to execute the secondary intent.

    Persistent training and pronunciation improvements through radio broadcast

    公开(公告)号:US10304454B2

    公开(公告)日:2019-05-28

    申请号:US15707315

    申请日:2017-09-18

    摘要: A processor receives a broadcast in a vehicle, select audio data from the broadcast, processes the audio data selected from the broadcast, determines a phonetic pattern of the selected audio data based on the processing, selects additional instances of audio data from the broadcast that resemble the selected audio data, processes the additional instances of audio data from the broadcast, determine phonetic patterns of the additional instances of audio data, and selects a plurality of phonetic patterns from the phonetic pattern of the selected audio data and the phonetic patterns of the additional instances of audio data. A transmitter transmits the plurality of phonetic patterns to a server to determine an optimal pronunciation of the selected audio data based on a statistical analysis of the plurality of phonetic patterns and to add the optimal pronunciation of the selected audio data to a database used to recognize speech in the vehicle.

    NEURAL NETWORK FOR USE IN SPEECH RECOGNITION ARBITRATION

    公开(公告)号:US20190147855A1

    公开(公告)日:2019-05-16

    申请号:US15811022

    申请日:2017-11-13

    摘要: A system and method of performing speech arbitration at a client device that includes a neural network speech arbitration application, wherein the neural network speech arbitration application is configured to implement a neural network speech arbitration process, and wherein the method includes: receiving speech signals at a client device; generating and/or obtaining a set of inputs to be used in a speech arbitration neural network process, wherein the speech arbitration neural network process uses a neural network model that is tailored to speech arbitration and that can be used to determine whether and/or to what extent speech recognition processing of the received speech signals should be carried out at the client device; and receiving a speech arbitration output that indicates whether and/or to what extent the speech recognition processing of the received speech signals is to be carried out at the client device or at the remote server.

    Speech recognition using a database and dynamic gate commands
    6.
    发明授权
    Speech recognition using a database and dynamic gate commands 有权
    使用数据库和动态门命令进行语音识别

    公开(公告)号:US09530414B2

    公开(公告)日:2016-12-27

    申请号:US14686042

    申请日:2015-04-14

    摘要: A system and method of controlling an automatic speech recognition (ASR) system includes: receiving speech at the ASR system from a vehicle occupant that includes a command to control a vehicle function; identifying a gate command from the speech; associating the identified gate command with the command to control the vehicle function; storing the associated gate command and vehicle command in a database; receiving additional speech at the ASR system from the vehicle occupant; detecting the gate command in the additional speech; and accessing the stored gate command and vehicle command from the database.

    摘要翻译: 一种控制自动语音识别(ASR)系统的系统和方法包括:从包括控制车辆功能的命令的车辆乘员在ASR系统接收语音; 从语音识别门命令; 将所识别的门命令与用于控制车辆功能的命令相关联; 将相关联的门命令和车辆命令存储在数据库中; 在车载乘客的ASR系统接收额外的演讲; 在附加语音中检测门命令; 以及从数据库访问存储的门命令和车辆命令。

    ADJUSTING AUDIO SAMPLING USED WITH WIDEBAND AUDIO
    7.
    发明申请
    ADJUSTING AUDIO SAMPLING USED WITH WIDEBAND AUDIO 审中-公开
    调整使用宽带音频的音频采样

    公开(公告)号:US20160268987A1

    公开(公告)日:2016-09-15

    申请号:US14643632

    申请日:2015-03-10

    IPC分类号: H03G3/00 G06F3/16

    摘要: A system and method of adjusting digital audio sampling used with wideband audio includes: performing audio sampling on an analog audio signal at an initial sampling rate and an initial bit rate over a wideband audio frequency range; generating a digital audio signal based on the audio sampling; detecting a qualitative error rate between the analog audio signal and the digital audio signal; and decreasing the initial sampling rate, the initial bit rate, or both for sampling subsequent analog audio when the qualitative error is below a threshold.

    摘要翻译: 调整与宽带音频一起使用的数字音频采样的系统和方法包括:以初始采样率对模拟音频信号进行音频采样,并在宽带音频频率范围内执行初始比特率; 基于音频采样生成数字音频信号; 检测模拟音频信号和数字音频信号之间的定性错误率; 并且当定性误差低于阈值时,降低初始采样率,初始比特率或两者用于对后续模拟音频进行采样。

    SELECTIVE NOISE SUPPRESSION DURING AUTOMATIC SPEECH RECOGNITION
    8.
    发明申请
    SELECTIVE NOISE SUPPRESSION DURING AUTOMATIC SPEECH RECOGNITION 有权
    自动语音识别期间的选择性噪声抑制

    公开(公告)号:US20160118042A1

    公开(公告)日:2016-04-28

    申请号:US14520974

    申请日:2014-10-22

    IPC分类号: G10L15/22 G10L21/0208

    摘要: An automatic speech recognition engine and a method of using the engine is described. The method pertains to front-end processing an audio signal and includes the steps of: identifying a plurality of voiced-frames of the audio signal; determining that one or more of the plurality of voiced-frames have a signal-to-noise (SNR) value greater than a first predetermined threshold; and based on the determination, bypassing noise suppression for the one or more of the plurality of voiced-frames.

    摘要翻译: 描述了自动语音识别引擎和使用该引擎的方法。 该方法涉及前端处理音频信号,并且包括以下步骤:识别音频信号的多个有声帧; 确定所述多个有声帧中的一个或多个具有大于第一预定阈值的信噪比(SNR)值; 并且基于所述确定,绕过所述多个有声帧中的一个或多个的边缘噪声抑制。

    Speech recognition with a plurality of microphones
    10.
    发明授权
    Speech recognition with a plurality of microphones 有权
    具有多个麦克风的语音识别

    公开(公告)号:US09269352B2

    公开(公告)日:2016-02-23

    申请号:US13893088

    申请日:2013-05-13

    IPC分类号: G10L15/08 H04R3/00 G10L15/00

    摘要: At least first and second microphones with different frequency responses form part of a speech recognition system. The microphones are coupled to a processor that is configured to recognize a spoken word based on the microphone signals. The processor classifies the spoken word, and weights the signals from the microphones based on the classification of the spoken word.

    摘要翻译: 具有不同频率响应的至少第一和第二麦克风构成语音识别系统的一部分。 麦克风被耦合到处理器,其被配置为基于麦克风信号来识别口语单词。 处理器对口语进行分类,并根据口语单词的分类对来自麦克风的信号加权。