SYSTEM AND METHOD FOR NETWORK BANDWIDTH MANAGEMENT FOR ADJUSTING AUDIO QUALITY
    1.
    发明申请
    SYSTEM AND METHOD FOR NETWORK BANDWIDTH MANAGEMENT FOR ADJUSTING AUDIO QUALITY 有权
    用于调整音频质量的网络带宽管理系统和方法

    公开(公告)号:US20150149159A1

    公开(公告)日:2015-05-28

    申请号:US14087814

    申请日:2013-11-22

    Abstract: Disclosed herein are systems, methods, and computer-readable storage devices for processing audio signals. An example system configured to practice the method receives audio at a device to be transmitted to a remote speech processing system. The system analyzes one of noise conditions, need for an enhanced speech quality, and network load to yield an analysis. Based on the analysis, the system determines to bypass user-defined options for enhancing audio for speech processing. Then, based on the analysis, the system can modify an audio transmission parameter used to transmit the audio from the device to the remote speech processing system. The audio transmission parameter can be one of an amount of coding, a chosen codec, an amount of coding, or a number of audio channels, for example.

    Abstract translation: 本文公开了用于处理音频信号的系统,方法和计算机可读存储装置。 配置为实施该方法的示例系统在要发送到远程语音处理系统的设备处接收音频。 该系统分析噪声条件之一,需要增强语音质量和网络负载以产生分析。 基于分析,系统决定绕过用户定义的选项,用于增强语音处理的音频。 然后,基于分析,系统可以修改用于将音频从设备传输到远程语音处理系统的音频传输参数。 例如,音频传输参数可以是编码量,选择的编解码器,编码量或音频信道数量之一。

    System and Method for Network Bandwidth Management for Adjusting Audio Quality

    公开(公告)号:US20200294523A1

    公开(公告)日:2020-09-17

    申请号:US16889654

    申请日:2020-06-01

    Abstract: Disclosed herein are systems, methods, and computer-readable storage devices for processing audio signals. An example system configured to practice the method receives audio at a device to be transmitted to a remote speech processing system. The system analyzes one of noise conditions, need for an enhanced speech quality, and network load to yield an analysis. Based on the analysis, the system determines to bypass user-defined options for enhancing audio for speech processing. Then, based on the analysis, the system can modify an audio transmission parameter used to transmit the audio from the device to the remote speech processing system. The audio transmission parameter can be one of an amount of coding, a chosen codec, or a number of audio channels, for example.

    SYSTEM AND METHOD FOR COMBINING FRAME AND SEGMENT LEVEL PROCESSING, VIA TEMPORAL POOLING, FOR PHONETIC CLASSIFICATION

    公开(公告)号:US20160063991A1

    公开(公告)日:2016-03-03

    申请号:US14936772

    申请日:2015-11-10

    CPC classification number: G10L15/02 G10L15/08 G10L15/16

    Abstract: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for combining frame and segment level processing, via temporal pooling, for phonetic classification. A frame processor unit receives an input and extracts the time-dependent features from the input. A plurality of pooling interface units generates a plurality of feature vectors based on pooling the time-dependent features and selecting a plurality of time-dependent features according to a plurality of selection strategies. Next, a plurality of segmental classification units generates scores for the feature vectors. Each segmental classification unit (SCU) can be dedicated to a specific pooling interface unit (PIU) to form a PIU-SCU combination. Multiple PIU-SCU combinations can be further combined to form an ensemble of combinations, and the ensemble can be diversified by varying the pooling operations used by the PIU-SCU combinations. Based on the scores, the plurality of segmental classification units selects a class label and returns a result.

    REAL - TIME EMOTION TRACKING SYSTEM
    4.
    发明申请
    REAL - TIME EMOTION TRACKING SYSTEM 有权
    实时感应跟踪系统

    公开(公告)号:US20140163960A1

    公开(公告)日:2014-06-12

    申请号:US13712288

    申请日:2012-12-12

    CPC classification number: G10L25/63 G10L17/04 G10L17/26 G10L25/48

    Abstract: Devices, systems, methods, media, and programs for detecting an emotional state change in an audio signal are provided. A plurality of segments of the audio signal is received, with the plurality of segments being sequential. Each segment of the plurality of segments is analyzed, and, for each segment, an emotional state and a confidence score of the emotional state are determined. The emotional state and the confidence score of each segment are sequentially analyzed, and a current emotional state of the audio signal is tracked throughout each of the plurality of segments. For each segment, it is determined whether the current emotional state of the audio signal changes to another emotional state based on the emotional state and the confidence score of the segment.

    Abstract translation: 提供了用于检测音频信号中的情绪状态改变的设备,系统,方法,媒体和程序。 接收音频信号的多个段,其中多个段是顺序的。 分析多个片段中的每个片段,并且针对每个片段,确定情感状态的情绪状态和置信评分。 顺序地分析每个片段的情绪状态和置信度得分,并且在多个片段中的每一个片段跟踪音频信号的当前情绪状态。 对于每个片段,基于片段的情绪状态和置信度分数确定音频信号的当前情绪状态是否改变到另一情感状态。

    REAL-TIME EMOTION TRACKING SYSTEM
    6.
    发明申请
    REAL-TIME EMOTION TRACKING SYSTEM 有权
    实时感应跟踪系统

    公开(公告)号:US20150235655A1

    公开(公告)日:2015-08-20

    申请号:US14703107

    申请日:2015-05-04

    CPC classification number: G10L25/63 G10L17/04 G10L17/26 G10L25/48

    Abstract: Devices, systems, methods, media, and programs for detecting an emotional state change in an audio signal are provided. A plurality of segments of the audio signal is received, with the plurality of segments being sequential. Each segment of the plurality of segments is analyzed, and, for each segment, an emotional state and a confidence score of the emotional state are determined. The emotional state and the confidence score of each segment are sequentially analyzed, and a current emotional state of the audio signal is tracked throughout each of the plurality of segments. For each segment, it is determined whether the current emotional state of the audio signal changes to another emotional state based on the emotional state and the confidence score of the segment.

    Abstract translation: 提供了用于检测音频信号中的情绪状态改变的设备,系统,方法,媒体和程序。 接收音频信号的多个段,其中多个段是顺序的。 分析多个片段中的每个片段,并且针对每个片段,确定情感状态的情绪状态和置信评分。 顺序地分析每个片段的情绪状态和置信度得分,并且在多个片段中的每一个片段跟踪音频信号的当前情绪状态。 对于每个片段,基于片段的情绪状态和置信度分数确定音频信号的当前情绪状态是否改变到另一情感状态。

    SYSTEM AND METHOD OF USING NEURAL TRANSFORMS OF ROBUST AUDIO FEATURES FOR SPEECH PROCESSING
    7.
    发明申请
    SYSTEM AND METHOD OF USING NEURAL TRANSFORMS OF ROBUST AUDIO FEATURES FOR SPEECH PROCESSING 有权
    用于语音处理的鲁棒音频特征的神经变换的系统和方法

    公开(公告)号:US20150100312A1

    公开(公告)日:2015-04-09

    申请号:US14046393

    申请日:2013-10-04

    Abstract: A system and method for processing speech includes receiving a first information stream associated with speech, the first information stream comprising micro-modulation features and receiving a second information stream associated with the speech, the second information stream comprising features. The method includes combining, via a non-linear multilayer perceptron, the first information stream and the second information stream to yield a third information stream. The system performs automatic speech recognition on the third information stream. The third information stream can also be used for training HMMs.

    Abstract translation: 用于处理语音的系统和方法包括:接收与语音相关联的第一信息流,所述第一信息流包括微调特征并接收与所述语音相关联的第二信息流,所述第二信息流包括特征。 该方法包括通过非线性多层感知器组合第一信息流和第二信息流以产生第三信息流。 该系统对第三信息流执行自动语音识别。 第三信息流也可用于训练HMM。

    AUGMENTED MULTI-TIER CLASSIFIER FOR MULTI-MODAL VOICE ACTIVITY DETECTION

    公开(公告)号:US20210074315A1

    公开(公告)日:2021-03-11

    申请号:US17101048

    申请日:2020-11-23

    Abstract: Voice activity in a media signal is detected in an augmented, multi-tier classifier architecture. For instance, a first voice activity indicator, detected in a first modality for a human subject, is received from a first classifier. Then, the system can receive, from a second classifier, a second voice activity indicator detected in a second modality for the human subject, wherein the first voice activity indicator and the second voice activity indicators are based on the human subject at a same time, and wherein the first modality and the second modality are different. The system then concatenates, via a third classifier, the first voice activity indicator and the second voice activity indicator with original features of the human subject, to yield a classifier output, and determine voice activity based on the classifier output.

    AUGMENTED MULTI-TIER CLASSIFIER FOR MULTI-MODAL VOICE ACTIVITY DETECTION

    公开(公告)号:US20180182415A1

    公开(公告)日:2018-06-28

    申请号:US15894245

    申请日:2018-02-12

    CPC classification number: G10L25/78 G06K9/00335 G10L15/24 G10L25/84

    Abstract: Disclosed herein are systems, methods, and computer-readable storage media for detecting voice activity in a media signal in an augmented, multi-tier classifier architecture. A system configured to practice the method can receive, from a first classifier, a first voice activity indicator detected in a first modality for a human subject. Then, the system can receive, from a second classifier, a second voice activity indicator detected in a second modality for the human subject, wherein the first voice activity indicator and the second voice activity indicators are based on the human subject at a same time, and wherein the first modality and the second modality are different. The system can concatenate, via a third classifier, the first voice activity indicator and the second voice activity indicator with original features of the human subject, to yield a classifier output, and determine voice activity based on the classifier output.

    SYSTEM AND METHOD FOR NETWORK BANDWIDTH MANAGEMENT FOR ADJUSTING AUDIO QUALITY

    公开(公告)号:US20170236527A1

    公开(公告)日:2017-08-17

    申请号:US15586432

    申请日:2017-05-04

    Abstract: Disclosed herein are systems, methods, and computer-readable storage devices for processing audio signals. An example system configured to practice the method receives audio at a device to be transmitted to a remote speech processing system. The system analyzes one of noise conditions, need for an enhanced speech quality, and network load to yield an analysis. Based on the analysis, the system determines to bypass user-defined options for enhancing audio for speech processing. Then, based on the analysis, the system can modify an audio transmission parameter used to transmit the audio from the device to the remote speech processing system. The audio transmission parameter can be one of an amount of coding, a chosen codec, an amount of coding, or a number of audio channels, for example.

Patent Agency Ranking