-
公开(公告)号:US11823707B2
公开(公告)日:2023-11-21
申请号:US17572002
申请日:2022-01-10
Applicant: Synaptics Incorporated
Inventor: Saeed Mosayyebpour Kaskari
IPC: G10L25/84 , G10L21/0216 , G10L21/0264 , G10L25/78 , G10L21/0208
CPC classification number: G10L25/84 , G10L21/0216 , G10L21/0264 , G10L2021/02082 , G10L2021/02166 , G10L2025/786
Abstract: An audio spotting system configured for various operating modes including a regular mode and sensitivity mode is described. An example cascade audio spotting system may include a high-power subsystem including a high-power trigger and a transfer module. This high-power trigger includes one or more detection models used to detect whether a target sound activity is included in the one or more audio streams. The one or more detection models are associated with a first set of hyperparameters when the cascade audio spotting system is in a regular mode, and the one or more detection models are associated with a second set of hyperparameters when the cascade audio spotting system is in a sensitivity mode. The transfer module provides at least one of one or more processed audio streams for further processing in response to the high-power trigger detecting the target sound activity in the one or more audio streams.
-
公开(公告)号:US11694710B2
公开(公告)日:2023-07-04
申请号:US17484208
申请日:2021-09-24
Applicant: Synaptics Incorporated
Inventor: Francesco Nesta , Saeed Mosayyebpour Kaskari
CPC classification number: G10L21/0364 , G10L15/22 , G10L25/60 , G10L25/84 , H04R1/406 , H04R3/005 , H04S3/008 , H04L65/60 , H04S2400/01
Abstract: Audio processing systems and methods include an audio sensor array configured to receive a multichannel audio input and generate a corresponding multichannel audio signal and target-speech detection logic and an automatic speech recognition engine or VoIP application. An audio processing device includes a target speech enhancement engine configured to analyze a multichannel audio input signal and generate a plurality of enhanced target streams, a multi-stream target-speech detection generator comprising a plurality of target-speech detector engines each configured to determine a probability of detecting a specific target-speech of interest in the stream, wherein the multi-stream target-speech detection generator is configured to determine a plurality of weights associated with the enhanced target streams, and a fusion subsystem configured to apply the plurality of weights to the enhanced target streams to generate an enhancement output signal.
-
公开(公告)号:US11373667B2
公开(公告)日:2022-06-28
申请号:US15957829
申请日:2018-04-19
Applicant: SYNAPTICS INCORPORATED
Inventor: Saeed Mosayyebpour Kaskari , Francesco Nesta , Trausti Thormundsson , Thomas Aaron Gulliver
IPC: H04B15/00 , G10L21/0232 , G10L21/038 , G10L21/0208 , G10L25/18 , G10L21/00
Abstract: Systems and methods for processing an audio signal include an audio input operable to receive an input signal comprising a time-domain, single-channel audio signal, a subband analysis block operable to transform the input signal to a frequency domain input signal comprising a plurality of k-spaced under-sampled subband signals, a reverberation reduction block operable to reduce reverberation effect, including late reverberation, in the plurality of k-spaced under-sampled subband signals, a noise reduction block operable to reduce background noise from the plurality of k-spaced under-sampled subband signals, and a subband synthesis block operable to transform the subband signals to the time-domain, thereby producing an enhanced output signal.
-
公开(公告)号:US10762427B2
公开(公告)日:2020-09-01
申请号:US15909930
申请日:2018-03-01
Applicant: SYNAPTICS INCORPORATED
Inventor: Saeed Mosayyebpour Kaskari , Trausti Thormundsson , Francesco Nesta
Abstract: Classification training systems and methods include a neural network for classification of input data, a training dataset providing segmented labeled training data, and a classification training module operable to train the neural network using the training data. A forward pass processing module is operable to generate neural network outputs for the training data using weights and bias for the neural network, and a backward pass processing module is operable to update the weights and biases in a backward pass, including obtaining Region of Target (ROT) information from the training data, generate a forward-backward masking based on the ROT information, the forward-backward masking placing at least one restriction on a neural network output path, compute modified forward and backward variables based on the neural network outputs and the forward-backward masking, and update the weights and biases.
-
公开(公告)号:US20210248470A1
公开(公告)日:2021-08-12
申请号:US17243519
申请日:2021-04-28
Applicant: SYNAPTICS INCORPORATED
Inventor: Saeed Mosayyebpour Kaskari
Abstract: A classification training system comprises a neural network configured to perform classification of input data, a training dataset including pre-segmented, labeled training samples, and a classification training module configured to train the neural network using the training dataset. The classification training module includes a forward pass processing module, and a backward pass processing module. The backward pass processing module is configured to determine whether a current frame is in a region of target (ROT), determine ROT information such as beginning and length of the ROT and update weights and biases using a cross-entropy cost function and a tunable many-or-one detection (MOOD) cost function, that comprises a tunable hyperparameter for tuning the classifier for a particular task. The backward pass module further computes a soft target value using ROT information and computes a signal output error using the soft target value and network output value.
-
公开(公告)号:US20190355373A1
公开(公告)日:2019-11-21
申请号:US16414677
申请日:2019-05-16
Applicant: SYNAPTICS INCORPORATED
Inventor: Francesco Nesta , Saeed Mosayyebpour Kaskari , Dror Givon
IPC: G10L21/028 , H04S7/00 , H04R1/40 , G10L25/84
Abstract: Audio processing systems and methods comprise an audio sensor array configured to receive a multichannel audio input and generate a corresponding multichannel audio signal and a target activity detector configured to identify audio target sources in the multichannel audio signal. The target activity detector includes a VAD, an instantaneous locations component configured to detect a location of a plurality of audio sources, a dominant locations component configured to selectively buffer a subset of the plurality of audio sources comprising dominant audio sources, a source tracker configured to track locations of the dominant audio sources over time, and a dominance selection component configured to select the dominant target sources for further audio processing. The instantaneous location component computes a discrete spatial map comprising the location of the plurality of audio sources, and the dominant location component selects N of the dominant sources from the discrete spatial map for source tracking.
-
公开(公告)号:US20190172480A1
公开(公告)日:2019-06-06
申请号:US15832709
申请日:2017-12-05
Applicant: Synaptics Incorporated
Inventor: Saeed Mosayyebpour Kaskari , Francesco Nesta
Abstract: An audio processing device or method includes an audio transducer operable to receive audio input and generate an audio signal based on the audio input. The audio processing device or method also includes an audio signal processor operable to extract local features from the audio signal, such as Power-Normalized Coefficients (PNCC) of the audio signal. The audio signal processor also is operable to extract global features from the audio signal, such as chroma features and harmonicity features. A neural network is provided to determine a probability that a target audio is present in the audio signal based on the local and global features. In particular, the neural network is trained to output a value indicating whether the target audio is present and locally dominant in the audio signal.
-
公开(公告)号:US20180253648A1
公开(公告)日:2018-09-06
申请号:US15909930
申请日:2018-03-01
Applicant: SYNAPTICS INCORPORATED
Inventor: Saeed Mosayyebpour Kaskari , Trausti Thormundsson , Francesco Nesta
CPC classification number: G06N3/084 , G06K9/6256 , G06K9/627 , G06N3/04 , G06N3/0445 , G06N7/005 , G10L15/063 , G10L15/16 , G10L2015/025
Abstract: Classification training systems and methods include a neural network for classification of input data, a training dataset providing segmented labeled training data, and a classification training module operable to train the neural network using the training data. A forward pass processing module is operable to generate neural network outputs for the training data using weights and bias for the neural network, and a backward pass processing module is operable to update the weights and biases in a backward pass, including obtaining Region of Target (ROT) information from the training data, generate a forward-backward masking based on the ROT information, the forward-backward masking placing at least one restriction on a neural network output path, compute modified forward and backward variables based on the neural network outputs and the forward-backward masking, and update the weights and biases.
-
公开(公告)号:US20180232632A1
公开(公告)日:2018-08-16
申请号:US15894872
申请日:2018-02-12
Applicant: SYNAPTICS INCORPORATED
Inventor: Saeed Mosayyebpour Kaskari , Trausti Thormundsson , Francesco Nesta
CPC classification number: G06N3/049 , G06N3/0445 , G06N3/084 , G10L15/063 , G10L15/16 , G10L15/22 , G10L2015/223
Abstract: A classification system and method for training a neural network includes receiving a stream of segmented, labeled training data having a sequence of frames, computing a stream of input features data for the sequence of frames, and generating neural network outputs for the sequence of frames in a forward pass through the training data and in accordance weights and biases. The weights and biases are updated in a backward pass through the training data, including determining Region of Target (ROT) information from the segmented, labeled training data, computing modified forward and backward variables based on the neural network outputs and the ROT information, deriving a signal error for each frame within the sequence of frames based on the modified forward and backward variables, and updating the weights and biases based on the derived signal error. An adaptive learning module is provided to improve a convergence rate of the neural network.
-
公开(公告)号:US12254860B2
公开(公告)日:2025-03-18
申请号:US18052374
申请日:2022-11-03
Applicant: Synaptics Incorporated
Inventor: Pei-Wen Hsieh , Saeed Mosayyebpour Kaskari , Hong Qiu , Chuan-Yau Chan
IPC: G10K11/178
Abstract: This disclosure provides methods, devices, and systems for active noise cancellation (ANC). The present implementations more specifically relate to the use of dynamic range compression (DRC) for ANC. In some aspects, an ANC system receives an input audio signal of a transient noise as measured by a microphone, performs DRC on the input audio signal to generate a compressed dynamic range audio signal, and performs ANC on the compressed dynamic range audio signal to generate a cancellation signal associated with the input audio signal. The cancellation signal is based on an adjusted gain of the input audio signal to prevent saturation or large spikes of the cancellation signal, which can cause undesirable audio during playback.
-
-
-
-
-
-
-
-
-