Patent search ap:("Synaptics Incorporated") AND inv:"Saeed Mosayyebpour Kaskari" Page 1

1.

发明授权
Sensitivity mode for an audio spotting system 有权

公开(公告)号：US11823707B2

公开(公告)日：2023-11-21

申请号：US17572002

申请日：2022-01-10

Applicant: Synaptics Incorporated

Inventor： Saeed Mosayyebpour Kaskari

IPC: G10L25/84 , G10L21/0216 , G10L21/0264 , G10L25/78 , G10L21/0208

CPC classification number: G10L25/84 , G10L21/0216 , G10L21/0264 , G10L2021/02082 , G10L2021/02166 , G10L2025/786

Abstract: An audio spotting system configured for various operating modes including a regular mode and sensitivity mode is described. An example cascade audio spotting system may include a high-power subsystem including a high-power trigger and a transfer module. This high-power trigger includes one or more detection models used to detect whether a target sound activity is included in the one or more audio streams. The one or more detection models are associated with a first set of hyperparameters when the cascade audio spotting system is in a regular mode, and the one or more detection models are associated with a second set of hyperparameters when the cascade audio spotting system is in a sensitivity mode. The transfer module provides at least one of one or more processed audio streams for further processing in response to the high-power trigger detecting the target sound activity in the one or more audio streams.

2.

发明授权
Multi-stream target-speech detection and channel fusion 有权

公开(公告)号：US11694710B2

公开(公告)日：2023-07-04

申请号：US17484208

申请日：2021-09-24

Applicant: Synaptics Incorporated

Inventor： Francesco Nesta , Saeed Mosayyebpour Kaskari

IPC: G10L21/0364 , G10L25/60 , G10L15/22 , G10L25/84 , H04R1/40 , H04R3/00 , H04S3/00 , H04L65/60

CPC classification number: G10L21/0364 , G10L15/22 , G10L25/60 , G10L25/84 , H04R1/406 , H04R3/005 , H04S3/008 , H04L65/60 , H04S2400/01

Abstract: Audio processing systems and methods include an audio sensor array configured to receive a multichannel audio input and generate a corresponding multichannel audio signal and target-speech detection logic and an automatic speech recognition engine or VoIP application. An audio processing device includes a target speech enhancement engine configured to analyze a multichannel audio input signal and generate a plurality of enhanced target streams, a multi-stream target-speech detection generator comprising a plurality of target-speech detector engines each configured to determine a probability of detecting a specific target-speech of interest in the stream, wherein the multi-stream target-speech detection generator is configured to determine a plurality of weights associated with the enhanced target streams, and a fusion subsystem configured to apply the plurality of weights to the enhanced target streams to generate an enhancement output signal.

3.

发明授权
Real-time single-channel speech enhancement in noisy and time-varying environments 有权

公开(公告)号：US11373667B2

公开(公告)日：2022-06-28

申请号：US15957829

申请日：2018-04-19

Applicant: SYNAPTICS INCORPORATED

Inventor： Saeed Mosayyebpour Kaskari , Francesco Nesta , Trausti Thormundsson , Thomas Aaron Gulliver

IPC: H04B15/00 , G10L21/0232 , G10L21/038 , G10L21/0208 , G10L25/18 , G10L21/00

Abstract: Systems and methods for processing an audio signal include an audio input operable to receive an input signal comprising a time-domain, single-channel audio signal, a subband analysis block operable to transform the input signal to a frequency domain input signal comprising a plurality of k-spaced under-sampled subband signals, a reverberation reduction block operable to reduce reverberation effect, including late reverberation, in the plurality of k-spaced under-sampled subband signals, a noise reduction block operable to reduce background noise from the plurality of k-spaced under-sampled subband signals, and a subband synthesis block operable to transform the subband signals to the time-domain, thereby producing an enhanced output signal.

4.

发明授权
Connectionist temporal classification using segmented labeled sequence data 有权

公开(公告)号：US10762427B2

公开(公告)日：2020-09-01

申请号：US15909930

申请日：2018-03-01

Applicant: SYNAPTICS INCORPORATED

Inventor： Saeed Mosayyebpour Kaskari , Trausti Thormundsson , Francesco Nesta

IPC: G10L15/00 , G06N3/08 , G06K9/62 , G06N3/04 , G10L15/06 , G10L15/16 , G10L15/02 , G06N7/00

Abstract: Classification training systems and methods include a neural network for classification of input data, a training dataset providing segmented labeled training data, and a classification training module operable to train the neural network using the training data. A forward pass processing module is operable to generate neural network outputs for the training data using weights and bias for the neural network, and a backward pass processing module is operable to update the weights and biases in a backward pass, including obtaining Region of Target (ROT) information from the training data, generate a forward-backward masking based on the ROT information, the forward-backward masking placing at least one restriction on a neural network output path, compute modified forward and backward variables based on the neural network outputs and the forward-backward masking, and update the weights and biases.

5.

发明申请
MANY OR ONE DETECTION CLASSIFICATION SYSTEMS AND METHODS 有权

公开(公告)号：US20210248470A1

公开(公告)日：2021-08-12

申请号：US17243519

申请日：2021-04-28

Applicant: SYNAPTICS INCORPORATED

Inventor： Saeed Mosayyebpour Kaskari

IPC: G06N3/08 , G10L25/51 , G10L25/30 , G10L15/06 , G10L15/16 , G10L15/22

Abstract: A classification training system comprises a neural network configured to perform classification of input data, a training dataset including pre-segmented, labeled training samples, and a classification training module configured to train the neural network using the training dataset. The classification training module includes a forward pass processing module, and a backward pass processing module. The backward pass processing module is configured to determine whether a current frame is in a region of target (ROT), determine ROT information such as beginning and length of the ROT and update weights and biases using a cross-entropy cost function and a tunable many-or-one detection (MOOD) cost function, that comprises a tunable hyperparameter for tuning the classifier for a particular task. The backward pass module further computes a soft target value using ROT information and computes a signal output error using the soft target value and network output value.

6.

发明申请
360-DEGREE MULTI-SOURCE LOCATION DETECTION, TRACKING AND ENHANCEMENT 审中-公开

公开(公告)号：US20190355373A1

公开(公告)日：2019-11-21

申请号：US16414677

申请日：2019-05-16

Applicant: SYNAPTICS INCORPORATED

Inventor： Francesco Nesta , Saeed Mosayyebpour Kaskari , Dror Givon

IPC: G10L21/028 , H04S7/00 , H04R1/40 , G10L25/84

Abstract: Audio processing systems and methods comprise an audio sensor array configured to receive a multichannel audio input and generate a corresponding multichannel audio signal and a target activity detector configured to identify audio target sources in the multichannel audio signal. The target activity detector includes a VAD, an instantaneous locations component configured to detect a location of a plurality of audio sources, a dominant locations component configured to selectively buffer a subset of the plurality of audio sources comprising dominant audio sources, a source tracker configured to track locations of the dominant audio sources over time, and a dominance selection component configured to select the dominant target sources for further audio processing. The instantaneous location component computes a discrete spatial map comprising the location of the plurality of audio sources, and the dominant location component selects N of the dominant sources from the discrete spatial map for source tracking.

7.

发明申请
VOICE ACTIVITY DETECTION SYSTEMS AND METHODS 审中-公开

公开(公告)号：US20190172480A1

公开(公告)日：2019-06-06

申请号：US15832709

申请日：2017-12-05

Applicant: Synaptics Incorporated

Inventor： Saeed Mosayyebpour Kaskari , Francesco Nesta

IPC: G10L25/78 , G10L15/02 , G10L15/16 , G10L25/18 , G10L25/21 , G10L15/22

Abstract: An audio processing device or method includes an audio transducer operable to receive audio input and generate an audio signal based on the audio input. The audio processing device or method also includes an audio signal processor operable to extract local features from the audio signal, such as Power-Normalized Coefficients (PNCC) of the audio signal. The audio signal processor also is operable to extract global features from the audio signal, such as chroma features and harmonicity features. A neural network is provided to determine a probability that a target audio is present in the audio signal based on the local and global features. In particular, the neural network is trained to output a value indicating whether the target audio is present and locally dominant in the audio signal.

8.

发明申请
CONNECTIONIST TEMPORAL CLASSIFICATION USING SEGMENTED LABELED SEQUENCE DATA 审中-公开

公开(公告)号：US20180253648A1

公开(公告)日：2018-09-06

申请号：US15909930

申请日：2018-03-01

Applicant: SYNAPTICS INCORPORATED

Inventor： Saeed Mosayyebpour Kaskari , Trausti Thormundsson , Francesco Nesta

IPC: G06N3/08 , G06N3/04 , G06K9/62

CPC classification number: G06N3/084 , G06K9/6256 , G06K9/627 , G06N3/04 , G06N3/0445 , G06N7/005 , G10L15/063 , G10L15/16 , G10L2015/025

Abstract: Classification training systems and methods include a neural network for classification of input data, a training dataset providing segmented labeled training data, and a classification training module operable to train the neural network using the training data. A forward pass processing module is operable to generate neural network outputs for the training data using weights and bias for the neural network, and a backward pass processing module is operable to update the weights and biases in a backward pass, including obtaining Region of Target (ROT) information from the training data, generate a forward-backward masking based on the ROT information, the forward-backward masking placing at least one restriction on a neural network output path, compute modified forward and backward variables based on the neural network outputs and the forward-backward masking, and update the weights and biases.

9.

发明申请
EFFICIENT CONNECTIONIST TEMPORAL CLASSIFICATION FOR BINARY CLASSIFICATION 审中-公开

公开(公告)号：US20180232632A1

公开(公告)日：2018-08-16

申请号：US15894872

申请日：2018-02-12

Applicant: SYNAPTICS INCORPORATED

Inventor： Saeed Mosayyebpour Kaskari , Trausti Thormundsson , Francesco Nesta

IPC: G06N3/04 , G06N3/08

CPC classification number: G06N3/049 , G06N3/0445 , G06N3/084 , G10L15/063 , G10L15/16 , G10L15/22 , G10L2015/223

Abstract: A classification system and method for training a neural network includes receiving a stream of segmented, labeled training data having a sequence of frames, computing a stream of input features data for the sequence of frames, and generating neural network outputs for the sequence of frames in a forward pass through the training data and in accordance weights and biases. The weights and biases are updated in a backward pass through the training data, including determining Region of Target (ROT) information from the segmented, labeled training data, computing modified forward and backward variables based on the neural network outputs and the ROT information, deriving a signal error for each frame within the sequence of frames based on the modified forward and backward variables, and updating the weights and biases based on the derived signal error. An adaptive learning module is provided to improve a convergence rate of the neural network.

10.

发明授权
Dynamic range compression combined with active noise cancellation to remove artifacts caused by transient noises 有权

公开(公告)号：US12254860B2

公开(公告)日：2025-03-18

申请号：US18052374

申请日：2022-11-03

Applicant: Synaptics Incorporated

Inventor： Pei-Wen Hsieh , Saeed Mosayyebpour Kaskari , Hong Qiu , Chuan-Yau Chan

IPC: G10K11/178

Abstract: This disclosure provides methods, devices, and systems for active noise cancellation (ANC). The present implementations more specifically relate to the use of dynamic range compression (DRC) for ANC. In some aspects, an ANC system receives an input audio signal of a transient noise as measured by a microphone, performs DRC on the input audio signal to generate a compressed dynamic range audio signal, and performs ANC on the compressed dynamic range audio signal to generate a cancellation signal associated with the input audio signal. The cancellation signal is based on an adjusted gain of the input audio signal to prevent saturation or large spikes of the cancellation signal, which can cause undesirable audio during playback.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification