-
公开(公告)号:US10037677B2
公开(公告)日:2018-07-31
申请号:US15492105
申请日:2017-04-20
Inventor: Xuan Zhong , William Yost , Michael Dorman , Julie Liss , Visar Berisha
CPC classification number: G08B21/182 , G09B5/00 , G09B19/04 , G10L25/78 , G10L2025/783
Abstract: Disclosed herein are speech therapeutic devices and methods. In one aspect, the speech therapeutic device includes audio input circuitry, signal processing circuitry, and stimulus circuitry. In certain embodiments, the audio input circuitry is configured to provide an input signal that is indicative of speech provided by a user and the signal processing circuitry is configured to utilize a reconfigurable rule that includes a condition, receive the input signal, process the input signal using the reconfigurable rule, and provide an alert signal responsive to attainment of the condition. The stimulus circuitry is configured to receive the alert signal and provide a stimulus to the user. The signal processing circuitry is additionally configured to (i) receive the reconfigurable rule from a communication network, and/or (ii) generate a record indicative of the alert signal, store the record in a memory, and send the record to a communication network.
-
公开(公告)号:US10290200B2
公开(公告)日:2019-05-14
申请号:US16035985
申请日:2018-07-16
Inventor: Xuan Zhong , William Yost , Michael Dorman , Julie Liss , Visar Berisha
Abstract: Disclosed herein are speech therapeutic devices and methods. In one aspect, the speech therapeutic device includes audio input circuitry, signal processing circuitry, and stimulus circuitry. In certain embodiments, the audio input circuitry is configured to provide an input signal that is indicative of speech provided by a user and the signal processing circuitry is configured to utilize a reconfigurable rule that includes a condition, receive the input signal, process the input signal using the reconfigurable rule, and provide an alert signal responsive to attainment of the condition. The stimulus circuitry is configured to receive the alert signal and provide a stimulus to the user. The signal processing circuitry is additionally configured to (i) receive the reconfigurable rule from a communication network, and/or (ii) generate a record indicative of the alert signal, store the record in a memory, and send the record to a communication network.
-
公开(公告)号:US20180322763A1
公开(公告)日:2018-11-08
申请号:US16035985
申请日:2018-07-16
Inventor: Xuan Zhong , William Yost , Michael Dorman , Julie Liss , Visar Berisha
CPC classification number: G08B21/182 , G09B5/00 , G09B19/04 , G10L25/78 , G10L2025/783
Abstract: Disclosed herein are speech therapeutic devices and methods. In one aspect, the speech therapeutic device includes audio input circuitry, signal processing circuitry, and stimulus circuitry. In certain embodiments, the audio input circuitry is configured to provide an input signal that is indicative of speech provided by a user and the signal processing circuitry is configured to utilize a reconfigurable rule that includes a condition, receive the input signal, process the input signal using the reconfigurable rule, and provide an alert signal responsive to attainment of the condition. The stimulus circuitry is configured to receive the alert signal and provide a stimulus to the user. The signal processing circuitry is additionally configured to (i) receive the reconfigurable rule from a communication network, and/or (ii) generate a record indicative of the alert signal, store the record in a memory, and send the record to a communication network.
-
4.
公开(公告)号:US11978466B2
公开(公告)日:2024-05-07
申请号:US17827438
申请日:2022-05-27
Inventor: Jianwei Zhang , Suren Jayasuriya , Visar Berisha
IPC: G10L21/02 , G10L19/028 , G10L25/18 , G10L25/30
CPC classification number: G10L21/02 , G10L19/028 , G10L25/18 , G10L25/30
Abstract: Systems, methods, and apparatuses to restore degraded speech via a modified diffusion model are described. An exemplary system is specially configured to train a diffusion-based vocoder containing an upsampler, based on pairing original speech x and degraded speech mel-spectrum mT samples; train a deep convoluted neural network (CNN) upsampler based on a mean absolute error loss to match the estimated original speech {circumflex over (x)}′ outputted by the diffusion-based vocoder by extracting the upsampler, generating a reference conditioner, and generating a weighted altered conditioner cTn′. The system further optimizes speech quality to invert non-linear transformation and estimate lost data by feeding the degraded mel-spectrum mT through the CNN upsampler and feeding the degraded mel-spectrum mT through the diffusion-based vocoder. The system then generates estimated original speech {circumflex over (x)}′ based on the corresponding degraded speech mel-spectrum mT. Other related embodiments are described.
-
5.
公开(公告)号:US20220392471A1
公开(公告)日:2022-12-08
申请号:US17827438
申请日:2022-05-27
Inventor: Jianwei Zhang , Suren Jayasuriya , Visar Berisha
IPC: G10L19/028 , G10L25/30 , G10L25/18
Abstract: Systems, methods, and apparatuses to restore degraded speech via a modified diffusion model are described. An exemplary system is specially configured to train a diffusion-based vocoder containing an upsampler, based on pairing original speech x and degraded speech mel-spectrum mT samples; train a deep convoluted neural network (CNN) upsampler based on a mean absolute error loss to match the estimated original speech {circumflex over (x)}′ outputted by the diffusion-based vocoder by extracting the upsampler, generating a reference conditioner, and generating a weighted altered conditioner ć′Tn. The system further optimizes speech quality to invert non-linear transformation and estimate lost data by feeding the degraded mel-spectrum mT through the CNN upsampler and feeding the degraded mel-spectrum mT through the diffusion-based vocoder. The system then generates estimated original speech {circumflex over (x)}′ based on the corresponding degraded speech mel-spectrum mT. Other related embodiments are described.
-
6.
公开(公告)号:US10796715B1
公开(公告)日:2020-10-06
申请号:US15693699
申请日:2017-09-01
Inventor: Visar Berisha , Ming Tu , Alan Wisler , Julie Liss
IPC: G10L25/66 , A61B5/00 , G10L25/51 , G10L25/78 , G10L15/02 , G10L25/90 , G10L25/60 , G10L15/16 , G10L15/22
Abstract: Systems and methods use patient speech samples as inputs, use subjective multi-point ratings by speech-language pathologists of multiple perceptual dimensions of patient speech samples as further inputs, and extract laboratory-implemented features from the patient speech samples. A predictive software model learns the relationship between speech acoustics and the subjective ratings of such speech obtained from speech-language pathologists, and is configured to apply this information to evaluate new speech samples. Outputs may include objective evaluation of the plurality of perceptual dimensions for new speech samples and/or evaluation of disease onset, disease progression, or disease treatment efficacy for a condition involving dysarthria as a symptom, utilizing the new speech samples.
-
公开(公告)号:US12175998B2
公开(公告)日:2024-12-24
申请号:US17292339
申请日:2019-11-08
Applicant: Arizona Board of Regents on behalf of Arizona State University , Mayo Foundation for Medical Education and Research
Inventor: Visar Berisha , Jacob Peplinski , Todd Schwedt
Abstract: Speech analysis devices and methods for identifying migraine attacks are provided. Migraine sufferers can experience changes in speech patterns both during a migraine attack and in a pre-attack phase (e.g., a time period before the migraine attack can be recognized by the migraine sufferer). Embodiments identify or predict migraine attacks during the pre-attack phase and/or the attack phase (such as early stages of a migraine attack) by comparing speech features from one or more speech samples provided by a user against baseline data. The speech features are indicative and/or predictive of migraine onset, and can be personalized to a user and/or based on normative data.
-
公开(公告)号:US20240180482A1
公开(公告)日:2024-06-06
申请号:US18553335
申请日:2022-03-31
Inventor: Gabriela Stegmann , Julie Liss , Visar Berisha , Shira Hahn
CPC classification number: A61B5/4803 , A61B5/4088 , A61B5/6898 , A61B5/7267 , G10L15/1815 , G10L25/66
Abstract: Disclosed herein are systems and methods for evaluating or analyzing cognitive function or impairment using speech analysis. In some implementations the evaluation of cognitive function comprises a predicted future cognitive function or change in cognitive function. In some implementations the cognitive function is evaluated using a panel or speech features such as a metric of semantic relevance, MATTR, and other relevant features. In another aspect, a machine learning predictive model for evaluating cognitive function based on speech, comprising: receiving input signal comprising speech audio for a plurality of subjects, to detect one or more metrics of speech identifying classifications corresponding to cognitive function and training a model using machine learning based on a training data set comprising the one or more metrics of speech and the classifications identified in the speech audio, thereby generating a machine learning predictive model configured to generate an evaluation of cognitive function based on speech.
-
公开(公告)号:US20240049981A1
公开(公告)日:2024-02-15
申请号:US18266563
申请日:2021-12-09
Inventor: Visar Berisha , Julie Liss , Shira Hahn , Gabriela Stegmann , Jeremy Shefner
CPC classification number: A61B5/08 , G10L25/66 , G10L15/22 , G10L15/02 , A61B5/4803 , A61B5/7267 , A61B5/6898 , G16H10/20 , G16H10/60 , G16H20/30 , A61B2562/0204
Abstract: Described are platforms, systems, media, and methods for maintaining a database of items associated with one or more skill requirements and a visit duration; maintaining a database of experts associated with one or more skill proficiencies, a location, and a schedule; receiving a request from a consumer for delivery by an expert of one or more items in the database to a consumer address; identifying experts in the database having skill proficiencies matching the skill requirements of the one or more items and available in a timeslot for the visit duration of the one or more items; presenting timeslots for which one or more experts are identified to the consumer and allowing the consumer to select a timeslot; and selecting an expert from among the identified experts in the selected timeslot based on shortest travel time; provided that utilization of the selected expert exceeds a predetermined utilization threshold.
-
公开(公告)号:US20170309154A1
公开(公告)日:2017-10-26
申请号:US15492105
申请日:2017-04-20
Inventor: Xuan Zhong , William Yost , Michael Dorman , Julie Liss , Visar Berisha
CPC classification number: G08B21/182 , G09B5/00 , G09B19/04 , G10L25/78 , G10L2025/783
Abstract: Disclosed herein are speech therapeutic devices and methods. In one aspect, the speech therapeutic device includes audio input circuitry, signal processing circuitry, and stimulus circuitry. In certain embodiments, the audio input circuitry is configured to provide an input signal that is indicative of speech provided by a user and the signal processing circuitry is configured to utilize a reconfigurable rule that includes a condition, receive the input signal, process the input signal using the reconfigurable rule, and provide an alert signal responsive to attainment of the condition. The stimulus circuitry is configured to receive the alert signal and provide a stimulus to the user. The signal processing circuitry is additionally configured to (i) receive the reconfigurable rule from a communication network, and/or (ii) generate a record indicative of the alert signal, store the record in a memory, and send the record to a communication network.
-
-
-
-
-
-
-
-
-