-
公开(公告)号:US12266350B1
公开(公告)日:2025-04-01
申请号:US17583812
申请日:2022-01-25
Applicant: Nvidia Corporation
Inventor: Siddha Ganju , Ruthie Lyle , Steven Dalton
Abstract: Systems and methods are directed toward evaluating auditory inputs against a range of tolerance to provide feedback regarding pronunciation. An auditory input may be evaluated using a trained machine learning system and evaluated for similarity against a target word. Similarity may be scored and then evaluated to determine whether the similarity falls within a range of tolerance, wherein the range of tolerance may be adjusted or modified for particular uses. A score within the range of tolerance is indicative of a word that has been pronounced such that it would be perceptible.
-
2.
公开(公告)号:US11605384B1
公开(公告)日:2023-03-14
申请号:US17390118
申请日:2021-07-30
Applicant: NVIDIA Corporation
Inventor: Steven Dalton , Siddha Ganju , Ruthie Lyle
Abstract: Systems and methods of presenting interrupting content during human speech are disclosed. The proposed systems offer improved duplex communications in conversational AI platforms. In some embodiments, the system receives speech data and evaluates the data using linguistic models. If the linguistic models detect indications of linguistic irregularities such as mispronunciation, a smart feedback assistant can determine that the system should interrupt the speaker in near-real-time and provide feedback regarding their pronunciation. In addition, conversational irregularities may also be detected, causing the smart feedback assistant to interrupt with presentation of moderating guidance. In some cases, emotion models may also be utilized to detect emotional states based on the speaker's voice in order to offer near-immediate feedback. Users can also customize the manner and occasions in which they are interrupted.
-