-
公开(公告)号:US20240221743A1
公开(公告)日:2024-07-04
申请号:US18557173
申请日:2021-07-27
Applicant: QUALCOMM Incorporated
Inventor: Jun WEI , Xiaoxia DONG , Qimeng PAN , Kwihyuk JIN , Tong TANG
CPC classification number: G10L15/22 , G10L25/63 , G10L25/84 , G10L2015/225 , G10L2015/228 , G10L2025/783
Abstract: Embodiments include methods of voice or speech recognition in varied environments and/or user emotional states executed by a processor of a computing device. The processor of a computing device may determine a voice or speech recognition threshold for voice or speech recognition based on information obtained from contextual information detected in an environment from which a received audio input was captured by the computing device and an emotional classification of a user's voice in the received audio input. The processor may determine a confidence score for one or more key words identified in the received audio input. The processor may then output results of a voice or speech recognition analysis of the received audio input in response to the determined confidence score exceeding the determined voice or speech recognition threshold.
-
2.
公开(公告)号:US20230041568A1
公开(公告)日:2023-02-09
申请号:US17759328
申请日:2020-04-02
Applicant: QUALCOMM Incorporated
Inventor: Qimeng PAN , Xiaoxia DONG , Jun WEI
IPC: H04W28/02
Abstract: Various embodiments include methods for preloading data onto a wireless device based on predicted network status coverage and predicted user data access behaviors. In an aspect, a server may receive trip information a wireless device, receive network coverage map information representing a different network coverage areas, predict a travel time for the wireless device to reach the network coverage area with reduced signal quality, predict a travel duration of the wireless device within that network coverage area, and transmit a notification to the wireless device to enable preloading appropriate data before entering the network coverage area. Further aspects include wireless devices configure to communicate with the server and preloading appropriate data before entering the network coverage area
-
公开(公告)号:US20250095640A1
公开(公告)日:2025-03-20
申请号:US18292297
申请日:2021-09-26
Applicant: QUALCOMM Incorporated
Inventor: Jun WEI , Xiaoxia DONG , Qimeng PAN
Abstract: Techniques described herein are directed to improving a user keyword detection model using user audio samples that have been falsely rejected. In some embodiments, user equipment (UE) may detect multiple attempts by a user at uttering a keyword. A true keyword that matches keyword models implemented by the UE may activate a desired function, such as initiating an assistant application, initiating a specific application, waking up from a lower power state, transitioning to a lower power state, toggling a power-saving mode, unlocking or locking the device, etc. Any true keywords uttered prior to detection of the true keyword but which have been falsely rejected may be sent to a server to train the keyword model and generate an updated keyword model. The updated keyword model may be received by the UE to replace the keyword model being used, allowing the UE to continually improve keyword detection accuracy.
-
公开(公告)号:US20230197085A1
公开(公告)日:2023-06-22
申请号:US17997243
申请日:2020-06-22
Applicant: QUALCOMM Incorporated
Inventor: Xiaoxia DONG , Jun WEI , Qimeng PAN
IPC: G10L17/20 , G10L25/51 , G10L21/0216 , G10L17/22
CPC classification number: G10L17/20 , G10L25/51 , G10L21/0216 , G10L17/22
Abstract: Embodiments include methods for voice/speech recognition in noisy environments executed by a processor of a computing device. In various embodiments, voice or speech recognition may be executed by a processor of a computing device, which may include determining a voice recognition model to use for voice and/or speech recognition based on a location where an audio input is received and performing voice and/or speech recognition on the audio input using the determined voice recognition model. Some embodiments my receive from a computing device, an audio input and location information associated with a location where the audio input was recorded. The received audio input may be used to generate a voice recognition model associated with the location where the audio input was recorded for use in voice and/or speech recognition. The generated voice recognition model associated with the location may be provided to the computing device.
-
-
-