-
1.
公开(公告)号:US11568887B1
公开(公告)日:2023-01-31
申请号:US17234186
申请日:2021-04-19
摘要: Various examples are provided for surveillance of an audio stream. In one example, a method includes identifying presence or absence of a sound type of interest at a location during a time period; selecting the sound type from a library of sound type information to provide a collection of sound type information; incorporating the collection on a device proximate to the location; acquiring an audio stream from the location by the device to provide a locational audio stream; analyzing the locational audio stream to determine whether a sound type in the collection is present in the audio stream; and generating a notification to a user or computer if a sound type in the collection is present. The device can acquire and process the audio stream. In another example, a bulk sound type information library can be generated by identifying sound types of interest including them based upon a confidence level.
-
公开(公告)号:US11348475B2
公开(公告)日:2022-05-31
申请号:US15649382
申请日:2017-07-13
申请人: The Boeing Company
IPC分类号: G10L21/00 , G10L25/00 , G09B5/02 , G05B19/042 , G06T19/00 , G09B19/24 , G10L15/22 , G06F16/242 , H04L67/12
摘要: A cognitive assistant that allows a maintainer to speak to an application using natural language is disclosed. The maintainer can quickly interact with an application hands-free without the need to use complex user interfaces or memorized voice commands. The assistant provides instructions to the maintainer using augmented reality audio and visual cues. The assistant will walk the maintainer through maintenance tasks and verify proper execution using IoT sensors. If after completing a step, the IoT sensors are not as expected, the maintainer is notified on how to resolve the situation.
-
公开(公告)号:US11259103B2
公开(公告)日:2022-02-22
申请号:US16643214
申请日:2018-06-29
发明人: Hwa-Sung Kim , Dong Hyun Sohn , Ji-Eun Lee
摘要: The present disclosure relates to a home appliance capable of being operated by speech of a user. The home appliance includes a main body forming an outer appearance, a microphone including at least one sensing portion disposed to direct to the front of the main body to detect speech of a user, and a speaker unit disposed to be spaced apart from the microphone unit by a predetermined distance.
-
公开(公告)号:US11250876B1
公开(公告)日:2022-02-15
申请号:US16707345
申请日:2019-12-09
摘要: A confidential sentiment analysis method includes receiving call data, storing the call data including interaction metadata, generating a speech-to-text transcript corresponding to words spoken by one or more callers, generating an anonymized transcript by anonymizing personally identifiable words, and generating a sentiment score by analyzing the anonymized transcript. A computing system includes a processor, and a memory including computer executable instructions that, when executed by the one processor, cause the system to receive call data, store the call data, generate a speech-to-text transcript, generate an anonymized transcript by anonymizing personally identifiable words, and generate a sentiment score based on the anonymized transcript. A non-transitory computer readable medium contains program instructions that when executed, cause a computer system to receive call data, store the call data, generate a speech-to-text transcript, generate an anonymized transcript by anonymizing personally identifiable words, and generate a sentiment score based on the anonymized transcript.
-
公开(公告)号:US11250846B2
公开(公告)日:2022-02-15
申请号:US16228701
申请日:2018-12-20
IPC分类号: G06F15/00 , G10L25/00 , G10L15/22 , H04W4/02 , H04L41/0866 , G10L15/30 , H04W64/00 , G10L15/26
摘要: Utilizing a voice capturing device (e.g., smart phone, tablet, smart speaker) to capture voice commands and send the voice commands to a cloud based voice recognition/processing engine to convert the commands to text commands. Processing the text commands at an access point for a WiFi network. The voice commands may include search queries about particular wireless devices that are associated with the WiFi network. The access point may search the configuration and connectivity data for the WiFi network to determine what access point the wireless device is connected to and a location for the access point. The result of the search may be announced to the user via the voice capturing device. The voice activated search may be to find wireless devices that have misplaced or for inventory management. The voice activated commands may also include voice WiFi network configuration commands.
-
公开(公告)号:US11244685B2
公开(公告)日:2022-02-08
申请号:US16560756
申请日:2019-09-04
摘要: A network computer system for managing a network service (e.g., a transport service) can include a voice-assistant subsystem for generating dialogues and performing actions for service providers of the network service. The network computer system can receive, from a user device, a request for the network service. In response, the network computer system can identify a service provider and transmit an invitation to the provider device of the service provider. In response to the identification of the service provider for the request, the voice-assistant subsystem can trigger an audio voice prompt to be presented on the provider device and a listening period during which the provider device monitors for an audio input from the service provider. Based on the audio input captured by the provider device, the network computer system can determine an intent corresponding to whether the service provider accepts or declines the invitation.
-
公开(公告)号:US11232800B2
公开(公告)日:2022-01-25
申请号:US16710189
申请日:2019-12-11
申请人: Google LLC
发明人: Jae Lee
摘要: The present disclosure provides for improved hot word detection in electronic devices, particularly small form factor devices such as wearables. The electronic device includes an accelerometer onboard to pick up voice in noisy conditions, and utilizes the accelerometer to confirm that a particular user intended to activate the hot word detection, thereby reducing false detection of other people's voices.
-
公开(公告)号:US11205443B2
公开(公告)日:2021-12-21
申请号:US16047058
申请日:2018-07-27
摘要: Systems, methods, and computer-readable storage devices are disclosed for improved audio feature discovery using a neural network. One method including: receiving a trained neural network model, the trained neural network configured to output an audio feature classification of audio data; deconstructing the trained neural network model to generate at least one saliency map, the at least one saliency map providing a successful classification of the audio feature; and extracting at least one visualization of the audio feature the trained neural network model relies on for classification based on the at least one saliency map.
-
公开(公告)号:US11189284B2
公开(公告)日:2021-11-30
申请号:US16600232
申请日:2019-10-11
申请人: LG ELECTRONICS INC.
发明人: Ji Chan Maeng
摘要: The present disclosure relates to an apparatus which communicates with a voice recognition device, and a method for controlling an apparatus with a voice recognition capability which operates in the Internet of Things environment configured by a 5G communication network. According to an exemplary embodiment of the present disclosure, an apparatus with a voice recognition capability includes a container which has one open surface and accommodates objects therein, a door which opens/closes the container, a sensor which senses an open/closed state of the door, a microphone which receives an external voice, a voice recognizer which recognizes a voice command received from the microphone, and a controller which controls an active state and an inactive state of the voice recognizer, in which the controller may predict whether the voice recognizer needs to be activated using a deep neural network model learned through the machine learning.
-
公开(公告)号:US20210335377A1
公开(公告)日:2021-10-28
申请号:US17232807
申请日:2021-04-16
发明人: Fengyan Qi , Lei Miao
IPC分类号: G10L21/013 , G10L25/00 , G10L19/00 , G10L21/028 , G10L25/90
摘要: A method and an apparatus for detecting correctness of a pitch period, where the method for detecting correctness of a pitch period includes determining, according to an initial pitch period of an input signal in a time domain, a pitch frequency bin of the input signal, where the initial pitch period is obtained by performing open-loop detection on the input signal, determining, based on an amplitude spectrum of the input signal in a frequency domain, a pitch period correctness decision parameter, associated with the pitch frequency bin, of the input signal, and determining correctness of the initial pitch period according to the pitch period correctness decision parameter.
-
-
-
-
-
-
-
-
-