-
公开(公告)号:US12130901B2
公开(公告)日:2024-10-29
申请号:US18511324
申请日:2023-11-16
申请人: Q (CUE) LTD.
发明人: Yonatan Wexler
IPC分类号: G10L15/25 , A61B5/1171 , G06F16/532 , G06F21/32 , G06F40/40 , G06V20/50 , G06V40/16 , G10L13/02 , G10L15/08 , G10L15/20 , G10L15/22 , H04R1/02 , H04R1/10
CPC分类号: G06F21/32 , A61B5/1176 , G06F16/532 , G06F40/40 , G06V20/50 , G06V40/166 , G06V40/171 , G06V40/172 , G06V40/176 , G10L13/02 , G10L15/08 , G10L15/20 , G10L15/22 , G10L15/25 , H04R1/028 , H04R1/10 , G10L2015/088 , G10L2015/223
摘要: Systems, methods, and non-transitory computer-readable media including instructions for detecting and utilizing facial skin micromovements are disclosed. In some non-limiting embodiments, the detection of the facial skin micromovements occurs using a speech detection system that may include a wearable housing, a light source (either a coherent light source or a non-coherent light source), a light detector, and at least one processor. One or more processors may be configured to analyze light reflections received from a facial region to determine the facial skin micromovements, and extract meaning from the determined facial skin micromovements. Examples of meaning that may be extracted from the determined facial skin micromovements may include words spoken by the individual (either silently spoken or vocally spoken), an identification of the individual, an emotional state of the individual, a heart rate of the individual, a respiration rate of the individual, or any other biometric, emotion, or speech-related indicator.
-
公开(公告)号:US12128765B2
公开(公告)日:2024-10-29
申请号:US17475828
申请日:2021-09-15
发明人: Jihoon Kim
IPC分类号: B60K35/00 , G06F3/14 , G06F16/27 , G06F40/279 , G10L15/08 , G10L15/22 , B60K35/10 , B60K35/22
CPC分类号: B60K35/00 , G06F3/14 , G06F16/27 , G06F40/279 , G10L15/08 , G10L15/22 , B60K35/10 , B60K35/22 , B60K2360/148 , G10L2015/088 , G10L2015/223
摘要: An embodiment vehicle includes an audio video navigation (AVN) device configured to execute an application, a display configured to display a screen of the application, an input device configured to receive a command from a user, and a processor configured to receive a backup command through the input device, in response to the backup command, generate snapshot data of the application being executed, extract a keyword based on the screen displayed on the display, generate metadata corresponding to the snapshot data and including a keyword, receive a restoration command that includes the keyword, the restoration command received thorough the input device, based on the received restoration command, select the metadata including the keyword, and restore data of the application based on the snapshot data corresponding to the selected metadata.
-
公开(公告)号:US12125475B2
公开(公告)日:2024-10-22
申请号:US18310105
申请日:2023-05-01
申请人: SATURN LICENSING LLC
发明人: Tomoaki Takemura , Shinya Masunaga , Koji Fujita , Katsutoshi Ishiwata , Kenichi Ikenaga , Katsutoshi Kusumoto
IPC分类号: G10L15/187 , G06F40/166 , G06F40/242 , G06F40/268 , G06F40/30 , G10L15/08 , G10L15/22 , G10L17/22 , G10L15/26
CPC分类号: G10L15/08 , G06F40/166 , G06F40/242 , G06F40/268 , G06F40/30 , G10L15/22 , G10L17/22 , G10L2015/221 , G10L15/26
摘要: There is provided an information processing device including an analysis unit configured to analyze a character string indicating contents of utterance obtained as a result of speech recognition, and a display control unit configured to display the character string indicating the contents of the utterance and an analysis result on a display screen.
-
公开(公告)号:US20240347057A1
公开(公告)日:2024-10-17
申请号:US18404254
申请日:2024-01-04
申请人: Sonos, Inc.
发明人: Connor Smith
IPC分类号: G10L15/22 , G10L15/07 , G10L15/08 , H04L43/0811
CPC分类号: G10L15/22 , G10L15/07 , G10L15/08 , H04L43/0811 , G10L2015/088 , G10L2015/223
摘要: As noted above, example techniques relate to offline voice control. A local voice input engine may process voice inputs locally when processing voice inputs via a cloud-based voice assistant service is not possible. Some techniques involve local (on-device) voice-assisted set-up of a cloud-based voice assistant service. Further example techniques involve local voice-assisted troubleshooting the cloud-based voice assistant service. Other techniques relate to interactions between local and cloud-based processing of voice inputs on a device that supports both local and cloud-based processing.
-
公开(公告)号:US20240345801A1
公开(公告)日:2024-10-17
申请号:US18432733
申请日:2024-02-05
申请人: Sonos, Inc.
发明人: Dayn Wilberding , John Tolomei
IPC分类号: G06F3/16 , G06F3/04817 , G06F3/0488 , G06F9/451 , G10L15/08 , G10L15/22 , G10L17/22 , H04L12/28 , H04N21/422 , H04N21/436 , H04N21/439
CPC分类号: G06F3/167 , G06F3/04817 , G10L15/08 , G10L15/22 , H04L12/282 , H04N21/42203 , H04N21/43615 , H04N21/4394 , G06F3/0488 , G06F9/453 , G10L2015/088 , G10L2015/223 , G10L17/22
摘要: Example techniques involve invoking voice assistance for a media playback system. In some embodiments, a NMD stores in memory a set of command information comprising a listing of playback commands and associated command criteria. The NMD captures a voice input and detects inclusion, within the voice input, of one or more particular playback commands from among the playback commands in the listing. In response, the NMD selects a local voice assistant that supports (a) one or more additional playback commands relative to a cloud-based VAS and (b) fewer non-playback commands relative to the cloud-based VAS, determines, via the local voice assistant, an intent in the captured voice input, and performs a response to the determined intent. The NMD foregoes selection of the cloud-based VAS when the local voice assistant is selected.
-
公开(公告)号:US20240339110A1
公开(公告)日:2024-10-10
申请号:US18296181
申请日:2023-04-05
发明人: Scott Kurtz , Philip Stick , Gary Skrabutenas , Christian Buchter
CPC分类号: G10L15/08 , G10L15/05 , G10L15/22 , G10L15/30 , G10L2015/088 , G10L2015/223
摘要: One or more portions of audio input may be detected. Timing data associated with the one or more portions of audio may be determined. Audio processing may be carried out based on the timing data.
-
7.
公开(公告)号:US20240331682A1
公开(公告)日:2024-10-03
申请号:US18621320
申请日:2024-03-29
申请人: Mixhalo Corp.
摘要: A method for generating and displaying contextual data using a mobile computing device at a live event includes receiving a data representation of a live audio signal corresponding to the live event via a wireless network. The method also includes processing the data representation of the live audio signal into a live audio stream. The method also includes generating first contextual data based on the live audio stream and a first machine learning model. The method also includes generating second contextual data based on the live audio stream and a second machine learning model. The method also includes generating for display on the mobile computing device at the live event the first contextual data and the second contextual data.
-
公开(公告)号:US20240331058A1
公开(公告)日:2024-10-03
申请号:US18623449
申请日:2024-04-01
申请人: Meta Platforms, Inc
IPC分类号: G06Q50/00 , G06F3/01 , G06F3/16 , G06F9/451 , G06F9/48 , G06F9/54 , G06F16/332 , G06F16/9032 , G06F16/9536 , G06F18/2321 , G06F40/205 , G06F40/242 , G06F40/253 , G06F40/295 , G06F40/30 , G06F40/35 , G06F40/56 , G06N3/04 , G06N3/045 , G06N3/047 , G06N3/08 , G06N20/00 , G06Q10/109 , G06Q30/0601 , G06V10/20 , G06V10/764 , G06V10/82 , G06V20/00 , G06V20/20 , G06V20/30 , G06V20/40 , G06V40/16 , G06V40/20 , G10L15/06 , G10L15/08 , G10L15/16 , G10L15/18 , G10L15/22 , G10L15/30 , G10L15/32 , H04L51/18 , H04L51/212 , H04L51/222 , H04L51/224 , H04L51/52 , H04L67/306 , H04L67/75 , H04N7/14
CPC分类号: G06Q50/01 , G06F3/011 , G06F3/013 , G06F9/453 , G06F9/485 , G06F9/4862 , G06F9/4881 , G06F9/547 , G06F16/3329 , G06F16/90332 , G06F16/9536 , G06F18/2321 , G06F40/205 , G06F40/242 , G06F40/253 , G06F40/295 , G06F40/30 , G06F40/35 , G06F40/56 , G06N3/04 , G06N3/045 , G06N3/047 , G06N3/08 , G06N20/00 , G06Q10/109 , G06Q30/0603 , G06Q30/0631 , G06Q30/0633 , G06Q30/0643 , G06V10/255 , G06V10/764 , G06V10/82 , G06V20/00 , G06V20/20 , G06V20/30 , G06V40/16 , G06V40/25 , G10L15/063 , G10L15/08 , G10L15/16 , G10L15/1815 , G10L15/1822 , G10L15/22 , G10L15/30 , G10L15/32 , H04L51/18 , H04L51/212 , H04L51/222 , H04L51/224 , H04L51/52 , H04L67/306 , H04L67/75 , H04N7/147 , G06F3/017 , G06F3/167 , G06V20/41 , G06V40/174 , G06V2201/10 , G10L2015/0631 , G10L2015/088 , G10L2015/223 , G10L2015/227 , G10L2015/228
摘要: In one embodiment, a method includes receiving, at a client system, an audio input, where the audio input comprises a coreference to a target object, accessing visual data from one or more camera associated with the client system, where the visual data comprises images portraying one or more objects, resolving the coreference to the target object from among the one or more objects, resoling the target object to a specific entity, and providing, at the client system, a response to the audio input, where the response comprises information about the specific entity.
-
公开(公告)号:US20240321264A1
公开(公告)日:2024-09-26
申请号:US18679981
申请日:2024-05-31
发明人: Jing Liu , Feng-Ju Chang , Athanasios Mouchtaris , Martin Radfar , Maurizio Omologo , Siegfried Kunzmann
CPC分类号: G10L15/08 , G10L15/005 , G10L15/02 , G10L2015/088
摘要: Techniques for performing automatic speech recognition (ASR) are described. In some embodiments, an ASR component integrates contextual information from user profile data into audio encoding data to predict a token(s) corresponding to a spoken input. The user profile data may include personalized words, such as, contact names, device names, etc. The ASR component determines word embedding data using the personalized words. The ASR component is configured to apply attention to audio frames that are relevant to the personalized words based on processing the audio encoding data and the word embedding data.
-
公开(公告)号:US12100398B2
公开(公告)日:2024-09-24
申请号:US18085867
申请日:2022-12-21
申请人: GOOGLE LLC
发明人: David Roy Schairer , Sumer Mohammed , Mark Spates, IV , Prem Kumar , Chi Yeung Jonathan Ng , Di Zhu , Steven Clark
CPC分类号: G10L15/22 , G06F3/167 , G10L15/08 , G10L15/30 , G16Y40/10 , G16Y40/35 , H04L12/282 , H04W4/70 , G10L2015/088 , G10L2015/223
摘要: Remote automated assistant component(s) generate client device notification(s) based on a received IoT state change notification that indicates a change in at least one state associated with at least one IoT device. The generated client device notification(s) can each indicate the change in state associated with the at least one IoT device, and can optionally indicate the at least one IoT device. Further, the remote automated assistant component(s) can identify candidate assistant client devices that are associated with the at least one IoT device, and determine whether each of the one or more of the candidate assistant client device(s) should render a corresponding client device notification. The remote automated assistant component(s) can then transmit a corresponding command to each of the assistant client device(s) it determines should render a corresponding client device notification, where each transmitted command causes the corresponding assistant client device to render the corresponding client device notification.
-
-
-
-
-
-
-
-
-