Patent search cpc:"G10L15/08" Page 1

1.

发明授权
Personal presentation of prevocalization to improve articulation 有权

公开(公告)号：US12130901B2

公开(公告)日：2024-10-29

申请号：US18511324

申请日：2023-11-16

Applicant: Q (CUE) LTD.

Inventor： Yonatan Wexler

IPC: G10L15/25 , A61B5/1171 , G06F16/532 , G06F21/32 , G06F40/40 , G06V20/50 , G06V40/16 , G10L13/02 , G10L15/08 , G10L15/20 , G10L15/22 , H04R1/02 , H04R1/10

CPC classification number: G06F21/32 , A61B5/1176 , G06F16/532 , G06F40/40 , G06V20/50 , G06V40/166 , G06V40/171 , G06V40/172 , G06V40/176 , G10L13/02 , G10L15/08 , G10L15/20 , G10L15/22 , G10L15/25 , H04R1/028 , H04R1/10 , G10L2015/088 , G10L2015/223

Abstract: Systems, methods, and non-transitory computer-readable media including instructions for detecting and utilizing facial skin micromovements are disclosed. In some non-limiting embodiments, the detection of the facial skin micromovements occurs using a speech detection system that may include a wearable housing, a light source (either a coherent light source or a non-coherent light source), a light detector, and at least one processor. One or more processors may be configured to analyze light reflections received from a facial region to determine the facial skin micromovements, and extract meaning from the determined facial skin micromovements. Examples of meaning that may be extracted from the determined facial skin micromovements may include words spoken by the individual (either silently spoken or vocally spoken), an identification of the individual, an emotional state of the individual, a heart rate of the individual, a respiration rate of the individual, or any other biometric, emotion, or speech-related indicator.

2.

发明授权
Vehicle and method of controlling the same 有权

公开(公告)号：US12128765B2

公开(公告)日：2024-10-29

申请号：US17475828

申请日：2021-09-15

Applicant: Hyundai Motor Company , Kia Corporation

Inventor： Jihoon Kim

IPC: B60K35/00 , G06F3/14 , G06F16/27 , G06F40/279 , G10L15/08 , G10L15/22 , B60K35/10 , B60K35/22

CPC classification number: B60K35/00 , G06F3/14 , G06F16/27 , G06F40/279 , G10L15/08 , G10L15/22 , B60K35/10 , B60K35/22 , B60K2360/148 , G10L2015/088 , G10L2015/223

Abstract: An embodiment vehicle includes an audio video navigation (AVN) device configured to execute an application, a display configured to display a screen of the application, an input device configured to receive a command from a user, and a processor configured to receive a backup command through the input device, in response to the backup command, generate snapshot data of the application being executed, extract a keyword based on the screen displayed on the display, generate metadata corresponding to the snapshot data and including a keyword, receive a restoration command that includes the keyword, the restoration command received thorough the input device, based on the received restoration command, select the metadata including the keyword, and restore data of the application based on the snapshot data corresponding to the selected metadata.

3.

发明授权
Information processing device, information processing method, and program 有权

公开(公告)号：US12125475B2

公开(公告)日：2024-10-22

申请号：US18310105

申请日：2023-05-01

Applicant: SATURN LICENSING LLC

Inventor： Tomoaki Takemura , Shinya Masunaga , Koji Fujita , Katsutoshi Ishiwata , Kenichi Ikenaga , Katsutoshi Kusumoto

IPC: G10L15/187 , G06F40/166 , G06F40/242 , G06F40/268 , G06F40/30 , G10L15/08 , G10L15/22 , G10L17/22 , G10L15/26

CPC classification number: G10L15/08 , G06F40/166 , G06F40/242 , G06F40/268 , G06F40/30 , G10L15/22 , G10L17/22 , G10L2015/221 , G10L15/26

Abstract: There is provided an information processing device including an analysis unit configured to analyze a character string indicating contents of utterance obtained as a result of speech recognition, and a display control unit configured to display the character string indicating the contents of the utterance and an analysis result on a display screen.

4.

发明公开
Offline Voice Control 审中-公开

公开(公告)号：US20240347057A1

公开(公告)日：2024-10-17

申请号：US18404254

申请日：2024-01-04

Applicant: Sonos, Inc.

Inventor： Connor Smith

IPC: G10L15/22 , G10L15/07 , G10L15/08 , H04L43/0811

CPC classification number: G10L15/22 , G10L15/07 , G10L15/08 , H04L43/0811 , G10L2015/088 , G10L2015/223

Abstract: As noted above, example techniques relate to offline voice control. A local voice input engine may process voice inputs locally when processing voice inputs via a cloud-based voice assistant service is not possible. Some techniques involve local (on-device) voice-assisted set-up of a cloud-based voice assistant service. Further example techniques involve local voice-assisted troubleshooting the cloud-based voice assistant service. Other techniques relate to interactions between local and cloud-based processing of voice inputs on a device that supports both local and cloud-based processing.

5.

发明公开
MEDIA PLAYBACK SYSTEM WITH CONCURRENT VOICE ASSISTANCE 审中-公开

公开(公告)号：US20240345801A1

公开(公告)日：2024-10-17

申请号：US18432733

申请日：2024-02-05

Applicant: Sonos, Inc.

Inventor： Dayn Wilberding , John Tolomei

IPC: G06F3/16 , G06F3/04817 , G06F3/0488 , G06F9/451 , G10L15/08 , G10L15/22 , G10L17/22 , H04L12/28 , H04N21/422 , H04N21/436 , H04N21/439

CPC classification number: G06F3/167 , G06F3/04817 , G10L15/08 , G10L15/22 , H04L12/282 , H04N21/42203 , H04N21/43615 , H04N21/4394 , G06F3/0488 , G06F9/453 , G10L2015/088 , G10L2015/223 , G10L17/22

Abstract: Example techniques involve invoking voice assistance for a media playback system. In some embodiments, a NMD stores in memory a set of command information comprising a listing of playback commands and associated command criteria. The NMD captures a voice input and detects inclusion, within the voice input, of one or more particular playback commands from among the playback commands in the listing. In response, the NMD selects a local voice assistant that supports (a) one or more additional playback commands relative to a cloud-based VAS and (b) fewer non-playback commands relative to the cloud-based VAS, determines, via the local voice assistant, an intent in the captured voice input, and performs a response to the determined intent. The NMD foregoes selection of the cloud-based VAS when the local voice assistant is selected.

6.

发明公开
METHODS AND SYSTEMS FOR VOICE CONTROL 审中-公开

公开(公告)号：US20240339110A1

公开(公告)日：2024-10-10

申请号：US18296181

申请日：2023-04-05

Applicant: Comcast Cable Communications, LLC

Inventor： Scott Kurtz , Philip Stick , Gary Skrabutenas , Christian Buchter

IPC: G10L15/08 , G10L15/05 , G10L15/22 , G10L15/30

CPC classification number: G10L15/08 , G10L15/05 , G10L15/22 , G10L15/30 , G10L2015/088 , G10L2015/223

Abstract: One or more portions of audio input may be detected. Timing data associated with the one or more portions of audio may be determined. Audio processing may be carried out based on the timing data.

7.

发明公开
SYSTEMS AND METHODS FOR REAL-TIME CONCERT TRANSCRIPTION AND USER-CAPTURED VIDEO TAGGING 审中-公开

公开(公告)号：US20240331682A1

公开(公告)日：2024-10-03

申请号：US18621320

申请日：2024-03-29

Applicant: Mixhalo Corp.

Inventor： Ty Daniels , Charles E. Luckhardt, IV

IPC: G10L15/08 , G10L15/30 , G10L25/57 , G11B27/34

CPC classification number: G10L15/08 , G10L15/30 , G10L25/57 , G11B27/34

Abstract: A method for generating and displaying contextual data using a mobile computing device at a live event includes receiving a data representation of a live audio signal corresponding to the live event via a wireless network. The method also includes processing the data representation of the live audio signal into a live audio stream. The method also includes generating first contextual data based on the live audio stream and a first machine learning model. The method also includes generating second contextual data based on the live audio stream and a second machine learning model. The method also includes generating for display on the mobile computing device at the live event the first contextual data and the second contextual data.

8.

发明公开
MULTIMODAL ENTITY AND COREFERENCE RESOLUTION FOR ASSISTANT SYSTEMS 审中-公开

公开(公告)号：US20240331058A1

公开(公告)日：2024-10-03

申请号：US18623449

申请日：2024-04-01

Applicant: Meta Platforms, Inc

Inventor： Shivani Poddar , Seungwhan Moon , Paul Anthony Crook , Rajen Subba

IPC: G06Q50/00 , G06F3/01 , G06F3/16 , G06F9/451 , G06F9/48 , G06F9/54 , G06F16/332 , G06F16/9032 , G06F16/9536 , G06F18/2321 , G06F40/205 , G06F40/242 , G06F40/253 , G06F40/295 , G06F40/30 , G06F40/35 , G06F40/56 , G06N3/04 , G06N3/045 , G06N3/047 , G06N3/08 , G06N20/00 , G06Q10/109 , G06Q30/0601 , G06V10/20 , G06V10/764 , G06V10/82 , G06V20/00 , G06V20/20 , G06V20/30 , G06V20/40 , G06V40/16 , G06V40/20 , G10L15/06 , G10L15/08 , G10L15/16 , G10L15/18 , G10L15/22 , G10L15/30 , G10L15/32 , H04L51/18 , H04L51/212 , H04L51/222 , H04L51/224 , H04L51/52 , H04L67/306 , H04L67/75 , H04N7/14

CPC classification number: G06Q50/01 , G06F3/011 , G06F3/013 , G06F9/453 , G06F9/485 , G06F9/4862 , G06F9/4881 , G06F9/547 , G06F16/3329 , G06F16/90332 , G06F16/9536 , G06F18/2321 , G06F40/205 , G06F40/242 , G06F40/253 , G06F40/295 , G06F40/30 , G06F40/35 , G06F40/56 , G06N3/04 , G06N3/045 , G06N3/047 , G06N3/08 , G06N20/00 , G06Q10/109 , G06Q30/0603 , G06Q30/0631 , G06Q30/0633 , G06Q30/0643 , G06V10/255 , G06V10/764 , G06V10/82 , G06V20/00 , G06V20/20 , G06V20/30 , G06V40/16 , G06V40/25 , G10L15/063 , G10L15/08 , G10L15/16 , G10L15/1815 , G10L15/1822 , G10L15/22 , G10L15/30 , G10L15/32 , H04L51/18 , H04L51/212 , H04L51/222 , H04L51/224 , H04L51/52 , H04L67/306 , H04L67/75 , H04N7/147 , G06F3/017 , G06F3/167 , G06V20/41 , G06V40/174 , G06V2201/10 , G10L2015/0631 , G10L2015/088 , G10L2015/223 , G10L2015/227 , G10L2015/228

Abstract: In one embodiment, a method includes receiving, at a client system, an audio input, where the audio input comprises a coreference to a target object, accessing visual data from one or more camera associated with the client system, where the visual data comprises images portraying one or more objects, resolving the coreference to the target object from among the one or more objects, resoling the target object to a specific entity, and providing, at the client system, a response to the audio input, where the response comprises information about the specific entity.

9.

发明公开
AUTOMATIC SPEECH RECOGNITION 审中-公开

公开(公告)号：US20240321264A1

公开(公告)日：2024-09-26

申请号：US18679981

申请日：2024-05-31

Applicant: Amazon Technologies, Inc.

Inventor： Jing Liu , Feng-Ju Chang , Athanasios Mouchtaris , Martin Radfar , Maurizio Omologo , Siegfried Kunzmann

IPC: G10L15/08 , G10L15/00 , G10L15/02

CPC classification number: G10L15/08 , G10L15/005 , G10L15/02 , G10L2015/088

Abstract: Techniques for performing automatic speech recognition (ASR) are described. In some embodiments, an ASR component integrates contextual information from user profile data into audio encoding data to predict a token(s) corresponding to a spoken input. The user profile data may include personalized words, such as, contact names, device names, etc. The ASR component determines word embedding data using the personalized words. The ASR component is configured to apply attention to audio frames that are relevant to the personalized words based on processing the audio encoding data and the word embedding data.

10.

发明授权
Generating IoT-based notification(s) and provisioning of command(s) to cause automatic rendering of the IoT-based notification(s) by automated assistant client(s) of client device(s) 有权

公开(公告)号：US12100398B2

公开(公告)日：2024-09-24

申请号：US18085867

申请日：2022-12-21

Applicant: GOOGLE LLC

Inventor： David Roy Schairer , Sumer Mohammed , Mark Spates, IV , Prem Kumar , Chi Yeung Jonathan Ng , Di Zhu , Steven Clark

IPC: G10L15/22 , G06F3/16 , G10L15/08 , G10L15/30 , G16Y40/10 , G16Y40/35 , H04L12/28 , H04W4/70

CPC classification number: G10L15/22 , G06F3/167 , G10L15/08 , G10L15/30 , G16Y40/10 , G16Y40/35 , H04L12/282 , H04W4/70 , G10L2015/088 , G10L2015/223

Abstract: Remote automated assistant component(s) generate client device notification(s) based on a received IoT state change notification that indicates a change in at least one state associated with at least one IoT device. The generated client device notification(s) can each indicate the change in state associated with the at least one IoT device, and can optionally indicate the at least one IoT device. Further, the remote automated assistant component(s) can identify candidate assistant client devices that are associated with the at least one IoT device, and determine whether each of the one or more of the candidate assistant client device(s) should render a corresponding client device notification. The remote automated assistant component(s) can then transmit a corresponding command to each of the assistant client device(s) it determines should render a corresponding client device notification, where each transmitted command causes the corresponding assistant client device to render the corresponding client device notification.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification