Patent search cpc:"G10L2015/223" Page 1

1.

发明公开
Voice-based user interface system, method and device 审中-公开

公开(公告)号：US20240361981A1

公开(公告)日：2024-10-31

申请号：US18687913

申请日：2022-08-31

Applicant: SANDEEP KUMAR R

Inventor： SANDEEP KUMAR R

IPC: G06F3/16 , G06F3/0482 , G06F3/0484 , G06F9/451 , G10L15/18 , G10L15/22

CPC classification number: G06F3/167 , G06F3/0482 , G06F3/0484 , G06F9/451 , G10L15/18 , G10L15/22 , G10L2015/223

Abstract: Exemplary embodiments of the present disclosure are directed to a voice-based user interface system 10 comprising a voice assembly 12 for processing voice-inputs into voice commands comprising cluster commands and a computing device comprising a focus zone 22 defined within a display thereof. When a cluster 30, which comprises one or more user-selectable items, is within the focus zone 22 whereby said cluster 30 and thereby each of the one or more selectable items thereof are said to be focused, the reception of a cluster command by the computing device results in a corresponding focused item being selected.

2.

发明授权
Context-based deactivation of a recording device 有权

公开(公告)号：US12131736B2

公开(公告)日：2024-10-29

申请号：US17810719

申请日：2022-07-05

Applicant: Capital One Services, LLC

Inventor： Jeremy Goodsitt , Galen Rafferty , Samuel Sharpe , Grant Eden , Austin Walters , Anh Truong , Christopher Wallace

IPC: G10L15/22 , G06F9/451 , G10L15/18 , G10L15/08

CPC classification number: G10L15/22 , G06F9/453 , G10L15/18 , G10L2015/088 , G10L2015/223

Abstract: In some implementations, a recording device may obtain a settings configuration associated with deactivating an audio recording function or an audio processing function of the recording device, wherein the settings configuration indicates one or more deactivation events. The recording device may obtain first audio content associated with the recording device for identifying audio prompts associated with causing the recording device to perform one or more actions. The recording device may detect a deactivation event of the one or more deactivation events. The recording device may refrain from obtaining audio content based on detecting the deactivation event and until an activation event is detected. The recording device may obtain second audio content associated with the recording device based on detecting the activation event.

3.

发明授权
Contextual auto-completion for assistant systems 有权

公开(公告)号：US12131522B2

公开(公告)日：2024-10-29

申请号：US17077316

申请日：2020-10-22

Applicant: Meta Platforms, Inc.

Inventor： Jiedan Zhu , Fuchun Peng , Benoit F. Dumoulin , Xiaohu Liu , Rajen Subba , Mohsen Agsen , Michael Robert Hanson

IPC: G06F16/338 , G06F3/01 , G06F3/16 , G06F7/14 , G06F9/451 , G06F16/176 , G06F16/22 , G06F16/23 , G06F16/242 , G06F16/2455 , G06F16/2457 , G06F16/248 , G06F16/33 , G06F16/332 , G06F16/903 , G06F16/9032 , G06F16/9038 , G06F16/904 , G06F16/951 , G06F16/9535 , G06F18/2411 , G06F40/205 , G06F40/295 , G06F40/30 , G06F40/40 , G06N3/006 , G06N3/08 , G06N7/01 , G06N20/00 , G06Q50/00 , G06V10/764 , G06V10/82 , G06V20/10 , G06V40/20 , G10L15/02 , G10L15/06 , G10L15/07 , G10L15/16 , G10L15/18 , G10L15/183 , G10L15/187 , G10L15/22 , G10L15/26 , G10L17/06 , G10L17/22 , H04L5/02 , H04L12/28 , H04L41/00 , H04L41/22 , H04L43/0882 , H04L43/0894 , H04L51/02 , H04L51/18 , H04L51/216 , H04L51/52 , H04L67/306 , H04L67/50 , H04L67/5651 , H04L67/75 , H04W12/08 , G10L13/00 , G10L13/04 , H04L51/046 , H04L67/10 , H04L67/53

CPC classification number: G06V10/82 , G06F3/011 , G06F3/013 , G06F3/017 , G06F3/167 , G06F7/14 , G06F9/453 , G06F16/176 , G06F16/2255 , G06F16/2365 , G06F16/243 , G06F16/24552 , G06F16/24575 , G06F16/24578 , G06F16/248 , G06F16/3323 , G06F16/3329 , G06F16/3344 , G06F16/338 , G06F16/90332 , G06F16/90335 , G06F16/9038 , G06F16/904 , G06F16/951 , G06F16/9535 , G06F18/2411 , G06F40/205 , G06F40/295 , G06F40/30 , G06F40/40 , G06N3/006 , G06N3/08 , G06N7/01 , G06N20/00 , G06Q50/01 , G06V10/764 , G06V20/10 , G06V40/28 , G10L15/02 , G10L15/063 , G10L15/07 , G10L15/16 , G10L15/1815 , G10L15/1822 , G10L15/183 , G10L15/187 , G10L15/22 , G10L15/26 , G10L17/06 , G10L17/22 , H04L5/02 , H04L12/2816 , H04L41/20 , H04L41/22 , H04L43/0882 , H04L43/0894 , H04L51/02 , H04L51/18 , H04L51/216 , H04L51/52 , H04L67/306 , H04L67/535 , H04L67/5651 , H04L67/75 , H04W12/08 , G06F2216/13 , G10L13/00 , G10L13/04 , G10L2015/223 , G10L2015/225 , H04L51/046 , H04L67/10 , H04L67/53

Abstract: In one embodiment, a method includes receiving a first user input from a first user, wherein the first user input comprises a partial request, presenting one or more suggested intent auto-completions corresponding to the partial request, receiving a selection by the first user of a first suggested intent auto-completion of the suggested intent auto-completions and a second user input, presenting one or more suggested slot auto-completions corresponding to one or more candidate slot-hypotheses corresponding to the second user input, respectively, wherein each of the candidate slot-hypotheses comprise a slot-suggestion, and wherein each suggested slot auto-completion comprises the second user input and the corresponding candidate slot-hypothesis, receiving a selection by the first user of a first suggested slot auto-completion of the suggested slot auto-completions, and presenting execution results of one or more tasks corresponding to the first suggested intent auto-completion and the first suggested slot auto-completion.

4.

发明公开
OVERLAYING VISUAL CONTENT USING MODEL ADAPTATION 审中-公开

公开(公告)号：US20240355064A1

公开(公告)日：2024-10-24

申请号：US18497629

申请日：2023-10-30

Applicant: Snap Inc.

Inventor： Daria Skrypnyk , Matthew Hallberg

IPC: G06T19/00 , G06T15/04 , G06T17/20 , G06T19/20 , G10L15/18 , G10L15/22

CPC classification number: G06T19/006 , G06T15/04 , G06T17/20 , G06T19/20 , G10L15/1815 , G10L15/22 , G06T2219/2004 , G06T2219/2012 , G10L2015/223

Abstract: Described is a system for overlaying visual content onto a real-world object by identifying a prompt of a user indicating a user's intent, accessing an image template, wherein the image template includes placement of features within the image template, and processing a combination of data associated with the image template and the prompt using a generative machine learning model to generate a first populated image template in which one or more portions of the image template are populated with visual content representing the user's intent. The system then proceeds to access an image depicting a real-world object and overlay the first populated image template that includes the visual content representative of the user's intent on at least a portion of the real-world object based on the placement of the features of the image template.

5.

发明公开
GOLF CADDY SYSTEM 审中-公开

公开(公告)号：US20240350879A1

公开(公告)日：2024-10-24

申请号：US18136691

申请日：2023-04-19

Applicant: Matthew Zdunich

Inventor： Matthew Zdunich

IPC: A63B55/60 , A63B24/00 , A63B71/06 , B62B3/10 , B62B5/00 , B64U10/14 , B64U50/20 , B64U50/30 , G05D1/02 , G05D1/10 , G06T7/70 , G06V20/17 , G06V20/40 , G06V40/20 , G10L15/22 , H04N5/91 , H04N23/57 , H04R1/02

CPC classification number: A63B55/61 , A63B24/0006 , A63B24/0021 , A63B71/0622 , B62B3/102 , B62B5/00 , B62B5/004 , B62B5/0043 , B62B5/0069 , B64U10/14 , B64U50/20 , B64U50/30 , G06T7/70 , G06V20/17 , G06V20/42 , G06V40/23 , G10L15/22 , H04N5/91 , H04N23/57 , H04R1/028 , A63B2024/0028 , A63B2071/0625 , A63B2220/05 , A63B2220/806 , A63B2225/50 , B62B2202/406 , B64U2101/30 , G06T2207/10016 , G06T2207/10032 , G06T2207/30224 , G10L2015/223

Abstract: A golf caddy system for autonomously carrying a golf bag across a golf course, recording video of golf swings, and providing golfing advice to a user includes an autonomous motorized cart. The cart includes a processor with map data of the golf course in a memory of the processor and a global positioning system receiver for determining a current location of the cart. The cart also includes a camera which is configured to capture video of a golf swing.

6.

发明授权
Interruption model 有权

公开(公告)号：US12126871B1

公开(公告)日：2024-10-22

申请号：US16592506

申请日：2019-10-03

Applicant: Amazon Technologies, Inc.

Inventor： Jatin Bajaj , Clare Elizabeth Veladanda

IPC: H04N21/47 , G10L15/22 , H04N21/43 , H04N21/431 , H04N21/472

CPC classification number: H04N21/47 , G10L15/22 , H04N21/4302 , H04N21/4316 , H04N21/47217 , G10L2015/223

Abstract: Devices and techniques are generally described for an interruption model for a user device. In various examples, first metadata related to first content executing on a user device may be determined. In some examples, second metadata related to second content for execution by the user device may be determined. In various examples, an output configuration for the user device may be determined using the first metadata and the second metadata. In some examples, the output configuration may result from identification of the second content to the user device during execution of the first content.

7.

发明授权
Electronic device supporting improved voice activity detection 有权

公开(公告)号：US12125498B2

公开(公告)日：2024-10-22

申请号：US17570557

申请日：2022-01-07

Applicant: SAMSUNG ELECTRONICS CO., LTD.

Inventor： Seungbeom Ryu , Sungjae Park , Hyuk Oh , Myeungyong Choi , Junkwon Choi

IPC: G10L15/00 , G06N3/045 , G10L15/02 , G10L15/16 , G10L15/22 , G10L25/84 , H04R1/08 , H04R3/00

CPC classification number: G10L25/84 , G06N3/045 , G10L15/02 , G10L15/16 , G10L15/22 , H04R1/08 , H04R3/00 , G10L2015/223 , H04R2420/07

Abstract: According to various embodiments, an electronic device may include: a microphone; an audio connector; a wireless communication circuit; a processor operatively connected to the microphone, the audio connector, and the wireless communication circuit; and a memory operatively connected to the processor, wherein the memory may store instructions that, when executed, cause the processor to: receive a first audio signal through the microphone, the audio connector, or the wireless communication circuit, extract audio feature information from the first audio signal, and recognize a speech section in a second audio signal, received after the first audio signal through the microphone, the audio connector, or the wireless communication circuit, using the audio feature information.

8.

发明授权
Multi-modal interaction between users, automated assistants, and other computing services 有权

公开(公告)号：US12125486B2

公开(公告)日：2024-10-22

申请号：US18217326

申请日：2023-06-30

Applicant: GOOGLE LLC

Inventor： Ulas Kirazci , Adam Coimbra , Abraham Lee , Wei Dong , Thushan Amarasiriwardena

IPC: G10L15/22 , G06F3/16 , G06F9/448 , G10L13/027

CPC classification number: G10L15/22 , G06F3/167 , G06F9/4498 , G10L13/027 , G10L2015/223 , G10L2015/228

Abstract: Techniques are described herein for multi-modal interaction between users, automated assistants, and other computing services. In various implementations, a user may engage with the automated assistant in order to further engage with a third party computing service. In some implementations, the user may advance through dialog state machines associated with third party computing service using both verbal input modalities and input modalities other than verbal modalities, such as visual/tactile modalities.

9.

发明授权
Adaptively recognizing speech using key phrases 有权

公开(公告)号：US12125482B2

公开(公告)日：2024-10-22

申请号：US16692150

申请日：2019-11-22

Applicant: INTEL CORPORATION

Inventor： Krzysztof Czarnowski , Munir Nikolai Alexander Georges , Tobias Bocklet , Georg Stemmer

IPC: G10L15/22 , G06F16/638 , G10L15/02 , G10L15/16 , G10L17/00 , G10L25/78 , G10L15/08

CPC classification number: G10L15/22 , G06F16/638 , G10L15/02 , G10L15/16 , G10L17/00 , G10L25/78 , G10L2015/027 , G10L2015/088 , G10L2015/223

Abstract: An example apparatus for recognizing speech includes an audio receiver to receive a stream of audio. The apparatus also includes a key phrase detector to detect a key phrase in the stream of audio. The apparatus further includes a model adapter to dynamically adapt a model based on the detected key phrase. The apparatus also includes a query recognizer to detect a voice query following the key phrase in a stream of audio via the adapted model.