-
公开(公告)号:US20240361981A1
公开(公告)日:2024-10-31
申请号:US18687913
申请日:2022-08-31
Applicant: SANDEEP KUMAR R
Inventor: SANDEEP KUMAR R
IPC: G06F3/16 , G06F3/0482 , G06F3/0484 , G06F9/451 , G10L15/18 , G10L15/22
CPC classification number: G06F3/167 , G06F3/0482 , G06F3/0484 , G06F9/451 , G10L15/18 , G10L15/22 , G10L2015/223
Abstract: Exemplary embodiments of the present disclosure are directed to a voice-based user interface system 10 comprising a voice assembly 12 for processing voice-inputs into voice commands comprising cluster commands and a computing device comprising a focus zone 22 defined within a display thereof. When a cluster 30, which comprises one or more user-selectable items, is within the focus zone 22 whereby said cluster 30 and thereby each of the one or more selectable items thereof are said to be focused, the reception of a cluster command by the computing device results in a corresponding focused item being selected.
-
公开(公告)号:US12131736B2
公开(公告)日:2024-10-29
申请号:US17810719
申请日:2022-07-05
Applicant: Capital One Services, LLC
Inventor: Jeremy Goodsitt , Galen Rafferty , Samuel Sharpe , Grant Eden , Austin Walters , Anh Truong , Christopher Wallace
CPC classification number: G10L15/22 , G06F9/453 , G10L15/18 , G10L2015/088 , G10L2015/223
Abstract: In some implementations, a recording device may obtain a settings configuration associated with deactivating an audio recording function or an audio processing function of the recording device, wherein the settings configuration indicates one or more deactivation events. The recording device may obtain first audio content associated with the recording device for identifying audio prompts associated with causing the recording device to perform one or more actions. The recording device may detect a deactivation event of the one or more deactivation events. The recording device may refrain from obtaining audio content based on detecting the deactivation event and until an activation event is detected. The recording device may obtain second audio content associated with the recording device based on detecting the activation event.
-
公开(公告)号:US12131522B2
公开(公告)日:2024-10-29
申请号:US17077316
申请日:2020-10-22
Applicant: Meta Platforms, Inc.
Inventor: Jiedan Zhu , Fuchun Peng , Benoit F. Dumoulin , Xiaohu Liu , Rajen Subba , Mohsen Agsen , Michael Robert Hanson
IPC: G06F16/338 , G06F3/01 , G06F3/16 , G06F7/14 , G06F9/451 , G06F16/176 , G06F16/22 , G06F16/23 , G06F16/242 , G06F16/2455 , G06F16/2457 , G06F16/248 , G06F16/33 , G06F16/332 , G06F16/903 , G06F16/9032 , G06F16/9038 , G06F16/904 , G06F16/951 , G06F16/9535 , G06F18/2411 , G06F40/205 , G06F40/295 , G06F40/30 , G06F40/40 , G06N3/006 , G06N3/08 , G06N7/01 , G06N20/00 , G06Q50/00 , G06V10/764 , G06V10/82 , G06V20/10 , G06V40/20 , G10L15/02 , G10L15/06 , G10L15/07 , G10L15/16 , G10L15/18 , G10L15/183 , G10L15/187 , G10L15/22 , G10L15/26 , G10L17/06 , G10L17/22 , H04L5/02 , H04L12/28 , H04L41/00 , H04L41/22 , H04L43/0882 , H04L43/0894 , H04L51/02 , H04L51/18 , H04L51/216 , H04L51/52 , H04L67/306 , H04L67/50 , H04L67/5651 , H04L67/75 , H04W12/08 , G10L13/00 , G10L13/04 , H04L51/046 , H04L67/10 , H04L67/53
CPC classification number: G06V10/82 , G06F3/011 , G06F3/013 , G06F3/017 , G06F3/167 , G06F7/14 , G06F9/453 , G06F16/176 , G06F16/2255 , G06F16/2365 , G06F16/243 , G06F16/24552 , G06F16/24575 , G06F16/24578 , G06F16/248 , G06F16/3323 , G06F16/3329 , G06F16/3344 , G06F16/338 , G06F16/90332 , G06F16/90335 , G06F16/9038 , G06F16/904 , G06F16/951 , G06F16/9535 , G06F18/2411 , G06F40/205 , G06F40/295 , G06F40/30 , G06F40/40 , G06N3/006 , G06N3/08 , G06N7/01 , G06N20/00 , G06Q50/01 , G06V10/764 , G06V20/10 , G06V40/28 , G10L15/02 , G10L15/063 , G10L15/07 , G10L15/16 , G10L15/1815 , G10L15/1822 , G10L15/183 , G10L15/187 , G10L15/22 , G10L15/26 , G10L17/06 , G10L17/22 , H04L5/02 , H04L12/2816 , H04L41/20 , H04L41/22 , H04L43/0882 , H04L43/0894 , H04L51/02 , H04L51/18 , H04L51/216 , H04L51/52 , H04L67/306 , H04L67/535 , H04L67/5651 , H04L67/75 , H04W12/08 , G06F2216/13 , G10L13/00 , G10L13/04 , G10L2015/223 , G10L2015/225 , H04L51/046 , H04L67/10 , H04L67/53
Abstract: In one embodiment, a method includes receiving a first user input from a first user, wherein the first user input comprises a partial request, presenting one or more suggested intent auto-completions corresponding to the partial request, receiving a selection by the first user of a first suggested intent auto-completion of the suggested intent auto-completions and a second user input, presenting one or more suggested slot auto-completions corresponding to one or more candidate slot-hypotheses corresponding to the second user input, respectively, wherein each of the candidate slot-hypotheses comprise a slot-suggestion, and wherein each suggested slot auto-completion comprises the second user input and the corresponding candidate slot-hypothesis, receiving a selection by the first user of a first suggested slot auto-completion of the suggested slot auto-completions, and presenting execution results of one or more tasks corresponding to the first suggested intent auto-completion and the first suggested slot auto-completion.
-
公开(公告)号:US20240355064A1
公开(公告)日:2024-10-24
申请号:US18497629
申请日:2023-10-30
Applicant: Snap Inc.
Inventor: Daria Skrypnyk , Matthew Hallberg
CPC classification number: G06T19/006 , G06T15/04 , G06T17/20 , G06T19/20 , G10L15/1815 , G10L15/22 , G06T2219/2004 , G06T2219/2012 , G10L2015/223
Abstract: Described is a system for overlaying visual content onto a real-world object by identifying a prompt of a user indicating a user's intent, accessing an image template, wherein the image template includes placement of features within the image template, and processing a combination of data associated with the image template and the prompt using a generative machine learning model to generate a first populated image template in which one or more portions of the image template are populated with visual content representing the user's intent. The system then proceeds to access an image depicting a real-world object and overlay the first populated image template that includes the visual content representative of the user's intent on at least a portion of the real-world object based on the placement of the features of the image template.
-
公开(公告)号:US20240350879A1
公开(公告)日:2024-10-24
申请号:US18136691
申请日:2023-04-19
Applicant: Matthew Zdunich
Inventor: Matthew Zdunich
IPC: A63B55/60 , A63B24/00 , A63B71/06 , B62B3/10 , B62B5/00 , B64U10/14 , B64U50/20 , B64U50/30 , G05D1/02 , G05D1/10 , G06T7/70 , G06V20/17 , G06V20/40 , G06V40/20 , G10L15/22 , H04N5/91 , H04N23/57 , H04R1/02
CPC classification number: A63B55/61 , A63B24/0006 , A63B24/0021 , A63B71/0622 , B62B3/102 , B62B5/00 , B62B5/004 , B62B5/0043 , B62B5/0069 , B64U10/14 , B64U50/20 , B64U50/30 , G06T7/70 , G06V20/17 , G06V20/42 , G06V40/23 , G10L15/22 , H04N5/91 , H04N23/57 , H04R1/028 , A63B2024/0028 , A63B2071/0625 , A63B2220/05 , A63B2220/806 , A63B2225/50 , B62B2202/406 , B64U2101/30 , G06T2207/10016 , G06T2207/10032 , G06T2207/30224 , G10L2015/223
Abstract: A golf caddy system for autonomously carrying a golf bag across a golf course, recording video of golf swings, and providing golfing advice to a user includes an autonomous motorized cart. The cart includes a processor with map data of the golf course in a memory of the processor and a global positioning system receiver for determining a current location of the cart. The cart also includes a camera which is configured to capture video of a golf swing.
-
公开(公告)号:US12126871B1
公开(公告)日:2024-10-22
申请号:US16592506
申请日:2019-10-03
Applicant: Amazon Technologies, Inc.
Inventor: Jatin Bajaj , Clare Elizabeth Veladanda
IPC: H04N21/47 , G10L15/22 , H04N21/43 , H04N21/431 , H04N21/472
CPC classification number: H04N21/47 , G10L15/22 , H04N21/4302 , H04N21/4316 , H04N21/47217 , G10L2015/223
Abstract: Devices and techniques are generally described for an interruption model for a user device. In various examples, first metadata related to first content executing on a user device may be determined. In some examples, second metadata related to second content for execution by the user device may be determined. In various examples, an output configuration for the user device may be determined using the first metadata and the second metadata. In some examples, the output configuration may result from identification of the second content to the user device during execution of the first content.
-
公开(公告)号:US12125498B2
公开(公告)日:2024-10-22
申请号:US17570557
申请日:2022-01-07
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Seungbeom Ryu , Sungjae Park , Hyuk Oh , Myeungyong Choi , Junkwon Choi
CPC classification number: G10L25/84 , G06N3/045 , G10L15/02 , G10L15/16 , G10L15/22 , H04R1/08 , H04R3/00 , G10L2015/223 , H04R2420/07
Abstract: According to various embodiments, an electronic device may include: a microphone; an audio connector; a wireless communication circuit; a processor operatively connected to the microphone, the audio connector, and the wireless communication circuit; and a memory operatively connected to the processor, wherein the memory may store instructions that, when executed, cause the processor to: receive a first audio signal through the microphone, the audio connector, or the wireless communication circuit, extract audio feature information from the first audio signal, and recognize a speech section in a second audio signal, received after the first audio signal through the microphone, the audio connector, or the wireless communication circuit, using the audio feature information.
-
8.
公开(公告)号:US12125486B2
公开(公告)日:2024-10-22
申请号:US18217326
申请日:2023-06-30
Applicant: GOOGLE LLC
Inventor: Ulas Kirazci , Adam Coimbra , Abraham Lee , Wei Dong , Thushan Amarasiriwardena
IPC: G10L15/22 , G06F3/16 , G06F9/448 , G10L13/027
CPC classification number: G10L15/22 , G06F3/167 , G06F9/4498 , G10L13/027 , G10L2015/223 , G10L2015/228
Abstract: Techniques are described herein for multi-modal interaction between users, automated assistants, and other computing services. In various implementations, a user may engage with the automated assistant in order to further engage with a third party computing service. In some implementations, the user may advance through dialog state machines associated with third party computing service using both verbal input modalities and input modalities other than verbal modalities, such as visual/tactile modalities.
-
公开(公告)号:US12125482B2
公开(公告)日:2024-10-22
申请号:US16692150
申请日:2019-11-22
Applicant: INTEL CORPORATION
CPC classification number: G10L15/22 , G06F16/638 , G10L15/02 , G10L15/16 , G10L17/00 , G10L25/78 , G10L2015/027 , G10L2015/088 , G10L2015/223
Abstract: An example apparatus for recognizing speech includes an audio receiver to receive a stream of audio. The apparatus also includes a key phrase detector to detect a key phrase in the stream of audio. The apparatus further includes a model adapter to dynamically adapt a model based on the detected key phrase. The apparatus also includes a query recognizer to detect a voice query following the key phrase in a stream of audio via the adapted model.
-
公开(公告)号:US12125272B2
公开(公告)日:2024-10-22
申请号:US18449525
申请日:2023-08-14
Applicant: Meta Platforms Technologies, LLC
Inventor: Paul Anthony Crook , Francislav P. Penov , Rajen Subba , Xiaohu Liu
IPC: G06V10/82 , G06F3/01 , G06F3/16 , G06F7/14 , G06F9/451 , G06F16/176 , G06F16/22 , G06F16/23 , G06F16/242 , G06F16/2455 , G06F16/2457 , G06F16/248 , G06F16/33 , G06F16/332 , G06F16/338 , G06F16/903 , G06F16/9032 , G06F16/9038 , G06F16/904 , G06F16/951 , G06F16/9535 , G06F18/2411 , G06F40/205 , G06F40/295 , G06F40/30 , G06F40/40 , G06N3/006 , G06N3/08 , G06N7/01 , G06N20/00 , G06Q50/00 , G06V10/764 , G06V20/10 , G06V40/20 , G10L15/02 , G10L15/06 , G10L15/07 , G10L15/16 , G10L15/18 , G10L15/183 , G10L15/187 , G10L15/22 , G10L15/26 , G10L17/06 , G10L17/22 , H04L5/02 , H04L12/28 , H04L41/00 , H04L41/22 , H04L43/0882 , H04L43/0894 , H04L51/02 , H04L51/18 , H04L51/216 , H04L51/52 , H04L67/306 , H04L67/50 , H04L67/5651 , H04L67/75 , H04W12/08 , G10L13/00 , G10L13/04 , H04L51/046 , H04L67/10 , H04L67/53
CPC classification number: G06V10/82 , G06F3/011 , G06F3/013 , G06F3/017 , G06F3/167 , G06F7/14 , G06F9/453 , G06F16/176 , G06F16/2255 , G06F16/2365 , G06F16/243 , G06F16/24552 , G06F16/24575 , G06F16/24578 , G06F16/248 , G06F16/3323 , G06F16/3329 , G06F16/3344 , G06F16/338 , G06F16/90332 , G06F16/90335 , G06F16/9038 , G06F16/904 , G06F16/951 , G06F16/9535 , G06F18/2411 , G06F40/205 , G06F40/295 , G06F40/30 , G06F40/40 , G06N3/006 , G06N3/08 , G06N7/01 , G06N20/00 , G06Q50/01 , G06V10/764 , G06V20/10 , G06V40/28 , G10L15/02 , G10L15/063 , G10L15/07 , G10L15/16 , G10L15/1815 , G10L15/1822 , G10L15/183 , G10L15/187 , G10L15/22 , G10L15/26 , G10L17/06 , G10L17/22 , H04L5/02 , H04L12/2816 , H04L41/20 , H04L41/22 , H04L43/0882 , H04L43/0894 , H04L51/02 , H04L51/18 , H04L51/216 , H04L51/52 , H04L67/306 , H04L67/535 , H04L67/5651 , H04L67/75 , H04W12/08 , G06F2216/13 , G10L13/00 , G10L13/04 , G10L2015/223 , G10L2015/225 , H04L51/046 , H04L67/10 , H04L67/53
Abstract: In one embodiment, a method includes receiving a user request from a first user from a client system associated with a first user, wherein the user request comprise a gesture-input from the first user and a speech-input from the first user, determining an intent corresponding to the user request based on the gesture-input by a personalized gesture-classification model associated with the first user, executing one or more tasks based on the determined intent and the speech-input, and sending instructions for presenting execution results of the one or more tasks to the client system responsive the user request.
-
-
-
-
-
-
-
-
-