-
公开(公告)号:US12062365B2
公开(公告)日:2024-08-13
申请号:US17514338
申请日:2021-10-29
Applicant: SAMSUNG SDS CO., LTD.
Inventor: Hyun Jae Lee , Hyun Jin Choi , Jae Woong Yun , Ju Dong Kim , Bong Kyu Hwang , Seong Ho Joe , Young June Gwon
CPC classification number: G10L15/18 , G10L15/063 , G10L15/22 , G10L2015/223
Abstract: An apparatus for training a dialogue summary model according to an embodiment includes a parameter transferer configured to transfer one or more learning parameter values of a pre-trained natural language processing model to a sequence-to-sequence-based dialogue summary model, and a model trainer configured to train the dialogue summary model by using the transferred learning parameter values as initial values for learning parameters of each of an encoder and a decoder in the dialogue summary model.
-
公开(公告)号:US20240265920A1
公开(公告)日:2024-08-08
申请号:US18637799
申请日:2024-04-17
Applicant: Kyndryl, Inc.
CPC classification number: G10L15/22 , G06F3/017 , G06F18/2155 , G10L15/30 , H04L12/282 , H04L12/2829 , G10L15/02 , G10L15/16 , G10L2015/223 , G10L2015/225 , G10L25/30
Abstract: The exemplary embodiments disclose a method, a computer program product, and a computer system for managing user commands. The exemplary embodiments may include a user giving one or more commands to one or more devices, collecting data of the one or more commands, extracting one or more features from the collected data, and determining which one or more of the commands should be executed on which one or more of the devices based on the extracted one or more features and one or more models.
-
公开(公告)号:US20240265701A1
公开(公告)日:2024-08-08
申请号:US18627468
申请日:2024-04-05
Applicant: JVCKENWOOD Corporation
Inventor: Kentaro Kodama
CPC classification number: G06V20/44 , G10L15/22 , G10L2015/223
Abstract: An on-vehicle recording control apparatus includes: a captured data acquisition unit configured to acquire video data captured by a camera that captures an image of surroundings of a vehicle; an operation controller configured to receive voice operation based on a voice command instructing event recording; a detection unit configured to detect a speech state of the voice command of the voice operation received by the operation controller; and a recording controller configured to: record the video data acquired by the captured data acquisition unit; generate event data while changing, based on the speech state of the voice command detected by the detection unit when the operation controller receives the voice operation for event recording, a period of the event data that is extracted from the video data; and store the generated event data.
-
公开(公告)号:US12057219B2
公开(公告)日:2024-08-06
申请号:US17384142
申请日:2021-07-23
Applicant: Cilag GmbH International
Inventor: Frederick E. Shelton, IV , Kevin Fiebig , Shane R. Adams
IPC: G16H20/40 , A61B17/00 , A61B18/12 , A61B34/00 , A61B34/10 , A61B34/20 , A61B34/30 , A61B34/32 , A61B90/00 , G05B13/02 , G06F3/14 , G06F3/16 , G06F9/48 , G06F9/54 , G06F13/40 , G06F16/21 , G06F16/28 , G06N20/00 , G06Q10/30 , G06T11/60 , G08B5/22 , G10L15/22 , G16H10/60 , G16H15/00 , G16H30/40 , G16H40/20 , G16H40/40 , G16H40/63 , G16H40/67 , G16H50/20 , G16H50/70 , H04L1/22 , H04L41/12 , H04L65/80 , H04L67/12 , H04L67/125 , H04N5/272 , H04N7/15 , A61B8/06 , A61B18/00 , G06F21/62 , G06F40/169 , G16H30/20 , H02J7/00
CPC classification number: G16H40/20 , A61B17/00 , A61B18/1206 , A61B34/10 , A61B34/20 , A61B34/25 , A61B34/30 , A61B34/32 , A61B90/08 , A61B90/37 , G05B13/0265 , G06F3/14 , G06F3/1423 , G06F3/167 , G06F9/4881 , G06F9/542 , G06F13/4068 , G06F16/211 , G06F16/284 , G06F16/285 , G06N20/00 , G06Q10/30 , G06T11/60 , G08B5/22 , G10L15/22 , G16H10/60 , G16H15/00 , G16H20/40 , G16H30/40 , G16H40/40 , G16H40/63 , G16H40/67 , G16H50/20 , G16H50/70 , H04L1/22 , H04L41/12 , H04L65/80 , H04L67/12 , H04L67/125 , H04N5/272 , H04N7/15 , A61B8/06 , A61B2017/00221 , A61B2018/00702 , A61B2018/00994 , A61B2034/2072 , A61B2034/254 , A61B2090/364 , A61B2090/365 , A61B2090/373 , G06F21/6245 , G06F40/169 , G10L2015/223 , G16H30/20 , H02J7/0063
Abstract: Systems, methods, and instrumentalities are disclosed for data processing and creating a record of the processing for archival in metadata associated with the results of the processing. The processing may include transformations of the data. Transforming the data may generate transformed data. The processes performed may be archived, for example, in metadata associated with the transformed data. The metadata may be annotated with information associated with previous transforms performed on the transformed data. The metadata may be stored with the transformed data.
-
公开(公告)号:US12057116B2
公开(公告)日:2024-08-06
申请号:US17162007
申请日:2021-01-29
Applicant: salesforce.com, inc.
Inventor: Juan Rodriguez , Michael Machado
CPC classification number: G10L15/22 , G06F9/453 , G10L15/26 , G10L15/32 , G10L2015/223
Abstract: The present disclosure is directed techniques for executing a task or service using a virtual agent. A method includes: executing, using a virtual agent, one or more tiers of a plurality of tiers of machine learning analysis to identify a desired action to be performed based on a user command, the user command being received from an external computing device; responsive to the one or more tiers of the plurality of tiers of machine learning analysis identifying a plurality of actions associated with the user command, determining a series of inquiries to present via the external computing device, wherein each inquiry of the series of inquiries is selected based on a number of actions associated with each inquiry, and wherein each subsequent inquiry in the series of inquires is based on a user response to a preceding inquiry; identifying, based on responses to the series of inquiries, the desired action to be performed; and executing the desired action to be performed.
-
公开(公告)号:US20240257809A1
公开(公告)日:2024-08-01
申请号:US18627846
申请日:2024-04-05
Applicant: Amazon Technologies, Inc.
Inventor: Ezekiel Wade Sanborn de Asis
CPC classification number: G10L15/22 , G06F3/165 , G10L2015/223
Abstract: A system is provided for modifying how an output is presented via a multi-device synchronous configuration based on detecting a speech characteristic in the user input. For example, if the user whispers a request, then the system may temporarily modify how the responsive output is presented to the user via multiple devices. In one example, the system may lower the volume on all devices presented the output. In another example, the system may present the output via a single device rather than multiple devices. The system may also determine to operate in an alternate output mode based on certain non-audio data.
-
公开(公告)号:US20240256599A1
公开(公告)日:2024-08-01
申请号:US18631952
申请日:2024-04-10
Applicant: GOOGLE LLC
Inventor: Sowmya Subramanian , Benton Davis DeLoache , Lauren Clark , Rami Banna , Igor Benko
IPC: G06F16/635 , G10L15/22
CPC classification number: G06F16/635 , G10L15/22 , G10L2015/223
Abstract: Implementations are provided for providing responsive audio recordings to user queries that are prerecorded by human beings, rather than generated automatically using speech synthesis processing. In various implementations, a query provided by a user at an input component of a computing device may be used to search a corpus of voice recordings. From the searching, a plurality of candidate responsive voice recordings may be identified and ranked based on measures of credibility associated with speakers that created the candidate responsive voice recordings. Based on the ranking, one or more of the plurality of candidate responsive voice recordings may be provided for presentation to the user at an output component of the same computing device or a different computing device.
-
公开(公告)号:US20240256220A1
公开(公告)日:2024-08-01
申请号:US18634100
申请日:2024-04-12
Applicant: Snap Inc.
Inventor: Joseph Timothy Fortier , Celia Nicole Mourkogiannis , Evan Spiegel , Kaveh Anvaripour
IPC: G06F3/16 , G06T11/00 , G06V10/44 , G06V10/764 , G06V20/10 , G06V20/20 , G06V20/64 , G06V40/16 , G10L15/08 , G10L15/22 , H04L51/046 , H04N23/60
CPC classification number: G06F3/167 , G06T11/00 , G06V10/454 , G06V10/764 , G06V20/10 , G06V20/20 , G06V20/64 , G06V40/161 , G06V40/168 , G06V40/174 , G10L15/08 , G10L15/22 , H04L51/046 , H04N23/60 , G06T2200/24 , G10L2015/088 , G10L2015/223
Abstract: Aspects of the present disclosure involve a system comprising a computer-readable storage medium storing a program and method for displaying augmented reality content. The program and method provide for causing, by a messaging application running on a device, a camera of the device to capture an image; receiving by the messaging application, speech input to select augmented reality content for display with the image; determining at least one keyword included in the speech input; determining that the at least one keyword indicates an object depicted in the image and an action to perform with respect to the object; identifying, from plural augmented reality content items, an augmented reality content item that corresponds to performing the action with respect to the object; and displaying the augmented reality content item with the image.
-
公开(公告)号:US12051415B1
公开(公告)日:2024-07-30
申请号:US18224259
申请日:2023-07-20
Applicant: Amazon Technologies, Inc.
Inventor: Gonzalo Alvarez Barrio , Shantanu Vikas Kurhekar , Bharath Bhimanaik Kumar , Fred Torok , Frederic J Deramat
CPC classification number: G10L15/22 , G10L15/1815 , G10L15/30 , H04W8/005 , G10L2015/223 , G10L2015/228 , H04W88/08
Abstract: Systems and methods for integration of speech processing functionality with organization systems are disclosed. For example, a voice interface application may be created to enable a voice interface functionality for devices associated with an organization. Space identifiers of spaces of the organization may be created and associated with the voice interface application. Devices associated with the space identifiers may be enabled for utilizing the voice interface application and may be set up utilizing wireless network identifiers associated with the spaces and/or the organization.
-
公开(公告)号:US12051412B2
公开(公告)日:2024-07-30
申请号:US17445530
申请日:2021-08-20
Applicant: Preferred Networks, Inc.
Inventor: Kenta Yonekura , Hirochika Asai , Kota Nabeshima , Manabu Nagao
IPC: G10L15/02 , G10L15/22 , G10L21/0208 , G10L25/78 , G10L25/87
CPC classification number: G10L15/22 , G10L15/02 , G10L21/0208 , G10L25/87 , G10L2015/025 , G10L2015/223 , G10L2015/226 , G10L2025/783
Abstract: A control device includes at least one memory, and at least one processor configured to detect a voice segment from sound data, the sound data being detected while a controlled object operates, and stop the controlled object based on following conditions: a speaking speed is a predetermined speed threshold or greater, the speaking speed being calculated based on a portion of the sound data in the voice segment; and a length of the voice segment is a predetermined length threshold or less.
-
-
-
-
-
-
-
-
-