-
公开(公告)号:US12094451B1
公开(公告)日:2024-09-17
申请号:US17677614
申请日:2022-02-22
Applicant: Amazon Technologies, Inc.
Inventor: Tao Zhang , Jie Ding , Huili Chen
CPC classification number: G10L15/05 , G06N20/20 , G10L15/063 , G10L2015/0635
Abstract: A system may create a localized machine learning model including one or more customized local parameter values using a global model and variance data. The localized machine learning model may be used by a device or cohort of devices to perform evaluations of data. The localized model may be trained based off a global model that is adjusted and then trained a certain number of steps, where the number of steps is based at least in part on the variance data. The variance data may include variance data from other device cohorts which is received from a remote device, which can also re-train the global model using the variance data and/or the localized machine learning model(s).
-
公开(公告)号:US20240296016A1
公开(公告)日:2024-09-05
申请号:US18117382
申请日:2023-03-03
Applicant: Radhika Grover
Inventor: Radhika Grover
CPC classification number: G06F8/30 , G06F3/017 , G06F40/289 , G10L15/063 , G10L15/18 , G10L15/22 , G10L2015/0635 , G10L2015/088 , G10L2015/223
Abstract: Classification and preprocessing algorithms are applied to at least one reference document and a natural language command provided as input by a user to a development environment for computer programming. Approximate matching of keywords in user commands with data in the reference document is performed with a fuzzy hash map and used to develop information metrics that produce a ranking of candidate actions. Selection criteria are then used to select an action from multiple candidate actions to perform the desired operation in the development environment.
-
83.
公开(公告)号:US20240290319A1
公开(公告)日:2024-08-29
申请号:US18442974
申请日:2024-02-15
Applicant: JPMORGAN CHASE BANK, N.A.
Inventor: Peter PLANTINGA , Jaekwon YOO , Chandra DHIR
IPC: G10L15/06
CPC classification number: G10L15/063 , G10L2015/0635
Abstract: In some aspects, the techniques described herein relate to a method including: providing, to a parallel model training platform, a plurality of domain datasets; training, by the parallel model training platform, a plurality of generalist models in parallel, wherein each generalist model of the plurality of generalist models is trained in parallel using a corresponding one of the plurality of domain datasets, and wherein training the plurality of generalist models in parallel generates a corresponding expert model for each generalist model in the plurality of generalist models; executing, by the parallel model training platform, a model parameter averaging process, wherein the model parameter averaging process take each corresponding expert model as input; and generating, by the parallel model training platform and as output of the model parameter averaging process, an average-of-domain-experts (AoDE) model.
-
公开(公告)号:US12068001B2
公开(公告)日:2024-08-20
申请号:US18243800
申请日:2023-09-08
Applicant: Amazon Technologies, Inc.
Inventor: Harshavardhan Sundar , Sheetal Laad , Jialiang Bao , Ming Sun , Chao Wang , Chungnam Chan , Cengiz Erbas , Mathias Jourdain , Nipul Bharani , Aaron David Wirshba
CPC classification number: G10L25/51 , G10L15/063 , G10L15/22 , G10L25/78 , G10L2015/0635
Abstract: Techniques for detecting certain acoustic events from audio data are described. A system may perform event aggregation for certain types of events before sending an output to a device representing the event is detected. The system may bypass the event aggregation process for certain types of events that the system may detect with a high level of confidence. In such cases, the system may send an output to the device when the event is detected. The system may be used to detect acoustic events representing presence of a person or other harmful circumstances (such as, fire, smoke, etc.) in a home, an office, a store, or other types of indoor settings.
-
公开(公告)号:US20240269566A1
公开(公告)日:2024-08-15
申请号:US18442958
申请日:2024-02-15
Applicant: Interwoven Worlds Inc.
Inventor: Frederick HOWARD , Josef HALL
IPC: A63F13/67 , A63F13/54 , A63F13/56 , A63F13/79 , G10L13/02 , G10L15/06 , G10L15/18 , G10L15/183 , G10L15/22
CPC classification number: A63F13/67 , A63F13/54 , A63F13/56 , A63F13/79 , G10L13/02 , G10L15/063 , G10L15/1815 , G10L15/183 , G10L15/22 , G10L2015/0635 , G10L2015/223
Abstract: A method may include: receiving, by a computer program, a user speech or a user action from a user of the computer program, the computer program comprising a virtual agent; identifying a user intent from the user speech or the user action; retrieving saved user-specific memories, static data, and an application state for the computer program; generating a prompt based on the user speech or the user action, the saved user-specific memories, the static data, and the application state; providing the prompt to a text generation module and receiving a suggested action for the virtual agent; converting the suggested action into virtual agent speech and a virtual agent action, wherein the virtual agent outputs the virtual agent speech and takes the virtual agent action; and updating the saved user-specific memories, the static data, and/or the application state with the virtual agent speech and the virtual agent action.
-
公开(公告)号:US20240265924A1
公开(公告)日:2024-08-08
申请号:US18573846
申请日:2021-06-29
Applicant: Shujie LIU , Jinyu LI , Long ZHOU , Xie SUN , Microsoft Technology Licensing, LLC
Inventor: Jinyu LI , Long ZHOU , Xie SUN , Shujie LIU
CPC classification number: G10L15/32 , G10L15/005 , G10L15/063 , G10L15/30 , G10L2015/0635
Abstract: Embodiments are provided for building a configurable multilingual model. A computing system obtains a plurality of language-specific automatic speech recognition modules and a universal automatic speech recognition module trained on a multi-language training dataset comprising training data corresponding to each of the plurality of different languages. The computing system then compiles the universal automatic speech recognition module with the plurality of language-specific automatic speech recognition modules to generate a configurable multilingual model that is configured to selectively and dynamically utilize a sub-set of the plurality of language-specific automatic speech recognition modules with the universal automatic speech recognition module to process audio content in response to user input identifying one or more target languages associated with the audio content.
-
公开(公告)号:US12039975B2
公开(公告)日:2024-07-16
申请号:US17112512
申请日:2020-12-04
Applicant: Amazon Technologies, Inc.
Inventor: Prakash Krishnan , Arindam Mandal , Siddhartha Reddy Jonnalagadda , Nikko Strom , Ariya Rastrow , Shiv Naga Prasad Vitaladevuni , Angeliki Metallinou , Vincent Auvray , Minmin Shen , Josey Diego Sandoval , Rohit Prasad , Thomas Taylor , Amotz Maimon
IPC: G10L15/22 , G06F3/16 , G06F18/24 , G06V10/40 , G06V40/10 , G06V40/20 , G10L13/08 , G10L15/02 , G10L15/06 , G10L15/08 , G10L15/20 , G10L15/24
CPC classification number: G10L15/22 , G06F3/167 , G06F18/24 , G06V10/40 , G06V40/10 , G06V40/20 , G10L13/08 , G10L15/02 , G10L15/063 , G10L15/08 , G10L15/20 , G10L15/222 , G10L15/24 , G10L2015/0635 , G10L2015/088 , G10L2015/223 , G10L2015/227
Abstract: A natural language system may be configured to act as a participant in a conversation between two users. The system may determine when a user expression such as speech, a gesture, or the like is directed from one user to the other. The system may processing input data related the expression (such as audio data, input data, language processing result data, conversation context data, etc.) to determine if the system should interject a response to the user-to-user expression. If so, the system may process the input data to determine a response and output it. The system may track that response as part of the data related to the ongoing conversation.
-
公开(公告)号:US20240203401A1
公开(公告)日:2024-06-20
申请号:US18530702
申请日:2023-12-06
Applicant: Spotify AB
Inventor: Daniel Bromand
CPC classification number: G10L15/063 , G06F7/582 , G10L13/02 , G10L15/07 , G10L2015/0635
Abstract: Systems, methods, and devices for training and testing utterance based frameworks are disclosed. The training and testing can be conducting using synthetic utterance samples in addition to natural utterance samples. The synthetic utterance samples can be generated based on a vector space representation of natural utterances. In one method, a synthetic weight vector associated with a vector space is generated. An average representation of the vector space is added to the synthetic weight vector to form a synthetic feature vector. The synthetic feature vector is used to generate a synthetic voice sample. The synthetic voice sample is provided to the utterance-based framework as at least one of a testing or training sample.
-
公开(公告)号:US20240194194A1
公开(公告)日:2024-06-13
申请号:US18062691
申请日:2022-12-07
Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
Inventor: Praveen Venkateswaran , Nigel Steven Fernandez , Yara Rizk , Vatche Isahagian , Vinod Muthusamy
CPC classification number: G10L15/1815 , G10L15/063 , G10L15/22 , G10L2015/0635 , G10L2015/223
Abstract: According to one embodiment, a method, computer system, and computer program product for software agent synthesis is provided. The present invention may include generating one or more training examples from historical data and software agents; training, using the on the one or more training examples, a language model to synthesize a software agent based on a natural language input from a user; monitoring, using one or more input devices, for one or more natural language user inputs; responsive to identifying one or more natural language user inputs, synthesizing, using the trained language model, one or more software agents based on the one or more natural language user inputs; execute the one or more new software agents to carry out one or more tasks invoked by the one or more natural language user inputs.
-
公开(公告)号:US20240127795A1
公开(公告)日:2024-04-18
申请号:US18276769
申请日:2022-05-07
Inventor: Linhao DONG , Zejun MA
IPC: G10L15/06 , G10L15/065
CPC classification number: G10L15/063 , G10L15/065 , G10L2015/0635 , G10L19/04
Abstract: A model training method, a speech recognition method and apparatus, a medium, and a device are provided. The speech recognition model including an encoder, a CIF prediction sub-model and a CTC prediction sub-model. The model training method includes: encoding training speech data based on the encoder to obtain an acoustic vector sequence corresponding to the training speech data; obtaining an information amount sequence corresponding to the training speech data based on the acoustic vector sequence and the CIF prediction sub-model; obtaining a target probability sequence based on the acoustic vector sequence and the CTC prediction sub-model; determining a target loss of the speech recognition model based on the information amount sequence and the target probability sequence; and updating, in response to an updating condition being satisfied, a model parameter of the speech recognition model based on the target loss.
-
-
-
-
-
-
-
-
-