-
公开(公告)号:US12051438B1
公开(公告)日:2024-07-30
申请号:US17214399
申请日:2021-03-26
Applicant: T-Mobile USA, Inc.
Inventor: Yasmin Karimli , Ryan Cyrus Khamneian , Jie Hui , Antoine T. Tran
Abstract: Described herein are techniques, devices, and systems for training a machine learning model(s) and/or artificial intelligence algorithm(s) to determine where a mobile device (and, hence, a user of the mobile device) is located based on audio data associated with the mobile device and/or contextual data associated with the mobile device. The machine learning techniques may be used to determine contextual information about users, such as determining that a particular location is likely to be a user's home, office, or the like, based on movement patterns exhibited in the data associated with a user's mobile device. Once trained, the machine learning model(s) is usable to classify a mobile device as having been located at one of multiple candidate locations, such as indoors or outdoors, at a particular time. The described techniques can improve the accuracy of determining a mobile device's location, among other technical benefits.
-
2.
公开(公告)号:US20240221524A1
公开(公告)日:2024-07-04
申请号:US18091334
申请日:2022-12-29
Applicant: SUFIAN MUNIR INC.
Inventor: Zahid Nisar , FARHAN HASSAN , SUFIAN MUNIR
IPC: G09B7/00 , G06F40/253 , G06F40/279 , G06F40/35 , G06F40/40 , G10L15/14 , G10L15/18 , G10L15/187 , G10L15/19 , G10L15/22 , G10L15/30
CPC classification number: G09B7/00 , G06F40/253 , G06F40/279 , G06F40/35 , G06F40/40 , G10L15/14 , G10L15/1815 , G10L15/187 , G10L15/19 , G10L15/22 , G10L15/30 , G10L2015/088
Abstract: The embodiments herein disclose a method and system for intelligent interpretation of information to autonomously design in-class/hybrid/remote assessment. In an embodiment disclosed herein, involves picking up audio of the presenter from an audio input device such as microphone during an interaction. The interaction includes both either audio or video interaction. Further, the embodiment herein, involves extracting the key information present in the captured audio interaction and then use the extracted key information to intelligently generate assessments such as quiz questions, multiple-choice questions, and mathematical questions.
-
公开(公告)号:US20240153505A1
公开(公告)日:2024-05-09
申请号:US18490029
申请日:2023-10-19
Applicant: Amazon Technologies, Inc.
Inventor: Anjishnu Kumar , Xing Fan , Arpit Gupta , Ruhi Sarikaya
CPC classification number: G10L15/22 , G06F40/30 , G06N5/022 , G10L13/00 , G10L15/14 , G10L15/1815 , G10L17/00 , G06F40/295 , G10L2015/223
Abstract: Techniques for determining a command or intent likely to be subsequently invoked by a user of a system are described. A user inputs a command (either via a spoken utterance or textual input) to a system. The system determines content responsive to the command. The system also determines a second command or corresponding intent likely to be invoked by the user subsequent to the previous command. Such determination may involve analyzing pairs of intents, with each pair being associated with a probability that one intent of the pair will be invoked by a user subsequent to a second intent of the pair. The system then outputs first content responsive to the first command and second content soliciting the user as to whether the system to execute the second command.
-
公开(公告)号:US11961507B2
公开(公告)日:2024-04-16
申请号:US18116501
申请日:2023-03-02
Applicant: Rovi Guides, Inc.
Inventor: Jeffry Copps Robert Jose , Sindhuja Chonat Sri
IPC: G10L15/02 , G06F16/432 , G06F16/438 , G06F40/279 , G10L15/14 , G10L15/26 , H04M3/51
CPC classification number: G10L15/02 , G06F16/433 , G06F16/438 , G06F40/279 , G10L15/26 , G10L15/14 , H04M3/5116
Abstract: A transcription of a query for content discovery is generated, and a context of the query is identified, as well as a first plurality of candidate entities to which the query refers. A search is performed based on the context of the query and the first plurality of candidate entities, and results are generated for output. A transcription of a second voice query is then generated, and it is determined whether the second transcription includes a trigger term indicating a corrective query. If so, the context of the first query is retrieved. A second term of the second query similar to a term of the first query is identified, and a second plurality of candidate entities to which the second term refers is determined. A second search is performed based on the second plurality of candidates and the context, and new search results are generated for output.
-
公开(公告)号:US11854550B2
公开(公告)日:2023-12-26
申请号:US18148221
申请日:2022-12-29
Applicant: Magic Leap, Inc.
Inventor: Anthony Robert Sheeder , Colby Nelson Leider
CPC classification number: G10L15/22 , G06F3/013 , G10L15/14 , G10L15/25 , G10L15/30 , G10L2015/223 , G10L2015/227
Abstract: A method of presenting a signal to a speech processing engine is disclosed. According to an example of the method, an audio signal is received via a microphone. A portion of the audio signal is identified, and a probability is determined that the portion comprises speech directed by a user of the speech processing engine as input to the speech processing engine. In accordance with a determination that the probability exceeds a threshold, the portion of the audio signal is presented as input to the speech processing engine. In accordance with a determination that the probability does not exceed the threshold, the portion of the audio signal is not presented as input to the speech processing engine.
-
公开(公告)号:US11783837B2
公开(公告)日:2023-10-10
申请号:US16950653
申请日:2020-11-17
Applicant: Sorenson IP Holdings, LLC
Inventor: Michael Holm
IPC: G10L15/01 , G10L15/26 , G10L15/22 , H04L67/10 , G10L15/08 , G10L15/18 , G10L15/30 , G10L15/16 , G10L15/28 , G10L15/14
CPC classification number: G10L15/26 , G10L15/01 , G10L15/22 , G10L15/08 , G10L15/14 , G10L15/16 , G10L15/18 , G10L15/28 , G10L15/30 , H04L67/10
Abstract: According to one or more aspects of the present disclosure, operations related to selecting a transcription generation technique may be disclosed. In some embodiments, the operations may include obtaining multiple user ratings that each correspond to a different one of multiple transcriptions. Each transcription may be obtained using a first transcription generation technique and may correspond to a different one of multiple communication sessions. The operations may further include selecting, for a subsequent communication session that occurs after the multiple communication sessions, a second transcription generation technique based on the user ratings. In addition, the operations may include providing the subsequent transcription to a device during the subsequent communication session.
-
公开(公告)号:US11762886B2
公开(公告)日:2023-09-19
申请号:US17510224
申请日:2021-10-25
Applicant: Johnson Controls Technology Company
Inventor: Youngchoon Park , Sudhi R. Sinha , Vaidhyanathan Venkiteswaran , Vijaya S. Chennupati , Erik S. Paulson
IPC: G05B19/00 , G06F16/28 , G05B15/02 , G06F16/23 , G06F16/2458 , G06F16/00 , G06F3/01 , G10L15/14 , G10L15/22 , G10L15/30 , G10L25/63
CPC classification number: G06F16/288 , G05B15/02 , G06F3/01 , G06F16/00 , G06F16/23 , G06F16/2477 , G06F16/28 , G10L15/14 , G10L15/142 , G10L15/22 , G10L15/30 , G10L25/63 , G05B2219/2642
Abstract: One or more non-transitory computer readable media contain program instructions that, when executed, cause one or more processors to: receive first raw data including one or more first data points generated by a first object of a plurality of objects associated with one or more buildings; generate first input timeseries according to the one or more data points; access a database of interconnected smart entities, the smart entities including object entities representing each of the plurality of objects and data entities representing stored data, the smart entities being interconnected by relational objects indicating relationships between the smart entities; identify a first object entity representing the first object from a first identifier in the first input timeseries; identify a first data entity from a first relational object indicating a relationship between the first object entity and the first data entity; and store the first input timeseries in the first data entity.
-
公开(公告)号:US11736860B2
公开(公告)日:2023-08-22
申请号:US17562412
申请日:2021-12-27
Applicant: Sonos, Inc.
Inventor: Jonathan P. Lang , Mark Plagge , Simon Jarvis , Romi Kadri , Yean-Nian Willy Chen , Paul Andrew Bates , Luis Vega-Zayas , Christopher Butts , Nicholas A. J. Millington , Keith Corbin
IPC: H04R3/00 , H04S7/00 , H04L12/28 , G06F3/16 , H04R29/00 , H04W8/00 , H04W8/24 , H04R27/00 , G10L15/14 , G10L15/22 , H04R3/12 , G10L21/02 , H04W84/12
CPC classification number: H04R3/00 , G06F3/162 , G06F3/165 , G06F3/167 , G10L15/14 , G10L15/22 , H04L12/2803 , H04L12/2809 , H04R3/12 , H04R27/00 , H04R29/007 , H04S7/301 , H04S7/303 , H04W8/005 , H04W8/24 , G10L21/02 , G10L2015/223 , H04L2012/2849 , H04R2227/003 , H04R2227/005 , H04R2420/07 , H04W84/12
Abstract: Multiple aspects of systems and methods for voice control and related features and functionality for various embodiments of media playback devices, networked microphone devices, microphone-equipped media playback devices, and speaker-equipped networked microphone devices are disclosed and described herein, including but not limited to designating and managing default networked devices, audio response playback, room-corrected voice detection, content mixing, music service selection, metadata exchange between networked playback systems and networked microphone systems, handling loss of pairing between networked devices, actions based on user identification, and other voice control of networked devices.
-
公开(公告)号:US11735173B2
公开(公告)日:2023-08-22
申请号:US17328400
申请日:2021-05-24
Applicant: Google LLC
Inventor: Pu-sen Chao , Diego Melendo Casado , Ignacio Lopez Moreno , William Zhang
CPC classification number: G10L15/197 , G10L13/00 , G10L15/005 , G10L15/08 , G10L15/14 , G10L15/1822 , G10L15/22 , G10L15/30 , G10L2015/088 , G10L2015/223 , G10L2015/228
Abstract: Determining a language for speech recognition of a spoken utterance received via an automated assistant interface for interacting with an automated assistant. Implementations can enable multilingual interaction with the automated assistant, without necessitating a user explicitly designate a language to be utilized for each interaction. Implementations determine a user profile that corresponds to audio data that captures a spoken utterance, and utilize language(s), and optionally corresponding probabilities, assigned to the user profile in determining a language for speech recognition of the spoken utterance. Some implementations select only a subset of languages, assigned to the user profile, to utilize in speech recognition of a given spoken utterance of the user. Some implementations perform speech recognition in each of multiple languages assigned to the user profile, and utilize criteria to select only one of the speech recognitions as appropriate for generating and providing content that is responsive to the spoken utterance.
-
公开(公告)号:US20230215426A1
公开(公告)日:2023-07-06
申请号:US18183695
申请日:2023-03-14
Applicant: TD Ameritrade IP Company, Inc.
Inventor: Abhilash Krishnankutty NAIR , Amaris Yuseon Sim , Dayanand Narregudem , Drew David Riassetto , Logan Sommers Ahlstrom , Nafiseh Saberian , Stephen Filios , Ravindra Reddy Tappeta Venkata
CPC classification number: G10L15/1822 , G06F40/30 , G06Q30/0281 , G10L15/14 , G10L15/16 , G06F16/90332
Abstract: A method of operating a customer utterance analysis system includes obtaining a subset of utterances from among a first set of utterances. The method includes encoding, by a sentence encoder, the subset of utterances into multi-dimensional vectors. The method includes generating reduced-dimensionality vectors by reducing a dimensionality of the multi-dimensional vectors. Each vector of the reduced-dimensionality vectors corresponds to an utterance from among the subset of utterances. The method includes performing clustering on the reduced-dimensionality vectors. The method includes, based on the clustering performed on the reduced-dimensionality vectors, arranging the subset of utterances into clusters. The method includes obtaining labels for a least two clusters from among the clusters. The method includes generating training data based on the obtained labels. The method includes training a neural network model to predict an intent of an utterance based on the training data.
-
-
-
-
-
-
-
-
-