-
公开(公告)号:US12057108B2
公开(公告)日:2024-08-06
申请号:US17071913
申请日:2020-10-15
申请人: Collibra Belgium BV
发明人: Michael Tandecki , Michael Maes , Anna Filipiak
摘要: A system for classifying words in a batch of words can include at least one memory device storing instructions for causing at least one processor to create dictionary vectors for each of a plurality of dictionary words using a neural network (NN), store each dictionary vector along with a classification indicator corresponding to the associated dictionary word, and create word vectors for each word in a batch of words for classification using the NN. The closest matching dictionary vectors are found for each word vector and the classification indicators of the closest matching dictionary vector for each word vector in the batch is reported.
-
公开(公告)号:US20240184992A1
公开(公告)日:2024-06-06
申请号:US18074653
申请日:2022-12-05
摘要: Systems and methods for increasing accuracy in an online chat interface are provided. Methods may be executed via a machine-learning (ML)-based chat monitoring engine. Methods may include intercepting a request utterance. Methods may include computing, via a trained ML model, an intent of the request utterance; generating a target response to the request utterance; intercepting a response utterance; and calculating a difference between the response utterance and the target response. In response to the difference being less than a threshold difference, methods may include releasing the response utterance to be transmitted as a response message. In response to the difference being more than a threshold difference, methods may include preventing the response utterance from being transmitted as a response message, generating a revised response utterance that is less than a threshold difference apart from the target response, and transmitting the revised response as a response message to a remote computing device.
-
公开(公告)号:US20240135960A1
公开(公告)日:2024-04-25
申请号:US18401231
申请日:2023-12-29
申请人: Google LLC
发明人: Chaitanya GHARPURE , Evan FISHER , Eric LIU , Peng YANG , Emily HOU , Victoria FANG
摘要: The disclosure provides technology for enhancing the ability of a computing device to detect when a user has discontinued reading a text source. An example method includes receiving audio data comprising a spoken word associated with a text source, comparing the audio data with data of the text source, determining, based on the comparing, whether a segment of the audio data corresponds to a location of the text source, and responsive to determining that the segment of the audio data does not correspond to a location of the text source, transmitting a signal indicating that a user has discontinued reading the text source, the signal causing to cease the comparing of the audio data with the data of the text source.
-
4.
公开(公告)号:US20240105174A1
公开(公告)日:2024-03-28
申请号:US18527928
申请日:2023-12-04
发明人: Takahiro KAMAI , Katsunori DAIMO , Misaki DOI , Kousuke ITAKURA
CPC分类号: G10L15/22 , B60R16/0373 , G10L15/01 , G10L15/02 , G10L15/10 , G10L15/30 , G10L2015/223
摘要: A voice recognition device includes an estimation unit that compares a plurality of pieces of registration voice data stored in a database with input voice data uttered by a speaker who gets on a mobile body to estimate a registration command corresponding to the input command, a presentation unit that presents an estimation result, a second acquisition unit that acquires an error instruction indicating that the estimation result is an error, a determination unit that, in a case where the error instruction is acquired, determines a correct command corresponding to the input command based on an operation by the speaker, and a database management unit that stores the correct command and the input voice data in the database in association with each other
-
公开(公告)号:US20240038230A1
公开(公告)日:2024-02-01
申请号:US18377590
申请日:2023-10-06
发明人: Nam-yeong KWON , Kyung-mi PARK
IPC分类号: G10L15/22 , G10L15/06 , H04N21/422 , H04N21/439 , H04N21/482 , G06F3/16 , G10L15/02 , G10L15/10
CPC分类号: G10L15/22 , G10L15/06 , H04N21/42203 , H04N21/4394 , H04N21/482 , G06F3/167 , G10L15/02 , G10L15/10 , G10L15/187
摘要: A display apparatus includes an input unit configured to receive a user command; an output unit configured to output a registration suitability determination result for the user command; and a processor configured to generate phonetic symbols for the user command, analyze the generated phonetic symbols to determine registration suitability for the user command, and control the output unit to output the registration suitability determination result for the user command. Therefore, the display apparatus may register a user command which is resistant to misrecognition and guarantees high recognition rate among user commands defined by a user.
-
公开(公告)号:US11887586B2
公开(公告)日:2024-01-30
申请号:US17191512
申请日:2021-03-03
申请人: Spotify AB
发明人: Vidhya Murali , Aaron Paul Harmon
CPC分类号: G10L15/1815 , G10L15/05 , G10L15/10 , G10L15/1822 , G10L2015/088
摘要: A method includes retrieving a plurality of transcripts from a database. Each transcript in the plurality of transcripts corresponds to audio from a media content item of a plurality of media content items that are provided by a media providing service. The method also includes applying each transcript of the plurality of transcripts to a trained computational model, and receiving a user request for information regarding a topic. The method further includes, in response to the user request, identifying a transcript from the database that is relevant to the topic, and a position within the transcript that is relevant to the topic. The method also includes providing, by the media providing service, at least a portion of a media content item corresponding to the identified transcript, beginning at a starting position that is based on the position within the identified transcript that is relevant to the topic.
-
公开(公告)号:US11854572B2
公开(公告)日:2023-12-26
申请号:US17302981
申请日:2021-05-18
CPC分类号: G10L25/18 , G10L15/10 , G10L15/14 , G10L19/008 , G10L19/0204 , G10L25/90
摘要: Computer-implemented methods, computer program products, and computer systems for mitigating frequency loss may include one or more processors configured for receiving first audio data corresponding to unobstructed user utterances, receiving second audio data corresponding to first obstructed user utterances, generating a frequency loss (FL) model representing frequency loss between the first audio data and the second audio data, receiving third audio data corresponding to one or more second obstructed user utterances, processing the third audio data using the FL model to generate fourth audio data corresponding to a frequency loss mitigated version of the second obstructed user utterances, and transmitting the fourth audio data to a recipient computing device. The first obstructed user utterances are obstructed by a facemask and the one or more second obstructed user utterances is obstructed by the facemask. The FL model may be executed as an audio plugin in a web conferencing program.
-
公开(公告)号:US11763814B2
公开(公告)日:2023-09-19
申请号:US17353678
申请日:2021-06-21
申请人: Logitech Europe S.A.
CPC分类号: G10L15/22 , G10L15/10 , G10L15/32 , G10L2015/223
摘要: Digitized audio command is decoded to generate audio features. An in-domain confidence score is calculated for a model trained by a limited set of peripheral device commands. An out-domain confidence score is calculated for a model trained without the peripheral device commands. The best score determines whether to process the audio locally or at a remote server. In some embodiments, a likelihood ratio (LR) is calculated of the in-domain and out-domain confidence scores. Based on the likelihood ratio, a locally decoded audio command is performed, or the audio features are sent to a remote server for processing to determine the audio command.
-
公开(公告)号:US20190214023A1
公开(公告)日:2019-07-11
申请号:US16330131
申请日:2017-08-04
申请人: SONY CORPORATION
发明人: Keigo IHARA
CPC分类号: G10L17/22 , G06Q30/02 , G06Q30/06 , G10L15/10 , G10L17/00 , G10L17/005 , G10L17/06 , H04R1/406 , H04R3/005
摘要: [Object] To provide an information processing device and an information processing method that can collect speech voice of a user, and recognize a specific user on the basis of the number of speeches performed by the user within a predetermined period.[Solution] An information processing device including: a communication unit capable of receiving voice information regarding voice collected by a plurality of microphones disposed discretely; and a control unit configured to determine a user identified on the basis of voice information regarding voice collected by a specific microphone among the plurality of microphones, the voice information having been received via the communication unit, to be a specific user that has performed speech a predefined number of times or more within at least a certain period of time, and control voice information to be transmitted to the specific user, to be transmitted to a speaker corresponding to the specific microphone, via the communication unit.
-
公开(公告)号:US20180261219A1
公开(公告)日:2018-09-13
申请号:US15914893
申请日:2018-03-07
申请人: SalesBoost, LLC
发明人: Margaret L BROOKS
CPC分类号: G10L15/22 , G06F16/683 , G06Q10/06398 , G10L15/10 , G10L25/48 , G10L2015/225
摘要: A method for performing voice analysis includes storing, in a database, a simulation file for conducting a training session with a user, the simulation file including at least a script. The method includes further storing, in the database, desired attributes associated with the simulation file. The method also includes retrieving, by a server, the simulation file from the database and providing, by a client application, a user interface to conduct the voice analysis using the simulation file from the database. The method further includes receiving, at the client application, one or more voice impressions from a user and analyzing, at an audio analysis tool, at least one of the voice impressions of the user. The method additionally includes determining, at the audio analysis tool, attributes of the at least one voice impression in response to analyzing the at least one voice impression and comparing, at the audio analysis tool, the determined attributes to the desired attributes associated with the simulation file. The method provides, by the client application, feedback to the user based on the comparison.
-
-
-
-
-
-
-
-
-