专利检索 cpc:"G10L15/10" 第 1 页

1.

发明授权
Methods and systems for word edit distance embedding 有权

公开(公告)号：US12057108B2

公开(公告)日：2024-08-06

申请号：US17071913

申请日：2020-10-15

申请人： Collibra Belgium BV

发明人： Michael Tandecki , Michael Maes , Anna Filipiak

IPC分类号： G10L15/16 , G06N3/08 , G10L15/10

CPC分类号： G10L15/16 , G06N3/08 , G10L15/10

摘要： A system for classifying words in a batch of words can include at least one memory device storing instructions for causing at least one processor to create dictionary vectors for each of a plurality of dictionary words using a neural network (NN), store each dictionary vector along with a classification indicator corresponding to the associated dictionary word, and create word vectors for each word in a batch of words for classification using the NN. The closest matching dictionary vectors are found for each word vector and the classification indicators of the closest matching dictionary vector for each word vector in the batch is reported.

2.

发明公开
MACHINE LEARNING (ML)-BASED DUAL LAYER CONVERSATIONAL ASSIST SYSTEM 审中-公开

公开(公告)号：US20240184992A1

公开(公告)日：2024-06-06

申请号：US18074653

申请日：2022-12-05

申请人： Bank of America Corporation

发明人： Ramakrishna R. Yannam , Ion Gerald McCusker , Prejish Thomas , Ravisha Andar

IPC分类号： G06F40/35 , G10L15/10 , G10L15/14 , G10L15/22 , H04L51/21

CPC分类号： G06F40/35 , G10L15/10 , G10L15/14 , G10L15/22 , H04L51/21

摘要： Systems and methods for increasing accuracy in an online chat interface are provided. Methods may be executed via a machine-learning (ML)-based chat monitoring engine. Methods may include intercepting a request utterance. Methods may include computing, via a trained ML model, an intent of the request utterance; generating a target response to the request utterance; intercepting a response utterance; and calculating a difference between the response utterance and the target response. In response to the difference being less than a threshold difference, methods may include releasing the response utterance to be transmitted as a response message. In response to the difference being more than a threshold difference, methods may include preventing the response utterance from being transmitted as a response message, generating a revised response utterance that is less than a threshold difference apart from the target response, and transmitting the revised response as a response message to a remote computing device.

3.

发明公开
ALGORITHMIC DETERMINATION OF A STORY READERS DISCONTINUATION OF READING 审中-公开

公开(公告)号：US20240135960A1

公开(公告)日：2024-04-25

申请号：US18401231

申请日：2023-12-29

申请人： Google LLC

发明人： Chaitanya GHARPURE , Evan FISHER , Eric LIU , Peng YANG , Emily HOU , Victoria FANG

IPC分类号： G10L25/87 , G06F3/0483 , G10L15/02 , G10L15/04 , G10L15/10 , G10L15/26

CPC分类号： G10L25/87 , G06F3/0483 , G10L15/02 , G10L15/04 , G10L15/10 , G10L15/26

摘要： The disclosure provides technology for enhancing the ability of a computing device to detect when a user has discontinued reading a text source. An example method includes receiving audio data comprising a spoken word associated with a text source, comparing the audio data with data of the text source, determining, based on the comparing, whether a segment of the audio data corresponds to a location of the text source, and responsive to determining that the segment of the audio data does not correspond to a location of the text source, transmitting a signal indicating that a user has discontinued reading the text source, the signal causing to cease the comparing of the audio data with the data of the text source.

4.

发明公开
VOICE RECOGNITION DEVICE, VOICE RECOGNITION METHOD, AND NON-TRANSITORY COMPUTER READABLE RECORDING MEDIUM 审中-公开

公开(公告)号：US20240105174A1

公开(公告)日：2024-03-28

申请号：US18527928

申请日：2023-12-04

申请人： Panasonic Intellectual Property Corporation of America

发明人： Takahiro KAMAI , Katsunori DAIMO , Misaki DOI , Kousuke ITAKURA

IPC分类号： G10L15/22 , B60R16/037 , G10L15/01 , G10L15/02 , G10L15/10 , G10L15/30

CPC分类号： G10L15/22 , B60R16/0373 , G10L15/01 , G10L15/02 , G10L15/10 , G10L15/30 , G10L2015/223

摘要： A voice recognition device includes an estimation unit that compares a plurality of pieces of registration voice data stored in a database with input voice data uttered by a speaker who gets on a mobile body to estimate a registration command corresponding to the input command, a presentation unit that presents an estimation result, a second acquisition unit that acquires an error instruction indicating that the estimation result is an error, a determination unit that, in a case where the error instruction is acquired, determines a correct command corresponding to the input command based on an operation by the speaker, and a database management unit that stores the correct command and the input voice data in the database in association with each other

5.

发明公开
DISPLAY APPARATUS AND METHOD FOR REGISTRATION OF USER COMMAND 审中-公开

公开(公告)号：US20240038230A1

公开(公告)日：2024-02-01

申请号：US18377590

申请日：2023-10-06

申请人： Samsung Electronics Co., Ltd.

发明人： Nam-yeong KWON , Kyung-mi PARK

IPC分类号： G10L15/22 , G10L15/06 , H04N21/422 , H04N21/439 , H04N21/482 , G06F3/16 , G10L15/02 , G10L15/10

CPC分类号： G10L15/22 , G10L15/06 , H04N21/42203 , H04N21/4394 , H04N21/482 , G06F3/167 , G10L15/02 , G10L15/10 , G10L15/187

摘要： A display apparatus includes an input unit configured to receive a user command; an output unit configured to output a registration suitability determination result for the user command; and a processor configured to generate phonetic symbols for the user command, analyze the generated phonetic symbols to determine registration suitability for the user command, and control the output unit to output the registration suitability determination result for the user command. Therefore, the display apparatus may register a user command which is resistant to misrecognition and guarantees high recognition rate among user commands defined by a user.

6.

发明授权
Systems and methods for providing responses from media content 有权

公开(公告)号：US11887586B2

公开(公告)日：2024-01-30

申请号：US17191512

申请日：2021-03-03

申请人： Spotify AB

发明人： Vidhya Murali , Aaron Paul Harmon

IPC分类号： G10L15/26 , G10L15/18 , G10L15/10 , G10L15/05 , G10L15/08

CPC分类号： G10L15/1815 , G10L15/05 , G10L15/10 , G10L15/1822 , G10L2015/088

摘要： A method includes retrieving a plurality of transcripts from a database. Each transcript in the plurality of transcripts corresponds to audio from a media content item of a plurality of media content items that are provided by a media providing service. The method also includes applying each transcript of the plurality of transcripts to a trained computational model, and receiving a user request for information regarding a topic. The method further includes, in response to the user request, identifying a transcript from the database that is relevant to the topic, and a position within the transcript that is relevant to the topic. The method also includes providing, by the media providing service, at least a portion of a media content item corresponding to the identified transcript, beginning at a starting position that is based on the position within the identified transcript that is relevant to the topic.

7.

发明授权
Mitigating voice frequency loss 有权

公开(公告)号：US11854572B2

公开(公告)日：2023-12-26

申请号：US17302981

申请日：2021-05-18

申请人： International Business Machines Corporation

发明人： Mary D. Swift , Irene Lizeth Manotas Gutiérrez , Kelley Anders , Jonathan D. Dunne

IPC分类号： G10L25/18 , G10L19/008 , G10L25/90 , G10L19/02 , G10L15/10 , G10L15/14

CPC分类号： G10L25/18 , G10L15/10 , G10L15/14 , G10L19/008 , G10L19/0204 , G10L25/90

摘要： Computer-implemented methods, computer program products, and computer systems for mitigating frequency loss may include one or more processors configured for receiving first audio data corresponding to unobstructed user utterances, receiving second audio data corresponding to first obstructed user utterances, generating a frequency loss (FL) model representing frequency loss between the first audio data and the second audio data, receiving third audio data corresponding to one or more second obstructed user utterances, processing the third audio data using the FL model to generate fourth audio data corresponding to a frequency loss mitigated version of the second obstructed user utterances, and transmitting the fourth audio data to a recipient computing device. The first obstructed user utterances are obstructed by a facemask and the one or more second obstructed user utterances is obstructed by the facemask. The FL model may be executed as an audio plugin in a web conferencing program.

8.

发明授权
Hybrid voice command processing 有权

公开(公告)号：US11763814B2

公开(公告)日：2023-09-19

申请号：US17353678

申请日：2021-06-21

申请人： Logitech Europe S.A.

发明人： Arash Salarian , Milos Cernak , Pablo Mainar , Jean-Michael Chardon , Niccolò Antonello

IPC分类号： G10L15/32 , G10L15/22 , G10L15/10

CPC分类号： G10L15/22 , G10L15/10 , G10L15/32 , G10L2015/223

摘要： Digitized audio command is decoded to generate audio features. An in-domain confidence score is calculated for a model trained by a limited set of peripheral device commands. An out-domain confidence score is calculated for a model trained without the peripheral device commands. The best score determines whether to process the audio locally or at a remote server. In some embodiments, a likelihood ratio (LR) is calculated of the in-domain and out-domain confidence scores. Based on the likelihood ratio, a locally decoded audio command is performed, or the audio features are sent to a remote server for processing to determine the audio command.

9.

发明申请
INFORMATION PROCESSING DEVICE AND INFORMATION PROCESSING METHOD 审中-公开

公开(公告)号：US20190214023A1

公开(公告)日：2019-07-11

申请号：US16330131

申请日：2017-08-04

申请人： SONY CORPORATION

发明人： Keigo IHARA

IPC分类号： G10L17/22 , G10L17/00 , H04R1/40 , H04R3/00 , G10L17/06

CPC分类号： G10L17/22 , G06Q30/02 , G06Q30/06 , G10L15/10 , G10L17/00 , G10L17/005 , G10L17/06 , H04R1/406 , H04R3/005

摘要： [Object] To provide an information processing device and an information processing method that can collect speech voice of a user, and recognize a specific user on the basis of the number of speeches performed by the user within a predetermined period.[Solution] An information processing device including: a communication unit capable of receiving voice information regarding voice collected by a plurality of microphones disposed discretely; and a control unit configured to determine a user identified on the basis of voice information regarding voice collected by a specific microphone among the plurality of microphones, the voice information having been received via the communication unit, to be a specific user that has performed speech a predefined number of times or more within at least a certain period of time, and control voice information to be transmitted to the specific user, to be transmitted to a speaker corresponding to the specific microphone, via the communication unit.

10.

发明申请
VOICE ANALYSIS TRAINING SYSTEM 审中-公开

公开(公告)号：US20180261219A1

公开(公告)日：2018-09-13

申请号：US15914893

申请日：2018-03-07

申请人： SalesBoost, LLC

发明人： Margaret L BROOKS

IPC分类号： G10L15/22 , G10L15/10 , G06F17/30 , G06Q10/06

CPC分类号： G10L15/22 , G06F16/683 , G06Q10/06398 , G10L15/10 , G10L25/48 , G10L2015/225

摘要： A method for performing voice analysis includes storing, in a database, a simulation file for conducting a training session with a user, the simulation file including at least a script. The method includes further storing, in the database, desired attributes associated with the simulation file. The method also includes retrieving, by a server, the simulation file from the database and providing, by a client application, a user interface to conduct the voice analysis using the simulation file from the database. The method further includes receiving, at the client application, one or more voice impressions from a user and analyzing, at an audio analysis tool, at least one of the voice impressions of the user. The method additionally includes determining, at the audio analysis tool, attributes of the at least one voice impression in response to analyzing the at least one voice impression and comparing, at the audio analysis tool, the determined attributes to the desired attributes associated with the simulation file. The method provides, by the client application, feedback to the user based on the comparison.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类