专利检索 ipc:G10L13/04 第 11 页

101.

发明申请
SYSTEMS AND METHODS FOR RECOGNIZING A SPEECH OF A SPEAKER 审中-公开

公开(公告)号：US20200211544A1

公开(公告)日：2020-07-02

申请号：US16583688

申请日：2019-09-26

申请人： RingCentral, Inc.

发明人： Ilya Vladimirovish Mikhailov

IPC分类号： G10L15/22 , G10L15/04 , G10L15/16 , G10L25/90 , G10L15/24 , G10L13/04 , G10L15/30 , G06N3/08

摘要： Systems, methods, and computer readable media comprising instructions executable by a processor, for recognizing speech within a received audio signal segment the audio signal to isolate the speech based on a speaker audio profile, determine from the audio signal a command, a first score reflecting confidence in determining the command, and a second score reflecting a potential error in determining the command, and cause the command to be executed if the first score is above a first threshold value and the second score is below a second threshold value.

102.

发明申请
TEXT-TO-SPEECH FROM MEDIA CONTENT ITEM SNIPPETS 审中-公开

公开(公告)号：US20200211531A1

公开(公告)日：2020-07-02

申请号：US16235776

申请日：2018-12-28

申请人： Rohit Kumar , Henrik Lindström , Henriette Cramer , Sarah Mennicken , Sravana Reddy , Jennifer Thom-Santelli

发明人： Rohit Kumar , Henrik Lindström , Henriette Cramer , Sarah Mennicken , Sravana Reddy , Jennifer Thom-Santelli

IPC分类号： G10L13/04 , G06F16/683

摘要： A text-to-speech engine creates audio output that includes synthesized speech and one or more media content item snippets. The input text is obtained and partitioned into text sets. A track having lyrics that match a part of one of the text sets is identified. The location of the track's audio that contains the lyric is extracted based on forced alignment data. The extracted audio is combined with synthesized speech corresponding to the remainder of the input text to form audio output.

103.

发明授权
System and method for managing calls of an automated call management system 有权

公开(公告)号：US10694038B2

公开(公告)日：2020-06-23

申请号：US16016453

申请日：2018-06-22

申请人： Replicant Solutions, Inc.

发明人： Jack Phillip Abraham , Benjamin Gleitzman

IPC分类号： H04M3/51 , H04M3/22 , G10L13/04 , H04M3/42 , H04M3/523 , G10L13/00

摘要： Systems and methods for managing a call between a contact, a conversation bot, and a human agent are disclosed. The method selects a conversation bot associated with a particular human agent from multiple conversation bots that are each associated with a different human agent. Each conversation bot can be a model trained using conversation data recorded during conversations conducted by the particular human agent with which it is associated. The method connects an audio call with a human contact, and generates audio during the call based upon a voice of the particular human agent. The method determines that a transition criterion is satisfied, and selects a selected human agent from amongst a plurality of available human agents. When the transition criterion is satisfied, the method enables a selected human agent to participate on the call, and continues the call between the selected human agent and the human contact.

104.

发明授权
Application-independent content translation 有权

公开(公告)号：US10692494B2

公开(公告)日：2020-06-23

申请号：US15975784

申请日：2018-05-10

申请人： Sattam Dasgupta

发明人： Sattam Dasgupta

IPC分类号： G10L15/22 , G06F3/16 , G06F3/0481 , G06F3/0484 , G10L13/04 , G10L15/26 , G06F3/0482 , G10L13/00

摘要： Techniques for providing application-independent content translation in an electronic device are disclosed. In one embodiment, a trigger may be received to activate a first application. Upon receiving the trigger to activate the first application, the first application may be enabled to display at least one visual indicator associated with the first application on a graphical user interface associated with a second application. The first application and the second application are to simultaneously run in an electronic device and the at least one visual indicator may be superimposed on the graphical user interface. Further, content on the graphical user interface may be translated from text-to-speech or speech-to-text in response to selecting the at least one visual indicator.

105.

发明授权
Synchronization method for visual information and auditory information and information processing device 有权

公开(公告)号：US10691898B2

公开(公告)日：2020-06-23

申请号：US15771460

申请日：2015-10-29

申请人： Hitachi, Ltd.

发明人： Qinghua Sun , Takeshi Homma , Takashi Sumiyoshi , Masahito Togami

IPC分类号： G06F40/40 , H04N21/2343 , H04N21/234 , H04N21/44 , H04N21/439 , H04N21/233 , H04N21/4402 , G06F40/45 , G06F40/58 , G06F40/194 , B25J9/16 , G10L13/04 , G10L15/22 , G10L21/055 , G10L13/00 , G10L15/26

摘要： Disclosed is a method for synchronizing visual information and auditory information characterized by extracting visual information included in video, recognizing auditory information in a first language that is included in a speech in the first language, associating the visual information with the auditory information in the first language, translating the auditory information in the first language to auditory information in a second language, and editing at least one of the visual information with the auditory information in the second language so as to associate the visual information and the auditory information in the second language with each other.

106.

发明授权
Method and system for text-to-speech synthesis 有权

公开(公告)号：US10685644B2

公开(公告)日：2020-06-16

申请号：US16027337

申请日：2018-07-04

申请人： YANDEX EUROPE AG

发明人： Vladimir Vladimirovich Kirichenko , Petr Vladislavovich Luferenko

IPC分类号： G10L13/04 , G10L13/08 , G10L13/02 , G06F17/16 , G10L15/187 , G10L13/00 , G10L13/06 , G06N20/00 , G06F40/205

摘要： There is disclosed a method of generating a text-to-speech (TTS) training set for training a Machine Learning Algorithm (MLA) for generating machine-spoken utterances The method is executable by a server. The method includes generating a synthetic word based on merging separate phonemes from each of two words of a corpus of pre-recorded utterances, the merging being done using the common phoneme as a merging anchor, the merging resulting in at least two synthetic words. The synthetic words and assessor labels are used to train a classifier to predict a quality parameter associated with a new synthetic phonemes-based word, the quality parameter being representative of whether the new synthetic phonemes-based word is naturally sounding (based on acoustic features of generated synthetic words utterances). The classifier is then used to generate training objects for the MLA and to use the MLA to process the corpus of pre-recorded utterances into their respective vectors.

107.

发明授权
Conversation-aware proactive notifications for a voice interface device 有权

公开(公告)号：US10679608B2

公开(公告)日：2020-06-09

申请号：US15841284

申请日：2017-12-13

申请人： GOOGLE LLC

发明人： Kenneth Mixter , Daniel Colish , Tuan Nguyen

IPC分类号： G10L13/04 , H04L12/28 , G06F3/16 , G10L15/22 , G10L15/26 , H04L12/58 , H04L29/08

摘要： A method for proactive notifications in a voice interface device includes: receiving a first user voice request for an action with an future performance time; assigning the first user voice request to a voice assistant service for performance; subsequent to the receiving, receiving a second user voice request and in response to the second user voice request initiating a conversation with the user; and during the conversation: receiving a notification from the voice assistant service of performance of the action; triggering a first audible announcement to the user to indicate a transition from the conversation and interrupting the conversation; triggering a second audible announcement to the user to indicate performance of the action; and triggering a third audible announcement to the user to indicate a transition back to the conversation and rejoining the conversation.

108.

发明授权
Delivery order relaying system using TTS and method therefor 审中-公开

公开(公告)号：US10672093B2

公开(公告)日：2020-06-02

申请号：US15559398

申请日：2016-03-10

申请人： WOOWA BROTHERS CO., LTD.

发明人： Nak Jeong Jeong , Chan Jung Kim

IPC分类号： G10L13/04 , G06Q50/28 , G06Q50/12 , G10L13/00 , G06Q10/08 , G06Q10/04 , G06F3/16 , G06Q10/06 , G06Q30/06 , H04M1/725 , H04W4/14

摘要： A delivery order relaying system is disclosed. The system comprises: an order receiving module for receiving a delivery order, to be processed, which is transmitted from an orderer terminal; a TTS module for generating, through TTS, a voice delivery order corresponding to the delivery order to be processed; a voice output module for connecting a phone call to a vendor corresponding to the delivery order to be processed, and outputting the voice delivery order through the connected phone call; a response receiving module for receiving, from the vendor through the phone call, an order response to the delivery order to be processed; and a delivery order response module for transmitting order processing result information, corresponding to the order response, to the orderer terminal having transmitted the delivery order to be processed.

109.

发明授权
Haptic communication system using broad-band stimuli 有权

公开(公告)号：US10665129B2

公开(公告)日：2020-05-26

申请号：US15949344

申请日：2018-04-10

申请人： Facebook, Inc.

发明人： Robert Turcott

IPC分类号： G09B21/00 , G01L5/00 , G06N20/00 , G06N3/04 , G06N3/08 , G08B6/00 , G09B21/04 , G10L15/02 , G10L15/22 , G10L13/04 , G10L21/02 , G10L21/0272 , G06F3/01 , G06F3/16 , G10L25/18 , G10L25/48 , G10L19/00 , G10L15/16 , G10L21/06

摘要： A haptic communication system includes a broadband signal generator to extract parameters from sensor signals describing a message for transmission to a user. Broadband carrier signals are generated by aggregating a plurality of frequency components. Actuator signals are generated by encoding the parameters from the sensor signals into the broadband carrier signals. One or more cutaneous actuators are communicatively coupled to the broadband signal generator to receive the actuator signals. Haptic vibrations are generated corresponding to the actuator signals on a body of the user to communicate the message to the user.

110.

发明授权
Haptic communication using inside body illusions 有权

公开(公告)号：US10650701B2

公开(公告)日：2020-05-12

申请号：US15949837

申请日：2018-04-10

申请人： Facebook, Inc.

发明人： Ali Israr

IPC分类号： G08B6/00 , G09B21/00 , G01L5/00 , G06N20/00 , G06N3/04 , G06N3/08 , G09B21/04 , G10L15/02 , G10L15/22 , G10L13/04 , G10L21/02 , G10L21/0272 , G06F3/01 , G06F3/16 , G10L25/18 , G10L25/48 , G10L19/00 , G10L15/16 , G10L21/06

摘要： Embodiments relate to operating multiple cutaneous actuators to provide the sensation of motions or actions occurring within the body. A part of receiving user's body (e.g., limb or head) is placed between the cutaneous actuators. The cutaneous actuators are operated in sequence, causing the illusion of motions or actions occurring inside the body part, as opposed to patches of skin where the cutaneous actuators are located. By differing the time interval between the activation of the cutaneous actuators and/or amplitude of vibrations generated by the cutaneous actuators.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类