-
公开(公告)号:US20200211544A1
公开(公告)日:2020-07-02
申请号:US16583688
申请日:2019-09-26
申请人: RingCentral, Inc.
IPC分类号: G10L15/22 , G10L15/04 , G10L15/16 , G10L25/90 , G10L15/24 , G10L13/04 , G10L15/30 , G06N3/08
摘要: Systems, methods, and computer readable media comprising instructions executable by a processor, for recognizing speech within a received audio signal segment the audio signal to isolate the speech based on a speaker audio profile, determine from the audio signal a command, a first score reflecting confidence in determining the command, and a second score reflecting a potential error in determining the command, and cause the command to be executed if the first score is above a first threshold value and the second score is below a second threshold value.
-
公开(公告)号:US20200211531A1
公开(公告)日:2020-07-02
申请号:US16235776
申请日:2018-12-28
申请人: Rohit Kumar , Henrik Lindström , Henriette Cramer , Sarah Mennicken , Sravana Reddy , Jennifer Thom-Santelli
发明人: Rohit Kumar , Henrik Lindström , Henriette Cramer , Sarah Mennicken , Sravana Reddy , Jennifer Thom-Santelli
IPC分类号: G10L13/04 , G06F16/683
摘要: A text-to-speech engine creates audio output that includes synthesized speech and one or more media content item snippets. The input text is obtained and partitioned into text sets. A track having lyrics that match a part of one of the text sets is identified. The location of the track's audio that contains the lyric is extracted based on forced alignment data. The extracted audio is combined with synthesized speech corresponding to the remainder of the input text to form audio output.
-
公开(公告)号:US10694038B2
公开(公告)日:2020-06-23
申请号:US16016453
申请日:2018-06-22
摘要: Systems and methods for managing a call between a contact, a conversation bot, and a human agent are disclosed. The method selects a conversation bot associated with a particular human agent from multiple conversation bots that are each associated with a different human agent. Each conversation bot can be a model trained using conversation data recorded during conversations conducted by the particular human agent with which it is associated. The method connects an audio call with a human contact, and generates audio during the call based upon a voice of the particular human agent. The method determines that a transition criterion is satisfied, and selects a selected human agent from amongst a plurality of available human agents. When the transition criterion is satisfied, the method enables a selected human agent to participate on the call, and continues the call between the selected human agent and the human contact.
-
公开(公告)号:US10692494B2
公开(公告)日:2020-06-23
申请号:US15975784
申请日:2018-05-10
申请人: Sattam Dasgupta
发明人: Sattam Dasgupta
IPC分类号: G10L15/22 , G06F3/16 , G06F3/0481 , G06F3/0484 , G10L13/04 , G10L15/26 , G06F3/0482 , G10L13/00
摘要: Techniques for providing application-independent content translation in an electronic device are disclosed. In one embodiment, a trigger may be received to activate a first application. Upon receiving the trigger to activate the first application, the first application may be enabled to display at least one visual indicator associated with the first application on a graphical user interface associated with a second application. The first application and the second application are to simultaneously run in an electronic device and the at least one visual indicator may be superimposed on the graphical user interface. Further, content on the graphical user interface may be translated from text-to-speech or speech-to-text in response to selecting the at least one visual indicator.
-
105.
公开(公告)号:US10691898B2
公开(公告)日:2020-06-23
申请号:US15771460
申请日:2015-10-29
申请人: Hitachi, Ltd.
发明人: Qinghua Sun , Takeshi Homma , Takashi Sumiyoshi , Masahito Togami
IPC分类号: G06F40/40 , H04N21/2343 , H04N21/234 , H04N21/44 , H04N21/439 , H04N21/233 , H04N21/4402 , G06F40/45 , G06F40/58 , G06F40/194 , B25J9/16 , G10L13/04 , G10L15/22 , G10L21/055 , G10L13/00 , G10L15/26
摘要: Disclosed is a method for synchronizing visual information and auditory information characterized by extracting visual information included in video, recognizing auditory information in a first language that is included in a speech in the first language, associating the visual information with the auditory information in the first language, translating the auditory information in the first language to auditory information in a second language, and editing at least one of the visual information with the auditory information in the second language so as to associate the visual information and the auditory information in the second language with each other.
-
公开(公告)号:US10685644B2
公开(公告)日:2020-06-16
申请号:US16027337
申请日:2018-07-04
申请人: YANDEX EUROPE AG
IPC分类号: G10L13/04 , G10L13/08 , G10L13/02 , G06F17/16 , G10L15/187 , G10L13/00 , G10L13/06 , G06N20/00 , G06F40/205
摘要: There is disclosed a method of generating a text-to-speech (TTS) training set for training a Machine Learning Algorithm (MLA) for generating machine-spoken utterances The method is executable by a server. The method includes generating a synthetic word based on merging separate phonemes from each of two words of a corpus of pre-recorded utterances, the merging being done using the common phoneme as a merging anchor, the merging resulting in at least two synthetic words. The synthetic words and assessor labels are used to train a classifier to predict a quality parameter associated with a new synthetic phonemes-based word, the quality parameter being representative of whether the new synthetic phonemes-based word is naturally sounding (based on acoustic features of generated synthetic words utterances). The classifier is then used to generate training objects for the MLA and to use the MLA to process the corpus of pre-recorded utterances into their respective vectors.
-
公开(公告)号:US10679608B2
公开(公告)日:2020-06-09
申请号:US15841284
申请日:2017-12-13
申请人: GOOGLE LLC
发明人: Kenneth Mixter , Daniel Colish , Tuan Nguyen
摘要: A method for proactive notifications in a voice interface device includes: receiving a first user voice request for an action with an future performance time; assigning the first user voice request to a voice assistant service for performance; subsequent to the receiving, receiving a second user voice request and in response to the second user voice request initiating a conversation with the user; and during the conversation: receiving a notification from the voice assistant service of performance of the action; triggering a first audible announcement to the user to indicate a transition from the conversation and interrupting the conversation; triggering a second audible announcement to the user to indicate performance of the action; and triggering a third audible announcement to the user to indicate a transition back to the conversation and rejoining the conversation.
-
公开(公告)号:US10672093B2
公开(公告)日:2020-06-02
申请号:US15559398
申请日:2016-03-10
发明人: Nak Jeong Jeong , Chan Jung Kim
IPC分类号: G10L13/04 , G06Q50/28 , G06Q50/12 , G10L13/00 , G06Q10/08 , G06Q10/04 , G06F3/16 , G06Q10/06 , G06Q30/06 , H04M1/725 , H04W4/14
摘要: A delivery order relaying system is disclosed. The system comprises: an order receiving module for receiving a delivery order, to be processed, which is transmitted from an orderer terminal; a TTS module for generating, through TTS, a voice delivery order corresponding to the delivery order to be processed; a voice output module for connecting a phone call to a vendor corresponding to the delivery order to be processed, and outputting the voice delivery order through the connected phone call; a response receiving module for receiving, from the vendor through the phone call, an order response to the delivery order to be processed; and a delivery order response module for transmitting order processing result information, corresponding to the order response, to the orderer terminal having transmitted the delivery order to be processed.
-
公开(公告)号:US10665129B2
公开(公告)日:2020-05-26
申请号:US15949344
申请日:2018-04-10
申请人: Facebook, Inc.
发明人: Robert Turcott
IPC分类号: G09B21/00 , G01L5/00 , G06N20/00 , G06N3/04 , G06N3/08 , G08B6/00 , G09B21/04 , G10L15/02 , G10L15/22 , G10L13/04 , G10L21/02 , G10L21/0272 , G06F3/01 , G06F3/16 , G10L25/18 , G10L25/48 , G10L19/00 , G10L15/16 , G10L21/06
摘要: A haptic communication system includes a broadband signal generator to extract parameters from sensor signals describing a message for transmission to a user. Broadband carrier signals are generated by aggregating a plurality of frequency components. Actuator signals are generated by encoding the parameters from the sensor signals into the broadband carrier signals. One or more cutaneous actuators are communicatively coupled to the broadband signal generator to receive the actuator signals. Haptic vibrations are generated corresponding to the actuator signals on a body of the user to communicate the message to the user.
-
公开(公告)号:US10650701B2
公开(公告)日:2020-05-12
申请号:US15949837
申请日:2018-04-10
申请人: Facebook, Inc.
发明人: Ali Israr
IPC分类号: G08B6/00 , G09B21/00 , G01L5/00 , G06N20/00 , G06N3/04 , G06N3/08 , G09B21/04 , G10L15/02 , G10L15/22 , G10L13/04 , G10L21/02 , G10L21/0272 , G06F3/01 , G06F3/16 , G10L25/18 , G10L25/48 , G10L19/00 , G10L15/16 , G10L21/06
摘要: Embodiments relate to operating multiple cutaneous actuators to provide the sensation of motions or actions occurring within the body. A part of receiving user's body (e.g., limb or head) is placed between the cutaneous actuators. The cutaneous actuators are operated in sequence, causing the illusion of motions or actions occurring inside the body part, as opposed to patches of skin where the cutaneous actuators are located. By differing the time interval between the activation of the cutaneous actuators and/or amplitude of vibrations generated by the cutaneous actuators.
-
-
-
-
-
-
-
-
-