-
公开(公告)号:US20240363115A1
公开(公告)日:2024-10-31
申请号:US18770316
申请日:2024-07-11
Applicant: GOOGLE LLC
Inventor: Jonathan Hayden Gomes , Shashank Goel , Oscar Armando Azucena , Patrick Berny , Keun-Young Park , Matthew William Crowley
IPC: G10L15/22 , G06F3/14 , G06F3/16 , G10L15/08 , G10L15/16 , G10L15/30 , G10L21/0208 , H04M1/27 , H04R3/00
CPC classification number: G10L15/22 , G06F3/14 , G06F3/165 , G10L15/16 , G10L21/0208 , H04M1/271 , H04R3/00 , G10L2015/088 , G10L15/30 , G10L2021/02082
Abstract: Techniques are described herein for concurrent voice assistants. A method includes: providing first and second automated assistants with access to one or more microphones; receiving, from the first automated assistant, an indication that the first automated assistant has initiated a first session, and in response: continuing providing, to the first automated assistant, access to the one or more microphones; discontinuing providing, to the second automated assistant, access to the one or more microphones; and preventing the second automated assistant from accessing one or more portions of an output audio data stream; receiving, from the first automated assistant, an indication that the first session has ended, and in response: continuing providing, to the first automated assistant, access to the one or more microphones; resuming providing, to the second automated assistant, access to the one or more microphones; and resuming providing, to the second automated assistant, the output audio data stream.
-
公开(公告)号:US20240212688A1
公开(公告)日:2024-06-27
申请号:US18436911
申请日:2024-02-08
Applicant: PINDROP SECURITY, INC.
Inventor: Ellie KHOURY , Matthew GARLAND
IPC: G10L17/00 , G06N7/01 , G10L15/07 , G10L15/19 , G10L15/26 , G10L17/04 , G10L17/08 , G10L17/24 , H04M1/27
CPC classification number: G10L17/00 , G06N7/01 , G10L15/07 , G10L15/19 , G10L15/26 , G10L17/04 , G10L17/08 , G10L17/24 , H04M1/271 , H04M2203/40
Abstract: Utterances of at least two speakers in a speech signal may be distinguished and the associated speaker identified by use of diarization together with automatic speech recognition of identifying words and phrases commonly in the speech signal. The diarization process clusters turns of the conversation while recognized special form phrases and entity names identify the speakers. A trained probabilistic model deduces which entity name(s) correspond to the clusters.
-
公开(公告)号:US12008996B2
公开(公告)日:2024-06-11
申请号:US17717326
申请日:2022-04-11
Applicant: Microsoft Technology Licensing, LLC
Inventor: Richard Breuer , Thomas Moser , Christoph Gilles , Hans Haustetter
CPC classification number: G10L17/00 , G10L17/24 , G10L17/26 , H04M1/271 , H04M3/533 , H04M2203/6027 , H04M2250/74
Abstract: A system, method and computer-readable storage device are disclosed signing a voicemail and confirming an identity of the speaker. A method includes receiving a request to verify a speaker associated with a communication to a recipient, receiving first data from the speaker in connection with the communication, accessing second data associated with the speaker to verify the speaker, determining whether a match exists between the first data and the second data to yield a determination, retrieving a communication address of the recipient, generating a notification for the recipient, wherein the notification reports on the determination and transmitting the notification to the recipient at the communication address.
-
公开(公告)号:US11990126B2
公开(公告)日:2024-05-21
申请号:US17750983
申请日:2022-05-23
Applicant: Google LLC
Inventor: Raunaq Shah , Matt Van Der Staay
IPC: G10L15/22 , G06F3/16 , G10L15/28 , G10L15/30 , H04M1/27 , H04M3/493 , H04N21/20 , H04N21/239 , H04N21/40 , H04N21/41 , H04N21/4147 , H04N21/422 , H04N21/47 , H04N21/4722 , H04N21/45 , H04N21/475
CPC classification number: G10L15/22 , G06F3/167 , G10L15/28 , G10L15/30 , H04M1/271 , H04M3/493 , H04N21/20 , H04N21/2393 , H04N21/40 , H04N21/4104 , H04N21/4112 , H04N21/4147 , H04N21/42203 , H04N21/42204 , H04N21/47 , H04N21/4722 , G10L2015/223 , H04N21/42206 , H04N21/4532 , H04N21/4751
Abstract: A method is implemented to move media content display between two media output devices. A server system determines in a voice message recorded by an electronic device a media transfer request that includes a user voice command to transfer media content to a destination media output device and a user voice designation of the destination media output device. The server system then obtains from a source cast device instant media play information including information of a media play application, the media content that is being played, and a temporal position. The server system further identifies a destination cast device associated in a user domain coupled to the destination media output device, and sends to the destination cast device a media play request including the instant media play information, thereby enabling the destination cast device to execute the media play application for playing the media content from the temporal location.
-
公开(公告)号:US20240137435A1
公开(公告)日:2024-04-25
申请号:US18537386
申请日:2023-12-12
Applicant: GOOGLE TECHNOLOGY HOLDINGS LLC
Inventor: Kazuhiro Ondo , Michael P. Labowicz , Hideki Yoshino , Andrew K. Wells
CPC classification number: H04M1/6041 , G06F16/60 , G10L15/20 , G10L15/28 , H04M1/271 , H04M2250/74
Abstract: A method on a mobile device for a wireless network is described. An audio input is monitored for a trigger phrase spoken by a user of the mobile device. A command phrase spoken by the user after the trigger phrase is buffered. The command phrase corresponds to a call command and a call parameter. A set of target contacts associated with the mobile device is selected based on respective voice validation scores and respective contact confidence scores. The respective voice validation scores are based on the call parameter. The respective contact confidence scores are based on a user context associated with the user. A call to a priority contact of the set of target contacts is automatically placed if the voice validation score of the priority contact meets a validation threshold and the contact confidence score of the priority contact meets a confidence threshold.
-
公开(公告)号:US20230326462A1
公开(公告)日:2023-10-12
申请号:US18329138
申请日:2023-06-05
Applicant: Pindrop Security, Inc.
Inventor: Elie KHOURY , Matthew GARLAND
IPC: G10L17/00 , H04M1/27 , G10L17/24 , G10L15/19 , G10L17/08 , G06N7/01 , G10L15/07 , G10L15/26 , G10L17/04
CPC classification number: G10L17/00 , H04M1/271 , G10L17/24 , G10L15/19 , G10L17/08 , G06N7/01 , G10L15/07 , G10L15/26 , G10L17/04 , H04M2203/40
Abstract: Utterances of at least two speakers in a speech signal may be distinguished and the associated speaker identified by use of diarization together with automatic speech recognition of identifying words and phrases commonly in the speech signal. The diarization process clusters turns of the conversation while recognized special form phrases and entity names identify the speakers. A trained probabilistic model deduces which entity name(s) correspond to the clusters.
-
公开(公告)号:US20190222684A1
公开(公告)日:2019-07-18
申请号:US16303326
申请日:2016-05-20
Applicant: Huawei Technologies Co., Ltd.
IPC: H04M1/27 , H04M1/60 , H04M1/2745 , H04M1/725 , H04M3/493
CPC classification number: H04M1/271 , H04M1/05 , H04M1/2745 , H04M1/27455 , H04M1/6041 , H04M1/725 , H04M1/72519 , H04M1/72527 , H04M1/7253 , H04M1/72563 , H04M1/72583 , H04M3/4938 , H04M2250/02 , H04M2250/74
Abstract: An interaction method in a call and a device, where the method includes displaying a voice assistant icon on a call screen of the wearable device when a phone number of a communication peer device is a service number converting received voice information into a dual tone multi frequency (DTMF) tone when the voice assistant icon is activated, and sending the DTMF tone to the communication peer device. Therefore, in a process of making or answering a service call using the wearable device, under a specific trigger condition, voice information of a user is converted into a DTMF tone to be sent to the communication peer device. In this way, when the user needs to enter a digit or a symbol in a call process, it can be ensured that the user enters correct information, and user experience is improved.
-
8.
公开(公告)号:US20180261224A1
公开(公告)日:2018-09-13
申请号:US15647844
申请日:2017-07-12
Applicant: Jetvox Acoustic Corp.
Inventor: To-Teng HUANG , Shih-Yuan Chen
IPC: G10L15/26 , H04M1/27 , H04N21/422 , G10L15/08 , G10L15/22
CPC classification number: G10L15/265 , G08C17/02 , G08C2201/31 , G10L15/083 , G10L15/22 , G10L2015/223 , H04M1/271 , H04M1/72533 , H04M2250/74 , H04N21/4126 , H04N21/42203 , H04N21/43637
Abstract: A wireless voice-controlled system includes a wearable voice transmitting-receiving device including a voice-receiving unit, a first wireless transmitting-receiving unit, and a first processor and a controlled electrical device including a second wireless transmitting-receiving unit and a second processor. The voice-receiving unit receives a voice instruction and converts the voice instruction into an audio signal. The first wireless transmitting-receiving unit receives the audio signal, wirelessly transmits the audio signal out, and receives a text signal corresponding to the audio signal. The first processor receives the text signal, generates a control signal according to the text signal, and wirelessly transmits the control signal to the first wireless transmitting-receiving unit. The second wireless transmitting-receiving unit is in wireless communication with the first wireless transmitting-receiving unit and wirelessly receives the control signal. The second processor receives the control signal and performs an operation according to the control signal.
-
9.
公开(公告)号:US20180197550A1
公开(公告)日:2018-07-12
申请号:US15917624
申请日:2018-03-10
Applicant: Shyh-Jye Wang , Chi-Ping Chung
Inventor: Shyh-Jye Wang , Chi-Ping Chung
CPC classification number: G10L17/22 , G10L15/005 , G10L15/08 , G10L15/22 , H04M1/271 , H04M1/6075 , H04M1/72519 , H04M1/72552 , H04M2201/40 , H04M2250/74 , H04W52/0254 , H04W52/028 , Y02D70/00 , Y02D70/142 , Y02D70/144 , Y02D70/146 , Y02D70/164 , Y02D70/26
Abstract: The embodiments provided herein are directed to a system and method of message-triggered voice command interface in portable electronic devices. The voice command interface is normally not activated until a message (e.g., an e-mail, a text message, or a voice mail) has been received by a portable electronic device. The arriving of a message is used to trigger the voice command interface by activating one or more speech recognition routines in a predetermined time period corresponding to the one or more speech recognition routines. The voice command interface come to an end when the predetermined time period expires or the user has no further commands.
-
公开(公告)号:US09967382B2
公开(公告)日:2018-05-08
申请号:US15392323
申请日:2016-12-28
Applicant: Amazon Technologies, Inc.
Inventor: Gregory Michael Hart , Brian Oliver , Adrian Hurditch , Nawdesh Uppal , Reza Abdollahi
CPC classification number: H04M1/271 , G05D23/1919 , H04M1/2535 , H04M1/6033 , H04M11/045 , H04M2201/40 , H04M2250/74 , H04W84/042
Abstract: A system capable of connecting a home telephone circuit connected to a Public Switched Telephone Network (PSTN) to a server via a data network using an adapter. The system may enable a telephone connected to the home telephone circuit to perform voice commands by sending audio data from the telephone to the server via the data network and the server determining the voice commands included in the audio data. Based on the voice command, the server may send an instruction to the adapter via the data network, the instruction causing the adapter to initiate a telephone call over the PSTN. Additionally, the server may send an instruction to any device associated with a user profile corresponding to the adapter. Thus, the system may enable the telephone to control a number of devices within the home using the voice commands.
-
-
-
-
-
-
-
-
-