-
公开(公告)号:US11004453B2
公开(公告)日:2021-05-11
申请号:US15945014
申请日:2018-04-04
发明人: Erich Adams
摘要: Techniques for avoiding wake word self-triggering are provided. In one embodiment, an electronic device can receive an audio-out signal to be output as audio via a speaker of the device and can attempt to recognize a wake word in the audio-out signal using a first recognizer. If the wake word is recognized in the audio-out signal, the electronic device can further determine whether a wake word match is made using a second recognizer with respect to a mic-in audio signal captured via a microphone of the device at approximately the same time that the audio-out signal is output via the speaker. If so, the electronic device can ignore the wake word match made using the second recognizer.
-
公开(公告)号:US10705789B2
公开(公告)日:2020-07-07
申请号:US16045560
申请日:2018-07-25
发明人: Todd F. Mozer
IPC分类号: G10L15/00 , G06F3/16 , G10L15/22 , G10L25/84 , G06Q50/26 , G10L15/08 , G10L13/00 , G10L13/10 , G10L13/033 , G10L25/21
摘要: Techniques for implementing dynamic volume adjustment by a virtual assistant are provided. In one embodiment, the virtual assistant can receive a voice query or command from a user, recognize the content of the voice query or command, process the voice query or command based on the recognized content, and determine an auditory response to be output to the user. The virtual assistant can then identify a plurality of criteria for automatically determining an output volume level for the response, where the plurality of criteria including content-based criteria and environment-based criteria, calculate values for the plurality of criteria, and combine the values to determine the output volume level. The virtual assistant can subsequently cause the auditory response to be output to the user at the determined output volume level.
-
公开(公告)号:US20190005222A1
公开(公告)日:2019-01-03
申请号:US16124121
申请日:2018-09-06
发明人: Matthew Wilder
摘要: Techniques for implementing face-controlled liveness verification are provided. In one embodiment, a computing device can present, to a user, a sequential series of targets on a graphical user interface (GUI) of the computing device, where each target is a visual element designed to direct the user's attention to a location in the GUI. The computing device can further determine whether the user has successfully hit each target, where the determining comprises tracking movement of a virtual pointer controlled by the user's gaze or face pose and checking whether the user has moved the virtual pointer over each target. If the user has successfully hit each target, the computing device can conclude that the user is a live subject.
-
公开(公告)号:US20180060552A1
公开(公告)日:2018-03-01
申请号:US15247292
申请日:2016-08-25
发明人: Bryan Pellom , Gordon Haupt , Karl Ridgeway
CPC分类号: G06F21/32 , G06F17/30743
摘要: Techniques for implementing voice-based liveness verification are provided. In one embodiment, a computing device can present a series of challenge prompts to a user being authenticated, where each challenge prompt corresponds to a request to utter a liveness passphrase that is randomly selected from a set of liveness passphrases that have been previously enrolled by an enrolled user of the computing device. The computing device can then receive utterances from the user in response to the series of challenge prompts and, if each utterance matches its corresponding enrolled liveness passphrase, can conclude that the user is a live subject.
-
公开(公告)号:US09142219B2
公开(公告)日:2015-09-22
申请号:US14280261
申请日:2014-05-16
发明人: Todd F. Mozer
CPC分类号: G10L17/22 , G10L15/22 , G10L17/00 , G10L2015/227
摘要: In one embodiment, a method includes receiving an acoustic input signal at a speech recognizer. A user is identified that is speaking based on the acoustic input signal. The method then determines speaker-specific information previously stored for the user and a set of responses based on the recognized acoustic input signal and the speaker-specific information for the user. It is determined if the response should be output and the response is outputted if it is determined the response should be output.
摘要翻译: 在一个实施例中,一种方法包括在语音识别器处接收声输入信号。 基于声输入信号识别正在说话的用户。 该方法然后基于识别的声输入信号和用户的说话者特定信息确定先前为用户存储的说话者特定信息和一组响应。 确定是否应该输出响应,如果确定应该输出响应,则确定响应是否被输出。
-
6.
公开(公告)号:US20140257812A1
公开(公告)日:2014-09-11
申请号:US14280261
申请日:2014-05-16
发明人: Todd F. Mozer
IPC分类号: G10L17/22
CPC分类号: G10L17/22 , G10L15/22 , G10L17/00 , G10L2015/227
摘要: In one embodiment, a method includes receiving an acoustic input signal at a speech recognizer. A user is identified that is speaking based on the acoustic input signal. The method then determines speaker-specific information previously stored for the user and a set of responses based on the recognized acoustic input signal and the speaker-specific information for the user. It is determined if the response should be output and the response is outputted if it is determined the response should be output.
摘要翻译: 在一个实施例中,一种方法包括在语音识别器处接收声输入信号。 基于声输入信号识别正在说话的用户。 该方法然后基于识别的声输入信号和用户的说话者特定信息确定先前为用户存储的说话者特定信息和一组响应。 确定是否应该输出响应,如果确定应该输出响应,则确定响应是否被输出。
-
公开(公告)号:US20230229803A1
公开(公告)日:2023-07-20
申请号:US17579383
申请日:2022-01-19
发明人: Todd Mozer , Pieter Vermeulen , Jonathan Welch
IPC分类号: G06F21/62 , G10L15/02 , G10L21/007 , G06T5/00 , G06V40/16 , G06V20/70 , G06T7/194 , G06V20/62 , G06K9/62
CPC分类号: G06F21/6245 , G10L15/02 , G10L21/007 , G06T5/002 , G06V40/171 , G06V20/70 , G06T7/194 , G06V20/63 , G06K9/6256 , G06K9/6253 , G06T2207/30196 , G06T2207/20081
摘要: Techniques for sanitizing personally identifiable information (PII) from audio and visual data are provided. For instance, in a scenario where the data comprises an audio signal with speech uttered by a person P, these techniques can include removing/obfuscating/transforming speech-related PII in the audio signal such as pitch and acoustic cues associated with P's vocal tract shape and/or vocal actuators (e.g., lips, nasal air bypass, teeth, tongue, etc.) while allowing the content of the speech to remain recognizable. Further, in a scenario where the data comprises a still image or video in which a person P appears, these techniques can include removing/obfuscating/transforming visual PII in the image or video such as P's biological features and indicators of P's location/belongings/data while allowing the general nature of the image or video to remain discernable. Through this PII sanitization process, the privacy of individuals portrayed in the audio or visual data can be preserved.
-
公开(公告)号:US10482230B2
公开(公告)日:2019-11-19
申请号:US16124121
申请日:2018-09-06
发明人: Matthew Wilder
摘要: Techniques for implementing face-controlled liveness verification are provided. In one embodiment, a computing device can present, to a user, a sequential series of targets on a graphical user interface (GUI) of the computing device, where each target is a visual element designed to direct the user's attention to a location in the GUI. The computing device can further determine whether the user has successfully hit each target, where the determining comprises tracking movement of a virtual pointer controlled by the user's gaze or face pose and checking whether the user has moved the virtual pointer over each target. If the user has successfully hit each target, the computing device can conclude that the user is a live subject.
-
公开(公告)号:US10248770B2
公开(公告)日:2019-04-02
申请号:US14450528
申请日:2014-08-04
IPC分类号: G06F21/32
摘要: Techniques for unobtrusively verifying the identity of a user of a computing device are provided. In one embodiment, the computing device can establish one or more verification models for verifying the user's identity, where at least a subset of the one or more verification models is based on enrollment data that is collected in an unobtrusive manner from the user. The computing device can then verify the user's identity using the one or more verification models.
-
公开(公告)号:US20170311261A1
公开(公告)日:2017-10-26
申请号:US15463805
申请日:2017-03-20
发明人: Todd F. Mozer , Bryan Pellom
CPC分类号: H04W52/0229 , G06N20/00 , H04W8/24 , Y02D70/142 , Y02D70/144 , Y02D70/164 , Y02D70/26
摘要: Smart listening modes for supporting quasi always-on listening on an electronic device are provided. In one embodiment, the electronic device can determine that a user is likely to utter a voice trigger in order to access the always-on listening functionality of the electronic device. In response to this determination, the electronic device can automatically enable the always-on listening functionality. Similarly, the electronic device can determine that a user is no longer likely to utter the voice trigger in order to access the always-on listening functionality of the electronic device. In response to this second determination, the electronic device can automatically disable the always-on listening functionality.
-
-
-
-
-
-
-
-
-