ACCIDENTAL VOICE TRIGGER AVOIDANCE USING THERMAL DATA

    公开(公告)号:US20230031145A1

    公开(公告)日:2023-02-02

    申请号:US17388673

    申请日:2021-07-29

    发明人: Ganesh Narayanan

    摘要: Methods and systems for processing voice commands are disclosed. A voice controlled device may receive audio data comprising a voice command. Location information indicative of the source of the audio data may be determined. One or more devices may be caused to determine signals based on the location information. The one or more devices may receive thermal data in response to the signals. The thermal data may be analyzed to determine if the thermal data indicates the presence of a person at the expected location. If a person is detected, then the audio data may processed to cause the voice command to be executed.

    IMAGE CAPTURE DEVICE THAT REDUCES GAPS BETWEEN CAPTURES

    公开(公告)号:US20230017212A1

    公开(公告)日:2023-01-19

    申请号:US17953029

    申请日:2022-09-26

    申请人: GoPro, Inc.

    IPC分类号: H04N5/235 G10L15/24

    摘要: After a command to stop recording a video is received, an image capture device may buffer footage in a buffer memory. The buffer memory may be used as a post-capture cache. The footage buffered in the buffer memory may be appended to the end of previously captured footage, appended to the beginning of subsequently captured footage, and/or used to bridge two separately captured footage.

    Language model biasing modulation

    公开(公告)号:US11532299B2

    公开(公告)日:2022-12-20

    申请号:US16896779

    申请日:2020-06-09

    申请人: Google LLC

    摘要: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for modulating language model biasing. In some implementations, context data is received. A likely context associated with a user is determined based on at least a portion of the context data. One or more language model biasing parameters based at least on the likely context associated with the user is selected. A context confidence score associated with the likely context based on at least a portion of the context data is determined. One or more language model teasing parameters based at least on the context confidence score is adjusted. A baseline language model based at least on the one or more of the adjusted language model biasing parameters is biased. The baseline language model is provided for use by an automated speech recognizer (ASR).

    Speech Interaction Method, Apparatus, and System

    公开(公告)号:US20220358919A1

    公开(公告)日:2022-11-10

    申请号:US17619055

    申请日:2020-06-12

    发明人: Wenhua Xu

    IPC分类号: G10L15/22 G10L15/24

    摘要: A speech interaction method includes receiving, by a server, a first play message, where the first play message includes an identifier of first audio content corresponding to a first non-speech instruction. The server determines a first intent and first slot information that correspond to the first non-speech instruction. In response to the first play message, the server instructs a playback device to play the first audio content. The server receives a first speech instruction input by a user into the playback device, where a second intent or second slot information or both in the first speech instruction are incomplete. The server determines, based on the first intent and the first slot information, the second intent and the second slot information that correspond to the first speech instruction, and the server, based on the second intent and the second slot information, instructs the playback device to play second audio content.

    SYSTEMS AND METHODS FOR RECOGNIZING A SPEECH OF A SPEAKER

    公开(公告)号:US20220351729A1

    公开(公告)日:2022-11-03

    申请号:US17813367

    申请日:2022-07-19

    申请人: RingCentral, Inc.

    摘要: A method for recognizing speech within a received audio signal includes separating, using a computer-based neural network model, a speech from an audio signal based on a speaker's audio profile, determining a command from the speech, determining, from the audio signal, a first score reflecting a percentage of confidence in determining the command based on a frequency of using the command by the speaker, determining, from the audio signal, a second score reflecting a percentage of importance of the command, and causing the command to be executed if the first score is above a first threshold value and the second score is below a second threshold value.

    DETERMINING CONVERSATION ANALYSIS INDICATORS FOR A MULTIPARTY CONVERSATION

    公开(公告)号:US20220343911A1

    公开(公告)日:2022-10-27

    申请号:US17811868

    申请日:2022-07-11

    申请人: BetterUp, Inc.

    摘要: Technology is provided for conversation analysis. The technology includes, receiving multiple utterance representations, where each utterance representation represents a portion of a conversation performed by at least two users, and each utterance representation is associated with video data, acoustic data, and text data. The technology further includes generating a first utterance output by applying video data, acoustic data, and text data of the first utterance representation to a respective video processing part of the machine learning system to generate video, text, and acoustic-based outputs. A second utterance output is further generated for a second user. Conversation analysis indicators are generated by applying, to a sequential machine learning system the combined speaker features and a previous state of the sequential machine learning system.

    SYSTEMS AND METHODS FOR AUTOMATING VOICE COMMANDS

    公开(公告)号:US20220246147A1

    公开(公告)日:2022-08-04

    申请号:US17546838

    申请日:2021-12-09

    申请人: Rovi Guides, Inc.

    摘要: A method of detecting establishment of a voice communication between a first voice communication equipment and a second voice communication equipment and automating requests for content. The method includes analyzing the voice communication to identify a request for content, analyzing the voice communication to identify an affirmative response to the request for content, and correlating the request for content with a first user account and correlating the affirmative response with a second user account. In response to identifying the affirmative response and based upon at least one of the first user account or the second user account, identifying from a data storage, the requested content and causing the transmission of the requested content.