Keyword detection apparatus, keyword detection method, and program

    公开(公告)号:US12131730B2

    公开(公告)日:2024-10-29

    申请号:US17298368

    申请日:2019-11-19

    CPC classification number: G10L15/18 G10L15/16 G10L2015/088

    Abstract: A keyword is extracted robustly despite a voice recognition result including an error. A model storage unit 10 stores a keyword extraction model that accepts word vector representations of a plurality of words as an input and extracts and outputs a word vector representation of a word to be extracted as a keyword. A speech detection unit 11 detects a speech part from a voice signal. A voice recognition unit 12 executes voice recognition on the speech part of the voice signal and outputs a confusion network which is a voice recognition result. A word vector representation generating unit 13 generates a word vector representation including reliability of voice recognition with regard to each candidate word for each confusion set. A keyword extraction unit 14 inputs the word vector representation of the candidate word to the keyword extraction model in descending order of the reliability and obtains the word vector representation of the keyword.

    Offline Voice Control
    8.
    发明公开

    公开(公告)号:US20240347057A1

    公开(公告)日:2024-10-17

    申请号:US18404254

    申请日:2024-01-04

    Applicant: Sonos, Inc.

    Inventor: Connor Smith

    Abstract: As noted above, example techniques relate to offline voice control. A local voice input engine may process voice inputs locally when processing voice inputs via a cloud-based voice assistant service is not possible. Some techniques involve local (on-device) voice-assisted set-up of a cloud-based voice assistant service. Further example techniques involve local voice-assisted troubleshooting the cloud-based voice assistant service. Other techniques relate to interactions between local and cloud-based processing of voice inputs on a device that supports both local and cloud-based processing.

    Input detection windowing
    10.
    发明授权

    公开(公告)号:US12119000B2

    公开(公告)日:2024-10-15

    申请号:US18316434

    申请日:2023-05-12

    Applicant: Sonos, Inc.

    Abstract: A device, such as Network Microphone Device or a playback device, detecting an event associated with the device or a system comprising the device. In response, an input detection window is opened for a given time period. During the given time period the device is arranged to receive an input sound data stream representing sound detected by a microphone. The input sound data stream is analyzed for a plurality of keywords and/or a wake-word for a Voice Assistant Service (VAS) and, based on the analysis, it is determined that the input sound data stream includes voice input data comprising a keyword or a wake-word for a VAS. In response, the device takes appropriate action such as causing the media playback system to perform a command corresponding to the keyword or sending at least part of the input sound data stream to the VAS.

Patent Agency Ranking