SIGNAL PROCESSING DEVICE, SIGNAL PROCESSING METHOD, AND SIGNAL PROCESSING PROGRAM

    公开(公告)号:US20240321273A1

    公开(公告)日:2024-09-26

    申请号:US18575327

    申请日:2021-07-02

    IPC分类号: G10L15/22 G10L25/51 G10L25/87

    CPC分类号: G10L15/22 G10L25/51 G10L25/87

    摘要: A signal processing device includes circuitry configured to receive, together with a voice recognition result of an utterance section of an utterance input to each of a plurality of microphones, an input of time information of a start time and an end time of each utterance and information regarding an appearance time of each word in the voice recognition result; detect whether there is an overlap in time of utterance sections in each pair of voice recognition results by combining voice recognition results of two utterances from utterance sections of utterances input to each of the microphones; calculate a similarity of voice recognition result for each pair having an overlap in time of utterance sections; compare the similarity with a predetermined threshold; and reject an utterance having a shorter length of the voice recognition result as a wraparound utterance for a pair in which the similarity exceeds the threshold.

    Voice filtering other speakers from calls and audio messages

    公开(公告)号:US12087297B2

    公开(公告)日:2024-09-10

    申请号:US17930822

    申请日:2022-09-09

    申请人: Google LLC

    摘要: A method includes receiving a first instance of raw audio data corresponding to a voice-based command and receiving a second instance of the raw audio data corresponding to an utterance of audible contents for an audio-based communication spoken by a user. When a voice filtering recognition routine determines to activate voice filtering for at least the voice of the user, the method also includes obtaining a respective speaker embedding of the user and processing, using the respective speaker embedding, the second instance of the raw audio data to generate enhanced audio data for the audio-based communication that isolates the utterance of the audible contents spoken by the user and excludes at least a portion of the one or more additional sounds that are not spoken by the user The method also includes executing.

    Method for controlling ambient sound and electronic device therefor

    公开(公告)号:US12033628B2

    公开(公告)日:2024-07-09

    申请号:US17545257

    申请日:2021-12-08

    IPC分类号: G10L15/22 G10L25/87 H04R3/00

    摘要: A wireless audio device is provided. The wireless audio device includes an audio receiving circuit, an audio output circuit, an acceleration sensor, a communication circuit, a processor, and a memory. The memory may store instructions that, when executed by the processor, cause the wireless audio device to detect an utterance of a user of the wireless audio device by using the acceleration sensor, enter a dialog mode in which at least some of ambient sounds received by the audio receiving circuit are output through the audio output circuit, in response to detecting the utterance of the user, and end the dialog mode if no voice is detected for a specified time or longer by using the audio receiving circuit in the dialog mode.

    Method for real-time redaction of sensitive information from audio stream

    公开(公告)号:US11875819B2

    公开(公告)日:2024-01-16

    申请号:US17447628

    申请日:2021-09-14

    发明人: Ravi Kappagantu

    摘要: A method for redacting sensitive information from an audio stream, such as a voice signal in a telephone call, in real time is provided. The method includes: receiving an audio stream; conveying the audio stream through a channel that includes a valve; detecting, from within the audio stream, a first event that indicates an onset of sensitive information; closing the valve so that the conveying of the audio stream through the channel is temporarily stopped; detecting, from within the audio stream, a second event that indicates an ending of the sensitive information; and reopening the valve so that the conveying of the audio stream through the channel is resumed. The sensitive information may include payment card industry (PCI) information, such as a card number and/or a card verification value (CVV).

    INDICATOR FOR AVOIDING SPEECH CONFLICTION IN A COMMUNICATIONS SESSION WHEN NETWORK LATENCY IS HIGH

    公开(公告)号:US20240007512A1

    公开(公告)日:2024-01-04

    申请号:US17813340

    申请日:2022-07-19

    摘要: A computing system includes first and second client computing devices accessing a communications network to establish a communications session. The first client computing device operates an audio analysis agent to determine network latency within the communications session based on communications with an audio analysis agent in the second client computing device. In response to the network latency exceeding a latency threshold, audio input from a user of the first client computing device is analyzed to determine a speaking status of the user. The audio analysis agent generates an indicator command message for the second client computing device based on the determined speaking status of the user. The second client computing device displays an indicator based on the indicator command message indicating when a user of the second client computing device can speak to avoid speech confliction with the user of said first client computing device.

    DIRECTION BASED END-POINTING FOR SPEECH RECOGNITION

    公开(公告)号:US20230395095A1

    公开(公告)日:2023-12-07

    申请号:US18182811

    申请日:2023-03-13

    IPC分类号: G10L25/87 G10L15/00

    CPC分类号: G10L25/87 G10L15/00 G10L25/78

    摘要: A speech recognition system utilizing automatic speech recognition techniques such as end-pointing techniques in conjunction with beamforming and/or signal processing to isolate speech from one or more speaking users from multiple received audio signals and to detect the beginning and/or end of the speech based at least in part on the isolation. Audio capture devices such as microphones may be arranged in a beamforming array to receive the multiple audio signals. Multiple audio sources including speech may be identified in different beams and processed.