SYSTEMS AND METHODS FOR REMOTE REAL-TIME AUDIO MONITORING

    公开(公告)号:US20240257823A1

    公开(公告)日:2024-08-01

    申请号:US18521676

    申请日:2023-11-28

    申请人: MIXHalo Corp.

    摘要: A method for remotely monitoring audio signal variance in real-time by a cloud-based virtual host communicative coupled to an audio server computing device includes receiving and processing network packets that contain an audio signal. The method also includes calculating an audio signal variance based on the processed network packets containing the audio signal. The method also includes determining whether the audio signal variance is below a threshold and, in response to determining that the audio signal variance is below the threshold, generating an alert indicating that the audio signal variance is below the threshold.

    Signal separation apparatus, signal separation method and program

    公开(公告)号:US11922966B2

    公开(公告)日:2024-03-05

    申请号:US17276256

    申请日:2019-10-01

    发明人: Hiroshi Sawada

    摘要: A signal separation device for acquiring a source signal from a mixed signal observed by a plurality of sensors includes: a database that stores feature information of a clean signal; separation matrix calculation means for repeatedly performing processes of, based on a separated signal obtained by multiplication of a mixed signal converted into a time-frequency representation by a separation matrix and on the feature information stored in the database, calculating a parameter to be used for an objective function for optimizing the separation matrix, and calculating a separation matrix for minimizing the objective function using the parameter; and output means for outputting a separated signal calculated using the optimized separation matrix obtained by the separation matrix calculation means.

    Voice-controlled management of user profiles

    公开(公告)号:US11727939B2

    公开(公告)日:2023-08-15

    申请号:US17568931

    申请日:2022-01-05

    摘要: A network node in a communication network receives, from a user equipment, a cluster of audio segments. The network node calculates a first confidence measure representing a first probability that a first speaker model represents a speaker of the cluster of audio segments. The network node also calculates a second confidence measure representing a second probability that a second speaker model represents the speaker of the cluster of audio segments. In response to the first confidence measure and the second confidence measure both representing probabilities that are higher than a target probability, the network node updates a first user profile associated with the first speaker model and a second user profile associated with the second speaker model based on a user preference assigned to the cluster of audio segments.

    Adaptive diarization model and user interface

    公开(公告)号:US11710496B2

    公开(公告)日:2023-07-25

    申请号:US17596861

    申请日:2019-07-01

    申请人: Google LLC

    摘要: A computing device receives a first audio waveform representing a first utterance and a second utterance. The computing device receives identity data indicating that the first utterance corresponds to a first speaker and the second utterance corresponds to a second speaker. The computing device determines, based on the first utterance, the second utterance, and the identity data, a diarization model configured to distinguish between utterances by the first speaker and utterances by the second speaker. The computing device receives, exclusively of receiving further identity data indicating a source speaker of a third utterance, a second audio waveform representing the third utterance. The computing device determines, by way of the diarization model and independently of the further identity data of the first type, the source speaker of the third utterance. The computing device updates the diarization model based on the third utterance and the determined source speaker.