TRACKING BEATS AND DOWNBEATS OF VOICES IN REAL TIME

    公开(公告)号:US20240395231A1

    公开(公告)日:2024-11-28

    申请号:US18200924

    申请日:2023-05-23

    Applicant: Lemon Inc.

    Abstract: The present disclosure describes techniques for tracking beats and downbeats of audio, such as human voices, in real time. Audio may be received in real time. The audio may be split into a sequence of segments. A sequence of audio features representing the sequence of segments of the audio may be extracted. A continuous sequence of activations indicative of probabilities of beats or downbeats occurring in the sequence of segments of the audio may be generated using a machine learning model with causal mechanisms. Timings of the beats or the downbeats occurring in the sequence of segments of the audio may be determined based on the continuous sequence of activations by fusing local rhythmic information with respect to each instant segment with information indicative of beats or downbeats in previous segments among the sequence of segments.

Patent Agency Ranking