-
公开(公告)号:US20240395231A1
公开(公告)日:2024-11-28
申请号:US18200924
申请日:2023-05-23
Applicant: Lemon Inc.
Inventor: Yun-Ning HUNG , Ju-Chiang WANG , Mojtaba HEYDARI
IPC: G10H1/00
Abstract: The present disclosure describes techniques for tracking beats and downbeats of audio, such as human voices, in real time. Audio may be received in real time. The audio may be split into a sequence of segments. A sequence of audio features representing the sequence of segments of the audio may be extracted. A continuous sequence of activations indicative of probabilities of beats or downbeats occurring in the sequence of segments of the audio may be generated using a machine learning model with causal mechanisms. Timings of the beats or the downbeats occurring in the sequence of segments of the audio may be determined based on the continuous sequence of activations by fusing local rhythmic information with respect to each instant segment with information indicative of beats or downbeats in previous segments among the sequence of segments.