Patent search ap:("NVIDIA CORPORATION") AND inv:"Jagadeesh BALAM" Page 1

1.

发明申请
PROBABILISTIC GENERATION OF SPEAKER DIARIZATION DATA 有权

公开(公告)号：US20250061883A1

公开(公告)日：2025-02-20

申请号：US18526600

申请日：2023-12-01

Applicant: NVIDIA CORPORATION

Inventor： Tae Jin PARK , He HUANG , Jagadeesh BALAM

IPC: G10L13/02

Abstract: In various examples, a technique for generating a simulated multi-speaker recording includes determining a first rate at which a first speech-based attribute occurs within a first portion of the simulated multi-speaker recording. The technique also includes computing a first difference between the first rate and a first target rate for the first speech-based attribute. The technique further includes determining, based at least on the first difference, a second rate at which the first speech-based attribute is to occur within a second portion of the simulated multi-speaker recording and generating the second portion of the simulated multi-speaker recording based at least on the second rate.

Patent Agency Ranking