PROBABILISTIC GENERATION OF SPEAKER DIARIZATION DATA

Invention Application

US20250061883A1 PROBABILISTIC GENERATION OF SPEAKER DIARIZATION DATA 有权

Please log in to see more content

Patent Title: PROBABILISTIC GENERATION OF SPEAKER DIARIZATION DATA
Application No.: US18526600

Application Date: 2023-12-01
Publication No.: US20250061883A1

Publication Date: 2025-02-20
Inventor: Tae Jin PARK , He HUANG , Jagadeesh BALAM
Applicant: NVIDIA CORPORATION
Applicant Address: US CA Santa Clara
Assignee: NVIDIA CORPORATION
Current Assignee: NVIDIA CORPORATION
Current Assignee Address: US CA Santa Clara
Main IPC: G10L13/02
IPC: G10L13/02

PROBABILISTIC GENERATION OF SPEAKER DIARIZATION DATA

Abstract:

In various examples, a technique for generating a simulated multi-speaker recording includes determining a first rate at which a first speech-based attribute occurs within a first portion of the simulated multi-speaker recording. The technique also includes computing a first difference between the first rate and a first target rate for the first speech-based attribute. The technique further includes determining, based at least on the first difference, a second rate at which the first speech-based attribute is to occur within a second portion of the simulated multi-speaker recording and generating the second portion of the simulated multi-speaker recording based at least on the second rate.

Information query

Global Dossier Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L13/00	语音合成；文本-语音合成系统
G10L13/02	.产生合成语音的方法；语音合成设备