METHOD AND SYSTEM FOR CONVERSATION TRANSCRIPTION WITH METADATA

Invention Publication

US20240331702A1 METHOD AND SYSTEM FOR CONVERSATION TRANSCRIPTION WITH METADATA 审中-公开

Please log in to see more content

Patent Title: METHOD AND SYSTEM FOR CONVERSATION TRANSCRIPTION WITH METADATA
Application No.: US18743562

Application Date: 2024-06-14
Publication No.: US20240331702A1

Publication Date: 2024-10-03
Inventor: Kiersten L. BRADLEY , Ethan COEYTAUX , Ziming YIN
Applicant: SoundHound AI IP, LLC.
Applicant Address: US CA Santa Clara
Assignee: SoundHound AI IP, LLC.
Current Assignee: SoundHound AI IP, LLC.
Current Assignee Address: US CA Santa Clara
Main IPC: G10L15/26
IPC: G10L15/26 ; G06F40/134 ; G06F40/166 ; G06F40/284 ; G10L15/02 ; G10L15/06 ; G10L15/07

METHOD AND SYSTEM FOR CONVERSATION TRANSCRIPTION WITH METADATA

Abstract:

Methods and systems for enabling an efficient review of meeting content via a metadata-enriched, speaker-attributed transcript are disclosed. By incorporating speaker diarization and other metadata, the system can provide a structured and effective way to review and/or edit the transcript. One type of metadata can be image or video data to represent the meeting content. Furthermore, the present subject matter utilizes a multimodal diarization model to identify and label different speakers. The system can synchronize various sources of data, e.g., audio channel data, voice feature vectors, acoustic beamforming, image identification, and extrinsic data, to implement speaker diarization.

Public/Granted literature

US3212114A Multiple part fastener assembly machine Public/Granted day:1965-10-19

Information query

Global Dossier Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）
G10L15/26	.语音—正文识别系统（G10L15/08优先）