Invention Publication
- Patent Title: METHOD AND SYSTEM FOR CONVERSATION TRANSCRIPTION WITH METADATA
-
Application No.: US18743562Application Date: 2024-06-14
-
Publication No.: US20240331702A1Publication Date: 2024-10-03
- Inventor: Kiersten L. BRADLEY , Ethan COEYTAUX , Ziming YIN
- Applicant: SoundHound AI IP, LLC.
- Applicant Address: US CA Santa Clara
- Assignee: SoundHound AI IP, LLC.
- Current Assignee: SoundHound AI IP, LLC.
- Current Assignee Address: US CA Santa Clara
- Main IPC: G10L15/26
- IPC: G10L15/26 ; G06F40/134 ; G06F40/166 ; G06F40/284 ; G10L15/02 ; G10L15/06 ; G10L15/07

Abstract:
Methods and systems for enabling an efficient review of meeting content via a metadata-enriched, speaker-attributed transcript are disclosed. By incorporating speaker diarization and other metadata, the system can provide a structured and effective way to review and/or edit the transcript. One type of metadata can be image or video data to represent the meeting content. Furthermore, the present subject matter utilizes a multimodal diarization model to identify and label different speakers. The system can synchronize various sources of data, e.g., audio channel data, voice feature vectors, acoustic beamforming, image identification, and extrinsic data, to implement speaker diarization.
Public/Granted literature
- US3212114A Multiple part fastener assembly machine Public/Granted day:1965-10-19
Information query