Invention Grant
- Patent Title: Systems and methods for audio processing
-
Application No.: US17038135Application Date: 2020-09-30
-
Publication No.: US11646032B2Publication Date: 2023-05-09
- Inventor: Balarajan Balasubramaniam , Prasanth Subendran , Uthayasanker Thayasivam , Ketharan Suntharam , Sarangan Janakan , Kanthasamy Jathusan , Balakrishnan Sathiyakugan
- Applicant: Medixin Inc.
- Applicant Address: CA Toronto
- Assignee: MEDIXIN INC.
- Current Assignee: MEDIXIN INC.
- Current Assignee Address: CA Kitchener
- Agency: Own Innovation
- Agent James W. Hinton
- Main IPC: G10L15/26
- IPC: G10L15/26 ; G10L15/16 ; G06N3/08 ; G16H80/00 ; G16H10/60 ; G10L15/02

Abstract:
A method of electronically documenting a conversation is provided. The method includes capturing audio of a conversation between a first speaker and a second speaker; generating conversation audio data from the captured audio; and segmenting the conversation audio data into a plurality of utterances according to a speaker segmentation technique. The method further includes, for each utterance: storing time data indicating the chronological position of the utterance in the conversation; passing the utterance to a neural network model, the neural network model configured to receive the utterance as an input and generate a feature representation of the utterance as an output; assigning the utterance feature representation to a first speaker cluster or a second speaker cluster according to a clustering technique; assigning a speaker identifier to the utterance based on the cluster assignment of the utterance; and generating a text representation of the utterance.
Public/Granted literature
- US20210272571A1 SYSTEMS AND METHODS FOR AUDIO PROCESSING Public/Granted day:2021-09-02
Information query