APPROACHES OF AUGMENTING OUTPUTS FROM SPEECH RECOGNITION

    公开(公告)号:US20230326453A1

    公开(公告)日:2023-10-12

    申请号:US17741992

    申请日:2022-05-11

    CPC classification number: G10L15/08 G10L17/02 G10L15/04 G10L15/22

    Abstract: Computing systems methods, and non-transitory storage media are provided for obtaining an audio stream, converting the audio stream to an intermediate representation, performing diarization on the audio stream, separating the audio stream into individual speech constructs, performing speech recognition on the individual speech constructs by mapping each of the individual speech constructs, or consecutive individual speech constructs, to entries within a dictionary, to generate a transcription of the audio stream, generating an output indicative of the transcription and a result of the diarization, transforming the output into an object-based representation, and performing one or more operations on the object-based representation

Patent Agency Ranking