APPROACHES OF AUGMENTING OUTPUTS FROM SPEECH RECOGNITION

    公开(公告)号:EP4258258A2

    公开(公告)日:2023-10-11

    申请号:EP23166354.3

    申请日:2023-04-03

    摘要: Computing systems, methods, and non-transitory storage media are provided for obtaining an audio stream, converting the audio stream to an intermediate representation, performing diarization on the audio stream, separating the audio stream into individual speech constructs, performing speech recognition on the individual speech constructs by mapping each of the individual speech constructs, or consecutive individual speech constructs, to entries within a dictionary, to generate a transcription of the audio stream, generating an output indicative of the transcription and a result of the diarization, transforming the output into an object-based representation, and performing one or more operations on the object-based representation

    APPROACHES OF AUGMENTING OUTPUTS FROM SPEECH RECOGNITION

    公开(公告)号:EP4258258A3

    公开(公告)日:2023-11-01

    申请号:EP23166354.3

    申请日:2023-04-03

    摘要: Computing systems, methods, and non-transitory storage media are provided for obtaining an audio stream, converting the audio stream to an intermediate representation, performing diarization on the audio stream, separating the audio stream into individual speech constructs, performing speech recognition on the individual speech constructs by mapping each of the individual speech constructs, or consecutive individual speech constructs, to entries within a dictionary, to generate a transcription of the audio stream, generating an output indicative of the transcription and a result of the diarization, transforming the output into an object-based representation, and performing one or more operations on the object-based representation