SPEECH PROCESSING DEVICE AND OPERATION METHOD THEREOF

    公开(公告)号:US20230377593A1

    公开(公告)日:2023-11-23

    申请号:US18029060

    申请日:2021-09-24

    发明人: Jungmin KIM

    摘要: Disclosed is a speech processing device. The speech processing device comprises: a speech reception circuit configured to receive a speech signal associated with speech uttered by speakers; a speech processing circuit configured to perform sound source separation for the speech signal on the basis of a sound source position of the speech so as to generate a separated speech signal associated with the speech and generate a translation result for the speech by using the separated speech signal; a memory; and an output circuit configured to output the translation result for the speech, wherein the sequence in which transmission results are output is determined on the basis of an utterance time point of the speech.

    VOICE PROCESSING APPARATUS FOR PROCESSING VOICES, VOICE PROCESSING SYSTEM, AND VOICE PROCESSING METHOD

    公开(公告)号:US20240257824A1

    公开(公告)日:2024-08-01

    申请号:US18564596

    申请日:2022-05-20

    发明人: Jungmin KIM

    摘要: Disclosed is a voice processing apparatus for processing voices of a plurality of speakers. The voice processing apparatus comprises: a microphone configured to generate voice signals in response to the voices of the plurality of speakers; a communication circuit configured to transmit and receive data; memory; and a processor, wherein the processor, on the basis of instructions stored in the memory, performs sound source separation of the voice signals on the basis of sound source positions of each of the voices, generates separate voice signals associated with each of the voices according to the sound source separation, determines output modes corresponding to the sound source positions of each of the voices, and uses the communication circuit to output the separate voice signals according to the determined output modes.

    VOICE PROCESSING DEVICE FOR PROCESSING VOICE SIGNAL AND VOICE PROCESSING SYSTEM COMPRISING SAME

    公开(公告)号:US20230325608A1

    公开(公告)日:2023-10-12

    申请号:US18022255

    申请日:2021-08-18

    发明人: Jungmin KIM

    摘要: A voice processing device is disclosed. The voice processing device comprises: a voice data receiving circuit receives input voice data associated with voices of speakers; a memory stores starting language data; a voice data output circuit outputs output voice data associated with the voices of the speakers; and a processor generates a control command for outputting the output voice data, wherein the processor uses the input voice data to generate first speaker position data indicating a position of a first speaker of the speakers and first output voice data associated with a voice of the first speaker, reads first source language data corresponding to the first speaker position data with reference to the memory, and transmits, to the voice data output circuit, a control command for outputting the first output voice data to a translation environment for translating a first source language indicated by the first starting language data.

    VOICE PROCESSING DEVICE FOR PROCESSING VOICES OF SPEAKERS

    公开(公告)号:US20230260509A1

    公开(公告)日:2023-08-17

    申请号:US18022498

    申请日:2021-08-23

    发明人: Jungmin KIM

    IPC分类号: G10L15/22 G10L17/04 B60W50/08

    摘要: Disclosed is a voice processing device. The voice processing device comprises: a voice data reception circuit configured to receive input voice data associated with the voice of a speaker; a wireless signal reception circuit configured to receive a wireless signal including a terminal ID from a speaker terminal of the speaker; a memory; and a processor configured to generate terminal location data indicating the location of the speaker terminal on the basis of the wireless signal, and match and store the generated terminal location data and the terminal ID in the memory, wherein the processor uses the input voice data to generate first speaker location data and first output voice data associated with a first voice spoken at the first location and matches a first terminal ID corresponding to the first speaker location data and the first output voice data.

    ELECTRONIC DEVICE AND OPERATING METHOD FOR ELECTRONIC DEVICE

    公开(公告)号:US20230300533A1

    公开(公告)日:2023-09-21

    申请号:US18022974

    申请日:2021-08-24

    发明人: Jungmin KIM

    IPC分类号: H04R5/04 G10L15/26 G06T7/70

    CPC分类号: H04R5/04 G10L15/26 G06T7/70

    摘要: An electronic device is disclosed. An electronic device comprises: an image data receiving circuit configured to receive, from a camera, input image data associated with an image captured by the camera; a voice data receiving circuit configured to receive input voice data associated with the voices of speakers; a memory configured to store transform parameters for projecting a space coordinate system onto an image coordinate system on the image; and a processor, which determines a speaker's spatial location from the input voice data, converts same into the speaker's image location on the image, and inserts, into an input image, text associated with the speaker's voice according to the image location, so as to generate output image data.

    DEVICE FOR PROCESSING VOICE AND OPERATION METHOD THEREOF

    公开(公告)号:US20230290355A1

    公开(公告)日:2023-09-14

    申请号:US18015472

    申请日:2021-07-09

    发明人: Jungmin KIM

    IPC分类号: G10L17/06 G10L25/78 G10L15/00

    摘要: Disclosed is a voice processing device. The voice processing device comprises a memory and a processor configured to perform sound source isolation on voice signals associated with the voices of speakers on the basis of the sound source positions of the respective voices. The processor is configured to: generate sound source position information indicating the sound source positions of the respective voices using the voice signals associated with the voices; generate isolated voice signals associated with the voices of the respective speakers from the voice signals on the basis of the sound source position information; and match the isolated voice signals and the voice source position information and store the same in the memory.

    MOBILE TERMINAL CAPABLE OF PROCESSING VOICE AND OPERATION METHOD THEREFOR

    公开(公告)号:US20230377594A1

    公开(公告)日:2023-11-23

    申请号:US18034626

    申请日:2021-10-27

    发明人: Jungmin KIM

    摘要: A mobile terminal is disclosed. The mobile terminal comprises: a microphone configured to generate a voice signal in response to voices of speakers; a processor configured to generate a separated voice signal associated with each of the voices by separating the voice signal from a sound source on the basis of a sound source location of each of the voices, and output the result of translation for each of the voices, on the basis of the separated voice signal; and a memory configured to store source language information indicating source languages that are uttered languages of the voices of the speakers. The processor outputs the results of translations in which the languages of the voices of the speakers have been translated from the source languages into a target language, on the basis of the source language information and the separated voice signal.

    VOICE PROCESSING DEVICE AND OPERATING METHOD THEREFOR

    公开(公告)号:US20230377592A1

    公开(公告)日:2023-11-23

    申请号:US18028175

    申请日:2021-09-24

    发明人: Jungmin KIM

    摘要: A voice processing device is disclosed. The voice processing device comprises: a voice processing circuit configured to generate an isolated voice signal associated with respective voices spoken at a plurality of sound source locations in a vehicle by isolating, and output an interpretation result for the respective voices on the basis of the isolated voice signal; a memory configured to store source language information indicating a source language and target language information indicating a target language in order to interpret the voice associated with the isolation voice signal; and a communication circuit configured to output the interpretation result, wherein the voice processing circuit generates the interpretation result in which the language of the voice corresponding to the isolated voice signal is interpreted from the source language into the target language.