ELECTRONIC DEVICE AND CONTROLLING METHOD OF ELECTRONIC DEVICE

    公开(公告)号:US20240312457A1

    公开(公告)日:2024-09-19

    申请号:US18669069

    申请日:2024-05-20

    IPC分类号: G10L15/22 G10L15/06

    CPC分类号: G10L15/22 G10L15/063

    摘要: Provided are an electronic device and a method of controlling an electronic device. The electronic device includes: a memory storing at least one instruction; and at least one processor configured to execute the at least one instruction, wherein one or more of the at least one processor is configured to: acquire a first vector corresponding to each of a plurality of sections of a voice signal by inputting the voice signal to a common encoder based on acquiring the voice signal; acquire a second vector corresponding to each of the plurality of sections and independent on a context of the voice signal by inputting the first vector into a first individual encoder; acquire a phoneme sequence corresponding to the second vector by inputting the second vector into a first decoder; acquire a third vector corresponding to at least two sections among the plurality of sections and dependent on the context of the voice signal by inputting the first vectors into a second individual encoder; acquire a sub-word sequence corresponding to the third vector by inputting the third vector into a second decoder; and acquire text information corresponding to the plurality of sections by correcting the sub-word sequence based on the phoneme sequence, through a text information acquisition module.

    VOICE RECOGNITION DEVICE AND METHOD

    公开(公告)号:US20220005481A1

    公开(公告)日:2022-01-06

    申请号:US17296806

    申请日:2019-11-22

    IPC分类号: G10L17/02 G10L25/21 G10L17/04

    摘要: The disclosure relates to an electronic apparatus for recognizing user voice and a method of recognizing, by the electronic apparatus, the user voice. According to an embodiment, the method of recognizing the user voice includes obtaining an audio signal segmented into a plurality of frame units, determining an energy component for each filter bank by applying a filter bank distributed according to a preset scale to a frequency spectrum of the audio signal segmented into the frame units, smoothing the determined energy component for each filter bank, extracting a feature vector of the audio signal based on the smoothed energy component for each filter bank, and recognizing the user voice in the audio signal by inputting the extracted feature vector to a voice recognition model.

    METHOD AND DEVICE FOR SPEECH RECOGNITION
    4.
    发明申请

    公开(公告)号:US20200234713A1

    公开(公告)日:2020-07-23

    申请号:US16750274

    申请日:2020-01-23

    摘要: Provided are an electronic device for recognizing speech of a user, and a method, performed by the electronic device, of recognizing speech. The method includes obtaining an audio signal based on a speech input based on the audio signal being input, obtaining an output value of a first automatic speech recognition (ASR) model that outputs a character string at a first level; obtaining an output value of a second ASR model that outputs a character string at a second level corresponding to the audio signal based on the output value of the first ASR model based on the audio signal being input; and recognizing the speech from the output value of the second ASR model.