Method and device for focusing sound source

    公开(公告)号:US11010124B2

    公开(公告)日:2021-05-18

    申请号:US16703768

    申请日:2019-12-04

    Abstract: Disclosed are a sound source focus method and device in which the sound source focus device, in a 5G communication environment by amplifying and outputting a sound source signal of a user's object of interest extracted from an acoustic signal included in video content by executing a loaded artificial intelligence (AI) algorithm and/or machine learning algorithm. The sound source focus method includes playing video content including a video signal including at least one moving object and the acoustic signal in which sound sources output by the object are mixed, determining the user's object of interest from the video signal, acquiring unique sound source information about the user's object of interest, extracting an actual sound source for the user's object of interest corresponding to the unique sound source information from the acoustic signal, and outputting the actual sound source extracted for the user's object of interest.

    Method for synthesized speech generation using emotion information correction and apparatus

    公开(公告)号:US11636845B2

    公开(公告)日:2023-04-25

    申请号:US16928815

    申请日:2020-07-14

    Abstract: A method includes generating first synthesized speech by using text and a first emotion vector configured for the text, extracting a second emotion vector included in the first synthesized speech, determining whether correction of the second emotion information vector is needed by comparing a loss value calculated by using the first emotion information vector and the second emotion information vector with a preconfigured threshold, re-performing speech synthesis by using a third emotion information vector generated by correcting the second emotion information vector, and outputting the generated synthesized speech, thereby configuring emotion information of speech in a more effective manner. A speech synthesis apparatus may be associated with an artificial intelligence module, drone (unmanned aerial vehicle, UAV), robot, augmented reality (AR) devices, virtual reality (VR) devices, devices related to 5G services, and the like.

    Speech synthesizer using artificial intelligence, method of operating speech synthesizer and computer-readable recording medium

    公开(公告)号:US11443732B2

    公开(公告)日:2022-09-13

    申请号:US16499816

    申请日:2019-02-15

    Abstract: A speech synthesizer includes a memory configured to store a plurality of sentences and prior information of a word classified into a minor class among a plurality of classes with respect to each sentence, and a processor configured to determine an oversampling rate of the word based on the prior information, determine the number of times of oversampling of the word using the determined oversampling rate and generate sentences including the word by the determined number of times of oversampling. The plurality of classes includes a first class corresponding to first reading break, a second class corresponding to second reading break greater than the first break and a third class corresponding to third reading break greater than the second break, and the minor class has a smallest count among the first to third classes in one sentence.

    Voice synthesis device
    9.
    发明授权

    公开(公告)号:US11120785B2

    公开(公告)日:2021-09-14

    申请号:US16547323

    申请日:2019-08-21

    Abstract: A voice synthesis device which includes a database configured to store a voice and a text corresponding to the voice and a processor configured to extract characteristic information and a tone of a first-language voice stored in the database, classify an utterance style of an utterer on basis of the extracted characteristic information, generate utterer analysis information including the utterance style and the tone, translate a text corresponding to the first-language voice into a second language, and synthesize the text, translated into the second language, in a second-language voice by using the utterer analysis information.

    Speech synthesis method and apparatus based on emotion information

    公开(公告)号:US11074904B2

    公开(公告)日:2021-07-27

    申请号:US16593161

    申请日:2019-10-04

    Abstract: A speech synthesis method and apparatus based on emotion information are disclosed. A speech synthesis method based on emotion information extracts speech synthesis target text from received data and determines whether the received data includes situation explanation information. First metadata corresponding to first emotion information is generated on the basis of the situation explanation information. When the extracted data does not include situation explanation information, second metadata corresponding to second emotion information generated on the basis of semantic analysis and context analysis is generated. One of the first metadata and the second metadata is added to the speech synthesis target text to synthesize speech corresponding to the extracted data. A speech synthesis apparatus of this disclosure may be associated with an artificial intelligence module, drone (unmanned aerial vehicle, UAV), robot, augmented reality (AR) devices, virtual reality (VR) devices, devices related to 5G services, and the like.

Patent Agency Ranking