SPEECH SYNTHESIS SYSTEM AND METHOD WITH ADJUSTABLE UTTERANCE LENGTH

    公开(公告)号:US20250149023A1

    公开(公告)日:2025-05-08

    申请号:US18390216

    申请日:2023-12-20

    Abstract: There is provided a speech synthesis system and method with an adjustable utterance length. The speech synthesis method according to an embodiment predicts a duration of each phoneme corresponding to a speech mask from the speech mask and a text to be synthesized with the speech mask, encodes the text to be synthesized and extracts a text sequence which is expressed by feature information of the text, generates a speech frame sequence by regulating a length of each phoneme of the text sequence according to the predicted duration of each phoneme corresponding to the speech mask, and synthesizes a speech from the generated speech frame sequence. Accordingly, a length of a speech to be synthesized can be freely regulated as a user desires by regulating a length of a speech mask.

    METHOD AND SYSTEM FOR ACQUIRING VISUAL EXPLANATION INFORMATION INDEPENDENT OF PURPOSE, TYPE, AND STRUCTURE OF VISUAL INTELLIGENCE MODEL

    公开(公告)号:US20250095341A1

    公开(公告)日:2025-03-20

    申请号:US18741942

    申请日:2024-06-13

    Abstract: There are provided a method and a system for acquiring visual explanation information independent of the purpose, type, and structure of a visual intelligence model. The visual explanation information acquisition system of the visual intelligence model according to an embodiment may input N transformed images which are generated by diversifying an input image to a deep learning-based visual intelligence model and may acquire outputted results, may generate attributes of the visual intelligence model from the acquired results, may derive, from losses of the visual intelligence model which are calculated from the generated attributes, basic data for generating a visual explanation map for visually explaining a result derivation rationale of the visual intelligence model, and may generate a visual explanation map from the derived basic data. Accordingly, visual explanation information may be acquired from various visual intelligence models through one system independently of the purpose, type, and structure of the visual intelligence model.

Patent Agency Ranking