EMOTION-BASED TEXT TO SPEECH
    1.
    发明公开

    公开(公告)号:US20230252972A1

    公开(公告)日:2023-08-10

    申请号:US17667128

    申请日:2022-02-08

    Applicant: Snap Inc.

    CPC classification number: G10L13/08 G10L25/18 G10L13/047 G06F3/0482 G10L13/033

    Abstract: Systems and methods are provided for providing emotion-based text to speech. The systems and methods perform operations comprising accessing a text string; storing a plurality of embeddings associated with a plurality of speakers, a first embedding for a first speaker being associated with a first emotion and a second embedding for a second speaker of the plurality of speakers being associated with a second emotion; selecting the first speaker to speak one or more words of the text string; determining that the one or more words are associated with the second emotion; generating, based on the first embedding and the second embedding, a third embedding for the first speaker associated with the second emotion; and applying the third embedding and the text string to a vocoder to generate an audio stream comprising the one or more words being spoken by the first speaker with the second emotion.

    EMOTION-BASED TEXT TO SPEECH
    2.
    发明申请

    公开(公告)号:US20250029595A1

    公开(公告)日:2025-01-23

    申请号:US18906853

    申请日:2024-10-04

    Applicant: Snap Inc.

    Abstract: Systems and methods are provided for providing emotion-based text to speech. The systems and methods perform operations comprising accessing a text string; storing a plurality of embeddings associated with a plurality of speakers, a first embedding for a first speaker being associated with a first emotion and a second embedding for a second speaker of the plurality of speakers being associated with a second emotion; selecting the first speaker to speak one or more words of the text string; determining that the one or more words are associated with the second emotion; generating, based on the first embedding and the second embedding, a third embedding for the first speaker associated with the second emotion; and applying the third embedding and the text string to a vocoder to generate an audio stream comprising the one or more words being spoken by the first speaker with the second emotion.

    Emotion-based text to speech
    3.
    发明授权

    公开(公告)号:US12142257B2

    公开(公告)日:2024-11-12

    申请号:US17667128

    申请日:2022-02-08

    Applicant: Snap Inc.

    Abstract: Systems and methods are provided for providing emotion-based text to speech. The systems and methods perform operations comprising accessing a text string; storing a plurality of embeddings associated with a plurality of speakers, a first embedding for a first speaker being associated with a first emotion and a second embedding for a second speaker of the plurality of speakers being associated with a second emotion; selecting the first speaker to speak one or more words of the text string; determining that the one or more words are associated with the second emotion; generating, based on the first embedding and the second embedding, a third embedding for the first speaker associated with the second emotion; and applying the third embedding and the text string to a vocoder to generate an audio stream comprising the one or more words being spoken by the first speaker with the second emotion.

Patent Agency Ranking