Patent search ap:("Snap Inc.") AND inv:"Liron Harazi" Page 1

1.

发明公开
EMOTION-BASED TEXT TO SPEECH 审中-公开

公开(公告)号：US20230252972A1

公开(公告)日：2023-08-10

申请号：US17667128

申请日：2022-02-08

Applicant: Snap Inc.

Inventor： Liron Harazi , Jackie Assa , Alan Bekker

IPC: G10L13/08 , G10L25/18 , G10L13/047 , G06F3/0482 , G10L13/033

CPC classification number: G10L13/08 , G10L25/18 , G10L13/047 , G06F3/0482 , G10L13/033

Abstract: Systems and methods are provided for providing emotion-based text to speech. The systems and methods perform operations comprising accessing a text string; storing a plurality of embeddings associated with a plurality of speakers, a first embedding for a first speaker being associated with a first emotion and a second embedding for a second speaker of the plurality of speakers being associated with a second emotion; selecting the first speaker to speak one or more words of the text string; determining that the one or more words are associated with the second emotion; generating, based on the first embedding and the second embedding, a third embedding for the first speaker associated with the second emotion; and applying the third embedding and the text string to a vocoder to generate an audio stream comprising the one or more words being spoken by the first speaker with the second emotion.

2.

发明申请
EMOTION-BASED TEXT TO SPEECH 有权

公开(公告)号：US20250029595A1

公开(公告)日：2025-01-23

申请号：US18906853

申请日：2024-10-04

Applicant: Snap Inc.

Inventor： Liron Harazi , Jacob Assa , Alan Bekker

IPC: G10L13/08 , G06F3/0482 , G10L13/033 , G10L13/047 , G10L25/18

Abstract: Systems and methods are provided for providing emotion-based text to speech. The systems and methods perform operations comprising accessing a text string; storing a plurality of embeddings associated with a plurality of speakers, a first embedding for a first speaker being associated with a first emotion and a second embedding for a second speaker of the plurality of speakers being associated with a second emotion; selecting the first speaker to speak one or more words of the text string; determining that the one or more words are associated with the second emotion; generating, based on the first embedding and the second embedding, a third embedding for the first speaker associated with the second emotion; and applying the third embedding and the text string to a vocoder to generate an audio stream comprising the one or more words being spoken by the first speaker with the second emotion.

3.

发明授权
Emotion-based text to speech 有权

公开(公告)号：US12142257B2

公开(公告)日：2024-11-12

申请号：US17667128

申请日：2022-02-08

Applicant: Snap Inc.

Inventor： Liron Harazi , Jacob Assa , Alan Bekker

IPC: G10L13/08 , G06F3/0482 , G10L13/033 , G10L13/047 , G10L25/18

Abstract: Systems and methods are provided for providing emotion-based text to speech. The systems and methods perform operations comprising accessing a text string; storing a plurality of embeddings associated with a plurality of speakers, a first embedding for a first speaker being associated with a first emotion and a second embedding for a second speaker of the plurality of speakers being associated with a second emotion; selecting the first speaker to speak one or more words of the text string; determining that the one or more words are associated with the second emotion; generating, based on the first embedding and the second embedding, a third embedding for the first speaker associated with the second emotion; and applying the third embedding and the text string to a vocoder to generate an audio stream comprising the one or more words being spoken by the first speaker with the second emotion.

Patent Agency Ranking