Patent search ap:("Google LLC") AND inv:"Andros Tjandra" Page 1

1.

发明申请
Unsupervised Learning of Disentangled Speech Content and Style Representation 有权

公开(公告)号：US20220189456A1

公开(公告)日：2022-06-16

申请号：US17455667

申请日：2021-11-18

Applicant: Google LLC

Inventor： Ruoming Pang , Andros Tjandra , Yu Zhang , Shigeki Karita

IPC: G10L13/027 , G10L21/0308

Abstract: A linguistic content and speaking style disentanglement model includes a content encoder, a style encoder, and a decoder. The content encoder is configured to receive input speech as input and generate a latent representation of linguistic content for the input speech output. The content encoder is trained to disentangle speaking style information from the latent representation of linguistic content. The style encoder is configured to receive the input speech as input and generate a latent representation of speaking style for the input speech as output. The style encoder is trained to disentangle linguistic content information from the latent representation of speaking style. The decoder is configured to generate output speech based on the latent representation of linguistic content for the input speech and the latent representation of speaking style for the same or different input speech.

2.

发明公开
Unsupervised Learning of Disentangled Speech Content and Style Representation 审中-公开

公开(公告)号：US20240312449A1

公开(公告)日：2024-09-19

申请号：US18676743

申请日：2024-05-29

Applicant: Google LLC

Inventor： Ruoming Pang , Andros Tjandra , Yu Zhang , Shigeki Karita

IPC: G10L13/027 , G10L21/0308

CPC classification number: G10L13/027 , G10L21/0308

Abstract: A linguistic content and speaking style disentanglement model includes a content encoder, a style encoder, and a decoder. The content encoder is configured to receive input speech as input and generate a latent representation of linguistic content for the input speech output. The content encoder is trained to disentangle speaking style information from the latent representation of linguistic content. The style encoder is configured to receive the input speech as input and generate a latent representation of speaking style for the input speech as output. The style encoder is trained to disentangle linguistic content information from the latent representation of speaking style. The decoder is configured to generate output speech based on the latent representation of linguistic content for the input speech and the latent representation of speaking style for the same or different input speech.

3.

发明授权
Unsupervised learning of disentangled speech content and style representation 有权

公开(公告)号：US12027151B2

公开(公告)日：2024-07-02

申请号：US17455667

申请日：2021-11-18

Applicant: Google LLC

Inventor： Ruoming Pang , Andros Tjandra , Yu Zhang , Shigeki Karita

IPC: G10L13/027 , G10L21/0308

CPC classification number: G10L13/027 , G10L21/0308

Abstract: A linguistic content and speaking style disentanglement model includes a content encoder, a style encoder, and a decoder. The content encoder is configured to receive input speech as input and generate a latent representation of linguistic content for the input speech output. The content encoder is trained to disentangle speaking style information from the latent representation of linguistic content. The style encoder is configured to receive the input speech as input and generate a latent representation of speaking style for the input speech as output. The style encoder is trained to disentangle linguistic content information from the latent representation of speaking style. The decoder is configured to generate output speech based on the latent representation of linguistic content for the input speech and the latent representation of speaking style for the same or different input speech.

Patent Agency Ranking