- 专利标题: Generating facial position data based on audio data
-
申请号: US16394515申请日: 2019-04-25
-
公开(公告)号: US11049308B2公开(公告)日: 2021-06-29
- 发明人: Jorge del Val Santos , Linus Gisslén , Martin Singh-Blom , Kristoffer Sjöö , Mattias Teye
- 申请人: Electronic Arts Inc.
- 申请人地址: US CA Redwood City
- 专利权人: Electronic Arts Inc.
- 当前专利权人: Electronic Arts Inc.
- 当前专利权人地址: US CA Redwood City
- 代理机构: Middleton Reutlinger
- 主分类号: G06T13/20
- IPC分类号: G06T13/20 ; G06N3/08 ; G06N20/20 ; G06T13/40
摘要:
A computer-implemented method for generating a machine-learned model to generate facial position data based on audio data comprising training a conditional variational autoencoder having an encoder and decoder. The training comprises receiving a set of training data items, each training data item comprising a facial position descriptor and an audio descriptor; processing one or more of the training data items using the encoder to obtain distribution parameters; sampling a latent vector from a latent space distribution based on the distribution parameters; processing the latent vector and the audio descriptor using the decoder to obtain a facial position output; calculating a loss value based at least in part on a comparison of the facial position output and the facial position descriptor of at least one of the one or more training data items; and updating parameters of the conditional variational autoencoder based at least in part on the calculated loss value.
公开/授权文献
- US20200302667A1 Generating Facial Position Data based on Audio Data 公开/授权日:2020-09-24
信息查询