-
公开(公告)号:US20240220530A1
公开(公告)日:2024-07-04
申请号:US18089710
申请日:2022-12-28
Applicant: ADOBE INC.
Inventor: Julia Lepley WILKINS , Oriol NIETO-CABALLERO , Justin SALAMON
IPC: G06F16/432 , G06V20/40 , G10L15/26
CPC classification number: G06F16/433 , G06F16/434 , G06V20/46 , G10L15/26
Abstract: A sound effects system recommends sound effects using a multi-modal embedding space for projecting visuals, text, and audio. Given an input query comprising a visual (i.e., an image/video) and/or text, an encoder generates a query embedding in the multi-modal embedding space in which sound effects have been projected into sound effect embeddings. A relevant sound effect embedding in the multi-modal space is identified using the query embedding, and a recommendation is provided for a sound effect corresponding to the sound effect embedding.