Invention Application
- Patent Title: MAPPING VISUAL TAGS TO SOUND TAGS USING TEXT SIMILARITY
-
Application No.: US16399640Application Date: 2019-04-30
-
Publication No.: US20200349387A1Publication Date: 2020-11-05
- Inventor: Sudha Krishnamurthy
- Applicant: Sony Interactive Entertainment Inc.
- Main IPC: G06K9/62
- IPC: G06K9/62 ; G06F16/75 ; G06F16/783 ; G06N20/10 ; G06N3/08

Abstract:
Sound effects (SFX) are registered in a database for efficient search and retrieval. This may be accomplished by classifying SFX and using a machine learning engine to output a first of the classified SFX for a first computer simulation based on learned correlations between video attributes of the first computer simulation and the classified SFX. Subsequently, videos without sound may be processed for object, action, and caption recognition to generate video tags which are semantically matched with SFX tags to associate SFX with the video.
Public/Granted literature
- US11030479B2 Mapping visual tags to sound tags using text similarity Public/Granted day:2021-06-08
Information query