-
公开(公告)号:US20230156294A1
公开(公告)日:2023-05-18
申请号:US18097900
申请日:2023-01-17
申请人: Ben Avi lngel , Ron Zass
发明人: Ben Avi lngel , Ron Zass
IPC分类号: H04N21/81 , H04N21/475 , H04N21/2668 , H04N21/458 , G06F40/58 , G10L13/033 , G10L13/08 , G10L13/10 , G10L13/00
CPC分类号: H04N21/8126 , G06F40/58 , G10L13/00 , G10L13/10 , G10L13/033 , G10L13/086 , H04N21/458 , H04N21/2668 , H04N21/4755
摘要: Methods, systems, and computer-readable media for generating videos with characters indicating regions of images are provided. For example, an image containing a first region may be received. At least one characteristic of a character may be obtained. A script containing a first segment of the script may be received. The first segment of the script may be related to the first region of the image. The at least one characteristic of a character and the script may be used to generate a video of the character presenting the script and at least part of the image, where the character visually indicates the first region of the image while presenting the first segment of the script.
-
公开(公告)号:US20240276072A1
公开(公告)日:2024-08-15
申请号:US18643486
申请日:2024-04-23
申请人: Ben Avi lngel , Ron Zass
发明人: Ben Avi lngel , Ron Zass
IPC分类号: H04N21/81 , G06F40/58 , G10L13/00 , G10L13/033 , G10L13/08 , G10L13/10 , H04N21/2668 , H04N21/458 , H04N21/475
CPC分类号: H04N21/8126 , G06F40/58 , G10L13/00 , G10L13/033 , G10L13/086 , G10L13/10 , H04N21/2668 , H04N21/458 , H04N21/4755
摘要: Systems, methods and non-transitory computer readable media for artificially generating translated media streams are provided. A media stream of an individual speaking in an origin language may be received. The individual may be associated with a particular voice. The received media stream may be analyzed to determine characteristics of the particular voice of the individual. A synthesized voice may be determined based on the determined characteristics of the particular voice of the individual. The synthesized voice may sound substantially identical to the particular voice. A translated media stream that includes, for each word spoken in the origin language in the media stream, a respective at least one word in the target language articulated using the synthesized voice, may be generated.
-