SYSTEM (EMBODIMENTS) FOR HARMONIOUSLY COMBINING VIDEO FILES AND AUDIO FILES AND CORRESPONDING METHOD

    公开(公告)号:EP4178206A1

    公开(公告)日:2023-05-10

    申请号:EP20943661.7

    申请日:2020-08-04

    申请人: Harmix Inc.

    摘要: The invention relates to computer systems, in particular, to systems which enable to process large data sets by means of artificial intelligence technologies, and may be used to create video clips with a video and a music combined in a harmonious fashion. Variants of a system for providing a harmonious combination of video files and audio files are proposed, the system comprises: at least one server, at least one user computing device, and the at least one server further comprises an intelligent system that comprises an artificial intelligence component having instruments to learn one or more machine learning and data analysis algorithms in order to provide a harmonious combination of the video files and the audio files, the intelligent system comprises: data collection and analysis modules to learn and to operate machine learning and data analysis models; analysis modules; audio parameters and video parameters recommendation modules; audio files and video files search modules; audio files and video files generation modules; synchronization modules, wherein the video parameters are characteristics of the video file: objects, actions, a mood of the video, an activity and peaks, a frame illumination change, a change of colors, a scene change, a movement speed of a background relative to a foreground in the video file, a sequence of frames and a metadata of the video file, the audio parameters are parameters of the audio file: a genre, a tempo, an energy level, an activity and peaks, a mood, an acousticness, a rhythmicity and an instrumentality of a music, a number of sounds and noises, a digital acoustic signal and a metadata of the audio file. A method for providing a harmonious combination of video files and audio files is proposed, the method comprises the steps of: uploading at least one video file or audio file to the intelligent system for providing a harmonious combination of video files and audio files; analyzing said video file or audio file; detecting parameters of a video stream or an audio stream; predicting corresponding audio parameters or video parameters; searching for at least one audio file that comprises the predicted audio parameters or at least one video file that comprises the predicted video parameters within databases; generating at least one audio file that comprises the predicted audio parameters or at least one video file that comprises the predicted video parameters; assembling and synchronizing the audio file found within the databases or the generated audio file and the video file received from the user computing device, or assembling and synchronizing the video file found within the databases or the generated video file and the audio file received from the user computing device, returning a video clip created by the intelligent system to the user computing device.

    SPEECH INFORMATION PROCESSING METHOD AND DEVICE, STORAGE MEDIUM, AND ELECTRONIC DEVICE

    公开(公告)号:EP4006747A1

    公开(公告)日:2022-06-01

    申请号:EP20843234.4

    申请日:2020-05-22

    申请人: ZTE Corporation

    IPC分类号: G06F16/63

    摘要: Provided are a method and a device for processing voice information, a storage medium and an electronic apparatus. The method comprises: searching whether there is voice information matching first question sentence voice information inputted by a user (S202); in a case that a searching result is no, performing semantic analysis on the first question sentence voice information, and generating second question sentence voice information according to the first question sentence voice information and a semantic analysis result of the first question sentence voice information (S204); searching the second question sentence voice information, and determining, according to a searching result, voice information of a rhetorical question sentence corresponding to the second question sentence voice information (S206); and determining, according to an answer of the user to the rhetorical question sentence, question sentence voice information to be fed back to the user (S208). The present invention solves the problem in the related art of limitation in recovering omission-type semantic loss on the basis of a semantic understanding result of a knowledge graph, thereby improving the user experience.