摘要:
A method of automatically generating a digital soundtrack for playback in an environment comprising live speech audio generated by one or more persons speaking in the environment, the method executed by a processing device or devices having associated memory. The method comprises syntactically and/or semantically analysing an incoming text data stream or streams representing or corresponding to the live speech audio in portions to generate an emotional profile for each text portion of the text data stream(s) in the context of a continuous emotion model. The method further comprises generating in real-time a customised soundtrack for the live speech audio comprising music tracks that are played back in the environment in real-time with the live speech audio. Each music track is selected for playback in the soundtrack based at least partly on the determined emotional profile or profiles associated with the most recently processed portion or portions of text from the text data stream(s).
摘要:
A method of automatically generating a digital soundtrack intended for synchronised playback with associated speech audio, the method executed by a processing device or devices having associated memory. The method comprises syntactically and/or semantically analysing text representing or corresponding to the speech audio at a text segment level to generate an emotional profile for each text segment in the context of a continuous emotion model. The method further comprises generating a soundtrack for the speech audio comprising one or more audio regions that are configured or selected for playback during corresponding speech regions of the speech audio, and wherein the audio configured for playback in the audio regions is based on or a function of the emotional profile of one or more of the text segments within the respective speech regions.
摘要:
A method of automatically generating a digital soundtrack intended for synchronised playback with the reading of an associated text, the method executed by a processing device or devices having associated memory. The method comprises syntactically and/or semantically analysing the text at a text segment level to generate an emotional profile for each text segment in the context of a continuous emotion model. The method further comprises generating a soundtrack for the text comprising one or more audio regions that are configured or selected for playback during corresponding text regions of the text, and wherein the audio configured for playback in the audio regions is based on or a function of the emotional profile of one or more of the text segments within the respective text regions.