摘要:
A music reproduction apparatus (EM) includes: a reproduction section (RP) for reproducing user-selected music piece data; a generation section (SP1) for generating control information including music piece information identifying a music piece to be reproduced and reproduced position information indicative of a position, reproduced by the reproduction section, of the music piece data; a modulation section (SP2) for outputting, on the basis of the generated control information, an audio signal of a predetermined frequency band for carrying the control information; and an output section (13) for transmitting to outside the audio signal generated by the modulation section. An information processing apparatus (DS) includes: a storage section (STd) storing a plurality of sets of displaying content; a reception section (7) for receiving the audio signal; an extraction section (DM, EX) for demodulating the control information included in the received audio signal and extracting the music piece information and reproduced position information included in the demodulated control information; and a display control section (CTd) for identifying one of the sets of displaying content, stored in the storage section, in accordance with the extracted music piece information and displaying a part of the identified displaying content in accordance with the extracted reproduced position information.
摘要:
A method of automatically generating a digital soundtrack for playback in an environment comprising live speech audio generated by one or more persons speaking in the environment, the method executed by a processing device or devices having associated memory. The method comprises syntactically and/or semantically analysing an incoming text data stream or streams representing or corresponding to the live speech audio in portions to generate an emotional profile for each text portion of the text data stream(s) in the context of a continuous emotion model. The method further comprises generating in real-time a customised soundtrack for the live speech audio comprising music tracks that are played back in the environment in real-time with the live speech audio. Each music track is selected for playback in the soundtrack based at least partly on the determined emotional profile or profiles associated with the most recently processed portion or portions of text from the text data stream(s).
摘要:
One aspect of the present invention provides a technology of being able to give a singer more pleasure of singing action. According to one aspect of the present invention, a scoring device includes: an acquisition unit configured to acquire image data in which a singer is photographed; a detector configured to detect a feature associated with an expression or a facial motion during singing from the image data acquired by the acquisition unit as a facial feature of the singer; a calculator configured to calculate a score for singing action of the singer based on the feature detected by the detector; and an output unit configured to output the score.
摘要:
The present invention is to carry out an automatic performance in synchronization with a video distributed by a moving image distribution server while suppressing the influence of the state of the communication path passing through the moving image distribution server. An automatic performance device includes a second performance data receiving section 312 that receives performance data transmitted without passing through a moving image distribution server 40 from a server device 20 storing the performance data which is a performance information group of an instrument terminal 10 and date and time information indicating the date and time when a performance indicated by the performance information is performed, a synchronization signal receiving section 314 that receives a synchronization signal transmitted through a transmission path of a sound signal from the moving image distribution server 40, and a reproduction unit 316 that reproduces the performance information of the performance data received in synchronization with a distribution image at the distribution time of the synchronization signal with timing corresponding to the time and date indicated by the date and time information of the performance data received by the second musical performance data receiving section 312 and the date and time indicated by the synchronization signal received by the synchronization signal receiving unit 314.
摘要:
Methods and systems for performing audio synchronization with corresponding textual transcription and determining confidence values of the timing-synchronization are provided. Audio and a corresponding text (e.g., transcript) may be synchronized in a forward and reverse direction using speech recognition to output a time-annotated audio-lyrics synchronized data. Metrics can be computed to quantify and/or qualify a confidence of the synchronization. Based on the metrics, example embodiments describe methods for enhancing an automated synchronization process to possibly adapted Hidden Markov Models (HMMs) to the synchronized audio for use during the speech recognition. Other examples describe methods for selecting an appropriate HMM for use.
摘要:
A method includes receiving a user selection of a musical piece; providing performance cues to a user to perform musical events on a musical instrument, wherein the performance cues are synchronized to expert performance data of the musical piece; receiving audio data corresponding to musical events performed by the user on the musical instrument; detecting fundamental frequencies associated with the user-performed musical events; determining an extent to which the user-performed musical events have been correctly or incorrectly performed; providing real-time or near real-time audio feedback and/or visual feedback indicating the extent to which the user-performed musical events have been correctly or incorrectly performed; and using the expert performance data as real-time or near real-time audio and/or video feedback by controlling an output level of the expert performance data output to the user during a session.
摘要:
Bei einem verbesserten Verfahren zum musiksynchronen Wiedergeben von Visualisierungen (a, b) wird Musik (28) empfangen und analysiert. Aus der Analyse der empfangenen Musik (28) wird zukünftig zu empfangende Musik (28) vorhergesagt. Die Visualisierungen (a, b) werden mit der vorhergesagten Musik (30) synchronisiert. Die synchronisierten Visualisierungen (a, b) werden zusammen mit zukünftig empfangener Musik (28) ausgegeben. Eine entsprechende Vorrichtung (10) wird ebenfalls bereit gestellt.