Abstract:
A music-synchronized control platform to synchronize music playback or music-synchronized performance or both with music in a plurality of devices. A master control section of at least one music-synchronized control master controls a master music playback player or master music-synchronized performer by converting a timing in the reference clock time corresponding to an in-content playback timing, which is obtained from the status of a virtual music playback player, to a timing in the individual clock time corresponding to the in-content playback timing. The slave control section of each of the plurality of music-synchronized control slaves control a slave music playback player or a slave music-synchronized performer and by converting a timing in the reference clock time corresponding to the in-content playback timing, which is obtained from the status of the virtual music playback player, to a timing in the individual clock time corresponding to the in-content playback timing.
Abstract:
For high-accuracy analysis and high-quality synthesis of voice sound (singing and speech), provided herein are a system and a method for estimating from an audio signal spectral envelopes and group delays for sound analysis and synthesis with high accuracy and high temporal resolution. An estimation system of spectral envelopes and group delays includes a fundamental frequency estimation section, an amplitude spectrum acquisition section, a group delay extraction section, a spectral envelope integration section, and a group delay integration section. The spectral envelope integration section sequentially obtains a spectral envelope for sound synthesis by averaging overlapped spectra. The group delay integration section selects from a plurality of group delays a group delay corresponding to the maximum envelope of each frequency component of the spectral envelope and integrates groups delays thus selected to sequentially obtain a group delay for sound synthesis.
Abstract:
A music-synchronized control platform to synchronize music playback or music-synchronized performance or both with music in a plurality of devices. A master control section of at least one music-synchronized control master controls a master music playback player or master music-synchronized performer by converting a timing in the reference clock time corresponding to an in-content playback timing, which is obtained from the status of a virtual music playback player, to a timing in the individual clock time corresponding to the in-content playback timing. The slave control section of each of the plurality of music-synchronized control slaves control a slave music playback player or a slave music-synchronized performer and by converting a timing in the reference clock time corresponding to the in-content playback timing, which is obtained from the status of the virtual music playback player, to a timing in the individual clock time corresponding to the in-content playback timing.
Abstract:
A singing synthesis section for generating singing by integrating into one singing a plurality of vocals sung by a singer a plurality of times or vocals of which parts that he/she does not like are sung again. A music audio signal playback section plays back the music audio signal from a signal portion or its immediately preceding signal corresponding to a character in the lyrics when the character displayed on the display screen is selected by a character selecting section. An estimation and analysis data storing section automatically aligns the lyrics with the vocal, decomposes the vocal into three elements, pitch, power, and timber, and stores them. A data selecting section allows the user to select each of the three elements for respective time periods of phonemes. The data editing section modifies the time periods of the three elements in alignment with the modified time periods of the phonemes.
Abstract:
A singing synthesis section for generating singing by integrating into one singing a plurality of vocals sung by a singer a plurality of times or vocals of which parts that he/she does not like are sung again. A music audio signal playback section plays back the music audio signal from a signal portion or its immediately preceding signal corresponding to a character in the lyrics when the character displayed on the display screen is selected by a character selecting section. An estimation and analysis data storing section automatically aligns the lyrics with the vocal, decomposes the vocal into three elements, pitch, power, and timber, and stores them. A data selecting section allows the user to select each of the three elements for respective time periods of phonemes. The data editing section modifies the time periods of the three elements in alignment with the modified time periods of the phonemes.
Abstract:
A system for generating topic inference information of lyrics that can provide more useful for topic interpretation of lyrics. A device for learning topic numbers performs an operation of updating and learning topic numbers, which performs an operation of updating topic numbers on all of a plurality of lyrics data of each of a plurality of artists, for a predetermined number of times. The operation of updating topic numbers updates the topic number assigned to a given lyrics data of a given artist using a random number generator having a deviation of appearance probability corresponding to a probability distribution over topic numbers. An outputting device outputs the topic numbers of the plurality of lyrics data for each of the plurality artists, and a probability distribution over words for each of the topic numbers.
Abstract:
A system for generating topic inference information of lyrics that can provide more useful for topic interpretation of lyrics. A device for learning topic numbers performs an operation of updating and learning topic numbers, which performs an operation of updating topic numbers on all of a plurality of lyrics data of each of a plurality of artists, for a predetermined number of times. The operation of updating topic numbers updates the topic number assigned to a given lyrics data of a given artist using a random number generator having a deviation of appearance probability corresponding to a probability distribution over topic numbers. An outputting device outputs the topic numbers of the plurality of lyrics data for each of the plurality artists, and a probability distribution over words for each of the topic numbers.
Abstract:
A system, method, and computer program for estimation of a target value, which can change the aggregation of estimation results based on the degrees of confidence, taking the nature of an input observation signal into consideration. An unknown observation signal is input to a plurality of regression models. A plurality of estimated values are respectively obtained by a plurality of regression models corresponding to the plurality of features of the unknown observation signal. A target value of the unknown observation signal is estimated by aggregation of the target values. The estimating section calculates weights to be added the estimation results output from the regression models, based on the degrees of confidence with respect to the inputs into the regression models. A target value of the unknown observation signal is estimated through the aggregation by calculating a weighted sum of the estimation results output from the regression models.
Abstract:
A system for multifaceted singing analysis for retrieval of songs or music including singing voices having some relationship in latent semantics with a singing voice included in one particular song or music. A topic analyzing processor uses a topic model to analyze a plurality of vocal symbolic time series obtained for a plurality of musical audio signals. The topic analyzing processor generates a vocal topic distribution for each of the musical audio signals whereby the vocal topic distribution is composed of a plurality of vocal topics each indicating a relationship of one of the musical audio signals with the other musical audio signals. The topic analyzing processor generates a vocal symbol distribution for each of the vocal topics whereby the vocal symbol distribution indicates occurrence probabilities for the vocal symbols. A multifaceted singing analyzing processor performs analysis of singing voices included in musical audio signals, in the multifaceted viewpoint.
Abstract:
A system for multifaceted singing analysis for retrieval of songs or music including singing voices having some relationship in latent semantics with a singing voice included in one particular song or music. A topic analyzing processor uses a topic model to analyze a plurality of vocal symbolic time series obtained for a plurality of musical audio signals. The topic analyzing processor generates a vocal topic distribution for each of the musical audio signals whereby the vocal topic distribution is composed of a plurality of vocal topics each indicating a relationship of one of the musical audio signals with the other musical audio signals. The topic analyzing processor generates a vocal symbol distribution for each of the vocal topics whereby the vocal symbol distribution indicates occurrence probabilities for the vocal symbols. A multifaceted singing analyzing processor performs analysis of singing voices included in musical audio signals, in the multifaceted viewpoint.