Abstract:
A wavetable speech synthesis apparatus includes a wavetable memory for defining a plurality of primitive speech sounds. The primitive speech elements are individually assigned to a memory cell designated by an instrument identification in the wavetable memory. Various primitive speech elements are defined and selected from among sound bites, entire words and phrases, frequently-occurring syllables, phonemes or smaller atomic speech elements. The primitive speech elements generate primitive sounds that are played back at a selected pitch, duration, attack velocity and envelope, sustain, and decay velocity and envelope. Various types of speaker qualities or identities are assigned to different frequency ranges of the speech elements. The wavetable memory includes a speech sample database and a speech reference database. The speech sample database supplies speech signals that are processed by the wavetable synthesizer according to information contained in the speech reference database. Reference information in the speech reference database includes various dictionaries, context lists, algorithms, and heuristic rules for guiding decisions relating to selection of primitive speech element, duration, volume and other parameters. The dictionaries store of sampled words and phonics and an encoding designating the pronunciation of the words and phonics. The context lists encode emphasis, lift and emotion that are expressed using variations in volume and addition of vibrato and tremolo.
Abstract:
An audio data format in which an instrument is described using a combination of sound samples and articulation instructions which determine modifications made to the sound sample is provided. The instruments form a first, initial layer, with a second layer having presets which can user defined to provide additional articulation instructions which can modify the articulation instructions at the instrument level. The articulation instructions are specified using various parameters. The present invention provides a format in which all of the parameters are specified in units which relate to a physical phenomena, and thus are not tied to any particular machine for creating or playing the audio samples. The articulation parameters include generators and modulators, which provide a connection between a real-time signal and a generator. The parameter units are specified in perceptually additive units, to make the data portable and easily edited. New units are defined to give perceptual additive parameters throughout.
Abstract:
A sound effect-creating device for imparting a sound effect to a musical tone produced by performance modulates an analog or digital signal indicative of the musical tone at a predetermined modulation frequency to impart a modulation (vibration) effect to the musical tone. The predetermined modulation frequency is set based on the repetition period of a timing clock of a MIDI signal received from an external electronic musical instrument. Alternatively, the repetition period of a timing clock of a MIDI signal transmitted to an external electronic musical instrument is determined based on a modulation parameter used in setting the predetermined modulation frequency.
Abstract:
In an electronic musical instrument which is arranged to produce musical tones belonging to the selected one of many different types of musical instruments and having a number of player-operable selection keys respectively corresponding to the number of types of musical instrument sounds desired, the improvement wherein means is provided for generating a sample tone belonging to that type of musical instrument specified by one of the above-mentioned selection keys at a prescribed pitch and period simply by depressing only once a particular selection key, without taking the trouble of successively depressing individual performance keys, thereby facilitating the selection of a desired type of musical instrument by the player.
Abstract:
An audio signal processing system for enhancing audio signals such as music by providing reverberation, generation of stereophonic or quadraphonic effects, chorusing, flanging, pitch change, vibrato and various combinations of these effects. An audio signal is passed through a compressor, to increase its signal-to-noise ratio,and then through a delay circuit. The delayed audio signal is passed through an expandor to restore the original input level. Preferably, before being applied to the compressor the audio signal passes through a pre-emphasis circuit and, after passing through the expandor, through a de-emphasis circuit, thereby emphasizing desired frequencies. In some embodiments, the processed output signal is blended with the received audio signal to provide a blended audio signal, and in some embodiments the processed audio signal is fed back to the system input for recirculation, providing reverberation. The delay circuit is preferably an analog shift register.
Abstract:
A delayed vibrato signal generating arrangement comprises an envelope signal generator having a capacitor, a switching device for discharging the capacitor at the instant of key depression, and two charging circuits providing different charging time constants for charging the capacitor to produce an envelope signal; a semiconductor unidirectional device coupled to the envelope signal generator; and a vibrato signal generator coupled to the unidirectional device. The capacitor is relatively rapidly charged to a predetermined potential level by both the two charging circuits and then gradually charged by only one of the charging circuits. The signal portion of the envelope signal substantially below the predetermined potential level is clipped off by the unidirectional device to provide a time delay for delayed vibrator. The time delay is virtually determined by the time constant for charging the capacitor to the predetermined potential level.
Abstract:
A delayed vibrato signal generating arrangement in which an envelope signal which changes from a first potential level to a second potential level at the instant of key depression, and thereafter varies gradually from the second potential level to the first potential level is generated by an envelope signal generator, the peak portion of such envelope signal is clipped off by a clipper, and then a vibrato signal with a gradually increasing amplitude is generated by a vibrato signal generator driven by the output of the clipper. The vibrato signal generator is so arranged as not to operate at the clipped level of the output of the clipper, thereby giving a delay time for delayed vibrato.
Abstract:
In an audio signal transmitting system having an audio signal transferring device and a loudspeaker to reproduce audio signals, the system includes a vibrato device consisting of a charge transfer device (CTD) connected to the transferring device, a clock pulse oscillator to produce clock pulses and supply the clock pulses to the CTD, and an oscillator to supply low frequency signals. The clock pulse oscillator is controlled in its frequency in response to the low frequency signals, so that delayed and frequency-modulated audio signals are reproduced by the loudspeaker.
Abstract:
An electronic musical instrument includes an operation unit that receives a user performance; and at least one processor. wherein the at least one processor performs the following: in accordance with a user operation specifying a chord on the operation unit, obtaining lyric data of a lyric and obtaining a plurality of pieces of waveform data respectively corresponding to a plurality of pitches indicated by the specified chord; inputting the obtained lyric data to a trained model that has been trained and learned singing voices of a singer so as to cause the trained model to output acoustic feature data in response thereto; synthesizing each of the plurality of pieces of waveform data with the acoustic feature data so as to generate a plurality of pieces of synthesized waveform data; and outputting a polyphonic synthesized singing voice based on the generated plurality of pieces of synthesized waveform data.
Abstract:
Embodiments of the present disclosure can provide systems, methods, and computer-readable medium for implementing user interfaces for interacting with a virtual instrument. For example, first touch input indicating a string location of a plurality of string locations within the note selection area. Audio output corresponding to the sting location may be presented on a speaker based at least in part on the first touch input. Second touch input corresponding to an ornamental interface element of the user interface may be received. In response to the first and second touch input, a series of two or more audio outputs may be presented on the speaker according to a predetermined pattern.