摘要:
An audio data segmentation apparatus for segmenting of audio data including for supplying audio data, dividing the audio data supplied into audio clips of a predetermined length, discriminating the audio clips into predetermined audio classes, the audio classes identifying a kind of audio data included in the respective audio clip and segmenting for segmenting the audio data into audio meta patterns based on a sequence of audio classes of consecutive audio clips, each meta pattern being allocated to a predetermined type of contents of the audio data. It is difficult to achieve good results with known methods for segmentation of audio data into meta patterns since the rules for the allocation of the meta patterns are dissatisfying. This problem is solved by the inventive audio data segmentation apparatus further including a program database including program data units to identify a certain kind of program, a plurality of respective audio meta patterns being allocated to each program data unit, wherein the segmenting segments the audio data into corresponding audio meta patterns on the basis of the program data units of the program database 5.
摘要:
The present invention discloses an apparatus for automatic extraction of important events in audio signals comprising: signal input means for supplying audio signals; audio signal fragmenting means for partitioning audio signals supplied by the signal input means into audio fragments of a predetermined length and for allocating a sequence of one or more audio fragments to a respective audio window; feature extracting means for analysing acoustic characteristics of the audio signals comprised in the audio fragments and for analysing acoustic characteristics of the audio signals comprised in the audio windows; and important event extraction means for extracting important events in audio signals supplied by the audio signal fragmenting means based on predetermined important event classifying rules depending on acoustic characteristics of the audio signals comprised in the audio fragments and on acoustic characteristics of the audio signals comprised in the audio windows, wherein each important event extracted by the important event extraction means comprises a discrete sequence of cohesive audio fragments corresponding to an important event included in the audio signals.
摘要:
An apparatus for automatic dissection of segmented audio signals, wherein at least one information signal for identifying programs included in said audio signals and for identifying contents included in said programs. Content detection device detects programs and contents belonging to the respective programs in the information signal. Program weighting device weights each program includes in the information signal based on the contents of the respective program detected by the content detection device. Program ranking device indentifies programmers of the same category and ranking said programs based on a weighting result for each program provided by the program weighting device.
摘要:
A user profile and/or the suggestions computed based thereon are obtained taking a special set of user features into account. The user features are defined to represent a typical general behaviour of an individual user in respect to the application where the user profile is used. In other words, for each application where a user profile is used a special set of user features are defined which are able to represent a typical general behaviour of an individual user. Based on these user features the weights in the list of word-weight pairs or weighted keywords which represents the user profile are computed or influenced during the creation of the user profile, and/or a multi-user profile is split during the creation of an individual user profile from a multi-user profile, and/or during specification of a suggestion a user history which is used to create the user profile, and/or the user profile, and/or the suggestion results are filtered.
摘要:
An apparatus for classifying audio signals comprises audio signal clipping means for partitioning audio signals into audio clips, and class discrimination means for discriminating the audio clips provided by the audio signal clipping means into predetermined audio classes based on predetermined audio class classifying rules, by analysing acoustic characteristics of the audio signals comprised in the audio clips, wherein a predetermined audio class classifying rule is provided for each audio class, and each audio class represents a respective kind of audio signals comprised in the corresponding audio clip. The determination process to find acceptable audio class classifying rules for each audio class according to the prior art is depending on both the used raw audio signals and the personal experience of the person conducting the determination process. Thus, the determination process usually is very difficult, time consuming and subjective. Furthermore, there is a high risk that not all possible peculiarities of the different programmes and the different categories the audio signal can belong to is sufficiently accounted for. This problem is solved in the inventive apparatus for classifying audio signals by class discrimination means calculating an audio class confidence value for each audio class assigned to an audio clip, wherein the audio class confidence value indicates the likelihood the respective audio class characterises the respective kind of audio signals comprised in the respective audio clip correctly. Furthermore, the class discrimination means use acoustic characteristics of audio clips of audio classes having a high audio class confidence value to train the respective audio class classifying rule.
摘要:
A user profile and/or the suggestions computed based thereon are obtained taking a special set of user features into account. The user features are defined to represent a typical general behaviour of an individual user in respect to the application where the user profile is used. In other words, for each application where a user profile is used a special set of user features are defined which are able to represent a typical general behaviour of an individual user. Based on these user features the weights in the list of word-weight pairs or weighted keywords which represents the user profile are computed or influenced during the creation of the user profile, and/or a mufti-user profile is split during the creation of an individual user profile from a mufti-user profile, and/or during specification of a suggestion a user history which is used to create the user profile, and/or the user profile, and/or the suggestion results are filtered.
摘要:
To increase the recognition rate and quality in a process of recognizing speech an approximative set of pronunciation rules (APR) for a current pronunciation (CP) of a current speaker is determined in a given pronunciation space (PS) and then applied to a current pronunciation lexicon (CL) so as to perform a speaker specific adaptation of said current lexicon (CL).
摘要:
Apparatus and method for automatic dissection of segmented audio signals According to the present invention, an apparatus for automatic dissection of segmented audio signals, wherein at least one information signal for identifying programmes included in said audio signals and for identifying contents included in said programmes is provided, comprises: content detection means for detecting programmes and contents belonging to the respective programmes in the information signal; programme weighting means for weighting each programme comprised in the information signal based on the contents of the respective programme detected by the content detection means; and programme ranking means for identifying programmes of the same category and ranking said programmes based on a weighting result for each programme provided by the programme weighting means.
摘要:
An audio data segmentation apparatus for segmenting of audio data comprises audio data input means for supplying audio data, audio data clipping means for dividing the audio data supplied by the audio data input means into audio clips of a predetermined length, class discrimination means for discriminating the audio clips supplied by the audio data clipping means into predetermined audio classes, the audio classes identifying a kind of audio data included in the respective audio clip and segmenting means for segmenting the audio data into audio meta patterns based on a sequence of audio classes of consecutive audio clips, each meta pattern being allocated to a predetermined type of contents of the audio data. It is difficult to achieve good results with known methods for segmentation of audio data into meta patterns since the rules for the allocation of the meta patterns are dissatisfying. This problem is solved by the inventive audio data segmentation apparatus further comprising a programme database comprising programme data units to identify a certain kind of programme, a plurality of respective audio meta patterns being allocated to each programme data unit, wherein the segmenting means segments the audio data into corresponding audio meta patterns on the basis of the programme data units of the programme database 5.
摘要:
Home networks low-cost digital interfaces are introduced that integrate entertainment, communication and computing electronics into consumer multimedia. Normally, these are low-cost, easy to use systems, since they allow the user to remove or add any kind of network devices with the bus being active. To improve the user interface a speech unit (2) is proposed that enables all devices (11) connected to the bus system (31) to be controlled by a single speech recognition device. The properties of this device, e.g. the vocabulary can be dynamically and actively extended by the consumer devices (11) connected to the bus system (31). The proposed technology is independent from a specific bus standard, e.g. the IEEE 1394 standard, and is well-suited for all kinds of wired wireless home networks. The speech unit (2) receives data and messages from the device. The speech unit (2) recognizes speaker-dependent commands. A Speech synthesizer synthesizes messages. A remotely controllable device (11) has access to a medium which may be a CD-ROM. The device may ask for a logical name or identifier.