摘要:
Embodiments of the invention relate to a method for video retrieval by providing a first audio video file, determining a first identifier of a first piece of music in the first audio video file, looking up for the first identifier first meta data in a music database, in which said first meta data are associated with said first identifier, providing second meta data of a second piece of music from the music database. The second piece of music is included in a second audio video file. Further the method is conducted by determining a similarity measure by comparing the first and second meta data, and providing the second audio video file or an identifier thereof depending on the similarity measure. Further embodiments relate to a server, a user device and a system and a computer program product for video retrieval.
摘要:
To improve the performance and the recognition rate of a method for recognizing speech in a dialogue system, or the like, it is suggested to derive emotion information data (EID) from speech input (SI) being descriptive for an emotional state of a speaker or a change thereof based upon which a process of recognition is chosen and/or designed.
摘要:
A user profile and/or the suggestions computed based thereon are obtained taking a special set of user features into account. The user features are defined to represent a typical general behaviour of an individual user in respect to the application where the user profile is used. In other words, for each application where a user profile is used a special set of user features are defined which are able to represent a typical general behaviour of an individual user. Based on these user features the weights in the list of word-weight pairs or weighted keywords which represents the user profile are computed or influenced during the creation of the user profile, and/or a mufti-user profile is split during the creation of an individual user profile from a mufti-user profile, and/or during specification of a suggestion a user history which is used to create the user profile, and/or the user profile, and/or the suggestion results are filtered.
摘要:
Method for providing an overview of pieces of music, comprising: providing at least two pieces of music; determining at least two sections of said pieces of music, wherein one of said sections is determined from one of said pieces of music and another of said sections is determined from another of said pieces of music; and arranging said sections in a sequence.
摘要:
A method for classifying music includes providing music classification data, providing an unclassified piece of music to be classified, and deriving for each music class within the music classification data a respective Gish distance value. A finite set of a finite number of Gish distance values is descriptive for the relation of the unclassified piece of music to be classified with respect to a discrete and finite set of a finite number of music classes. Alternatively, for a given piece of music to be classified, music classification data of a n-tuple of at least three numbers are obtained, which are representative for the mood of the piece of music. From the n-tuple of numbers of the music classification data a pair of two dimensional coordinate values are determined, which are representative for the mood of the piece of music.
摘要:
To reduce the error rate when classifying emotions from an acoustical speech input (SI) only, it is suggested to include a process of speaker identification to obtain certain speaker identification data (SID) on the basis of which the process of recognizing an emotional state is adapted and/or configured. In particular, speaker-specific feature extractors (FE) and/or emotion classifiers (EC) are selected based on said speaker identification data (SID).
摘要:
A user profile and/or the suggestions computed based thereon are obtained taking a special set of user features into account. The user features are defined to represent a typical general behaviour of an individual user in respect to the application where the user profile is used. In other words, for each application where a user profile is used a special set of user features are defined which are able to represent a typical general behaviour of an individual user. Based on these user features the weights in the list of word-weight pairs or weighted keywords which represents the user profile are computed or influenced during the creation of the user profile, and/or a multi-user profile is split during the creation of an individual user profile from a multi-user profile, and/or during specification of a suggestion a user history which is used to create the user profile, and/or the user profile, and/or the suggestion results are filtered.
摘要:
An apparatus for classifying audio signals comprises audio signal clipping means for partitioning audio signals into audio clips, and class discrimination means for discriminating the audio clips provided by the audio signal clipping means into predetermined audio classes based on predetermined audio class classifying rules, by analysing acoustic characteristics of the audio signals comprised in the audio clips, wherein a predetermined audio class classifying rule is provided for each audio class, and each audio class represents a respective kind of audio signals comprised in the corresponding audio clip. The determination process to find acceptable audio class classifying rules for each audio class according to the prior art is depending on both the used raw audio signals and the personal experience of the person conducting the determination process. Thus, the determination process usually is very difficult, time consuming and subjective. Furthermore, there is a high risk that not all possible peculiarities of the different programmes and the different categories the audio signal can belong to is sufficiently accounted for. This problem is solved in the inventive apparatus for classifying audio signals by class discrimination means calculating an audio class confidence value for each audio class assigned to an audio clip, wherein the audio class confidence value indicates the likelihood the respective audio class characterises the respective kind of audio signals comprised in the respective audio clip correctly. Furthermore, the class discrimination means use acoustic characteristics of audio clips of audio classes having a high audio class confidence value to train the respective audio class classifying rule.
摘要:
A method for determining a sentiment, including determining, from a text including formatting information related to parts of the text, a sentiment expressed by at least one of the parts, wherein the sentiment is determined automatically using a microprocessor and depends on formatting information related to the at least one of the parts.
摘要:
An auto-injector for administering a dose of a liquid medicament (M) is presented having an elongate case arranged to be held by a user, a tubular chassis telescoped in the case and biased against the case so as to protrude from the case. The injector also includes a syringe with a hollow injection needle, a drive spring and a plunger for forwarding load of the drive spring to a stopper of the syringe. A trigger button is distally arranged in or on the case, wherein the trigger button is initially coupled to the case in a manner to protrude distally from the case and interlocked to the chassis in a manner preventing the trigger button from disengaging the case.