摘要:
An audio processing device including a feature calculation unit, a boundary calculation unit and a judgment unit, detects points of change of audio features from an audio signal in an AV content. The feature calculation unit calculates, for each unit section of the audio signal, section feature data expressing features of the audio signal in the unit section. The boundary calculation unit calculates, for each target unit section among the unit sections of the audio signal, a piece of boundary information relating to at least one boundary of a similarity section. The similarity section consists of consecutive unit sections, inclusive of the target unit section, which each have similar section feature data. The judgment unit calculates a priority of each boundary indicated by one or more of the pieces of boundary information and judges whether the boundary is a scene change point based on the priority.
摘要:
The present invention aims to provide a data processing device that provides a result of categorization that is satisfactory to a user, even when user data includes an object specific to the user. The data processing device stores therein model data pieces each indicating detection counts of feature amounts; judges, for each target data piece, whether the target data piece is a non-categorization data piece including an uncategorizable object, using the model data pieces and the detection count of each of at least two feature amounts detected in the target data piece; when, as a result of the judgment, two or more of the target data pieces are judged to be non-categorization data pieces, specifies at least two feature amounts that are each included, and detected the same number of times, in a predetermined number or more of the non-categorization data pieces, and newly creates a model data piece based on the at least two feature amounts that have been specified, using a class creation method, and stores the model data piece into the storage unit.
摘要:
The invention aims to provide a user interface for efficiently displaying desired content from among a large number of contents.An operation location and an operation amount of an operation that has been made on an operation member is detected. Based on the operation location, one content is selected from among a plurality of contents that have been arranged in sequence, and a display unit displays the selected one content. The display unit displays another content when the operation location has moved during the display of the selected one content, an order of said another content being different from an order of the selected one content by a number based on the operation amount detected by a detection unit.
摘要:
An imaging device of the present invention comprises: an imaging mechanism; a sound acquisition unit operable to acquire sound data that includes information reflecting an imaging environment; and a setting unit operable to, based on the sound data acquired by the sound acquisition unit, select and set one or more setting values for controlling the imaging mechanism.
摘要:
A video transmission system that achieves uninterrupted video transmission, even when there are large bit rate fluctuation due to receiving terminal movement and so forth, in a network such as a wireless network in which transmission bit rate fluctuations occur. In this system, when a video receiving apparatus (receiving terminal) 150 is moving, a video transmitting apparatus (transmitting terminal) 100 lowers the base layer bit rate of layered-coded data to the limit. When the base layer bit rate is lowered in this way, the bit rate of the lowest enhancement layer is raised and effects on the received image quality of other terminals due to lowering of the base layer are suppressed, or the lowest enhancement layer is divided finely and the adjustability to a bit rate under bit rate fluctuations is improved.
摘要:
An interesting section extracting device extracts an interesting section of interest to a user from a video file with reference to an audio signal included in the video file such that a specified time is included in the interesting section. The interesting section extracting device includes an interface device that obtains the specified time; and a likelihood vector generating unit that calculates, in one-to-one correspondence with first unit sections of the audio signal, likelihoods for anchor models that respectively represent features of a plurality of types of sound pieces and generates likelihood vectors having the calculated likelihoods as components thereof. An interesting section extracting unit calculates a first feature section as candidate section, which is candidate for the interesting section to be extracted, by using likelihood vectors and extract, as the interesting section, part of the first feature section including the specified time.
摘要:
The invention aims to provide a user interface for efficiently displaying desired content from among a large number of contents.An operation location and an operation amount of an operation that has been made on an operation member is detected. Based on the operation location, one content is selected from among a plurality of contents that have been arranged in sequence, and a display unit displays the selected one content. The display unit displays another content when the operation location has moved during the display of the selected one content, an order of said another content being different from an order of the selected one content by a number based on the operation amount detected by a detection unit.
摘要:
Provided is an image processing device for associating images with objects appearing in the images, while reducing burden on the user. The image processing device: stores, for each of events, a photographic attribute indicating a photographic condition predicted to be met with respect to an image photographed in the event; stores an object predicted to appear in an image photographed in the event; extracts from a collection of photographed images a photographic attribute that is common among a predetermined number of photographed images in the collection, based on pieces of photography-related information of the respective photographed images; specifies an object stored for an event corresponding to the extracted photographic attribute; and conducts a process on the collection of photographed images to associate each photographed image containing the specified object with the object.
摘要:
An image information processing apparatus comprising: an extraction unit that extracts an object from a photographed image; a calculation unit that calculates an orientation of the object as exhibited in the image; and a provision unit that provides a tag to the image according to the orientation of the object.
摘要:
The image classification apparatus extracts first features of each received image (S22) and second features of a relevant image relevant to each received image (S25). Subsequently, the image classification apparatus obtains a third feature by calculation using locality of the extracted first and second features, the third feature being distinctive of a target object of each received image (S26), and creates model data based on the obtained third feature (S27).