Abstract:
A voice retrieval apparatus executes processes of: converting a retrieval string into a phoneme string; obtaining, from a time length memory, a continuous time length for each phoneme contained in the converted phoneme string; deriving a plurality of time lengths corresponding to a plurality of utterance rates as candidate utterance time lengths of voices corresponding to the retrieval string based on the obtained continuous time length; specifying, for each of the plurality of time lengths, a plurality of likelihood obtainment segments having the derived time length within a time length of a retrieval sound signal; obtaining a likelihood showing a plausibility that the specified likelihood obtainment segment specified is a segment where the voices are uttered; and identifying, based on the obtained likelihood, for each of the specified likelihood obtainment segments, an estimation segment where utterance of the voices is estimated in the retrieval sound signal.
Abstract:
An exercise assisting apparatus includes at least one memory and at least one processor configured to execute a program loaded in the memory. The processor resamples arm-swing trajectory data pieces on a plurality of human subjects with a predetermined number of samples. The processor generates a distance matrix on the basis of the minimum distance between two point groups after association between individual points, the two point groups being selected from among the resampled arm-swing trajectory data pieces. The processor generates clustering data through classification of the values contained in the distance matrix into a certain number of clusters.
Abstract:
A search word acquiring unit acquires a search word. A converting unit converts the search word into a phoneme sequence. An output probability acquiring unit acquires, for each frame, an output probability of a feature quantity of a target voice signal being output from each phoneme included in the phoneme sequence. A relative calculating unit executes relative calculation of the output probability acquired from each phoneme by the output probability acquirer, based on an output probability acquired from another phoneme included in the phoneme sequence. A zone designating unit successively designates a likelihood acquisition zones. A likelihood calculating unit acquires a likelihood indicating how likely a likelihood acquisition zone designated by the zone designator is a zone in which voice corresponding to the search word is spoken. An identifying unit identifies from the target voice signal an estimated zone for which the voice corresponding to the search word is estimated to be spoken, based on the likelihood acquired by the likelihood acquiring unit.
Abstract:
A measurement device includes at least one processor that acquires acceleration data indicating a temporal transition of acceleration of a subject when the subject is moving by performing a moving action, detects a target timing that is a timing when the subject performs a target action in accordance with a temporal transition of the acceleration of the subject in a detection target direction, the temporal transition being indicated by the acceleration data, and in accordance with reference data indicating a temporal transition of the acceleration of the subject when the subject performs the target action in the moving action, detects the target timing using a first detection method when the moving action is a first moving action, and detects the target timing using a second detection method different from the first detection method when the moving action is a second moving action different from the first moving action. At least one of the detection target direction or the reference data is different between when the target timing is detected using the first detection method and when the target timing is detected using the second detection method.
Abstract:
An audio interval detection apparatus comprising a processor and a storage storing instructions that, when executed by the processor, control the processor to: detect from a target audio signal a specified audio interval including a specified audio signal representing a state of a phoneme of a same consonant produced continuously over a period longer than a specified time, and by eliminating, from the target audio signal at least the detected specified audio interval, detect from the target audio signal an utterance audio interval that includes a speech utterance signal representing a speech utterance uttered by a speaker.
Abstract:
A voice retrieval apparatus executes processes of: obtaining, from a time length memory, a continuous time length for each phoneme contained in a phoneme string of a retrieval string; obtaining user-specified information on an utterance rate; changing the continuous time length for each obtained phoneme in accordance with the obtained information; deriving, based on the changed continuous time length, an utterance time length of voices corresponding to the retrieval string; specifying a plurality of likelihood obtainment segments of the derived utterance time length in a time length of a retrieval sound signal; obtaining a likelihood showing a plausibility that the specified likelihood obtainment segment is a segment where the voices are uttered; and identifying, based on the obtained likelihood, an estimation segment where, within the retrieval sound signal, utterance of the voices is estimated, the estimation segment being identified for each specified likelihood obtainment segment.