摘要:
An apparatus comprising a processor and a memory that cause the apparatus to perform receiving a video indicating a motion, generating a set of scalar representations of movement based, at least in part, on at least part of the video, and identifying at least one predetermined motion that correlates to the set of scalar representations of movement is disclosed.
摘要:
An apparatus, comprising a processor and memory configured to cause the apparatus to perform at least the following: receiving a video indicating a motion, generating a set of normalized representations of movement based, at least in part, on the video, evaluating a reference set of representations with respect to the set of normalized representations of the movement, and determining that at least one predetermined motion correlates to the set of normalized representations of the movement based, at least in part, on the evaluation is disclosed.
摘要:
An apparatus, comprising a processor and memory configured to cause the apparatus to perform at least the following: receiving a video indicating a motion, generating a set of normalized representations of movement based, at least in part, on the video, evaluating a reference set of representations with respect to the set of normalized representations of the movement, and determining that at least one predetermined motion correlates to the set of normalized representations of the movement based, at least in part, on the evaluation is disclosed.
摘要:
An apparatus comprising a processor and a memory that cause the apparatus to perform receiving a video indicating a motion, generating a set of scalar representations of movement based, at least in part, on at least part of the video, and identifying at least one predetermined motion that correlates to the set of scalar representations of movement is disclosed.
摘要:
A platform that is configured to be removably placed symmetrically on or about a user's head has at least a first transducer configured to capture vibration of the user's skull or facial movement generated by the user's voice activity and to detect the user's speaking activity. This first transducer converts the vibration or facial movement into a first electrical audio signal. The electrical audio signal from the first transducer is processed by circuitry or embodied software as voiced frames and/or as unvoiced frames, in which the voiced frames and/or the unvoiced frames are defined based at least on the first electrical audio signal. Multiple embodiments follow from this: where the first transducer is a vibration sensor; where voice is captured by an air microphone and filtering adaptation differs for the voiced versus unvoiced frames as defined by the first transducer, and another with at least three air microphones.
摘要:
A system is disclosed for forming frames of color video signals. A video color encoder and a multichannel keyer are employed in generating color frames from monochrome frames. Colors from a pseudo-color generator can be automatically inserted in portions of a frame that are not being colorized by an operator with operator-selected colors.
摘要:
An apparatus including an air-conduction transducer and a bone conduction transducer. The air-conduction transducer is configured to convert a first frequency band component of an electrical audio signal into acoustic energy to be delivered to an ear canal of a user. The bone conduction transducer is configured to convert a second, at least partially different, frequency band component of the electrical audio signal into mechanical energy to be delivered to a skull of the user. The apparatus is configured to deliver both forms of the energies to the user at a substantially same time to provide a combined audio delivery result to the user.
摘要:
A method and system of identity masking to obscure identities corresponding to face regions in an image is disclosed. A face detector is applied to detect a set of possible face regions in the image. Then an identity masker is used to process the detected face regions by identity masking techniques in order to obscure identities corresponding to the regions. For example, a detected face region can be blurred as if it is in motion by a motion blur algorithm, such that the blurred region can not be recognized as the original identity. Or the detected face region can be replaced by a substitute facial image by a face replacement algorithm to obscure the corresponding identity.
摘要:
An apparatus including an air-conduction transducer and a bone conduction transducer. The air-conduction transducer is configured to convert a first frequency band component of an electrical audio signal into acoustic energy to be delivered to an ear canal of a user. The bone conduction transducer is configured to convert a second, at least partially different, frequency band component of the electrical audio signal into mechanical energy to be delivered to a skull of the user. The apparatus is configured to deliver both forms of the energies to the user at a substantially same time to provide a combined audio delivery result to the user.
摘要:
A platform that is configured to be removably placed symmetrically on or about a user's head has at least a first transducer configured to capture vibration of the user's skull or facial movement generated by the user's voice activity and to detect the user's speaking activity. This first transducer converts the vibration or facial movement into a first electrical audio signal. The electrical audio signal from the first transducer is processed by circuitry or embodied software as voiced frames and/or as unvoiced frames, in which the voiced frames and/or the unvoiced frames are defined based at least on the first electrical audio signal. Multiple embodiments follow from this: where the first transducer is a vibration sensor; where voice is captured by an air microphone and filtering adaptation differs for the voiced versus unvoiced frames as defined by the first transducer, and another with at least three air microphones.