摘要:
Techniques are described for identifying blurred images and recognizing text. One or more images of text may be captured. A change of movement associated with each image of the one or more images may be calculated. The change of movement associated with an image of the one or more images represents a change in an amount of acceleration of the device used to capture the image while the image was being captured. A steady image may be selected from the one or more images to use for text recognition. The steady image can be selected using the variances of acceleration associated with each image of the one or more images.
摘要:
Signal separation techniques based on frequency dependency are described. In one implementation, a blind signal separation process is provided that avoids the permutation problem of previous signal separation processes. In the process, two or more signal sources are provided, with each signal source having recognized frequency dependencies. The process uses these inter-frequency dependencies to more robustly separate the source signals. The process receives a set of mixed signal input signals, and samples each input signal using a rolling window process. The sampled data is transformed into the frequency domain, which provides channel inputs to the inter-frequency dependent separation process. Since frequency dependencies have been defined for each source, the process is able to use the frequency dependency to more accurately separate the signals. The process can use a learning algorithm that preserves frequency dependencies within each source signal, and can remove dependencies between or among the signal sources.
摘要:
A method for identifying mobile devices in a similar sound environment is disclosed. Each of at least two mobile devices captures an input sound and extracts a sound signature from the input sound. Further, the mobile device extracts a sound feature from the input sound and determines a reliability value based on the sound feature. The reliability value may refer to a probability of a normal sound class given the sound feature. A server receives a packet including the sound signatures and reliability values from the mobile devices. A similarity value between sound signatures from a pair of the mobile devices is determined based on corresponding reliability values from the pair of mobile devices. Specifically, the sound signatures are weighted by the corresponding reliability values. The server identifies mobile devices in a similar sound environment based on the similarity values.
摘要:
Signal separation techniques based on frequency dependency are described. In one implementation, a blind signal separation process is provided that avoids the permutation problem of previous signal separation processes. In the process, two or more signal sources are provided, with each signal source having recognized frequency dependencies. The process uses these inter-frequency dependencies to more robustly separate the source signals. The process receives a set of mixed signal input signals, and samples each input signal using a rolling window process. The sampled data is transformed into the frequency domain, which provides channel inputs to the inter-frequency dependent separation process. Since frequency dependencies have been defined for each source, the process is able to use the frequency dependency to more accurately separate the signals. The process can use a learning algorithm that preserves frequency dependencies within each source signal, and can remove dependencies between or among the signal sources.
摘要:
Methods, apparatuses, systems, and computer-readable media for rejecting false positive detection and tracking of image objects are presented. According to one or more aspects, a computing device may implement embodiments of the invention to use the movement of the mobile device for distinguishing false positives from true movement of the mobile device depicted in the field of view of the camera. In one embodiment, the actual movement of the mobile device may be measured using multi-modal sensor data from inertial sensors such as accelerometers and gyroscopes. In another embodiment, the actual movement of the device is calculated using the global movement of the mobile phone with reference to other objects in the field of view of the camera.
摘要:
The present disclosure relates to a method for processing sound source in terminal capable of receiving an outside sound source through at least two microphones and displaying a position of the outside sound source on a display unit, and a terminal using the same, wherein the terminal includes: at least two microphones configured to receive an outside sound source; a display unit configured to display a predetermined data; and a controller configured to obtain position information of the sound source using at least one of an amplitude, a phase and a period of the outside sound source received from the two microphones, and controllably display the position information of the sound source on a display unit.