摘要:
A method for providing information for a conference at one or more locations is disclosed. One or more mobile devices monitor one or more starting requirements of the conference and transmit input sound information to a server when the one or more starting requirements of the conference is detected. The one or more starting requirements may include a starting time of the conference, a location of the conference, and/or acoustic characteristics of a conference environment. The server generates conference information based on the input sound information from each mobile device and transmits the conference information to each mobile device. The conference information may include information on attendees, a current speaker among the attendees, an arrangement of the attendees, and/or a meeting log of attendee participation at the conference.
摘要:
A system and method for separating a mixture of audio signal into desired audio signals (430) (e.g., speech) and a noise sign (440) is disclosed. Microphones (310, 320) are positioned to receive the mixed audio signals, and an independent component analysis (ICA) processes (212) the sound mixture using stability constraints. The ICA process (508) uses predefined characteristics of the desired speech signal to identify and isolate a target sound signal (430). Filter coefficients are adapted with a learning rule and filter weight update dynamics are stabilized to assist convergence to a stable separated ICA signal result. The separated signals may be peripherally-processed to further reduce noise effects using post-processing (214) and pre-processing (220, 230) techniques and information. The proposed system is designed and easily adaptable for implementation on DSP units or CPUs in audio communication hardware environments.
摘要:
The present invention provides a process for separating a good quality information signal from a noisy acoustic environment. The separation process uses a set of at least two spaced-apart transducers to capture noise and information components. The transducer signals, which have both a noise and information component, are received into a separation process. The separation process generates one channel that is substantially only noise, and another channel that is a combination of noise and information. An identification process is used to identify which channel has the information component. The noise signal is then used to set process characteristics that are applied to the combination signal to efficiently reduce or eliminate the noise component. In this way, the noise is effectively removed from the combination signal to generate a good qualify information signal. The information signal may be, for example, a speech signal, a seismic signal, a sonar signal, or other acoustic signal.
摘要:
A method and apparatus for efficiently encoding images using a set of non-orthogonal basis functions, thereby allowing reduction of file size, shorter transmission time, and improved accuracy. The non-orthogonal basis functions include homogenous color basis functions, luminance-encoding basis functions that have luminance edges and chromatic basis functions that exhibit color opponency. Some of the basis functions are non-orthogonal with respect to each other. Using these basis functions, a source vector is calculated to provide a number of coefficients, each coefficient associated with one basis function. The source vector is compressed by selecting a subset of the calculated coefficients, thereby providing an encoded vector. Because the method is highly efficient, the image data is substantially represented by a small number of coefficients. In some embodiments, the non-orthogonal basis functions include two or more classes. A wavelet approach can also be utilized.
摘要:
The present invention provides a process for separating a good quality information signal from a noisy acoustic environment. The separation process uses a set of at least two spaced-apart transducers to capture noise and information components. The transducer signals, which have both a noise and information component, are received into a separation process. The separation process generates one channel that is substantially only noise, and another channel that is a combination of noise and information. An identification process is used to identify which channel has the information component. The noise signal is then used to set process characteristics that are applied to the combination signal to efficiently reduce or eliminate the noise component. In this way, the noise is effectively removed from the combination signal to generate a good qualify information signal. The information signal may be, for example, a speech signal, a seismic signal, a sonar signal, or other acoustic signal.
摘要:
Embodiments of the invention describe methods and apparatus for performing context-sensitive OCR. A device obtains an image using a camera coupled to the device. The device identifies a portion of the image comprising a graphical object. The device infers a context associated with the image and selects a group of graphical objects based on the context associated with the image. Improved OCR results are generated using the group of graphical objects. Input from various sensors including microphone, GPS, and camera, along with user inputs including voice, touch, and user usage patterns may be used in inferring the user context and selecting dictionaries that are most relevant to the inferred contexts.
摘要:
A voice activity detection (VAD) system includes a first voice activity detector, a second voice activity detector and control logic. The first voice activity detector is included in a device and produces a first VAD signal. The second voice activity detector is located externally to the device and produces a second VAD signal. The control logic combines the first and second VAD signals into a VAD output signal. Voice activity may be detected based on the VAD output signal. The second VAD signal can be represented as a flag included in a packet containing digitized audio. The packet can be transmitted to the device from the externally located VAD over a wireless link.
摘要:
Estimating a location of a mobile device is performed by comparing environmental information, such as environmental sound, associated with the mobile device with that of other devices to determine if the environmental information is similar enough to conclude that the mobile device is in a comparable location as another device. The devices may be in comparable locations in that they are in geographically similar locations (e.g., same store, same street, same city, etc.). The devices may be in comparable locations even though they are located in geographically dissimilar locations because the environmental information of the two locations demonstrates that the devices are in the same perceived location. With knowledge that the devices are in comparable locations, and with knowledge of the location of one of the devices, certain actions, such as targeted advertising, may be taken with respect to another device that is within a comparable location.
摘要:
This disclosure describes techniques that can improve and possibly accelerate the generation of augmented reality (AR) information with respect to objects that appear in images of a video sequence. To do so, the techniques of this disclosure capture and use information about the eyes of a user of a video device. The video device may include two different cameras. A first camera is oriented to capture a sequence of images (e.g., video) outward from a user. A second camera is oriented to capture images of the eyes of the user when the first camera captures images outward from the user. The eyes of the user, as captured by one or more images of the second camera, may be used to generate a probability map, and the probability map may be used to prioritize objects in the first image for AR processing.
摘要:
A method of scanning an image of a document with a portable electronic device includes interactively indicating in substantially real time on a user interface of the portable electronic device, an instruction for capturing at least one portion of an image to enhance quality. The indication is in response to identifying degradation associated with the portion(s) of the image. The method also includes capturing the portion(s) of the image with the portable electronic device according to the instruction. The method further includes stitching the captured portion(s) of the image in place of a degraded portion of a reference image corresponding to the document, to create a corrected stitched image of the document.