SYSTEM AND METHOD FOR PROVIDING CONFERENCE INFORMATION
    1.
    发明申请
    SYSTEM AND METHOD FOR PROVIDING CONFERENCE INFORMATION 审中-公开
    用于提供会议信息的系统和方法

    公开(公告)号:US20120142324A1

    公开(公告)日:2012-06-07

    申请号:US13289437

    申请日:2011-11-04

    IPC分类号: H04M3/42

    摘要: A method for providing information for a conference at one or more locations is disclosed. One or more mobile devices monitor one or more starting requirements of the conference and transmit input sound information to a server when the one or more starting requirements of the conference is detected. The one or more starting requirements may include a starting time of the conference, a location of the conference, and/or acoustic characteristics of a conference environment. The server generates conference information based on the input sound information from each mobile device and transmits the conference information to each mobile device. The conference information may include information on attendees, a current speaker among the attendees, an arrangement of the attendees, and/or a meeting log of attendee participation at the conference.

    摘要翻译: 公开了一种在一个或多个位置为会议提供信息的方法。 当检测到会议的一个或多个启动要求时,一个或多个移动设备监视会议的一个或多个启动要求并将输入声音信息发送到服务器。 一个或多个启动要求可以包括会议的开始时间,会议的位置和/或会议环境的声学特性。 服务器基于来自每个移动设备的输入声音信息生成会议信息,并将会议信息发送到每个移动设备。 会议信息可以包括参加者的信息,参加者中的现任演讲者,与会者的安排和/或参加者在会议中的会议记录。

    System and method for speech processing using independent component analysis under stability constraints
    2.
    发明授权
    System and method for speech processing using independent component analysis under stability constraints 有权
    在稳定性约束下使用独立分量分析的语音处理系统和方法

    公开(公告)号:US07383178B2

    公开(公告)日:2008-06-03

    申请号:US10537985

    申请日:2003-12-11

    IPC分类号: G10L21/02

    CPC分类号: G10L21/0272

    摘要: A system and method for separating a mixture of audio signal into desired audio signals (430) (e.g., speech) and a noise sign (440) is disclosed. Microphones (310, 320) are positioned to receive the mixed audio signals, and an independent component analysis (ICA) processes (212) the sound mixture using stability constraints. The ICA process (508) uses predefined characteristics of the desired speech signal to identify and isolate a target sound signal (430). Filter coefficients are adapted with a learning rule and filter weight update dynamics are stabilized to assist convergence to a stable separated ICA signal result. The separated signals may be peripherally-processed to further reduce noise effects using post-processing (214) and pre-processing (220, 230) techniques and information. The proposed system is designed and easily adaptable for implementation on DSP units or CPUs in audio communication hardware environments.

    摘要翻译: 公开了一种用于将音频信号的混合分离成所需音频信号(430)(例如语音)和噪声信号(440)的系统和方法。 麦克风(310,320)被定位成接收混合音频信号,并且独立分量分析(ICA)使用稳定性约束来处理(212)声音混合。 ICA处理(508)使用所需语音信号的预定特征来识别和隔离目标声音信号(430)。 滤波器系数适应于学习规则,并且滤波器权重更新动态被稳定以辅助收敛到稳定的分离的ICA信号结果。 可以对分离的信号进行外围处理,以进一步减少使用后处理(214)和预处理(220,230)技术和信息的噪声影响。 所提出的系统设计和易于适应于在音频通信硬件环境中的DSP单元或CPU上的实现。

    Separation of target acoustic signals in a multi-transducer arrangement
    3.
    发明授权
    Separation of target acoustic signals in a multi-transducer arrangement 有权
    在多换能器布置中分离目标声信号

    公开(公告)号:US07366662B2

    公开(公告)日:2008-04-29

    申请号:US11463376

    申请日:2006-08-09

    IPC分类号: G10L21/02

    摘要: The present invention provides a process for separating a good quality information signal from a noisy acoustic environment. The separation process uses a set of at least two spaced-apart transducers to capture noise and information components. The transducer signals, which have both a noise and information component, are received into a separation process. The separation process generates one channel that is substantially only noise, and another channel that is a combination of noise and information. An identification process is used to identify which channel has the information component. The noise signal is then used to set process characteristics that are applied to the combination signal to efficiently reduce or eliminate the noise component. In this way, the noise is effectively removed from the combination signal to generate a good qualify information signal. The information signal may be, for example, a speech signal, a seismic signal, a sonar signal, or other acoustic signal.

    摘要翻译: 本发明提供了一种用于从良好的声学环境中分离出良好质量信息信号的方法。 分离过程使用一组至少两个间隔开的传感器来捕获噪声和信息分量。 具有噪声和信息分量的换能器信号被接收到分离过程中。 分离过程产生基本上只有噪声的一个信道,以及作为噪声和信息的组合的另一个信道。 使用识别过程来识别哪个信道具有信息分量。 然后使用噪声信号来设置施加到组合信号的处理特性,以有效地减少或消除噪声分量。 以这种方式,有效地从组合信号中去除噪声以产生良好的限定信息信号。 信息信号可以是例如语音信号,地震信号,声纳信号或其他声信号。

    Method and apparatus for efficiently encoding chromatic images using non-orthogonal basis functions
    4.
    发明授权
    Method and apparatus for efficiently encoding chromatic images using non-orthogonal basis functions 失效
    使用非正交基函数有效地编码彩色图像的方法和装置

    公开(公告)号:US07286712B2

    公开(公告)日:2007-10-23

    申请号:US11086802

    申请日:2005-03-21

    IPC分类号: G06K9/36 G06K9/46

    摘要: A method and apparatus for efficiently encoding images using a set of non-orthogonal basis functions, thereby allowing reduction of file size, shorter transmission time, and improved accuracy. The non-orthogonal basis functions include homogenous color basis functions, luminance-encoding basis functions that have luminance edges and chromatic basis functions that exhibit color opponency. Some of the basis functions are non-orthogonal with respect to each other. Using these basis functions, a source vector is calculated to provide a number of coefficients, each coefficient associated with one basis function. The source vector is compressed by selecting a subset of the calculated coefficients, thereby providing an encoded vector. Because the method is highly efficient, the image data is substantially represented by a small number of coefficients. In some embodiments, the non-orthogonal basis functions include two or more classes. A wavelet approach can also be utilized.

    摘要翻译: 一种用于使用一组非正交基函数对图像进行高效编码的方法和装置,从而允许减小文件大小,缩短传输时间和提高精度。 非正交基函数包括均匀颜色基函数,具有亮度边缘的亮度编码基函数和表现色彩对立的色基函数。 一些基本函数相互之间是非正交的。 使用这些基函数,计算源向量以提供多个系数,每个系数与一个基函数相关联。 通过选择所计算的系数的子集来压缩源向量,由此提供编码的向量。 因为该方法是高效的,所以图像数据基本上由少量系数表示。 在一些实施例中,非正交基函数包括两个或多个类。 也可以使用小波方法。

    Separation of target acoustic signals in a multi-transducer arrangement
    5.
    发明申请
    Separation of target acoustic signals in a multi-transducer arrangement 有权
    在多换能器布置中分离目标声信号

    公开(公告)号:US20070038442A1

    公开(公告)日:2007-02-15

    申请号:US11463376

    申请日:2006-08-09

    IPC分类号: G10L15/20

    摘要: The present invention provides a process for separating a good quality information signal from a noisy acoustic environment. The separation process uses a set of at least two spaced-apart transducers to capture noise and information components. The transducer signals, which have both a noise and information component, are received into a separation process. The separation process generates one channel that is substantially only noise, and another channel that is a combination of noise and information. An identification process is used to identify which channel has the information component. The noise signal is then used to set process characteristics that are applied to the combination signal to efficiently reduce or eliminate the noise component. In this way, the noise is effectively removed from the combination signal to generate a good qualify information signal. The information signal may be, for example, a speech signal, a seismic signal, a sonar signal, or other acoustic signal.

    摘要翻译: 本发明提供了一种用于从良好的声学环境中分离出良好质量信息信号的方法。 分离过程使用一组至少两个间隔开的传感器来捕获噪声和信息分量。 具有噪声和信息分量的换能器信号被接收到分离过程中。 分离过程产生基本上只有噪声的一个信道,以及作为噪声和信息的组合的另一个信道。 使用识别过程来识别哪个信道具有信息分量。 然后使用噪声信号来设置施加到组合信号的处理特性,以有效地减少或消除噪声分量。 以这种方式,有效地从组合信号中去除噪声以产生良好的限定信息信号。 信息信号可以是例如语音信号,地震信号,声纳信号或其他声信号。

    Camera OCR with context information
    6.
    发明授权
    Camera OCR with context information 有权
    相机OCR与上下文信息

    公开(公告)号:US09082035B2

    公开(公告)日:2015-07-14

    申请号:US13450016

    申请日:2012-04-18

    摘要: Embodiments of the invention describe methods and apparatus for performing context-sensitive OCR. A device obtains an image using a camera coupled to the device. The device identifies a portion of the image comprising a graphical object. The device infers a context associated with the image and selects a group of graphical objects based on the context associated with the image. Improved OCR results are generated using the group of graphical objects. Input from various sensors including microphone, GPS, and camera, along with user inputs including voice, touch, and user usage patterns may be used in inferring the user context and selecting dictionaries that are most relevant to the inferred contexts.

    摘要翻译: 本发明的实施例描述了用于执行上下文敏感的OCR的方法和装置。 设备使用耦合到该设备的照相机来获得图像。 设备识别包括图形对象的图像的一部分。 设备推断与图像相关联的上下文,并且基于与图像相关联的上下文来选择一组图形对象。 使用图形对象组生成改进的OCR结果。 来自包括麦克风,GPS和相机在内的各种传感器的输入以及包括语音,触摸和用户使用模式在内的用户输入可以用于推断用户上下文并选择与所推断的上下文最相关的字典。

    Voice activity detection based on plural voice activity detectors
    7.
    发明授权
    Voice activity detection based on plural voice activity detectors 有权
    基于多个语音活动检测器的语音活动检测

    公开(公告)号:US08626498B2

    公开(公告)日:2014-01-07

    申请号:US12711943

    申请日:2010-02-24

    申请人: Te-Won Lee

    发明人: Te-Won Lee

    IPC分类号: G10L25/93

    CPC分类号: G10L25/78

    摘要: A voice activity detection (VAD) system includes a first voice activity detector, a second voice activity detector and control logic. The first voice activity detector is included in a device and produces a first VAD signal. The second voice activity detector is located externally to the device and produces a second VAD signal. The control logic combines the first and second VAD signals into a VAD output signal. Voice activity may be detected based on the VAD output signal. The second VAD signal can be represented as a flag included in a packet containing digitized audio. The packet can be transmitted to the device from the externally located VAD over a wireless link.

    摘要翻译: 语音活动检测(VAD)系统包括第一语音活动检测器,第二语音活动检测器和控制逻辑。 第一语音活动检测器被包括在设备中并产生第一VAD信号。 第二语音活动检测器位于设备外部并产生第二VA​​D信号。 控制逻辑将第一和第二VAD信号组合成VAD输出信号。 可以基于VAD输出信号检测语音活动。 第二VAD信号可以被表示为包含在包含数字化音频的分组中的标志。 该分组可以通过无线链路从外部定位的VAD发送到设备。

    Mobile device location estimation using environmental information
    8.
    发明授权
    Mobile device location estimation using environmental information 有权
    使用环境信息的移动设备位置估计

    公开(公告)号:US08606293B2

    公开(公告)日:2013-12-10

    申请号:US12898647

    申请日:2010-10-05

    IPC分类号: H04M3/00 H04W24/00

    摘要: Estimating a location of a mobile device is performed by comparing environmental information, such as environmental sound, associated with the mobile device with that of other devices to determine if the environmental information is similar enough to conclude that the mobile device is in a comparable location as another device. The devices may be in comparable locations in that they are in geographically similar locations (e.g., same store, same street, same city, etc.). The devices may be in comparable locations even though they are located in geographically dissimilar locations because the environmental information of the two locations demonstrates that the devices are in the same perceived location. With knowledge that the devices are in comparable locations, and with knowledge of the location of one of the devices, certain actions, such as targeted advertising, may be taken with respect to another device that is within a comparable location.

    摘要翻译: 通过将与移动设备相关联的环境信息与其他设备的环境信息进行比较来确定移动设备的位置是否足够相似,以确定移动设备处于可比较的位置,从而估计移动设备的位置 另一个设备。 这些设备可以在可比较的位置,因为它们在地理上相似的位置(例如,相同的商店,相同的街道,相同的城市等)。 即使这些设备位于地理位置不同的位置,设备也可处于可比较的位置,因为两个位置的环境信息表明设备处于相同的感知位置。 知道设备处于可比较的位置,并且了解设备之一的位置,可以针对在可比较的位置内的另一设备采取诸如定向广告的某些动作。

    Augmented reality processing based on eye capture in handheld device
    9.
    发明授权
    Augmented reality processing based on eye capture in handheld device 有权
    基于手持设备中眼睛捕获的增强现实处理

    公开(公告)号:US08514295B2

    公开(公告)日:2013-08-20

    申请号:US12971121

    申请日:2010-12-17

    申请人: Te-Won Lee

    发明人: Te-Won Lee

    IPC分类号: H04N5/228

    摘要: This disclosure describes techniques that can improve and possibly accelerate the generation of augmented reality (AR) information with respect to objects that appear in images of a video sequence. To do so, the techniques of this disclosure capture and use information about the eyes of a user of a video device. The video device may include two different cameras. A first camera is oriented to capture a sequence of images (e.g., video) outward from a user. A second camera is oriented to capture images of the eyes of the user when the first camera captures images outward from the user. The eyes of the user, as captured by one or more images of the second camera, may be used to generate a probability map, and the probability map may be used to prioritize objects in the first image for AR processing.

    摘要翻译: 本公开描述了可以改进并且可能加速相对于出现在视频序列的图像中的对象的增强现实(AR)信息的生成的技术。 为此,本公开的技术捕获和使用关于视频设备的用户的眼睛的信息。 视频设备可以包括两个不同的相机。 第一相机被定向为从用户向外捕获图像序列(例如,视频)。 当第一相机从用户向外捕获图像时,第二相机被定向为捕获用户的眼睛的图像。 由第二相机的一个或多个图像拍摄的用户的眼睛可以用于生成概率图,并且可以使用概率图来优先排列第一图像中的对象用于AR处理。

    MOBILE FAX MACHINE WITH IMAGE STITCHING AND DEGRADATION REMOVAL PROCESSING
    10.
    发明申请
    MOBILE FAX MACHINE WITH IMAGE STITCHING AND DEGRADATION REMOVAL PROCESSING 审中-公开
    移动传真机与图像缝合和降解处理

    公开(公告)号:US20130027757A1

    公开(公告)日:2013-01-31

    申请号:US13194872

    申请日:2011-07-29

    IPC分类号: H04N1/387

    摘要: A method of scanning an image of a document with a portable electronic device includes interactively indicating in substantially real time on a user interface of the portable electronic device, an instruction for capturing at least one portion of an image to enhance quality. The indication is in response to identifying degradation associated with the portion(s) of the image. The method also includes capturing the portion(s) of the image with the portable electronic device according to the instruction. The method further includes stitching the captured portion(s) of the image in place of a degraded portion of a reference image corresponding to the document, to create a corrected stitched image of the document.

    摘要翻译: 使用便携式电子设备扫描文档的图像的方法包括在便携式电子设备的用户界面上基本上实时地交互地指示用于捕获图像的至少一部分以提高质量的指令。 该指示是响应于识别与图像的部分相关联的退化。 该方法还包括根据该指令用便携式电子设备捕获图像的部分。 该方法还包括将图像的拍摄部分拼接代替对应于文档的参考图像的劣化部分,以创建文档的经过校正的拼接图像。