TOUCHLESS SENSING AND GESTURE RECOGNITION USING CONTINUOUS WAVE ULTRASOUND SIGNALS
    1.
    发明申请
    TOUCHLESS SENSING AND GESTURE RECOGNITION USING CONTINUOUS WAVE ULTRASOUND SIGNALS 有权
    使用连续波超声信号进行无感应感知和姿态识别

    公开(公告)号:US20120001875A1

    公开(公告)日:2012-01-05

    申请号:US12885151

    申请日:2010-09-17

    IPC分类号: G06F3/043

    摘要: The embodiments provide systems and methods for touchless sensing and gesture recognition using continuous wave sound signals. Continuous wave sound, such as ultrasound, emitted by a transmitter may reflect from an object, and be received by one or more sound receivers. Sound signals may be temporally encoded. Received sound signals may be processed to determine a channel impulse response or calculate time of flight. Determined channel impulse responses may be processed to extract recognizable features or angles. Extracted features may be compared to a database of features to identify a user input gesture associated with the matched feature. Angles of channel impulse response curves may be associated with an input gesture. Time of flight values from each receiver may be used to determine coordinates of the reflecting object. Embodiments may be implemented as part of a graphical user interface. Embodiments may be used to determine a location of an emitter.

    摘要翻译: 这些实施例提供了使用连续波声信号的无接触感测和手势识别的系统和方法。 由发射器发射的连续波声,如超声波,可能会从物体反射,并由一个或多个声音接收器接收。 声音信号可能是时间编码的。 可以处理接收的声音信号以确定信道脉冲响应或计算飞行时间。 可以处理确定的信道脉冲响应以提取可识别的特征或角度。 提取的特征可以与特征数据库进行比较,以识别与匹配特征相关联的用户输入手势。 信道脉冲响应曲线的角度可以与输入手势相关联。 可以使用来自每个接收机的飞行时间值来确定反射对象的坐标。 实施例可以被实现为图形用户界面的一部分。 实施例可以用于确定发射器的位置。

    Touchless sensing and gesture recognition using continuous wave ultrasound signals
    2.
    发明授权
    Touchless sensing and gesture recognition using continuous wave ultrasound signals 有权
    使用连续波超声信号进行无触摸感测和手势识别

    公开(公告)号:US08907929B2

    公开(公告)日:2014-12-09

    申请号:US12885151

    申请日:2010-09-17

    摘要: The embodiments provide systems and methods for touchless sensing and gesture recognition using continuous wave sound signals. Continuous wave sound, such as ultrasound, emitted by a transmitter may reflect from an object, and be received by one or more sound receivers. Sound signals may be temporally encoded. Received sound signals may be processed to determine a channel impulse response or calculate time of flight. Determined channel impulse responses may be processed to extract recognizable features or angles. Extracted features may be compared to a database of features to identify a user input gesture associated with the matched feature. Angles of channel impulse response curves may be associated with an input gesture. Time of flight values from each receiver may be used to determine coordinates of the reflecting object. Embodiments may be implemented as part of a graphical user interface. Embodiments may be used to determine a location of an emitter.

    摘要翻译: 这些实施例提供了使用连续波声信号的无接触感测和手势识别的系统和方法。 由发射器发射的连续波声,如超声波,可能会从物体反射,并由一个或多个声音接收器接收。 声音信号可能是时间编码的。 可以处理接收的声音信号以确定信道脉冲响应或计算飞行时间。 可以处理确定的信道脉冲响应以提取可识别的特征或角度。 提取的特征可以与特征数据库进行比较,以识别与匹配特征相关联的用户输入手势。 信道脉冲响应曲线的角度可以与输入手势相关联。 可以使用来自每个接收机的飞行时间值来确定反射对象的坐标。 实施例可以被实现为图形用户界面的一部分。 实施例可以用于确定发射器的位置。

    SYSTEM AND METHOD FOR PROVIDING CONFERENCE INFORMATION
    3.
    发明申请
    SYSTEM AND METHOD FOR PROVIDING CONFERENCE INFORMATION 审中-公开
    用于提供会议信息的系统和方法

    公开(公告)号:US20120142324A1

    公开(公告)日:2012-06-07

    申请号:US13289437

    申请日:2011-11-04

    IPC分类号: H04M3/42

    摘要: A method for providing information for a conference at one or more locations is disclosed. One or more mobile devices monitor one or more starting requirements of the conference and transmit input sound information to a server when the one or more starting requirements of the conference is detected. The one or more starting requirements may include a starting time of the conference, a location of the conference, and/or acoustic characteristics of a conference environment. The server generates conference information based on the input sound information from each mobile device and transmits the conference information to each mobile device. The conference information may include information on attendees, a current speaker among the attendees, an arrangement of the attendees, and/or a meeting log of attendee participation at the conference.

    摘要翻译: 公开了一种在一个或多个位置为会议提供信息的方法。 当检测到会议的一个或多个启动要求时,一个或多个移动设备监视会议的一个或多个启动要求并将输入声音信息发送到服务器。 一个或多个启动要求可以包括会议的开始时间,会议的位置和/或会议环境的声学特性。 服务器基于来自每个移动设备的输入声音信息生成会议信息,并将会议信息发送到每个移动设备。 会议信息可以包括参加者的信息,参加者中的现任演讲者,与会者的安排和/或参加者在会议中的会议记录。

    System and method for speech processing using independent component analysis under stability constraints
    4.
    发明授权
    System and method for speech processing using independent component analysis under stability constraints 有权
    在稳定性约束下使用独立分量分析的语音处理系统和方法

    公开(公告)号:US07383178B2

    公开(公告)日:2008-06-03

    申请号:US10537985

    申请日:2003-12-11

    IPC分类号: G10L21/02

    CPC分类号: G10L21/0272

    摘要: A system and method for separating a mixture of audio signal into desired audio signals (430) (e.g., speech) and a noise sign (440) is disclosed. Microphones (310, 320) are positioned to receive the mixed audio signals, and an independent component analysis (ICA) processes (212) the sound mixture using stability constraints. The ICA process (508) uses predefined characteristics of the desired speech signal to identify and isolate a target sound signal (430). Filter coefficients are adapted with a learning rule and filter weight update dynamics are stabilized to assist convergence to a stable separated ICA signal result. The separated signals may be peripherally-processed to further reduce noise effects using post-processing (214) and pre-processing (220, 230) techniques and information. The proposed system is designed and easily adaptable for implementation on DSP units or CPUs in audio communication hardware environments.

    摘要翻译: 公开了一种用于将音频信号的混合分离成所需音频信号(430)(例如语音)和噪声信号(440)的系统和方法。 麦克风(310,320)被定位成接收混合音频信号,并且独立分量分析(ICA)使用稳定性约束来处理(212)声音混合。 ICA处理(508)使用所需语音信号的预定特征来识别和隔离目标声音信号(430)。 滤波器系数适应于学习规则,并且滤波器权重更新动态被稳定以辅助收敛到稳定的分离的ICA信号结果。 可以对分离的信号进行外围处理,以进一步减少使用后处理(214)和预处理(220,230)技术和信息的噪声影响。 所提出的系统设计和易于适应于在音频通信硬件环境中的DSP单元或CPU上的实现。

    Separation of target acoustic signals in a multi-transducer arrangement
    5.
    发明授权
    Separation of target acoustic signals in a multi-transducer arrangement 有权
    在多换能器布置中分离目标声信号

    公开(公告)号:US07366662B2

    公开(公告)日:2008-04-29

    申请号:US11463376

    申请日:2006-08-09

    IPC分类号: G10L21/02

    摘要: The present invention provides a process for separating a good quality information signal from a noisy acoustic environment. The separation process uses a set of at least two spaced-apart transducers to capture noise and information components. The transducer signals, which have both a noise and information component, are received into a separation process. The separation process generates one channel that is substantially only noise, and another channel that is a combination of noise and information. An identification process is used to identify which channel has the information component. The noise signal is then used to set process characteristics that are applied to the combination signal to efficiently reduce or eliminate the noise component. In this way, the noise is effectively removed from the combination signal to generate a good qualify information signal. The information signal may be, for example, a speech signal, a seismic signal, a sonar signal, or other acoustic signal.

    摘要翻译: 本发明提供了一种用于从良好的声学环境中分离出良好质量信息信号的方法。 分离过程使用一组至少两个间隔开的传感器来捕获噪声和信息分量。 具有噪声和信息分量的换能器信号被接收到分离过程中。 分离过程产生基本上只有噪声的一个信道,以及作为噪声和信息的组合的另一个信道。 使用识别过程来识别哪个信道具有信息分量。 然后使用噪声信号来设置施加到组合信号的处理特性,以有效地减少或消除噪声分量。 以这种方式,有效地从组合信号中去除噪声以产生良好的限定信息信号。 信息信号可以是例如语音信号,地震信号,声纳信号或其他声信号。

    Method and apparatus for efficiently encoding chromatic images using non-orthogonal basis functions
    6.
    发明授权
    Method and apparatus for efficiently encoding chromatic images using non-orthogonal basis functions 失效
    使用非正交基函数有效地编码彩色图像的方法和装置

    公开(公告)号:US07286712B2

    公开(公告)日:2007-10-23

    申请号:US11086802

    申请日:2005-03-21

    IPC分类号: G06K9/36 G06K9/46

    摘要: A method and apparatus for efficiently encoding images using a set of non-orthogonal basis functions, thereby allowing reduction of file size, shorter transmission time, and improved accuracy. The non-orthogonal basis functions include homogenous color basis functions, luminance-encoding basis functions that have luminance edges and chromatic basis functions that exhibit color opponency. Some of the basis functions are non-orthogonal with respect to each other. Using these basis functions, a source vector is calculated to provide a number of coefficients, each coefficient associated with one basis function. The source vector is compressed by selecting a subset of the calculated coefficients, thereby providing an encoded vector. Because the method is highly efficient, the image data is substantially represented by a small number of coefficients. In some embodiments, the non-orthogonal basis functions include two or more classes. A wavelet approach can also be utilized.

    摘要翻译: 一种用于使用一组非正交基函数对图像进行高效编码的方法和装置,从而允许减小文件大小,缩短传输时间和提高精度。 非正交基函数包括均匀颜色基函数,具有亮度边缘的亮度编码基函数和表现色彩对立的色基函数。 一些基本函数相互之间是非正交的。 使用这些基函数,计算源向量以提供多个系数,每个系数与一个基函数相关联。 通过选择所计算的系数的子集来压缩源向量,由此提供编码的向量。 因为该方法是高效的,所以图像数据基本上由少量系数表示。 在一些实施例中,非正交基函数包括两个或多个类。 也可以使用小波方法。

    Separation of target acoustic signals in a multi-transducer arrangement
    7.
    发明申请
    Separation of target acoustic signals in a multi-transducer arrangement 有权
    在多换能器布置中分离目标声信号

    公开(公告)号:US20070038442A1

    公开(公告)日:2007-02-15

    申请号:US11463376

    申请日:2006-08-09

    IPC分类号: G10L15/20

    摘要: The present invention provides a process for separating a good quality information signal from a noisy acoustic environment. The separation process uses a set of at least two spaced-apart transducers to capture noise and information components. The transducer signals, which have both a noise and information component, are received into a separation process. The separation process generates one channel that is substantially only noise, and another channel that is a combination of noise and information. An identification process is used to identify which channel has the information component. The noise signal is then used to set process characteristics that are applied to the combination signal to efficiently reduce or eliminate the noise component. In this way, the noise is effectively removed from the combination signal to generate a good qualify information signal. The information signal may be, for example, a speech signal, a seismic signal, a sonar signal, or other acoustic signal.

    摘要翻译: 本发明提供了一种用于从良好的声学环境中分离出良好质量信息信号的方法。 分离过程使用一组至少两个间隔开的传感器来捕获噪声和信息分量。 具有噪声和信息分量的换能器信号被接收到分离过程中。 分离过程产生基本上只有噪声的一个信道,以及作为噪声和信息的组合的另一个信道。 使用识别过程来识别哪个信道具有信息分量。 然后使用噪声信号来设置施加到组合信号的处理特性,以有效地减少或消除噪声分量。 以这种方式,有效地从组合信号中去除噪声以产生良好的限定信息信号。 信息信号可以是例如语音信号,地震信号,声纳信号或其他声信号。

    Camera OCR with context information
    8.
    发明授权
    Camera OCR with context information 有权
    相机OCR与上下文信息

    公开(公告)号:US09082035B2

    公开(公告)日:2015-07-14

    申请号:US13450016

    申请日:2012-04-18

    摘要: Embodiments of the invention describe methods and apparatus for performing context-sensitive OCR. A device obtains an image using a camera coupled to the device. The device identifies a portion of the image comprising a graphical object. The device infers a context associated with the image and selects a group of graphical objects based on the context associated with the image. Improved OCR results are generated using the group of graphical objects. Input from various sensors including microphone, GPS, and camera, along with user inputs including voice, touch, and user usage patterns may be used in inferring the user context and selecting dictionaries that are most relevant to the inferred contexts.

    摘要翻译: 本发明的实施例描述了用于执行上下文敏感的OCR的方法和装置。 设备使用耦合到该设备的照相机来获得图像。 设备识别包括图形对象的图像的一部分。 设备推断与图像相关联的上下文,并且基于与图像相关联的上下文来选择一组图形对象。 使用图形对象组生成改进的OCR结果。 来自包括麦克风,GPS和相机在内的各种传感器的输入以及包括语音,触摸和用户使用模式在内的用户输入可以用于推断用户上下文并选择与所推断的上下文最相关的字典。

    Voice activity detection based on plural voice activity detectors
    9.
    发明授权
    Voice activity detection based on plural voice activity detectors 有权
    基于多个语音活动检测器的语音活动检测

    公开(公告)号:US08626498B2

    公开(公告)日:2014-01-07

    申请号:US12711943

    申请日:2010-02-24

    申请人: Te-Won Lee

    发明人: Te-Won Lee

    IPC分类号: G10L25/93

    CPC分类号: G10L25/78

    摘要: A voice activity detection (VAD) system includes a first voice activity detector, a second voice activity detector and control logic. The first voice activity detector is included in a device and produces a first VAD signal. The second voice activity detector is located externally to the device and produces a second VAD signal. The control logic combines the first and second VAD signals into a VAD output signal. Voice activity may be detected based on the VAD output signal. The second VAD signal can be represented as a flag included in a packet containing digitized audio. The packet can be transmitted to the device from the externally located VAD over a wireless link.

    摘要翻译: 语音活动检测(VAD)系统包括第一语音活动检测器,第二语音活动检测器和控制逻辑。 第一语音活动检测器被包括在设备中并产生第一VAD信号。 第二语音活动检测器位于设备外部并产生第二VA​​D信号。 控制逻辑将第一和第二VAD信号组合成VAD输出信号。 可以基于VAD输出信号检测语音活动。 第二VAD信号可以被表示为包含在包含数字化音频的分组中的标志。 该分组可以通过无线链路从外部定位的VAD发送到设备。

    Mobile device location estimation using environmental information
    10.
    发明授权
    Mobile device location estimation using environmental information 有权
    使用环境信息的移动设备位置估计

    公开(公告)号:US08606293B2

    公开(公告)日:2013-12-10

    申请号:US12898647

    申请日:2010-10-05

    IPC分类号: H04M3/00 H04W24/00

    摘要: Estimating a location of a mobile device is performed by comparing environmental information, such as environmental sound, associated with the mobile device with that of other devices to determine if the environmental information is similar enough to conclude that the mobile device is in a comparable location as another device. The devices may be in comparable locations in that they are in geographically similar locations (e.g., same store, same street, same city, etc.). The devices may be in comparable locations even though they are located in geographically dissimilar locations because the environmental information of the two locations demonstrates that the devices are in the same perceived location. With knowledge that the devices are in comparable locations, and with knowledge of the location of one of the devices, certain actions, such as targeted advertising, may be taken with respect to another device that is within a comparable location.

    摘要翻译: 通过将与移动设备相关联的环境信息与其他设备的环境信息进行比较来确定移动设备的位置是否足够相似,以确定移动设备处于可比较的位置,从而估计移动设备的位置 另一个设备。 这些设备可以在可比较的位置,因为它们在地理上相似的位置(例如,相同的商店,相同的街道,相同的城市等)。 即使这些设备位于地理位置不同的位置,设备也可处于可比较的位置,因为两个位置的环境信息表明设备处于相同的感知位置。 知道设备处于可比较的位置,并且了解设备之一的位置,可以针对在可比较的位置内的另一设备采取诸如定向广告的某些动作。