-
公开(公告)号:US20090001165A1
公开(公告)日:2009-01-01
申请号:US11772069
申请日:2007-06-29
申请人: Chunhui Zhang , Zhouchen Lin , Zhengyou Zhang , Shi Han
发明人: Chunhui Zhang , Zhouchen Lin , Zhengyou Zhang , Shi Han
IPC分类号: G06K7/10
CPC分类号: G06K7/1093 , G06T3/608 , G06T2207/20061
摘要: Systems and methods for 2-D barcode recognition are described. In one aspect, the systems and methods use a charge coupled camera capturing device to capture a digital image of a 3-D scene. The systems and methods evaluate the digital image to localize and segment a 2-D barcode from the digital image of the 3-D scene. The 2-D barcode is rectified to remove non-uniform lighting and correct any perspective distortion. The rectified 2-D barcode is divided into multiple uniform cells to generate a 2-D matrix array of symbols. A barcode processing application evaluates the 2-D matrix array of symbols to present data to the user.
摘要翻译: 描述了用于二维条形码识别的系统和方法。 在一个方面,所述系统和方法使用电荷耦合的摄像机捕捉设备来捕获3-D场景的数字图像。 系统和方法评估数字图像,以从3-D场景的数字图像中定位和分割二维条形码。 二维条形码整流,以消除不均匀的照明并纠正任何透视失真。 经整流的二维条形码被分成多个均匀的单元格,以产生符号的二维矩阵阵列。 条形码处理应用程序评估符号的二维矩阵数组以向用户呈现数据。
-
公开(公告)号:US20080195389A1
公开(公告)日:2008-08-14
申请号:US11674139
申请日:2007-02-12
申请人: Zhengyou Zhang , Amarnag Subramaya
发明人: Zhengyou Zhang , Amarnag Subramaya
IPC分类号: G10L15/00
摘要: A text-dependent speaker verification technique that uses a generic speaker-independent speech recognizer for robust speaker verification, and uses the acoustical model of a speaker-independent speech recognizer as a background model. Instead of using a likelihood ratio test (LRT) at the utterance level (e.g., the sentence level), which is typical of most speaker verification systems, the present text-dependent speaker verification technique uses weighted sum of likelihood ratios at the sub-unit level (word, tri-phone, or phone) as well as at the utterance level.
摘要翻译: 一种文本相关的扬声器验证技术,其使用一般的与扬声器无关的语音识别器进行强大的扬声器验证,并使用与扬声器无关的语音识别器的声学模型作为背景模型。 现在的文本相关说明者验证技术不是在大多数说话人验证系统的典型的话语级别(例如,句子级别)上使用似然比检验(LRT),而是使用子单元中的似然比加权和 水平(单词,三话电话或电话)以及话语水平。
-
133.
公开(公告)号:US07406303B2
公开(公告)日:2008-07-29
申请号:US11228710
申请日:2005-09-16
申请人: Li Deng , Zhengyou Zhang , Zicheng Liu , Amarnag Subramanya
发明人: Li Deng , Zhengyou Zhang , Zicheng Liu , Amarnag Subramanya
CPC分类号: G10L21/0208 , G10L2021/02165
摘要: A synthesized alternative sensor signal is produced from an alternative sensor signal. The synthesized alternative sensor signal is computed using vocal tract resonances estimated based on the alternative sensor signal, and using a waveform synthesis technique that converts the estimated vocal tract resonance sequence into a spectral magnitude sequence. The synthesized alternative sensor signal and the alternative sensor signal are used to estimate a clean speech value.
摘要翻译: 从替代传感器信号产生合成的替代传感器信号。 使用基于替代传感器信号估计的声道共振来计算合成的替代传感器信号,并且使用将估计的声道共振序列转换为频谱幅度序列的波形合成技术。 合成的替代传感器信号和替代传感器信号用于估计干净的语音值。
-
公开(公告)号:US20080154714A1
公开(公告)日:2008-06-26
申请号:US11614391
申请日:2006-12-21
申请人: Zicheng Liu , Philip A. Chou , Zhengyou Zhang
发明人: Zicheng Liu , Philip A. Chou , Zhengyou Zhang
CPC分类号: G01C21/30 , G06Q30/02 , G06Q30/0224 , G06Q30/0236
摘要: A technique for providing and receiving personalized e-coupons is presented. In general, the technique involves an e-coupon provider sending e-coupons to a user of a mobile communication device, such as a cellular telephone or PDA, which are personalized in various ways so as to make them attractive to the user. In one embodiment, the e-coupons are provided based on location information received from the mobile communication device. In another embodiment, the e-coupons are provided based on the user's purchasing history. The mobile communication device that receives e-coupons from the provider includes an e-coupon handler program to facilitate the procurement and receipt of the e-coupons. In general, the e-coupon handler receives e-coupons and displays them to the user on a display of the mobile communication device. The e-coupons could have been requested by the e-coupon handler, or pushed to it by the e-coupon provider.
摘要翻译: 提出了一种提供和接收个性化电子优惠券的技术。 通常,该技术涉及电子优惠券提供者向诸如蜂窝电话或PDA的移动通信设备的用户发送电子优惠券,其以各种方式被个性化以便使其对用户具有吸引力。 在一个实施例中,基于从移动通信设备接收的位置信息来提供电子优惠券。 在另一个实施例中,基于用户的购买历史来提供电子优惠券。 从提供商接收电子优惠券的移动通信设备包括电子优惠券处理程序,以便于采购和接收电子优惠券。 通常,电子优惠券处理程序接收电子优惠券并将其显示给移动通信设备的显示器上的用户。 电子优惠券可能已被电子优惠券处理者要求,或由电子优惠券提供商推送给该优惠券。
-
公开(公告)号:US07383181B2
公开(公告)日:2008-06-03
申请号:US10629278
申请日:2003-07-29
IPC分类号: G10L15/00
摘要: The present invention combines a conventional audio microphone with an additional speech sensor that provides a speech sensor signal based on an input. The speech sensor signal is generated based on an action undertaken by a speaker during speech, such as facial movement, bone vibration, throat vibration, throat impedance changes, etc. A speech detector component receives an input from the speech sensor and outputs a speech detection signal indicative of whether a user is speaking. The speech detector generates the speech detection signal based on the microphone signal and the speech sensor signal.
摘要翻译: 本发明将常规音频麦克风与基于输入提供语音传感器信号的附加话音传感器组合。 语音传感器信号基于语音中的扬声器在诸如面部运动,骨骼振动,喉部振动,喉部阻抗变化等中的动作而产生。语音检测器组件从语音传感器接收输入并输出语音检测 指示用户是否正在说话的信号。 语音检测器基于麦克风信号和语音传感器信号产生语音检测信号。
-
公开(公告)号:US07308115B2
公开(公告)日:2007-12-11
申请号:US11182629
申请日:2005-07-14
申请人: Zhengyou Zhang , Ying Shan
发明人: Zhengyou Zhang , Ying Shan
IPC分类号: G06K9/00
CPC分类号: G06T7/251 , G06T7/80 , G06T2207/10016 , G06T2207/30244
摘要: An incremental motion estimation system and process for estimating the camera pose parameters associated with each image of a long image sequence. Unlike previous approaches, which rely on point matches across three or more views, the present system and process also includes those points shared only by two views. The problem is formulated as a series of localized bundle adjustments in such a way that the estimated camera motions in the whole sequence are consistent with each other. The result of the inclusion of two-view matching points and the localized bundle adjustment approach is more accurate estimates of the camera pose parameters for each image in the sequence than previous incremental techniques, and providing an accuracy approaching that of global bundle adjustment techniques except with processing times about 100 to 700 times faster than the global approaches.
摘要翻译: 用于估计与长图像序列的每个图像相关联的相机姿态参数的增量运动估计系统和过程。 不同于以上三种或更多视图中依赖点匹配的方法,本系统和过程也包括仅由两个视图共享的点。 该问题被形成为一系列本地化的束调整,使得整个序列中估计的摄像机运动彼此一致。 包含双视点匹配点和局部束调整方法的结果是对序列中每个图像的相机姿态参数的估计值比以前的增量技术更准确,并提供接近全局束调整技术的精度,除了 处理时间比全球方法快100到700倍。
-
公开(公告)号:US07274388B2
公开(公告)日:2007-09-25
申请号:US11275456
申请日:2006-01-05
申请人: Zhengyou Zhang
发明人: Zhengyou Zhang
IPC分类号: H04N17/00
CPC分类号: H04N17/002
摘要: Calibration for a camera is achieved by receiving images of a calibration object whose geometry is one-dimension in space. The received images show the calibration object in several distinct positions. Calibration for the camera is then calculated based on the received images of the calibration object.
-
公开(公告)号:US20070172144A1
公开(公告)日:2007-07-26
申请号:US11340313
申请日:2006-01-26
申请人: Zhengyou Zhang , An Xu , Chunhui Zhang
发明人: Zhengyou Zhang , An Xu , Chunhui Zhang
IPC分类号: G06K9/40
CPC分类号: G06K9/40
摘要: A video clip is processed by selecting a plurality of video frames of the video clip. A plurality of the pixels of the selected video frames are modified to form modified video frames. The modification to each of the plurality of the pixels is based on the intensity of the pixel, a change in the intensity of the pixel from the corresponding pixel in at least one related video frame, and the intensity of the corresponding pixel. A second video clip is formed that comprises the modified video clips.
摘要翻译: 通过选择视频剪辑的多个视频帧来处理视频剪辑。 所选择的视频帧的多个像素被修改以形成修改的视频帧。 对多个像素中的每一个的修改基于像素的强度,来自至少一个相关视频帧中的对应像素的像素强度的变化以及相应像素的强度。 形成包括经修改的视频剪辑的第二视频剪辑。
-
139.
公开(公告)号:US20070150263A1
公开(公告)日:2007-06-28
申请号:US11317269
申请日:2005-12-23
IPC分类号: G10L21/00
CPC分类号: G10L21/0208 , G10L15/20
摘要: A frame of a speech signal is converted into the spectral domain to identify a plurality of frequency components and an energy value for the frame is determined. The plurality of frequency components is divided by the energy value for the frame to form energy-normalized frequency components. A model is then constructed from the energy-normalized frequency components and can be used for speech recognition and speech enhancement.
摘要翻译: 语音信号的帧被转换成频谱域以识别多个频率分量,并确定该帧的能量值。 将多个频率分量除以帧的能量值以形成能量归一化频率分量。 然后从能量归一化的频率分量构建模型,并可用于语音识别和语音增强。
-
公开(公告)号:US20070126755A1
公开(公告)日:2007-06-07
申请号:US11565596
申请日:2006-11-30
申请人: Zhengyou Zhang , Ross Cutler , Zicheng Liu , Anoop Gupta , Li-wei He
发明人: Zhengyou Zhang , Ross Cutler , Zicheng Liu , Anoop Gupta , Li-wei He
IPC分类号: G09G5/00
CPC分类号: G06F17/30843 , G06Q10/1095 , G11B27/105 , G11B27/28 , G11B27/34 , H04N5/77 , H04N9/806
摘要: A system that captures both whiteboard content and audio signals of a meeting using a digital camera and a microphone. The system can be retrofit to any existing whiteboard. It computes the time stamps of pen strokes on the whiteboard by analyzing the sequence of captured snapshots. It also automatically produces a set of key frames representing all the written content on the whiteboard before each erasure. The whiteboard content serves as a visual index to efficiently browse the audio meeting. The system not only captures the whiteboard content, but also helps the users to view and manage the captured meeting content efficiently and securely.
摘要翻译: 使用数码相机和麦克风捕获会议的白板内容和音频信号的系统。 该系统可以改装任何现有的白板。 它通过分析捕获的快照的顺序来计算白板上笔划的时间戳。 它也会在每次擦除之前自动产生代表白板上所有写入内容的一组关键帧。 白板内容作为视觉索引,有效地浏览音频会议。 该系统不仅可以捕获白板内容,还可以帮助用户有效,安全地查看和管理所捕获的会议内容。
-
-
-
-
-
-
-
-
-