Method and apparatus for reducing noise corruption from an alternative sensor signal during multi-sensory speech enhancement
    73.
    发明授权
    Method and apparatus for reducing noise corruption from an alternative sensor signal during multi-sensory speech enhancement 有权
    用于在多感觉语音增强期间从替代传感器信号减少噪声损坏的方法和装置

    公开(公告)号:US07590529B2

    公开(公告)日:2009-09-15

    申请号:US11050936

    申请日:2005-02-04

    IPC分类号: G10L21/02 G10L15/20

    摘要: A method and apparatus classify a portion of an alternative sensor signal as either containing noise or not containing noise. The portions of the alternative sensor signal that are classified as containing noise are not used to estimate a portion of a clean speech signal and the channel response associated with the alternative sensor. The portions of the alternative sensor signal that are classified as not containing noise are used to estimate a portion of a clean speech signal and the channel response associated with the alternative sensor.

    摘要翻译: 方法和装置将替代传感器信号的一部分分类为包含噪声或不包含噪声。 分类为包含噪声的替代传感器信号的部分不用于估计清洁语音信号的一部分和与替代传感器相关联的信道响应。 被分类为不包含噪声的替代传感器信号的部分被用于估计清洁语音信号的一部分和与替代传感器相关联的信道响应。

    EFFICIENT IMAGE DISPLAYING
    74.
    发明申请
    EFFICIENT IMAGE DISPLAYING 审中-公开
    高效图像显示

    公开(公告)号:US20090220165A1

    公开(公告)日:2009-09-03

    申请号:US12039741

    申请日:2008-02-29

    IPC分类号: G06K9/42 G06K9/32

    CPC分类号: G06F16/9577

    摘要: Efficient image display on a display screen (e.g., in terms of number, space, resolution, and/or distortion) is facilitated by implementing one or more specialized select and pack routines for images. That is, representative images are selected from an image database, based on desired resolution and distortion, then resized and packed into a display arrangement that enhances use of display screen space. This allows, for example, images to be sent to a user from an image database more quickly, with more desirable resolution, and less distortion than traditional display techniques.

    摘要翻译: 通过为图像实现一个或多个专门的选择和包程序,便于在显示屏幕上(例如,在数量,空间,分辨率和/或失真方面)上的高效图像显示。 也就是说,基于期望的分辨率和失真从图像数据库中选择代表性图像,然后将其调整大小并将其打包成增强显示屏空间使用的显示装置。 例如,这允许从传统的显示技术更快地,从图像数据库向用户发送图像,具有更理想的分辨率和更少的失真。

    MANAGEMENT OF SPLIT AUDIO/VIDEO STREAMS
    75.
    发明申请
    MANAGEMENT OF SPLIT AUDIO/VIDEO STREAMS 有权
    分割音频/视频流的管理

    公开(公告)号:US20090172779A1

    公开(公告)日:2009-07-02

    申请号:US11968194

    申请日:2008-01-02

    IPC分类号: G06F21/00

    CPC分类号: G06F21/6209

    摘要: Described herein is a method that includes receiving multiple requests for access to an exposed media object, wherein the exposed media object represents a live media stream that is being generated by a media source. The method also includes receiving data associated with each entity that provided a request, and determining, for each entity, whether the entities that provided the request are authorized to access the media stream based at least in part upon the received data and splitting the media stream into multiple media streams, wherein a number of media streams corresponds to a number of authorized entities. The method also includes automatically applying at least one policy to at least one of the split media streams based at least in part upon the received data.

    摘要翻译: 这里描述的方法包括接收对暴露的媒体对象的访问的多个请求,其中所述暴露的媒体对象表示正由媒体源生成的实况媒体流。 该方法还包括接收与提供请求的每个实体相关联的数据,以及为每个实体确定提供该请求的实体是否被授权至少部分地基于所接收的数据和分割媒体流来访问媒体流 转换成多个媒体流,其中多个媒体流对应于多个授权实体。 该方法还包括至少部分地基于所接收的数据自动地将至少一个策略应用于至少一个分离媒体流。

    Method and apparatus for multi-sensory speech enhancement
    76.
    发明授权
    Method and apparatus for multi-sensory speech enhancement 有权
    多感官语音增强的方法和装置

    公开(公告)号:US07447630B2

    公开(公告)日:2008-11-04

    申请号:US10724008

    申请日:2003-11-26

    IPC分类号: G10L21/02

    摘要: A method and system use an alternative sensor signal received from a sensor other than an air conduction microphone to estimate a clean speech value. The estimation uses either the alternative sensor signal alone, or in conjunction with the air conduction microphone signal. The clean speech value is estimated without using a model trained from noisy training data collected from an air conduction microphone. Under one embodiment, correction vectors are added to a vector formed from the alternative sensor signal in order to form a filter, which is applied to the air conductive microphone signal to produce the clean speech estimate. In other embodiments, the pitch of a speech signal is determined from the alternative sensor signal and is used to decompose an air conduction microphone signal. The decomposed signal is then used to determine a clean signal estimate.

    摘要翻译: 一种方法和系统使用从除空气传导麦克风以外的传感器接收的替代传感器信号来估计干净的语音值。 该估计单独使用替代传感器信号,或者与导气麦克风信号一起使用。 无需使用从空气传导麦克风收集的噪声训练数据训练的模型来估计干净的语音值。 在一个实施例中,校正矢量被添加到由替代传感器信号形成的矢量中,以形成滤波器,该滤波器被施加到空气传导麦克风信号以产生干净的语音估计。 在其他实施例中,语音信号的音调由替代传感器信号确定,并用于分解空气传导麦克风信号。 然后使用分解的信号来确定干净的信号估计。

    ENERGY-BASED SOUND SOURCE LOCALIZATION AND GAIN NORMALIZATION
    77.
    发明申请
    ENERGY-BASED SOUND SOURCE LOCALIZATION AND GAIN NORMALIZATION 有权
    基于能量的声源定位和增益正规化

    公开(公告)号:US20080170717A1

    公开(公告)日:2008-07-17

    申请号:US11623643

    申请日:2007-01-16

    IPC分类号: H04R3/00

    摘要: An energy based technique to estimate the positions of people speaking from an ad hoc network of microphones. The present technique does not require accurate synchronization of the microphones. In addition, a technique to normalize the gains of the microphones based on people's speech is presented, which allows aggregation of various audio channels from the ad hoc microphone network into a single stream for audio conferencing. The technique is invariant of the speaker's volumes thus making the system easy to deploy in practice.

    摘要翻译: 一种基于能量的技术来估计从麦克风的自组织网络发言的人的位置。 本技术不需要麦克风的准确同步。 此外,提出了一种基于人们的语音来归一化麦克风的增益的技术,其允许将各种音频频道从专用麦克风网络聚合成用于音频会议的单个流。 该技术是扬声器音量不变的,从而使得系统在实践中容易部署。

    System and method for head size equalization in 360 degree panoramic images
    78.
    发明授权
    System and method for head size equalization in 360 degree panoramic images 有权
    用于360度全景图像的头部尺寸均衡的系统和方法

    公开(公告)号:US07327899B2

    公开(公告)日:2008-02-05

    申请号:US11465703

    申请日:2006-08-18

    IPC分类号: G06K9/36 G09G5/00 H04N7/00

    摘要: A real-time approximately 360 degree image correction system and a method for alleviating distortion and perception problems in images captured by omni-directional cameras. In general, the real-time panoramic image correction method generates a warp table from pixel coordinates of a panoramic image and applies the warp table to the panoramic image to create a corrected panoramic image. The corrections are performed using a parametric class of warping functions that include Spatially Varying Uniform (SVU) scaling functions. The SVU scaling functions and scaling factors are used to perform vertical scaling and horizontal scaling on the panoramic image pixel coordinates. A horizontal distortion correction is performed using the SVU scaling functions at at least two different scaling factors. This processing generates a warp table that can be applied to the panoramic image to yield the corrected panoramic image. In one embodiment the warp table is concatenated with a stitching table used to create the panoramic image.

    摘要翻译: 一种实时大约360度的图像校正系统和一种减轻全方位摄像机拍摄图像失真和感知问题的方法。 通常,实时全景图像校正方法从全景图像的像素坐标生成扭曲表,并将经线表应用于全景图像,以创建校正的全景图像。 校正使用包括空间变化均匀(SVU)缩放函数的参数化的翘曲函数类来执行。 SVU缩放函数和缩放因子用于对全景图像像素坐标执行垂直缩放和水平缩放。 使用SVU缩放函数以至少两个不同的缩放因子执行水平失真校正。 该处理产生可应用于全景图像以产生经校正的全景图像的经向表。 在一个实施例中,经线台与用于创建全景图像的缝合台连接。

    Multi-modal device power/mode management
    79.
    发明授权
    Multi-modal device power/mode management 有权
    多模式设备电源/模式管理

    公开(公告)号:US07319908B2

    公开(公告)日:2008-01-15

    申请号:US11261108

    申请日:2005-10-28

    IPC分类号: G05B11/01 G05D3/12

    摘要: A system that facilitates managing resources (e.g., functionality, services) based at least in part upon an established context. More particularly, a context determination component can be employed to establish a context by processing sensor inputs or learning/inferring a user action/preference. Once the context is established via context determination component, a power/mode management component can be employed to activate and/or mask resources in accordance with the established context. The power and mode management of the device can extend life of a power source (e.g., battery) and mask functionality in accordance with a user and/or device state.

    摘要翻译: 一种有助于至少部分地基于建立的上下文来管理资源(例如,功能,服务)的系统。 更具体地,可以采用上下文确定组件来通过处理传感器输入或学习/推断用户动作/偏好来建立上下文。 一旦通过上下文确定组件建立了上下文,则可以使用功率/模式管理组件来根据建立的上下文激活和/或掩蔽资源。 设备的功率和模式管理可以根据用户和/或设备状态延长电源(例如电池)的寿命和屏蔽功能。

    System and method for whiteboard and audio capture
    80.
    发明授权
    System and method for whiteboard and audio capture 有权
    用于白板和音频捕获的系统和方法

    公开(公告)号:US07260257B2

    公开(公告)日:2007-08-21

    申请号:US10178443

    申请日:2002-06-19

    IPC分类号: C06K9/00

    摘要: A system that captures both whiteboard content and audio signals of a meeting using a digital camera and a microphone. The system can be retrofit to any existing whiteboard. It computes the time stamps of pen strokes on the whiteboard by analyzing the sequence of captured snapshots. It also automatically produces a set of key frames representing all the written content on the whiteboard before each erasure. The whiteboard content serves as a visual index to efficiently browse the audio meeting. The system not only captures the whiteboard content, but also helps the users to view and manage the captured meeting content efficiently and securely.

    摘要翻译: 使用数码相机和麦克风捕获会议的白板内容和音频信号的系统。 该系统可以改装任何现有的白板。 它通过分析捕获的快照的顺序来计算白板上笔划的时间戳。 它也会在每次擦除之前自动产生代表白板上所有写入内容的一组关键帧。 白板内容作为视觉索引,有效地浏览音频会议。 该系统不仅可以捕获白板内容,还可以帮助用户有效,安全地查看和管理所捕获的会议内容。