Automatic detection of panoramic camera position and orientation table parameters
    1.
    发明授权
    Automatic detection of panoramic camera position and orientation table parameters 有权
    自动检测全景相机的位置和方位表参数

    公开(公告)号:US07630571B2

    公开(公告)日:2009-12-08

    申请号:US11227046

    申请日:2005-09-15

    IPC分类号: G06K9/40 G06K9/48

    摘要: A panoramic camera is configured to automatically determine parameters of a table upon which the camera is situated as well as positional information of the camera relative to the table. In an initialization stage, table edges are detected to create an edge map. A Hough transformation-like symmetry voting operation is performed to clean up the edge map and to determine camera offset, camera orientation and camera tilt. The table is then fit to a table model to determine table parameters. In an operational stage, table edges are detected to create an edge map and the table model is fit to the edge map. The output can then be used for further panoramic image processing such as head size normalization, zooming, compensation for camera movement, etc.

    摘要翻译: 全景相机被配置为自动确定相机所在的表的参数以及相机相对于表的位置信息。 在初始化阶段,检测表边缘以创建边缘图。 执行霍夫变换对称投票操作来清理边缘图并确定相机偏移,相机方向和相机倾斜度。 然后将表适合于表模型以确定表参数。 在操作阶段,检测表边缘以创建边缘图,并且表模型适合边缘图。 然后可以将输出用于进一步的全景图像处理,例如头部尺寸归一化,变焦,相机移动补偿等。

    Audio transforms in connection with multiparty communication
    3.
    发明授权
    Audio transforms in connection with multiparty communication 有权
    与多方通信有关的音频转换

    公开(公告)号:US08340267B2

    公开(公告)日:2012-12-25

    申请号:US12365949

    申请日:2009-02-05

    IPC分类号: H04M3/42

    摘要: The claimed subject matter relates to an architecture that can preprocess audio portions of communications in order to enrich multiparty communication sessions or environments. In particular, the architecture can provide both a public channel for public communications that are received by substantially all connected parties and can further provide a private channel for private communications that are received by a selected subset of all connected parties. Most particularly, the architecture can apply an audio transform to communications that occur during the multiparty communication session based upon a target audience of the communication. By way of illustration, the architecture can apply a whisper transform to private communications, an emotion transform based upon relationships, an ambience or spatial transform based upon physical locations, or a pace transform based upon lack of presence.

    摘要翻译: 所要求保护的主题涉及可以预处理通信的音频部分以便丰富多方通信会话或环境的架构。 特别地,该架构可以提供公共通信的公共信道,其由基本上所有连接的各方接收,并且可以进一步提供由所有连接方的所选子集接收的专用通信的专用信道。 特别地,架构可以基于通信的目标受众对音频转换应用于在多方通信会话期间发生的通信。 作为说明,架构可以对私人通信应用耳语转换,基于关系,基于物理位置的氛围或空间变换或基于缺乏存在的步调变换的情感变换。

    AUDIO TRANSFORMS IN CONNECTION WITH MULTIPARTY COMMUNICATION
    4.
    发明申请
    AUDIO TRANSFORMS IN CONNECTION WITH MULTIPARTY COMMUNICATION 有权
    与多媒体通信相关的音频转换

    公开(公告)号:US20100195812A1

    公开(公告)日:2010-08-05

    申请号:US12365949

    申请日:2009-02-05

    IPC分类号: H04M3/42 G10L11/00

    摘要: The claimed subject matter relates to an architecture that can preprocess audio portions of communications in order to enrich multiparty communication sessions or environments. In particular, the architecture can provide both a public channel for public communications that are received by substantially all connected parties and can further provide a private channel for private communications that are received by a selected subset of all connected parties. Most particularly, the architecture can apply an audio transform to communications that occur during the multiparty communication session based upon a target audience of the communication. By way of illustration, the architecture can apply a whisper transform to private communications, an emotion transform based upon relationships, an ambience or spatial transform based upon physical locations, or a pace transform based upon lack of presence.

    摘要翻译: 所要求保护的主题涉及可以预处理通信的音频部分以便丰富多方通信会话或环境的架构。 特别地,该架构可以提供公共通信的公共信道,其由基本上所有连接的各方接收,并且可以进一步提供由所有连接方的所选子集接收的专用通信的专用信道。 特别地,架构可以基于通信的目标受众对音频转换应用于在多方通信会话期间发生的通信。 作为说明,架构可以对私人通信应用耳语转换,基于关系,基于物理位置的氛围或空间变换或基于缺乏存在的步调变换的情感变换。

    Capture device movement compensation for speaker indexing
    7.
    发明授权
    Capture device movement compensation for speaker indexing 有权
    用于扬声器索引的捕获装置移动补偿

    公开(公告)号:US08330787B2

    公开(公告)日:2012-12-11

    申请号:US11771786

    申请日:2007-06-29

    申请人: Ross G. Cutler

    发明人: Ross G. Cutler

    IPC分类号: H04N7/14 H04N5/232 H04N9/80

    摘要: Embodiments of the invention compensate for the movement of a meeting capture device during a live meeting when performing speaker indexing of a recorded meeting. In one example, a first position of a capture device is determined. A second position of the capture device is determined after the capture device has been moved from the first position to the second position. The movement data associated with movement of the capture device from the first position to the second position is determined. The movement data is outputted and used in speaker indexing of the recorded meeting.

    摘要翻译: 本发明的实施例在进行会议记录会议的说话者索引时,补偿会议捕获装置在实时会议期间的移动。 在一个示例中,确定捕获装置的第一位置。 在捕捉装置已经从第一位置移动到第二位置之后确定捕获装置的第二位置。 确定与捕捉装置从第一位置移动到第二位置相关联的移动数据。 运动数据被输出并用于记录的会议的讲话者索引。

    MUTE CONTROL IN AUDIO ENDPOINTS
    8.
    发明申请
    MUTE CONTROL IN AUDIO ENDPOINTS 有权
    音频终端中的静音控制

    公开(公告)号:US20100324891A1

    公开(公告)日:2010-12-23

    申请号:US12486761

    申请日:2009-06-18

    申请人: Ross G. Cutler

    发明人: Ross G. Cutler

    IPC分类号: G10L11/06

    CPC分类号: G10L25/78

    摘要: Architecture that uses near-end speech detection and far-end energy level detection to notify a user when a local microphone and/or speaker that the user is using, are muted. A voice activity detector is employed to detect the presence of near-end speech, sense the existing mute state of the near-end microphone, and then notify the user when the current microphone is muted. Separately or in combination therewith, received far-end voice signals are detected, the associated energy level computed, the existing mute state of the near-end audio speaker is sensed, and the user notified when the speaker is muted and/or at a reduced volume setting. These determinations enhance the user experience when the architecture is employed for communications sessions where participants connect via different communications modalities by automatically notifying the user of the audio device state, without attempting to contribute only to find that a microphone or speaker was muted.

    摘要翻译: 使用近端语音检测和远端能级检测的架构,当用户正在使用的本地麦克风和/或扬声器被静音时通知用户。 使用语音活动检测器来检测近端语音的存在,感测近端麦克风的现有静音状态,然后当当前麦克风静音时通知用户。 单独地或与其组合,检测到所接收的远端语音信号,计算相关联的能量水平,感测到近端音频扬声器的现有静音状态,并且当扬声器静音和/或减小时通知用户 音量设置。 当通过自动通知用户音频设备状态的参与者通过不同的通信模式进行连接的通信会话时,这些确定增强了用户体验,而不试图仅仅发现话筒或扬声器被静音。

    AUTOMATIC GAIN AND EXPOSURE CONTROL USING REGION OF INTEREST DETECTION
    9.
    发明申请
    AUTOMATIC GAIN AND EXPOSURE CONTROL USING REGION OF INTEREST DETECTION 有权
    利用感兴趣区域进行自动增益和接触控制

    公开(公告)号:US20090003678A1

    公开(公告)日:2009-01-01

    申请号:US11771802

    申请日:2007-06-29

    申请人: Ross G. Cutler

    发明人: Ross G. Cutler

    IPC分类号: G06K9/00

    摘要: A region of interest may be determined using any or all of sound source location, multi-person detection, and active speaker detection. An weighted mean may be determined using the region of interest and a set of backlight weight regions, or, only the set of backlight weight regions if a region of interest could not be found. The image mean is compared to a target value to determine if the image mean is greater than or less than the target value within a predetermined threshold. If the image mean is greater than the predetermined target value and predetermined threshold value, the gain and exposure are decreased. If the image mean is lesser than the predetermined target value minus the predetermined threshold value, the gain and exposure are decreased.

    摘要翻译: 可以使用声源位置,多人检测和主动扬声器检测中的任何一个或全部来确定感兴趣的区域。 可以使用感兴趣的区域和一组背光重量区域来确定加权平均值,或者如果不能找到感兴趣的区域,则仅限于该组背光重量区域。 将图像平均值与目标值进行比较,以确定图像均值是否大于或小于预定阈值内的目标值。 如果图像均值大于预定目标值和预定阈值,则增益和曝光量减小。 如果图像平均值小于预定目标值减去预定阈值,则增益和曝光减小。

    CAPTURE DEVICE MOVEMENT COMPENSATION FOR SPEAKER INDEXING
    10.
    发明申请
    CAPTURE DEVICE MOVEMENT COMPENSATION FOR SPEAKER INDEXING 有权
    用于扬声器索引的捕获设备运动补偿

    公开(公告)号:US20090002477A1

    公开(公告)日:2009-01-01

    申请号:US11771786

    申请日:2007-06-29

    申请人: Ross G. Cutler

    发明人: Ross G. Cutler

    IPC分类号: H04N7/14

    摘要: Embodiments of the invention compensate for the movement of a meeting capture device during a live meeting when performing speaker indexing of a recorded meeting. In one example, a first position of a capture device is determined. A second position of the capture device is determined after the capture device has been moved from the first position to the second position. The movement data associated with movement of the capture device from the first position to the second position is determined. The movement data is outputted and used in speaker indexing of the recorded meeting.

    摘要翻译: 本发明的实施例在进行会议记录会议的说话者索引时,补偿会议捕获装置在实时会议期间的移动。 在一个示例中,确定捕获装置的第一位置。 在捕捉装置已经从第一位置移动到第二位置之后确定捕获装置的第二位置。 确定与捕捉装置从第一位置移动到第二位置相关联的移动数据。 运动数据被输出并用于记录的会议的讲话者索引。