Integrated design for omni-directional camera and microphone array
    41.
    发明授权
    Integrated design for omni-directional camera and microphone array 有权
    全方位摄像头和麦克风阵列的集成设计

    公开(公告)号:US07852369B2

    公开(公告)日:2010-12-14

    申请号:US10184499

    申请日:2002-06-27

    IPC分类号: H04N7/14

    摘要: An omni-directional camera (a 360 degree camera) is proposed with an integrated microphone array. The primary application for such a camera is videoconferencing and meeting recording, and the device is designed to be placed on a meeting room table. The microphone array is in a planar configuration, and the microphones are located as close to the desktop as possible to eliminate sound reflections from the table. The camera is connected to the microphone array base with a thin cylindrical rod, which is acoustically invisible to the microphone array for the frequency range [50-4000] Hz. This provides a direct path from the person talking to all of the microphones in the array, and can therefore be used for sound source localization (determining the location of the talker) and beam-forming (improving the sound quality of the talker by filtering only sound from a particular direction). The camera array is elevated from the table to provide a near frontal viewpoint of the meeting participants.

    摘要翻译: 提出了一种全方位摄像机(360度摄像机),集成了麦克风阵列。 这种相机的主要应用是视频会议和会议录制,设备被设计为放在会议室桌子上。 麦克风阵列处于平面配置,麦克风尽可能靠近桌面,以消除桌面的声音反射。 摄像机连接到麦克风阵列底座上,并带有一个薄的圆柱形杆,在[50-4000] Hz的频率范围内,麦克风阵列声学上不可见。 这提供了从人员通话到阵列中的所有麦克风的直接路径,因此可以用于声源定位(确定讲话者的位置)和波束形成(仅通过滤波来提高讲话者的声音质量 声音从特定的方向)。 相机阵列从表中提升,以提供会议参与者的近前视点。

    Audio/video synchronization using audio hashing

    公开(公告)号:US20060291478A1

    公开(公告)日:2006-12-28

    申请号:US11165946

    申请日:2005-06-24

    IPC分类号: H04L12/56

    摘要: Audio and video frames are synchronized by hashing an audio frame at a sender and combining the resultant hash value with the video frame. The audio frame is transmitted over an audio network, such as a telephone network, and the video frame is transmitted over a digital network, such as an intranet. The audio frame may be combined with additional audio signals from an audio bridge. The receiver receives the audio signal from the audio bridge and performs the same hash function on the mixed signal as was performed on the original signal. The receiver correlates the hash value on the mixed signal with the hash value included with the video frame (wherein the video frame is one of several video frames buffered by the receiver). The receiver can thus identify the video frame that corresponds to the audio frame and render them simultaneously.

    User interface for a system and method for head size equalization in 360 degree panoramic images
    44.
    发明授权
    User interface for a system and method for head size equalization in 360 degree panoramic images 有权
    用于360度全景图像的头部尺寸均衡的系统和方法的用户界面

    公开(公告)号:US07149367B2

    公开(公告)日:2006-12-12

    申请号:US11106328

    申请日:2005-04-14

    申请人: Ross Cutler

    发明人: Ross Cutler

    IPC分类号: G06K9/36 G09G5/00 H04N7/00

    摘要: A User Interface (UI) for a real-time panoramic image correction system and method that simplifies the use of the system for the user. The UI includes a control panel that allows a user to enter meeting table size and shape, camera position and orientation, and the amount of normalization desired (e.g. 0 to 100%). A window can also be implemented on a display that displays the corrected panoramic image. In this window, the head (either normalized or non-normalized) of a meeting participant, preferably one that is speaking, is extracted and displayed in a separate window. Additionally, the corrected panoramic image, whose size will vary in conjunction with the amount of warping applied, can be displayed and transmitted with extra pixels around its perimeter in order to allow the corrected or normalized panoramic image to adapt to any of the standard display size and resolutions and to simplify network transmission. The corrected image can also be transmitted with standard resolutions using non-unity pixel aspect ratios to simply network transmission.

    摘要翻译: 用于实时全景图像校正系统的用户界面(UI)和简化用户对系统的使用的方法。 UI包括控制面板,其允许用户输入会议台的大小和形状,相机位置和方位以及所需的归一化量(例如0至100%)。 还可以在显示校正的全景图像的显示器上实现窗口。 在该窗口中,提取会议参与者的头部(归一化的或未归一化的),优选地正在说话的头部,并在单独的窗口中显示。 另外,校正的全景图像的尺寸将随着施加的翘曲量的变化而变化,并且可以在其周边附近以额外的像素来显示和传输,以便校正或归一化的全景图像适应任何标准显示尺寸 决议和简化网络传输。 校正图像也可以使用非一体像素宽高比以标准分辨率传输到简单的网络传输。

    Multi-view integrated camera system
    45.
    发明申请
    Multi-view integrated camera system 有权
    多视图集成摄像系统

    公开(公告)号:US20060023106A1

    公开(公告)日:2006-02-02

    申请号:US10902650

    申请日:2004-07-28

    IPC分类号: G02B13/16

    摘要: A panoramic camera design that is lower cost, robust, stable and more user friendly than prior art designs. The camera design makes use of a unified molded structure of optical material to house a mirror, aligned sensor, and lens assembly. The unified molded structure of the camera keeps the sensed optical path enclosed to minimize dust and user's fingers and maintain optical alignment.

    摘要翻译: 全景相机设计比现有技术设计更低成本,坚固,更稳定,更加用户友好。 相机设计利用光学材料的统一模制结构来容纳反射镜,对准的传感器和透镜组件。 相机的统一模制结构保持所感测到的光路,以尽量减少灰尘和用户的手指,并保持光学对准。

    System and process for adding high frame-rate current speaker data to a low frame-rate video using audio watermarking techniques
    46.
    发明申请
    System and process for adding high frame-rate current speaker data to a low frame-rate video using audio watermarking techniques 有权
    使用音频水印技术将高帧率当前扬声器数据添加到低帧率视频的系统和处理

    公开(公告)号:US20050243168A1

    公开(公告)日:2005-11-03

    申请号:US10837973

    申请日:2004-04-30

    申请人: Ross Cutler

    发明人: Ross Cutler

    IPC分类号: H04N7/14

    CPC分类号: H04N7/147 G10L19/018

    摘要: A system and process for highlighting the current speaker on an on-going basis in each frame of a low frame-rate video of an event having multiple people in attendance, such as a video teleconference, is presented. In general, this is accomplished by periodically identifying an attendee that is currently speaking at a rate substantially faster than the video frame rate, and for each frame of the video updating the frame to highlight the attendee currently speaking. More particularly, an A/V source provides video and audio data streams to the client computing device, with current speaker data embedded into the audio stream via audio watermarking techniques. The client device extracts the current speaker data from the audio stream, and then renders and displays the video while using the current speaker data to periodically update the frame being displayed to highlight the current speaker.

    摘要翻译: 提出了一种系统和过程,用于在具有多个出席者的事件(例如,视频电话会议)的低帧率视频的每帧中持续突出显示当前的扬声器。 通常,这是通过周期性地识别当前以比视频帧速率快得多的速率来识别出席者,并且对于视频的每一帧更新帧来突出显示当前的演讲者。 更具体地,A / V源将视频和音频数据流提供给客户端计算设备,当前的扬声器数据通过音频水印技术嵌入到音频流中。 客户端设备从音频流中提取当前的扬声器数据,然后在使用当前扬声器数据的同时呈现和显示视频,以周期性地更新正被显示的帧以突出显示当前的扬声器。

    System and process for adding high frame-rate current speaker data to a low frame-rate video
    47.
    发明申请
    System and process for adding high frame-rate current speaker data to a low frame-rate video 有权
    将高帧率当前扬声器数据添加到低帧率视频的系统和过程

    公开(公告)号:US20050243166A1

    公开(公告)日:2005-11-03

    申请号:US10837138

    申请日:2004-04-30

    申请人: Ross Cutler

    发明人: Ross Cutler

    IPC分类号: H04N7/14 H04N7/15

    CPC分类号: H04N7/147 H04N7/15

    摘要: A system and process for highlighting the current speaker on an on-going basis in each frame of a low frame-rate video of an event having multiple people in attendance, such as a video teleconference, is presented. In general, this is accomplished by periodically identifying an attendee that is currently speaking at a rate substantially faster than the video frame rate, and for each frame of the video updating the frame to highlight the attendee currently speaking. More particularly, an audio/visual (A/V) source provides separate video, audio, and current speaker data streams to a client computing device. The client device then uses these data streams to render and display the video and to periodically update the frame being displayed to highlight the current speaker depicted therein.

    摘要翻译: 提出了一种系统和过程,用于在具有多个出席者的事件(例如,视频电话会议)的低帧率视频的每帧中持续突出显示当前的扬声器。 通常,这是通过周期性地识别当前以比视频帧速率快得多的速率来识别出席者,并且对于视频的每一帧更新帧来突出显示当前的演讲者。 更具体地,音频/视频(A / V)源向客户端计算设备提供单独的视频,音频和当前扬声器数据流。 然后,客户端设备使用这些数据流来呈现和显示视频,并且周期性地更新被显示的帧以突显其中描绘的当前扬声器。

    User interface for a system and method for head size equalization in 360 degree panoramic images
    48.
    发明申请
    User interface for a system and method for head size equalization in 360 degree panoramic images 有权
    用于360度全景图像的头部尺寸均衡的系统和方法的用户界面

    公开(公告)号:US20050206659A1

    公开(公告)日:2005-09-22

    申请号:US11106328

    申请日:2005-04-14

    申请人: Ross Cutler

    发明人: Ross Cutler

    摘要: A User Interface (UI) for a real-time panoramic image correction system and method that simplifies the use of the system for the user. The UI includes a control panel that allows a user to enter meeting table size and shape, camera position and orientation, and the amount of normalization desired (e.g. 0 to 100%). A window can also be implemented on a display that displays the corrected panoramic image. In this window, the head (either normalized or non-normalized) of a meeting participant, preferably one that is speaking, is extracted and displayed in a separate window. Additionally, the corrected panoramic image, whose size will vary in conjunction with the amount of warping applied, can be displayed and transmitted with extra pixels around its perimeter in order to allow the corrected or normalized panoramic image to adapt to any of the standard display size and resolutions and to simplify network transmission. The corrected image can also be transmitted with standard resolutions using non-unity pixel aspect ratios to simply network transmission.

    摘要翻译: 用于实时全景图像校正系统的用户界面(UI)和简化用户对系统的使用的方法。 UI包括控制面板,其允许用户输入会议台的大小和形状,相机位置和方位以及所需的归一化量(例如0至100%)。 还可以在显示校正的全景图像的显示器上实现窗口。 在该窗口中,提取会议参与者的头部(归一化的或未归一化的),优选地正在说话的头部,并在单独的窗口中显示。 另外,校正的全景图像的尺寸将随着施加的翘曲量的变化而变化,并且可以围绕其周边以额外的像素进行传播,以便校正或归一化的全景图像能够适应任何标准显示尺寸 决议和简化网络传输。 校正图像也可以使用非一体像素宽高比以标准分辨率传输到简单的网络传输。

    Foveated panoramic camera system
    50.
    发明申请
    Foveated panoramic camera system 审中-公开
    全景相机系统

    公开(公告)号:US20050117015A1

    公开(公告)日:2005-06-02

    申请号:US11027068

    申请日:2004-12-30

    申请人: Ross Cutler

    发明人: Ross Cutler

    IPC分类号: H04N7/14 H04N7/00

    摘要: A foveated panoramic camera system includes multiple cameras oriented so that individual images captured by the cameras can be combined to form a panoramic image. Each of the cameras includes a lens having a focal length that corresponds to a field of view for the camera. A field of view for a camera overlaps with the field(s) of view of each adjacent camera. At least one of the cameras has a field of view that differs from fields of view of other cameras for capturing images that are situated at a greater distance from the camera system than are images captured by the other cameras. As a result, a more uniform resolution is achieved across all images captured by the multiple cameras. A mirror assembly is utilized to reflect object images into the multiple cameras to achieve a near center of projection for the camera system.

    摘要翻译: 一个移动的全景相机系统包括多个照相机定向,使得由相机拍摄的各个图像可以组合以形成全景图像。 每个相机包括具有对应于照相机的视场的焦距的透镜。 摄像机的视野与每个相邻摄像机的视场重叠。 摄像机中的至少一个具有不同于用于捕获与相机系统相距更远的位置的其他照相机的视野的视野的视野,而不是由其他照相机捕获的图像。 结果,在由多个摄像机拍摄的所有图像上实现更均匀的分辨率。 使用反射镜组件将物体图像反射到多个相机中以实现相机系统的近投影中心。