专利检索 ap:("Ross G. Cutler" OR "Ya Chang" OR "Zicheng Liu" OR "Zhengyou Zhang") AND inv:"Ross G. Cutler" 第 1 页

1.

发明授权
Automatic detection of panoramic camera position and orientation table parameters 有权
标题翻译：自动检测全景相机的位置和方位表参数

公开(公告)号：US07630571B2

公开(公告)日：2009-12-08

申请号：US11227046

申请日：2005-09-15

申请人： Ross G. Cutler , Ya Chang , Zicheng Liu , Zhengyou Zhang

发明人： Ross G. Cutler , Ya Chang , Zicheng Liu , Zhengyou Zhang

IPC分类号： G06K9/40 , G06K9/48

CPC分类号： G06T5/006 , G06T7/68 , G06T7/80 , G06T2200/32

摘要： A panoramic camera is configured to automatically determine parameters of a table upon which the camera is situated as well as positional information of the camera relative to the table. In an initialization stage, table edges are detected to create an edge map. A Hough transformation-like symmetry voting operation is performed to clean up the edge map and to determine camera offset, camera orientation and camera tilt. The table is then fit to a table model to determine table parameters. In an operational stage, table edges are detected to create an edge map and the table model is fit to the edge map. The output can then be used for further panoramic image processing such as head size normalization, zooming, compensation for camera movement, etc.

摘要翻译： 全景相机被配置为自动确定相机所在的表的参数以及相机相对于表的位置信息。在初始化阶段，检测表边缘以创建边缘图。执行霍夫变换对称投票操作来清理边缘图并确定相机偏移，相机方向和相机倾斜度。然后将表适合于表模型以确定表参数。在操作阶段，检测表边缘以创建边缘图，并且表模型适合边缘图。然后可以将输出用于进一步的全景图像处理，例如头部尺寸归一化，变焦，相机移动补偿等。

2.

发明申请
Interest Determination For Auditory Enhancement 有权
标题翻译：用于听觉增强的兴趣测定

公开(公告)号：US20100315482A1

公开(公告)日：2010-12-16

申请号：US12484906

申请日：2009-06-15

申请人： Daniel A. Rosenfeld , Zicheng Liu , Ross G. Cutler , Philip A. Chou , Christian Huitema , Kori Quinn

发明人： Daniel A. Rosenfeld , Zicheng Liu , Ross G. Cutler , Philip A. Chou , Christian Huitema , Kori Quinn

IPC分类号： H04N7/15

CPC分类号： H04N7/147 , H04L12/1827 , H04L65/403 , H04M3/564 , H04M3/567 , H04M2201/20 , H04M2203/2044 , H04M2203/5018 , H04M2203/5072 , H04S7/303 , H04S2400/01 , H04S2400/13

摘要： Gaze tracking or other interest indications are used during a video conference to determine one or more audio sources that are of interest to one or more participants to the video conference, such as by determining a conversation from among multiple conversations that a subset of participants are participating in or listening to, for enhancing the audio experience of one or more of the participants.

摘要翻译： 在视频会议期间使用凝视跟踪或其他兴趣指示来确定一个或多个视频会议的一个或多个参与者感兴趣的一个或多个音频源，例如通过确定参与者的子集参与的多个对话中的会话在或听取，以增强一个或多个参与者的音频体验。

3.

发明授权
Audio transforms in connection with multiparty communication 有权
标题翻译：与多方通信有关的音频转换

公开(公告)号：US08340267B2

公开(公告)日：2012-12-25

申请号：US12365949

申请日：2009-02-05

申请人： Dinei A. Florencio , Alejandro Acero , William Buxton , Phillip A. Chou , Ross G. Cutler , Jason Garms , Christian Huitema , Kori M. Quinn , Daniel Allen Rosenfeld , Zhengyou Zhang

发明人： Dinei A. Florencio , Alejandro Acero , William Buxton , Phillip A. Chou , Ross G. Cutler , Jason Garms , Christian Huitema , Kori M. Quinn , Daniel Allen Rosenfeld , Zhengyou Zhang

IPC分类号： H04M3/42

CPC分类号： H04M3/56 , G10L2021/0135 , H04M3/565

摘要： The claimed subject matter relates to an architecture that can preprocess audio portions of communications in order to enrich multiparty communication sessions or environments. In particular, the architecture can provide both a public channel for public communications that are received by substantially all connected parties and can further provide a private channel for private communications that are received by a selected subset of all connected parties. Most particularly, the architecture can apply an audio transform to communications that occur during the multiparty communication session based upon a target audience of the communication. By way of illustration, the architecture can apply a whisper transform to private communications, an emotion transform based upon relationships, an ambience or spatial transform based upon physical locations, or a pace transform based upon lack of presence.

摘要翻译： 所要求保护的主题涉及可以预处理通信的音频部分以便丰富多方通信会话或环境的架构。特别地，该架构可以提供公共通信的公共信道，其由基本上所有连接的各方接收，并且可以进一步提供由所有连接方的所选子集接收的专用通信的专用信道。特别地，架构可以基于通信的目标受众对音频转换应用于在多方通信会话期间发生的通信。作为说明，架构可以对私人通信应用耳语转换，基于关系，基于物理位置的氛围或空间变换或基于缺乏存在的步调变换的情感变换。

4.

发明申请
AUDIO TRANSFORMS IN CONNECTION WITH MULTIPARTY COMMUNICATION 有权
标题翻译：与多媒体通信相关的音频转换

公开(公告)号：US20100195812A1

公开(公告)日：2010-08-05

申请号：US12365949

申请日：2009-02-05

申请人： Dinei A. Florencio , Alejandro Acero , William Buxton , Phillip A. Chou , Ross G. Cutler , Jason Garms , Christian Huitema , Kori M. Quinn , Daniel Allen Rosenfeld , Zhengyou Zhang

发明人： Dinei A. Florencio , Alejandro Acero , William Buxton , Phillip A. Chou , Ross G. Cutler , Jason Garms , Christian Huitema , Kori M. Quinn , Daniel Allen Rosenfeld , Zhengyou Zhang

IPC分类号： H04M3/42 , G10L11/00

CPC分类号： H04M3/56 , G10L2021/0135 , H04M3/565

摘要： The claimed subject matter relates to an architecture that can preprocess audio portions of communications in order to enrich multiparty communication sessions or environments. In particular, the architecture can provide both a public channel for public communications that are received by substantially all connected parties and can further provide a private channel for private communications that are received by a selected subset of all connected parties. Most particularly, the architecture can apply an audio transform to communications that occur during the multiparty communication session based upon a target audience of the communication. By way of illustration, the architecture can apply a whisper transform to private communications, an emotion transform based upon relationships, an ambience or spatial transform based upon physical locations, or a pace transform based upon lack of presence.

摘要翻译： 所要求保护的主题涉及可以预处理通信的音频部分以便丰富多方通信会话或环境的架构。特别地，该架构可以提供公共通信的公共信道，其由基本上所有连接的各方接收，并且可以进一步提供由所有连接方的所选子集接收的专用通信的专用信道。特别地，架构可以基于通信的目标受众对音频转换应用于在多方通信会话期间发生的通信。作为说明，架构可以对私人通信应用耳语转换，基于关系，基于物理位置的氛围或空间变换或基于缺乏存在的步调变换的情感变换。

5.

发明授权
Interest determination for auditory enhancement 有权
标题翻译：听觉增强的兴趣决定

公开(公告)号：US08416715B2

公开(公告)日：2013-04-09

申请号：US12484906

申请日：2009-06-15

申请人： Daniel A. Rosenfeld , Zicheng Liu , Ross G. Cutler , Philip A. Chou , Christian Huitema , Kori Quinn

发明人： Daniel A. Rosenfeld , Zicheng Liu , Ross G. Cutler , Philip A. Chou , Christian Huitema , Kori Quinn

IPC分类号： H04L12/16

CPC分类号： H04N7/147 , H04L12/1827 , H04L65/403 , H04M3/564 , H04M3/567 , H04M2201/20 , H04M2203/2044 , H04M2203/5018 , H04M2203/5072 , H04S7/303 , H04S2400/01 , H04S2400/13

摘要： Gaze tracking or other interest indications are used during a video conference to determine one or more audio sources that are of interest to one or more participants to the video conference, such as by determining a conversation from among multiple conversations that a subset of participants are participating in or listening to, for enhancing the audio experience of one or more of the participants.

摘要翻译： 在视频会议期间使用凝视跟踪或其他兴趣指示来确定一个或多个视频会议的一个或多个参与者感兴趣的一个或多个音频源，例如通过确定参与者的子集参与的多个对话中的会话在或听取，以增强一个或多个参与者的音频体验。

6.

发明授权
Identification of people using multiple types of input 有权
标题翻译：识别使用多种输入的人

公开(公告)号：US08510110B2

公开(公告)日：2013-08-13

申请号：US13546153

申请日：2012-07-11

申请人： Cha Zhang , Paul A. Viola , Pei Yin , Ross G. Cutler , Xinding Sun , Yong Rui

发明人： Cha Zhang , Paul A. Viola , Pei Yin , Ross G. Cutler , Xinding Sun , Yong Rui

IPC分类号： G10L15/00

CPC分类号： G06K9/6256 , G06K9/4614 , G10L25/78 , G10L2021/02166 , H04N7/147 , H04N7/15 , H04N21/42203 , H04N21/4223 , H04N21/4394 , H04N21/44008 , H04N21/44213 , H04N21/4788

摘要： Systems and methods for detecting people or speakers in an automated fashion are disclosed. A pool of features including more than one type of input (like audio input and video input) may be identified and used with a learning algorithm to generate a classifier that identifies people or speakers. The resulting classifier may be evaluated to detect people or speakers.

摘要翻译： 公开了以自动方式检测人或扬声器的系统和方法。可以识别包括多于一种类型的输入（例如音频输入和视频输入）的功能池，并与学习算法一起使用以生成识别人或扬声器的分类器。可以评估所得分类器以检测人或扬声器。

7.

发明授权
Capture device movement compensation for speaker indexing 有权
标题翻译：用于扬声器索引的捕获装置移动补偿

公开(公告)号：US08330787B2

公开(公告)日：2012-12-11

申请号：US11771786

申请日：2007-06-29

申请人： Ross G. Cutler

发明人： Ross G. Cutler

IPC分类号： H04N7/14 , H04N5/232 , H04N9/80

CPC分类号： H04N5/23258 , G06K9/00711 , H04N7/147

摘要： Embodiments of the invention compensate for the movement of a meeting capture device during a live meeting when performing speaker indexing of a recorded meeting. In one example, a first position of a capture device is determined. A second position of the capture device is determined after the capture device has been moved from the first position to the second position. The movement data associated with movement of the capture device from the first position to the second position is determined. The movement data is outputted and used in speaker indexing of the recorded meeting.

摘要翻译： 本发明的实施例在进行会议记录会议的说话者索引时，补偿会议捕获装置在实时会议期间的移动。在一个示例中，确定捕获装置的第一位置。在捕捉装置已经从第一位置移动到第二位置之后确定捕获装置的第二位置。确定与捕捉装置从第一位置移动到第二位置相关联的移动数据。运动数据被输出并用于记录的会议的讲话者索引。

8.

发明申请
MUTE CONTROL IN AUDIO ENDPOINTS 有权
标题翻译：音频终端中的静音控制

公开(公告)号：US20100324891A1

公开(公告)日：2010-12-23

申请号：US12486761

申请日：2009-06-18

申请人： Ross G. Cutler

发明人： Ross G. Cutler

IPC分类号： G10L11/06

CPC分类号： G10L25/78

摘要： Architecture that uses near-end speech detection and far-end energy level detection to notify a user when a local microphone and/or speaker that the user is using, are muted. A voice activity detector is employed to detect the presence of near-end speech, sense the existing mute state of the near-end microphone, and then notify the user when the current microphone is muted. Separately or in combination therewith, received far-end voice signals are detected, the associated energy level computed, the existing mute state of the near-end audio speaker is sensed, and the user notified when the speaker is muted and/or at a reduced volume setting. These determinations enhance the user experience when the architecture is employed for communications sessions where participants connect via different communications modalities by automatically notifying the user of the audio device state, without attempting to contribute only to find that a microphone or speaker was muted.

摘要翻译： 使用近端语音检测和远端能级检测的架构，当用户正在使用的本地麦克风和/或扬声器被静音时通知用户。使用语音活动检测器来检测近端语音的存在，感测近端麦克风的现有静音状态，然后当当前麦克风静音时通知用户。单独地或与其组合，检测到所接收的远端语音信号，计算相关联的能量水平，感测到近端音频扬声器的现有静音状态，并且当扬声器静音和/或减小时通知用户音量设置。当通过自动通知用户音频设备状态的参与者通过不同的通信模式进行连接的通信会话时，这些确定增强了用户体验，而不试图仅仅发现话筒或扬声器被静音。

9.

发明申请
AUTOMATIC GAIN AND EXPOSURE CONTROL USING REGION OF INTEREST DETECTION 有权
标题翻译：利用感兴趣区域进行自动增益和接触控制

公开(公告)号：US20090003678A1

公开(公告)日：2009-01-01

申请号：US11771802

申请日：2007-06-29

申请人： Ross G. Cutler

发明人： Ross G. Cutler

IPC分类号： G06K9/00

CPC分类号： G06K9/00362 , H04N5/2351 , H04N5/2352 , H04N5/2353 , H04N5/243 , H04N7/142

摘要： A region of interest may be determined using any or all of sound source location, multi-person detection, and active speaker detection. An weighted mean may be determined using the region of interest and a set of backlight weight regions, or, only the set of backlight weight regions if a region of interest could not be found. The image mean is compared to a target value to determine if the image mean is greater than or less than the target value within a predetermined threshold. If the image mean is greater than the predetermined target value and predetermined threshold value, the gain and exposure are decreased. If the image mean is lesser than the predetermined target value minus the predetermined threshold value, the gain and exposure are decreased.

摘要翻译： 可以使用声源位置，多人检测和主动扬声器检测中的任何一个或全部来确定感兴趣的区域。可以使用感兴趣的区域和一组背光重量区域来确定加权平均值，或者如果不能找到感兴趣的区域，则仅限于该组背光重量区域。将图像平均值与目标值进行比较，以确定图像均值是否大于或小于预定阈值内的目标值。如果图像均值大于预定目标值和预定阈值，则增益和曝光量减小。如果图像平均值小于预定目标值减去预定阈值，则增益和曝光减小。

10.

发明申请
CAPTURE DEVICE MOVEMENT COMPENSATION FOR SPEAKER INDEXING 有权
标题翻译：用于扬声器索引的捕获设备运动补偿

公开(公告)号：US20090002477A1

公开(公告)日：2009-01-01

申请号：US11771786

申请日：2007-06-29

申请人： Ross G. Cutler

发明人： Ross G. Cutler

IPC分类号： H04N7/14

CPC分类号： H04N5/23258 , G06K9/00711 , H04N7/147

摘要： Embodiments of the invention compensate for the movement of a meeting capture device during a live meeting when performing speaker indexing of a recorded meeting. In one example, a first position of a capture device is determined. A second position of the capture device is determined after the capture device has been moved from the first position to the second position. The movement data associated with movement of the capture device from the first position to the second position is determined. The movement data is outputted and used in speaker indexing of the recorded meeting.

摘要翻译： 本发明的实施例在进行会议记录会议的说话者索引时，补偿会议捕获装置在实时会议期间的移动。在一个示例中，确定捕获装置的第一位置。在捕捉装置已经从第一位置移动到第二位置之后确定捕获装置的第二位置。确定与捕捉装置从第一位置移动到第二位置相关联的移动数据。运动数据被输出并用于记录的会议的讲话者索引。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类