-
1.
公开(公告)号:US08531519B1
公开(公告)日:2013-09-10
申请号:US13605194
申请日:2012-09-06
申请人: Yifan Peng , Wei Hua , Hrishikesh Aradhye , Rodrigo Carceroni
发明人: Yifan Peng , Wei Hua , Hrishikesh Aradhye , Rodrigo Carceroni
CPC分类号: H04N7/18
摘要: Implementations relate to a computer-implemented method and a device for determining a relative posed between devices. The method can include receiving data representing first keypoint features from a first image captured by a camera of a second mobile computing device; capturing, by a camera of a first mobile computing device, a second image, wherein the first image and the second image comprise a substantially common scene having an area of overlap; computing, by the first mobile computing device, data representing second keypoint features from the second image; determining, by the first mobile computing device, based at least in part on the data representing first keypoint features and the data representing second keypoint features, a relative pose of the first mobile computing device and the second mobile computing device; and communicating the relative pose to the second mobile computing device.
摘要翻译: 实现涉及计算机实现的方法和用于确定设备之间的相对设置的设备。 该方法可以包括从由第二移动计算设备的相机捕获的第一图像接收表示第一关键点特征的数据; 通过第一移动计算设备的相机捕获第二图像,其中所述第一图像和所述第二图像包括具有重叠区域的基本上共同的场景; 由所述第一移动计算设备计算表示来自所述第二图像的第二关键点特征的数据; 至少部分地基于表示第一关键点特征的数据和表示第二关键点特征的数据确定第一移动计算设备和第二移动计算设备的相对姿态; 以及将所述相对姿势传达给所述第二移动计算设备。
-
公开(公告)号:US08542265B1
公开(公告)日:2013-09-24
申请号:US13769188
申请日:2013-02-15
申请人: Michael Dodd , Rodrigo Carceroni
发明人: Michael Dodd , Rodrigo Carceroni
IPC分类号: H04N7/15
CPC分类号: H04N7/15 , G06T3/4092 , H04L65/403 , H04L65/605 , H04N9/67 , H04N21/25825 , H04N21/2662 , H04N21/41407 , H04N21/4788
摘要: Implementations relate to a system for video encoding and conversion including an image resolution conversion component operable to convert a resolution of a source image frame from a first resolution to a second resolution to produce a first intermediate image frame at the second resolution; an image conversion component operable to receive the first intermediate image frame and convert an image size of the first intermediate image frame to another image frame size to produce a first viewable image frame; an image viewer component operable to display the first viewable image on a first display; a color space conversion component comprising a luminance conversion component and a chrominance operable to receive the first viewable image frame and convent a first luminance value and a first chrominance value of the first viewable image frame to a second intermediate image frame having a second luminance value and a second chrominance value.
-
公开(公告)号:US08913103B1
公开(公告)日:2014-12-16
申请号:US13363948
申请日:2012-02-01
申请人: Emre Sargin , Rodrigo Carceroni , Huazhong Ning , Wei Hua , Marius Renn , Hrishikesh Aradhye
发明人: Emre Sargin , Rodrigo Carceroni , Huazhong Ning , Wei Hua , Marius Renn , Hrishikesh Aradhye
IPC分类号: H04N7/14
CPC分类号: G06K9/00221 , G06K9/00335 , G06K9/00711 , H04N7/14 , H04N21/42203 , H04N21/4394 , H04N21/44218
摘要: Disclosed are methods for automatically generating commands to transform a video sequence based on information regarding speaking participants derived from the audio and video signals. The audio stream is analyzed to detect individual speakers and the video is optionally analyzed to detect lip movement to determine a probability that a detected participant is speaking. Commands are then generated to transform the video stream consistent with the identified speaker.
摘要翻译: 公开了一种用于基于从音频和视频信号导出的关于说话参与者的信息来自动生成用于变换视频序列的命令的方法。 分析音频流以检测单个扬声器,并且可选地分析视频以检测唇部移动以确定检测到的参与者正在说话的概率。 然后生成命令以转换与所识别的扬声器一致的视频流。
-
公开(公告)号:US08903130B1
公开(公告)日:2014-12-02
申请号:US13467442
申请日:2012-05-09
申请人: Rodrigo Carceroni , Wei Hua
发明人: Rodrigo Carceroni , Wei Hua
IPC分类号: G06K9/36
CPC分类号: G06K9/00765 , H04N21/4394 , H04N21/44218 , H04N21/8106
摘要: A method and apparatus for virtual camera operation is disclosed. Virtual camera operation may include identifying potential subjects of a video stream by identifying faces of participants in the input video stream. Virtual camera operation may include determining a speaking state of each participant in the input video stream based on their respective identified face. Virtual camera operation may include identifying a subject of the input video stream based on the speaking state. Virtual camera operation may include generating, using a processor, an output video stream including a portion of the input video stream based on the subject.
摘要翻译: 公开了一种用于虚拟相机操作的方法和装置。 虚拟相机操作可以包括通过识别输入视频流中的参与者的面部来识别视频流的潜在主体。 虚拟相机操作可以包括基于它们各自识别的面部来确定输入视频流中每个参与者的说话状态。 虚拟相机操作可以包括基于说话状态识别输入视频流的主题。 虚拟相机操作可以包括使用处理器生成包括基于被摄体的输入视频流的一部分的输出视频流。
-
公开(公告)号:US08886576B1
公开(公告)日:2014-11-11
申请号:US13585458
申请日:2012-08-14
CPC分类号: G06N99/005 , G06F17/30038 , G06K9/00 , H04M1/6008 , H04M1/72569
摘要: Methods and apparatus for suggesting image, video, and image album titles are presented. A machine-learning service executing on a mobile platform receives feature-related data. The feature-related data includes image-related data related to one or more images received from an application executing on the mobile platform and platform-related data received from the mobile platform. The image-related data and the platform-related data differ. The machine-learning service generates a title related to the one or more images by performing a machine-learning operation on the feature-related data. The machine-learning service sends the title related to the one or more images to the application.
摘要翻译: 介绍图像,视频和图像相册标题的方法和设备。 在移动平台上执行的机器学习服务接收特征相关数据。 特征相关数据包括与从移动平台上执行的应用接收的一个或多个图像相关的图像相关数据和从移动平台接收的与平台相关的数据。 图像相关数据和平台相关数据不同。 机器学习服务通过对特征相关数据执行机器学习操作来生成与一个或多个图像有关的标题。 机器学习服务将与一个或多个图像相关的标题发送到应用程序。
-
-
-
-