-
11.
公开(公告)号:US11561621B2
公开(公告)日:2023-01-24
申请号:US16600830
申请日:2019-10-14
Applicant: Apple Inc.
Inventor: Feng Tang , Chong Chen , Haitao Guo , Xiaojin Shi , Thorsten Gernoth
Abstract: Intelligent systems are disclosed that respond to user intent and desires based upon activity that may or may not be expressly directed at the intelligent system. In some embodiments, the intelligent system acquires a depth image of a scene surrounding the system. A scene geometry may be extracted from the depth image and elements of the scene may be monitored. In certain embodiments, user activity in the scene is monitored and analyzed to infer user desires or intent with respect to the system. The interpretation of the user's intent as well as the system's response may be affected by the scene geometry surrounding the user and/or the system. In some embodiments, techniques and systems are disclosed for interpreting express user communication, e.g., expressed through hand gesture movements. In some embodiments, such gesture movements may be interpreted based on real-time depth information obtained from, e.g., optical or non-optical type depth sensors.
-
公开(公告)号:US20220014697A1
公开(公告)日:2022-01-13
申请号:US17377308
申请日:2021-07-15
Applicant: Apple Inc.
Inventor: Guy Cote , D. Amnon Silverstein , Suk Hwan Lim , Sheng Lin , Haitao Guo
IPC: H04N5/365
Abstract: The present disclosure generally relates to systems and methods for image data processing. In certain embodiments, an image processing pipeline may be configured to receive a frame of the image data having a plurality of pixels acquired using a digital image sensor. The image processing pipeline may then be configured to determine a first plurality of correction factors that may correct each pixel in the plurality of pixels for fixed pattern noise. The first plurality of correction factors may be determined based at least in part on fixed pattern noise statistics that correspond to the frame of the image data. After determining the first plurality of correction factors, the image processing pipeline may be configured to configured to apply the first plurality of correction factors to the plurality of pixels, thereby reducing the fixed pattern noise present in the plurality of pixels.
-
公开(公告)号:US10986345B2
公开(公告)日:2021-04-20
申请号:US16277892
申请日:2019-02-15
Applicant: Apple Inc.
Inventor: Haitao Guo , Hao Pan , Guy Cote , Andrew Bai
IPC: H04N11/02 , H04N19/14 , G06F3/14 , G09G5/02 , G09G5/10 , H04N1/64 , H04N5/20 , H04N5/235 , H04N19/172 , H04N19/102 , H04N19/137 , H04N19/177 , H04N19/124 , H04N19/154 , H04N19/17 , H04N19/98 , H04N21/422 , H04N21/431 , G06T5/00 , G09G3/20 , G09G5/00 , H04N1/60 , H04N5/50 , H04N9/64 , H04N9/67 , H04N19/136 , H04N19/184 , H04N19/186 , H04N19/169 , H04N19/44 , H04N19/30 , H04N19/33 , H04N19/176 , H04N19/182 , H04N19/52 , H04N19/86 , H04N19/463 , H04N21/4223 , H04N21/44 , H04N21/4402 , H04N21/442 , H04N21/485
Abstract: Video processing techniques and pipelines that support capture, distribution, and display of high dynamic range (HDR) image data to both HDR-enabled display devices and display devices that do not support HDR imaging. A sensor pipeline may generate standard dynamic range (SDR) data from HDR data captured by a sensor using tone mapping, for example local tone mapping. Information used to generate the SDR data may be provided to a display pipeline as metadata with the generated SDR data. If a target display does not support HDR imaging, the SDR data may be directly rendered by the display pipeline. If the target display does support HDR imaging, then an inverse mapping technique may be applied to the SDR data according to the metadata to render HDR data for display. Information used in performing color gamut mapping may also be provided in the metadata and used to recover clipped colors for display.
-
14.
公开(公告)号:US10757445B2
公开(公告)日:2020-08-25
申请号:US16173105
申请日:2018-10-29
Applicant: Apple Inc.
Inventor: Alexandros Tourapis , David W. Singer , Haitao Guo , Hsi-Jung Wu , Sorin C. Cismas , Xiaohua Yang , Yeping Su , Dazhong Zhang
IPC: H04N19/593 , H04N19/156 , H04N19/44 , H04N19/176 , H04N19/182 , H04N19/174
Abstract: Methods are described for encoding and decoding blocks of image data using intra block copying (IBC). A source block for intra block copying is selected from a source region of a current image that is closer to the current block than a threshold, wherein the source region does not include a portion of the current image that is further from the current block than the threshold.
-
15.
公开(公告)号:US20200042096A1
公开(公告)日:2020-02-06
申请号:US16600830
申请日:2019-10-14
Applicant: Apple Inc.
Inventor: Feng Tang , Chong Chen , Haitao Guo , Xiaojin Shi , Thorsten Gernoth
Abstract: Intelligent systems are disclosed that respond to user intent and desires based upon activity that may or may not be expressly directed at the intelligent system. In some embodiments, the intelligent system acquires a depth image of a scene surrounding the system. A scene geometry may be extracted from the depth image and elements of the scene may be monitored. In certain embodiments, user activity in the scene is monitored and analyzed to infer user desires or intent with respect to the system. The interpretation of the user's intent as well as the system's response may be affected by the scene geometry surrounding the user and/or the system. In some embodiments, techniques and systems are disclosed for interpreting express user communication, e.g., expressed through hand gesture movements. In some embodiments, such gesture movements may be interpreted based on real-time depth information obtained from, e.g., optical or non-optical type depth sensors.
-
16.
公开(公告)号:US20170090584A1
公开(公告)日:2017-03-30
申请号:US14865850
申请日:2015-09-25
Applicant: Apple Inc.
Inventor: Feng Tang , Chong Chen , Haitao Guo , Xiaojin Shi , Thorsten Gernoth
CPC classification number: G06F3/017 , G06F3/012 , G06F3/0304 , G06F3/16 , G06K9/00375 , G06K9/00389 , G06K9/6269 , G06K9/6282 , G06K2209/40
Abstract: Varying embodiments of intelligent systems are disclosed that respond to user intent and desires based upon activity that may or may not be expressly directed at the intelligent system. In some embodiments, the intelligent system acquires a depth image of a scene surrounding the system. A scene geometry may be extracted from the depth image and elements of the scene, such as walls, furniture, and humans may be evaluated and monitored. In certain embodiments, user activity in the scene is monitored and analyzed to infer user desires or intent with respect to the system. The interpretation of the user's intent or desire as well as the system's response may be affected by the scene geometry surrounding the user and/or the system. In some embodiments, techniques and systems are disclosed for interpreting express user communication, for example, expressed through fine hand gesture movements. In some embodiments, such gesture movements may be interpreted based on real-time depth information obtained from, for example, optical or non-optical type depth sensors. The depth information may be interpreted in “slices” (three-dimensional regions of space having a relatively small depth) until one or more candidate hand structures are detected. Once detected, each candidate hand structure may be confirmed or rejected based on its own unique physical properties (e.g., shape, size and continuity to an arm structure). Each confirmed hand structure may be submitted to a depth-aware filtering process before its own unique three-dimensional features are quantified into a high-dimensional feature vector. A two-step classification scheme may be applied to the feature vectors to identify a candidate gesture (step 1), and to reject candidate gestures that do not meet a gesture-specific identification operation (step-2). The identified gesture may be used to initiate some action controlled by a computer system.
-
公开(公告)号:US09432647B2
公开(公告)日:2016-08-30
申请号:US14269814
申请日:2014-05-05
Applicant: Apple Inc.
Inventor: Touraj Tajbakhsh , Yingjun Bai , Haitao Guo
CPC classification number: H04N9/735 , H04N1/60 , H04N1/6086 , H04N5/2353 , H04N5/3572
Abstract: This disclosure pertains to systems, methods, and computer readable media for extending the dynamic range of images using an operation referred to herein as “Adaptive Auto Exposure” (AAE). According to the embodiments disclosed herein, the AAE-enabled higher dynamic range capture operations are accomplished without blending multiple or bracketed exposure captures (as is the case with traditional high dynamic range (HDR) photography). AAE also enables high signal-to-noise ratio (SNR) rendering when scene content allows for it and/or certain highlight clipping is tolerable. Decisions with regard to preferred AE strategies may be based, at least in part, on one or more of the following: sensor characteristics; scene content; and pre-defined preferences under different scenarios.
Abstract translation: 本公开涉及用于使用本文称为“自适应自动曝光”(AAE)的操作来扩展图像的动态范围的系统,方法和计算机可读介质。 根据本文公开的实施例,在不混合多个或包围曝光捕获的情况下(如传统的高动态范围(HDR)拍摄的情况)),实现了启用AAE的较高动态范围捕获操作。 当场景内容允许时,AAE还能实现高信噪比(SNR)渲染,和/或某些高亮度剪辑是可以容忍的。 关于优选AE策略的决定可以至少部分地基于以下一个或多个:传感器特征; 场景内容; 和不同场景下的预定义偏好。
-
18.
公开(公告)号:US09332309B2
公开(公告)日:2016-05-03
申请号:US13631719
申请日:2012-09-28
Applicant: Apple Inc.
Inventor: Athanasios Leontaris , Haitao Guo , Xiaojin Shi
IPC: H04N19/65 , H04N19/89 , H04N21/442 , H04N21/6379 , H04N19/172 , H04N19/196 , H04N19/107 , H04N19/164
CPC classification number: H04N21/44209 , H04N19/107 , H04N19/164 , H04N19/172 , H04N19/196 , H04N19/65 , H04N19/89 , H04N21/6379
Abstract: An error recovery method may be engaged by an encoder to recover from misalignment between reference picture caches at the encoder and decoder. When a communication error is detected between a coder and a decoder, a number of non-acknowledged reference frames present in the decoder's reference picture cache may be estimated. Thereafter, frames may be coded as reference frames in a number greater or equal to the number of non-acknowledged reference frames that are estimated to be present in the decoder's reference picture cache. Thereafter, ordinary coding operations may resume. Typically, a final reference frame that is coded in the error recovery mode will be coded as a synchronization frame that has high coding quality. The coded reference frames that precede it may be coded at low quality (or may be coded as SKIP-coded frames). On reception and decoding, the preceding frames may cause the decoder to flush from its reference picture cache any non-acknowledged reference frames that otherwise might collide with the new synchronization frame. In this manner, alignment between the encoder and decoder may be restored.
Abstract translation: 编码器可以接合错误恢复方法以从编码器和解码器处的参考图像高速缓存之间的未对准中恢复。 当在编码器和解码器之间检测到通信错误时,可以估计存在于解码器的参考图像高速缓存中的多个未确认参考帧。 此后,帧可以被编码为大于或等于估计存在于解码器的参考图像缓存中的未确认参考帧的数量的参考帧。 此后,可以恢复普通编码操作。 通常,以错误恢复模式编码的最终参考帧将被编码为具有高编码质量的同步帧。 其之前的编码参考帧可以以低质量编码(或者可以被编码为SKIP编码的帧)。 在接收和解码时,前述帧可能导致解码器从其参考图像缓存中刷新否则可能与新的同步帧相冲突的任何未确认的参考帧。 以这种方式,可以恢复编码器和解码器之间的对准。
-
公开(公告)号:US08988563B2
公开(公告)日:2015-03-24
申请号:US13705122
申请日:2012-12-04
Applicant: Apple Inc.
Inventor: Andrew Yanowitz , Ting Chen , David Daming Kuo , Haitao Guo
CPC classification number: H04N5/367 , G06T5/00 , G06T2200/28 , H04N5/2173 , H04N5/23229 , H04N5/3572 , H04N9/045
Abstract: Systems, methods, and devices for dual processing of raw image data by main image processing and alternative image processing capabilities of an electronic device are provided. According to an embodiment, alternative image processing may analyze a first copy of a frame of raw image data before a second copy of the frame of raw image data is processed by main image processing. Thereafter, the main image processing may process the second copy of the frame of raw image. The main image processing may be calibrated based at least in part on the analysis of the first copy of the frame of raw image data.
Abstract translation: 提供了通过主要图像处理和替代图像处理能力对原始图像数据进行双重处理的系统,方法和装置。 根据实施例,替代图像处理可以在通过主图像处理处理原始图像数据的帧的第二副本之前分析原始图像数据的帧的第一副本。 此后,主图像处理可以处理原始图像的帧的第二副本。 可以至少部分地基于原始图像数据的帧的第一副本的分析来校准主图像处理。
-
公开(公告)号:US20130342739A1
公开(公告)日:2013-12-26
申请号:US13705122
申请日:2012-12-04
Applicant: APPLE INC.
Inventor: Andrew Yanowitz , Ting Chen , David Daming Kuo , Haitao Guo
IPC: H04N5/367
CPC classification number: H04N5/367 , G06T5/00 , G06T2200/28 , H04N5/2173 , H04N5/23229 , H04N5/3572 , H04N9/045
Abstract: Systems, methods, and devices for dual processing of raw image data by main image processing and alternative image processing capabilities of an electronic device are provided. According to an embodiment, alternative image processing may analyze a first copy of a frame of raw image data before a second copy of the frame of raw image data is processed by main image processing. Thereafter, the main image processing may process the second copy of the frame of raw image. The main image processing may be calibrated based at least in part on the analysis of the first copy of the frame of raw image data.
Abstract translation: 提供了通过主要图像处理和替代图像处理能力对原始图像数据进行双重处理的系统,方法和装置。 根据实施例,替代图像处理可以在通过主图像处理处理原始图像数据的帧的第二副本之前分析原始图像数据的帧的第一副本。 此后,主图像处理可以处理原始图像的帧的第二副本。 可以至少部分地基于原始图像数据的帧的第一副本的分析来校准主图像处理。
-
-
-
-
-
-
-
-
-