-
公开(公告)号:US11514261B2
公开(公告)日:2022-11-29
申请号:US17056737
申请日:2019-06-19
Applicant: Microsoft Technology Licensing, LLC
Inventor: Jing Liao , Lu Yuan , Dongdong Chen , Mingming He
Abstract: According to implementations of the subject matter described herein, there is provided an image colorization solution. The solution includes determining a similarity between contents presented in a grayscale source image and a color reference image and determining a col or target image corresponding to the source image based on the similarity. Specifically, a first and a second sets of blocks similar and dissimilar to the reference image are determined based on the similarity; a first color for the first set of blocks is determined based on a color of corresponding blocks in the reference image; a second color for the second set of blocks is determined independently of the reference image. Through this solution, it is possible to provide user controllability and customized effects in colorization, and there is no strict requirement on correspondence between the color image and grayscale image, achieving more robustness to selection of color reference images.
-
公开(公告)号:US11308576B2
公开(公告)日:2022-04-19
申请号:US16955601
申请日:2019-01-08
Applicant: Microsoft Technology Licensing, LLC
Inventor: Lu Yuan , Gang Hua , Jing Liao , Dongdong Chen
IPC: G06T3/00 , G06T7/593 , H04N13/156 , H04N13/00
Abstract: In accordance with implementations of the subject matter described herein, there is proposed a solution of visual stylization of stereoscopic images. In the solution, a first feature map for a first source image and a second feature map for a second source image are extracted. The first and second source images correspond to first and second views of a stereoscopic image, respectively. A first unidirectional disparity from the first source image to the second source image is determined based on the first and second source images. First and second target images having a specified visual style are generated by processing the first and second feature maps based on the first unidirectional disparity. Through the solution, a disparity between two source images of a stereoscopic image are taken into account when performing the visual style transfer, thereby maintaining a stereoscopic effect in the stereoscopic image consisting of the target images.
-
公开(公告)号:US12223412B2
公开(公告)日:2025-02-11
申请号:US17123697
申请日:2020-12-16
Applicant: Microsoft Technology Licensing, LLC
Inventor: Yinpeng Chen , Xiyang Dai , Mengchen Liu , Dongdong Chen , Lu Yuan , Zicheng Liu , Ye Yu , Mei Chen , Yunsheng Li
Abstract: A computer device for automatic feature detection comprises a processor, a communication device, and a memory configured to hold instructions executable by the processor to instantiate a dynamic convolution neural network, receive input data via the communication network, and execute the dynamic convolution neural network to automatically detect features in the input data. The dynamic convolution neural network compresses the input data from an input space having a dimensionality equal to a predetermined number of channels into an intermediate space having a dimensionality less than the number of channels. The dynamic convolution neural network dynamically fuses the channels into an intermediate representation within the intermediate space and expands the intermediate representation from the intermediate space to an expanded representation in an output space having a higher dimensionality than the dimensionality of the intermediate space. The features in the input data are automatically detected based on the expanded representation.
-
公开(公告)号:US12190588B2
公开(公告)日:2025-01-07
申请号:US17339413
申请日:2021-06-04
Applicant: Microsoft Technology Licensing, LLC
Inventor: Dongdong Chen , Qiankun Liu , Lu Yuan , Lei Zhang
Abstract: A system for tracking a target object across a plurality of image frames. The system comprises a logic machine and a storage machine. The storage machine holds instructions executable by the logic machine to calculate a trajectory for the target object over one or more previous frames occurring before a target frame. Responsive to assessing no detection of the target object in the target frame, the instructions are executable to predict an estimated region for the target object based on the trajectory, predict an occlusion center based on a set of candidate occluding locations for a set of other objects within a threshold distance of the estimated region, each location of the set of candidate occluding locations overlapping with the estimated region, and automatically estimate a bounding box for the target object in the target frame based on the occlusion center.
-
公开(公告)号:US11989956B2
公开(公告)日:2024-05-21
申请号:US17222879
申请日:2021-04-05
Applicant: Microsoft Technology Licensing, LLC
Inventor: Xiyang Dai , Yinpeng Chen , Bin Xiao , Dongdong Chen , Mengchen Liu , Lu Yuan , Lei Zhang
CPC classification number: G06V20/64 , G02B27/0172 , G06T3/06 , G06T3/40
Abstract: Systems and methods for object detection generate a feature pyramid corresponding to image data, and rescaling the feature pyramid to a scale corresponding to a median level of the feature pyramid, wherein the rescaled feature pyramid is a four-dimensional (4D) tensor. The 4D tensor is reshaped into a three-dimensional (3D) tensor having individual perspectives including scale features, spatial features, and task features corresponding to different dimensions of the 3D tensor. The 3D tensor is used with a plurality of attention layers to update a plurality of feature maps associated with the image data. Object detection is performed on the image data using the updated plurality of feature maps.
-
公开(公告)号:US11593615B2
公开(公告)日:2023-02-28
申请号:US16469512
申请日:2017-12-12
Applicant: MICROSOFT TECHNOLOGY LICENSING, LLC
Inventor: Gang Hua , Lu Yuan , Jing Liao , Dongdong Chen
Abstract: Image stylization is based on a learning network. A learning network is trained with a plurality of images and a reference image with a particular texture style. A plurality of different sub-networks of the learning network is trained, respectively. Specifically, one of the sub-networks is trained to extract one or more feature maps from the source image and transform the feature maps with the texture style applied thereon to a target image. Each of the feature maps indicates part of feature information of the source image. Another sub-network is trained to apply a specified texture style to the extracted feature maps, such that the target image generated based on the processed feature maps can embody the specified texture style.
-
公开(公告)号:US12148131B2
公开(公告)日:2024-11-19
申请号:US17733634
申请日:2022-04-29
Applicant: Microsoft Technology Licensing, LLC
Inventor: Dongdong Chen , Xiyang Dai , Yinpeng Chen , Mengchen Liu , Lu Yuan
Abstract: The disclosure herein describes generating an inpainted image from a masked image using a patch-based encoder and an unquantized transformer. An image including a masked region and an unmasked region is received, and the received image is divided into a plurality of patches including masked patches. The plurality of patches is encoded into a plurality of feature vectors, wherein each patch is encoded to a feature vector. Using a transformer, a predicted token is generated for each masked patch using a feature vector encoded from the masked patch, and a quantized vector of the masked patch is determined using generated predicted token and a masked patch-specific codebook. The determined quantized vector of the masked patch is included into a set of quantized vectors associated with the plurality of patches, and an output image is generated from the set of quantized vectors using a decoder.
-
公开(公告)号:US11810385B2
公开(公告)日:2023-11-07
申请号:US17135315
申请日:2020-12-28
Applicant: Microsoft Technology Licensing, LLC
Inventor: Dongdong Chen , Dengpan Fu , Jingdong Wang , Jianmin Bao
Abstract: A computer-vision method includes recognizing a feature representation of a query image depicting an unknown subject. A similarity score is computed between the representation of the query image and feature representations of a plurality of gallery images collectively depicting two or more different subjects with at least two or more gallery images for each subject, and each gallery image having a label identifying which of the subjects is depicted. One or more updated feature representations of the query image are sequentially iterated based on one or more of the computed similarity scores. For each of the one or more updated feature representations, an updated similarity score is computed between the updated feature representation and the feature representations of each of the gallery images. The unknown subject is identified based on a gallery image having a highest updated similarity score.
-
公开(公告)号:US20200342570A1
公开(公告)日:2020-10-29
申请号:US16955601
申请日:2019-01-08
Applicant: Microsoft Technology Licensing, LLC
Inventor: Lu Yuan , Gang Hua , Jing Liao , Dongdong Chen
IPC: G06T3/00 , H04N13/156 , G06T7/593
Abstract: In accordance with implementations of the subject matter described herein, there is proposed a solution of visual stylization of stereoscopic images. In the solution, a first feature map for a first source image and a second feature map for a second source image are extracted. The first and second source images correspond to first and second views of a stereoscopic image, respectively. A first unidirectional disparity from the first source image to the second source image is determined based on the first and second source images. First and second target images having a specified visual style are generated by processing the first and second feature maps based on the first unidirectional disparity. Through the solution, a disparity between two source images of a stereoscopic image are taken into account when performing the visual style transfer, thereby maintaining a stereoscopic effect in the stereoscopic image consisting of the target images.
-
-
-
-
-
-
-
-