-
公开(公告)号:US20190200078A1
公开(公告)日:2019-06-27
申请号:US16327674
申请日:2017-08-15
Applicant: NOKIA TECHNOLOGIES OY
Inventor: Sounak BHATTACHARYA , Lixin FAN , Yu YOU , Tinghuai WANG
IPC: H04N21/454 , G11B27/031 , G11B27/34 , G06F17/27 , H04N21/475 , H04N21/488
CPC classification number: H04N21/4542 , G06F17/2785 , G11B27/031 , G11B27/28 , G11B27/34 , H04N21/4307 , H04N21/4532 , H04N21/4755 , H04N21/4756 , H04N21/4882
Abstract: An apparatus configured to, in respect of a video provided to a user and wherein a plurality of comments are displayed such that they scroll across the video, based on the comments and one or more comment filtering rules, provide for comment filtering as each of the plurality of comments scrolls to meet a filter line, wherein the filter line extends across at least part of the video to define a first area overlaying the video and a non-overlapping second area overlaying the video, such that in the first area the plurality of comments are provided for display scrolling therethrough towards the filter line prior to comment filtering and in the second area the plurality of comments are provided for display with any comments of the plurality of comments that satisfy the one or more comment filtering rules removed from display.
-
公开(公告)号:US20210232848A1
公开(公告)日:2021-07-29
申请号:US15734785
申请日:2019-08-14
Applicant: Nokia Technologies OY
Inventor: Tinghuai WANG
Abstract: Apparatus for processing image data associated with at least one input image, including a convolutional neural network, CNN, -based encoder configured to provide a plurality of hierarchical feature maps based on the image data, a decoder configured to provide output data based on the plurality of feature maps, wherein the decoder includes a convolutional long short-term memory, Conv-LSTM, module configured to sequentially process at least some of the plurality of hierarchical feature maps.
-
公开(公告)号:US20220083866A1
公开(公告)日:2022-03-17
申请号:US17423314
申请日:2020-01-02
Applicant: Nokia Technologies Oy
Inventor: Tinghuai WANG , Lixin FAN
Abstract: There is provided an apparatus comprising means for performing: training a neural network by applying an optimization loss function, wherein the optimization loss function considers empirical errors and model redundancy (210); pruning a trained neural network by removing one or more filters that have insignificant contributions from a set of filters (220); and providing the pruned neural network for transmission (230).
-
公开(公告)号:US20190012804A1
公开(公告)日:2019-01-10
申请号:US16019349
申请日:2018-06-26
Applicant: Nokia Technologies Oy
Inventor: Tinghuai WANG , Yu YOU , Lixin FAN
IPC: G06T7/73 , H04N5/232 , H04N13/111 , G06T5/00 , G06T3/40 , G06T17/00 , G06T7/593 , H04N13/243
Abstract: This specification describes a method comprising generating, from a plurality of first images representing a scene, at least one stereoscopic panoramic image comprising a left-eye panoramic image and a right-eye panoramic image. Depth map images are generated corresponding to each of the left and right-eye panoramic images. Each of the left and right-eye panoramic images are re-projected to obtain a plurality of second images, each associated with a respective virtual camera. Each of the left and right-eye depth map images are re-projected to generate a re-projected depth map associated with each second image. A first three-dimensional model of the scene based on the plurality of second images is determined. A second three-dimensional model of the scene based on the plurality of re-projected depth map images is determined. One or more corresponding points of the first and second three-dimensional models is or are compared to determine a scaling factor.
-
公开(公告)号:US20180114071A1
公开(公告)日:2018-04-26
申请号:US15785711
申请日:2017-10-17
Applicant: Nokia Technologies Oy
Inventor: Tinghuai WANG
CPC classification number: G06K9/00765 , G06K9/00624 , G06K9/00744 , G06K9/4628 , G06K9/6256 , G06K9/6267 , G06K9/6271 , G06N3/04 , G06N3/0445 , G06N3/0454 , G06N3/08 , G06T3/40
Abstract: The invention relates to a method, an apparatus and a computer program product for analyzing media content. The method comprises receiving media content objects by a feature extractor for extracting a plurality of feature maps from said media content objects; processing the plurality of feature maps in a bidirectional Long-Short Term memory neural network, where the bidirectional Long-Short Term memory neural network is aligned along different directions of the feature maps to produce low resolution feature maps; upsampling the low resolution feature maps to the size of received media content; and assigning each pixel of the upsampled feature maps with a label of maximum likelihood for segmenting objects from the upsampled feature maps.
-
公开(公告)号:US20170345153A1
公开(公告)日:2017-11-30
申请号:US15597480
申请日:2017-05-17
Applicant: Nokia Technologies Oy
Inventor: Tinghuai WANG
CPC classification number: G06T7/11 , G06K9/4604 , G06T7/20 , G06T7/215 , G06T7/246 , G06T7/70 , G06T2207/10016 , G06T2207/20084
Abstract: The invention relates to a method and an apparatus implementing the method. The method comprises extracting region proposals from a media content; selecting a set of region proposals corresponding to an object in the media content; identifying objects of interest; determining an object-specific representation by an iterative tracking method; sampling positive examples from the set of tracked region proposal groups obtained from the iterative tracking method; and performing object segmentation.
-
公开(公告)号:US20190213769A1
公开(公告)日:2019-07-11
申请号:US16325771
申请日:2017-08-15
Applicant: NOKIA TECHNOLOGIES OY
Inventor: Tinghuai WANG , Lixin FAN , Yu YOU
IPC: G06T11/60 , G06F17/27 , H04N13/183
Abstract: An apparatus configured to, in respect of virtual reality content comprising video imagery configured to provide a virtual reality space wherein a virtual reality view presented to a user provides for viewing of the VR space; based on a comment made by and a virtual location of a commenting-user in the virtual reality space when the comment was made; provide for determination of a point of interest in the virtual reality space, the point of interest identified based on, at least, the virtual location of the commenting-user when the comment was made and semantic analysis of the comment to identify the point of interest surrounding the virtual location to which the comment refers, the point of interest associated with the comment thereby enabling the comment to be overlaid over the virtual reality view of the video imagery.
-
公开(公告)号:US20180314894A1
公开(公告)日:2018-11-01
申请号:US15956878
申请日:2018-04-19
Applicant: Nokia Technologies Oy
Inventor: Tinghuai WANG
Abstract: A method, an apparatus and a computer program product are provided, wherein the method comprises receiving a video comprising video frames as an input; generating set of object proposals from the video, the set of object proposals comprising positive object proposals and negative object proposals; generating object tracklets comprising regions appearing in consecutive frames of the video, said regions corresponding to object proposals with a high confidence; constructing a graph for the object proposals to rescore the object proposals in the generated object tracklets; and aggregating the rescored object proposals to produce an object detection.
-
9.
公开(公告)号:US20180293805A1
公开(公告)日:2018-10-11
申请号:US15948012
申请日:2018-04-09
Applicant: Nokia Technologies Oy
Inventor: Tinghuai WANG , Yu YOU , Lixin FAN
IPC: G06T19/20 , G03B37/04 , H04N13/243 , H04N5/232 , H04N5/225
Abstract: A method comprising performing image re-projection on each of a plurality of first images of a scene, thereby to generate a plurality of re-projected second images of the scene, wherein each first image of the scene is captured by a respective camera of a first multi-directional image capture apparatus and each second image of the scene is associated with a respective virtual camera; processing the plurality of second images based on a previously generated virtual three dimensional model of the scene, thereby to generate respective positions of the virtual cameras associated with the second images; and determining a position of the first multi-directional image capture apparatus based on one or more of the generated positions of the virtual cameras.
-
-
-
-
-
-
-
-