ESTIMATING MULTI-PERSON POSES USING GREEDY PART ASSIGNMENT

    公开(公告)号:US20180293753A1

    公开(公告)日:2018-10-11

    申请号:US15480204

    申请日:2017-04-05

    申请人: INTEL CORPORATION

    IPC分类号: G06T7/73

    摘要: An example apparatus for estimating poses includes a person estimator to estimate a number of people based on a significant head count of received refined part detections. The apparatus includes a detection clusterer to cluster the refined part detections based on the estimated number of people to generate clustered part detections. The apparatus includes a candidate selector to select candidate person clusters for each clustered part detection based on proximity to the clustered part detection. The apparatus includes a sequential assigner to calculate a cluster affinity score for each combination of candidate person cluster and clustered part detection, and greedily sequentially assign each clustered part detection to a candidate person cluster based on the cluster affinity score to generate person clusters. The apparatus includes a pose generator to generate a pose for each person cluster.

    Hybrid pixel-domain and compressed-domain video analytics framework

    公开(公告)号:US11166041B2

    公开(公告)日:2021-11-02

    申请号:US16457802

    申请日:2019-06-28

    申请人: Intel Corporation

    摘要: In one embodiment, an apparatus comprises processing circuitry to: receive, via a communication interface, a compressed video stream captured by a camera, wherein the compressed video stream comprises: a first compressed frame; and a second compressed frame, wherein the second compressed frame is compressed based at least in part on the first compressed frame, and wherein the second compressed frame comprises a plurality of motion vectors; decompress the first compressed frame into a first decompressed frame; perform pixel-domain object detection to detect an object at a first position in the first decompressed frame; and perform compressed-domain object detection to detect the object at a second position in the second compressed frame, wherein the object is detected at the second position in the second compressed frame based on: the first position of the object in the first decompressed frame; and the plurality of motion vectors from the second compressed frame.