GENERATING A COMPRESSED REPRESENTATION OF A NEURAL NETWORK WITH PROFICIENT INFERENCE SPEED AND POWER CONSUMPTION

    公开(公告)号:US20210125070A1

    公开(公告)日:2021-04-29

    申请号:US17139825

    申请日:2020-12-31

    Inventor: Wei Wang Wei Jiang

    Abstract: The disclosure relates to technology for generating a compressed neural network. A weight tensor is received from a neural network to be compressed, and it is reordered to be compressed to have an inner two-dimensional (2D) shape and a 2D sparse bitmap. A layered structure is generated that represents the reordered weight tensor, and the reordered weight tensor is divided into a group of coefficients (GOCs). An encoding mode is selected to generate a quantized reordered weight tensor using one of a codebook or direct quantization, and a column swapped quantized reordered weigh tensor is generated. A compressed neural network is formed by encoding and the compressed representation of the neural network is transmitted to a target system for decompression.

    User image verification
    2.
    发明授权

    公开(公告)号:US10769261B2

    公开(公告)日:2020-09-08

    申请号:US15975282

    申请日:2018-05-09

    Inventor: Wei Jiang Wei Wang

    Abstract: A computer-implemented method verifies an image based authentication via one or more processors performing operations including receiving image data corresponding to a face identified by a facial recognition system, processing the received raw image data via a deep neural network trained on training data that includes images of both verified and fake faces to perform a temporal facial analysis, and generating a verification signal in response to the temporal facial analysis to indicate whether the raw image data is fake.

    Robot navigation and object tracking

    公开(公告)号:US10695911B2

    公开(公告)日:2020-06-30

    申请号:US15870626

    申请日:2018-01-12

    Inventor: Wei Jiang Wei Wang

    Abstract: A system and method of tracking an object and navigating an object tracking robot includes receiving tracking sensor input representing the object and an environment at multiple times, responsive to the tracking sensor input, calculating positions of the robot and the object at the multiple times, and using a computer implemented deep reinforcement learning (DRL) network trained as a function of tracking quality rewards and robot navigation path quality rewards, the DRL network being responsive to the calculated positions of the robot and the object at the multiple times to determine possible actions specifying movement of the object tracking robot from a current position of the robot and target, determine quality values (Q-values) for the possible actions, and select an action as a function of the Q-values. A method of training the DRL network is also included.

    Three-dimensional (3D) reconstructions of dynamic scenes using a reconfigurable hybrid imaging system

    公开(公告)号:US10529086B2

    公开(公告)日:2020-01-07

    申请号:US15820935

    申请日:2017-11-22

    Inventor: Wei Jiang Wei Wang

    Abstract: A computer-implemented method for a three-dimensional (3D) reconstruction of a dynamic scene includes receiving a plurality of color image sequences from a plurality of color imaging sensors, and at least one depth image sequence from at least one depth imaging sensor, where a color imaging sensor quantity is larger than a depth imaging sensor quantity. A plurality of calibrated color image sequences and at least one calibrated depth image sequence are generated based on the plurality of color imaging sequences and the at least one depth image sequence. A plurality of initial 3D patches is constructed using the plurality of calibrated color image sequences and the at least one calibrated depth image sequence. A 3D patch cloud is generated by expanding the plurality of initial 3D patches.

    Fine-grained object recognition in robotic systems

    公开(公告)号:US10322510B2

    公开(公告)日:2019-06-18

    申请号:US15449541

    申请日:2017-03-03

    Inventor: Wei Jiang Wei Wang

    Abstract: A method for fine-grained object recognition in a robotic system is disclosed that includes obtaining an image of an object from an imaging device. Based on the image, a deep category-level detection neural network is used to detect pre-defined categories of objects. A feature map is generated for each pre-defined category of object detected by the deep category-level detection neural network. Embedded features are generated, based on the feature map, using a deep instance-level detection neural network corresponding to the pre-defined category of the object, wherein each pre-defined category of an object comprises a corresponding different instance-level detection neural network. An instance-level of the object is determined based on classification of the embedded features.

    ACTIVITY DETECTION BY JOINT HUMAN AND OBJECT DETECTION AND TRACKING

    公开(公告)号:US20190180090A1

    公开(公告)日:2019-06-13

    申请号:US15835195

    申请日:2017-12-07

    Abstract: A computing device includes a communication interface, a memory, and processing circuitry. The processing circuitry is coupled to the communication interface and to the memory and is configured to execute the operational instructions to perform various functions. The computing device is configured to process a video frame of a video segment on a per-frame basis and based on joint human-object interactive activity (HOIA) to generate a per-frame pairwise human-object interactive (HOI) feature based on a plurality of candidate HOI pairs. The computing device is also configured to process the per-frame pairwise HOI feature to identify a valid HOI pair among the plurality of candidate HOI pairs and to track the valid HOI pair through subsequent frames of the video segment to generate a contextual spatial-temporal feature for the valid HOI pair to be used in activity detection.

    Context Reduction Of Palette Run Type In High Efficiency Video Coding (HEVC) Screen Content Coding (SCC)
    8.
    发明申请
    Context Reduction Of Palette Run Type In High Efficiency Video Coding (HEVC) Screen Content Coding (SCC) 审中-公开
    高效率视频编码(HEVC)屏幕内容编码(SCC)中调色板运行类型的上下文减少

    公开(公告)号:US20170055003A1

    公开(公告)日:2017-02-23

    申请号:US14831408

    申请日:2015-08-20

    CPC classification number: H04N19/90 H04N19/13 H04N19/593 H04N19/70 H04N19/93

    Abstract: An encoding apparatus includes a processor configured to receive a video frame including screen content and generate a block containing an index map of colors for screen content in the video frame. The block includes a first string of index values and a second string of the index values immediately below the first string. The processor is also configured to encode a second string palette_run_type flag corresponding to the second string without referencing a first string palette_run_type flag corresponding to the first string and using a single available context. A transmitter operably coupled to the processor is configured to transmit the second string palette_run_type flag in a bitstream to a decoding apparatus.

    Abstract translation: 编码装置包括:处理器,被配置为接收包括屏幕内容的视频帧,并生成包含视频帧中的屏幕内容的颜色的索引图的块。 该块包括第一个索引值串和第一个字符串正下方的索引值的第二个字符串。 处理器还被配置为对与第二字符串相对应的第二字符串palette_run_type标志进行编码,而不引用与第一字符串相对应的第一字符串palette_run_type标志并使用单个可用上下文。 可操作地耦合到处理器的发射机被配置为将比特流中的第二串palette_run_type标志发送到解码装置。

    Advanced Coding Techniques For High Efficiency Video Coding (HEVC) Screen Content Coding (SCC) Extensions
    9.
    发明申请
    Advanced Coding Techniques For High Efficiency Video Coding (HEVC) Screen Content Coding (SCC) Extensions 审中-公开
    高效率视频编码(HEVC)屏幕内容编码(SCC)扩展的高级编码技术

    公开(公告)号:US20160373756A1

    公开(公告)日:2016-12-22

    申请号:US15178316

    申请日:2016-06-09

    CPC classification number: H04N19/513 H04N19/119 H04N19/176 H04N19/96

    Abstract: An encoding apparatus, decoding apparatus, and coding methods are provided. A method of decoding including receiving, by a decoder, a bitstream from an encoder, scanning, using the decoder, the bitstream to identify a first flag corresponding to a string of index values in a block other than a last string and a second flag corresponding to the last string of index values from the block, determining, by the decoder, that a context model used to encode the first flag is the same as the context model used to encode the second flag, and generating, by the decoder, a video frame using the context model

    Abstract translation: 提供了编码装置,解码装置和编码方法。 一种解码方法,包括由解码器从编码器接收比特流,使用解码器扫描比特流,以识别与除最后一个字符串以外的块中的索引值串相对应的第一标志和相应的第二标志 对于来自块的索引值的最后一串,由解码器确定用于对第一标记进行编码的上下文模型与用于编码第二标志的上下文模型相同,并且由解码器生成视频 框架使用上下文模型

    Multi-stage image recognition for a non-ideal environment

    公开(公告)号:US11037029B2

    公开(公告)日:2021-06-15

    申请号:US16157983

    申请日:2018-10-11

    Inventor: Wei Jiang Wei Wang

    Abstract: Provided are an apparatus and a method of multi-stage image recognition. For the multi-stage image recognition, categorized object data is received from a first deep neural network. A second deep neural network is trained on subcategory customization data that relates to a non-ideal environment when the second deep neural network produces invalid subcategorized object data from the categorized object data, and generates an image recognition result using the second deep neural network as trained.

Patent Agency Ranking