Abstract:
A method and an apparatus are provided. The method includes receiving a video with a first plurality of frames having a first resolution; generating a plurality of warped frames from the first plurality of frames based on a first type of motion compensation; generating a second plurality of frames having a second resolution, wherein the second resolution is of higher resolution than the first resolution, wherein each of the second plurality of frames having the second resolution is derived from a subset of the plurality of warped frames using a convolutional network; and generating a third plurality of frames having the second resolution based on a second type of motion compensation, wherein each of the third plurality of frames having the second resolution is derived from a fusing a subset of the second plurality of frames.
Abstract:
An apparatus, a method, a method of manufacturing and apparatus, and a method of constructing an integrated circuit are provided. The apparatus includes a teacher network; a student network; a plurality of knowledge bridges between the teacher network and the student network, where each of the plurality of knowledge bridges provides a hint about a function being learned, and where a hint includes a mean square error or a probability; and a loss function device connected to the plurality of knowledge bridges and the student network. The method includes training a teacher network; providing hints to a student network by a plurality of knowledge bridges between the teacher network and the student network; and determining a loss function from outputs of the plurality of knowledge bridges and the student network.
Abstract:
An apparatus and a method. The apparatus includes a plurality of long short term memory (LSTM) networks, wherein each of the plurality of LSTM networks is at a different network layer, wherein each of the plurality of LSTM networks is configured to determine a residual function, wherein each of the plurality of LSTM networks includes an output gate to control what is provided to a subsequent LSTM network, and wherein each of the plurality of LSTM networks includes at least one highway connection to compensate for the residual function of a previous LSTM network.
Abstract:
A system, method and device for object identification is provided. The method of identifying objects includes, but is not limited to, calculating feature vectors of the object, calculating feature vectors of the object's context and surroundings, combining feature vectors of the object, calculating likelihood metrics of combined feature vectors, calculating verification likelihood metrics against contact list entries, calculating a joint verification likelihood metric using the verification likelihood metrics, and identifying the object based on the joint verification likelihood metric.
Abstract:
A method and system for decoding a signal are provided. The method includes receiving a signal, where the signal includes at least one symbol; decoding the signal in stages, where each at least one symbol is decoded into at least one bit per stage, wherein a Log-Likelihood Ratio (LLR) and a path metric are determined for each possible path for each at least one bit at each stage; determining the magnitudes of the LLRs; identifying K bits of the signal with the smallest corresponding LLR magnitudes; identifying, for each of the K bits, L possible paths with the largest path metrics at each decoder stage for a user-definable number of decoder stages; performing forward and backward traces, for each of the L possible paths, to determine candidate codewords; performing a Cyclic Redundancy Check (CRC) on the candidate codewords, and stopping after a first candidate codeword passes the CRC.
Abstract:
A system and a method are disclosed for processing and combining feature maps using a hardware friendly multi-kernel convolution block (HFMCB). The method including splitting an input feature map into a plurality of feature maps, each of the plurality of feature maps having a reduced number of channels; processing each of the plurality of feature maps with a different series of kernels; and combining the processed plurality of feature maps.
Abstract:
A system and a method are disclosed for neural architecture search. In some embodiments, the method includes: processing a training data set with a neural network during a first epoch of training of the neural network; computing a training loss using a smooth maximum unit regularization value; and adjusting a plurality of multiplicative connection weights and a plurality of parametric connection weights of the neural network in a direction that reduces the training loss.
Abstract:
A server and method thereof are provided for use in a federated network. A method includes receiving local updates from client devices; updating a global model based on the received local updates; quantizing the updated global model; reconstructing feature maps based on the received local updates; refining the quantized, updated global model based on the reconstructed feature maps; and transmitting the refined, quantized, updated global model to the client devices.
Abstract:
A method and system for multi-frame contextual attention are provided. The method includes includes obtaining a reference frame to be processed, identifying context frames with respect to the reference frame, and producing a refined reference frame by processing the obtained reference frame based on the context frames.
Abstract:
A system and method for processing an input video while maintaining temporal consistency across video frames is provided. The method includes converting the input video from a first frame rate to a second frame rate, wherein the second frame rate is a faster frame rate than the first frame rate; generating processed frames of the input video at the second frame rate; and aggregating the processed frames using temporal sliding window aggregation to yield a processed output video at a third frame rate.