Abstract:
Systems and methods are disclosed to recognize human action from one or more video frames by performing 3D convolutions to capture motion information encoded in multiple adjacent frames and extracting features from spatial and temporal dimensions therefrom; generating multiple channels of information from the video frames, combining information from all channels to obtain a feature representation for a 3D CNN model; and applying the 3D CNN model to recognize human actions.
Abstract:
Systems and methods are disclosed to recognize clothing from videos by detecting and tracking a human; performing face alignment and occlusal detection; and performing age and gender estimation, skin area extraction, and clothing segmentation to a linear support vector machine (SVM) to recognize clothing worn by the human.
Abstract:
The invention relates to a method for the computer-assisted identification of a class of VoIP calls of a first type (spam) in a communication network (internet). Said communication network has a plurality (N) of first subscribers (Tn1-1, . . . , Tn1-5) and a plurality (M) of second subscribers (Tn2-1, . . . , Tn2-7), the first and the second subscribers being allocated a definite characteristic (IP address, telephone number, e-mail address) wherein, at least some of the first subscribers (Tn1-1, . . . , Tn1-5) are allocated, respectively, with at least one list (white list, black list) which contains at least one definite characteristic of the second subscriber. During a call of one of the second subscribers to one of the first subscribers, a control screens to see whether the characteristic of the second subscriber is on the list of the first subscriber and in the event that the second subscriber is not on the list of the called first subscribers, the lists of the additional first subscriber are used to make a decision whether the call is classified as a call of the first type (spam or trusted caller).
Abstract:
A compound of ferrous L-threonate with structure (I), its compostions and methods useful for iron supplementation for mammals, particularly for human body to improve and treat nutritional iron-deficiency anemia, blood loss anemia and hemolytic anemia.
Abstract:
Systems and methods process an image having a plurality of pixels includes an image sensor to capture an image; a first-layer to encode local patches on an image region; and a second layer to jointly encode patches from the same image region.
Abstract:
Method and apparatus for generating a set of generator polynomials for use as a tail biting convolutional code to operate on data transmitted over a channel comprises: (0) specifying a constraint and a low code rate for a tail biting convolutional code, where the low rate code is lower than 1/n (n being an integer greater than 4); (1) selecting valid combinations of generator polynomials to include in a pool of potential codes, each valid combination being a potential code of the low rate code; (2) determining first lines of a weight spectrum for each potential code in the pool and including potential codes of the pool having best first lines in a candidate set; (3) determining best codes of the candidate set based on the first L number of lines in the weight spectrum; (4) selecting an optimum code(s) from the best codes; and (5) configuring a circuit(s) of a data transceiver to implement the optimum code(s).
Abstract:
Systems and methods are disclosed to search for a query image, by detecting local invariant features and local descriptors; retrieving best matching images by quantizing the local descriptors with a vocabulary tree; and reordering retrieved images with results from the vocabulary tree quantization.
Abstract:
Systems and methods are disclosed for generating a recommendation by performing collaborative filtering using an infinite dimensional matrix factorization; generating one or more recommendations using the collaborative filtering; and displaying the recommendations to a user.
Abstract:
A method and system for training a neural network of a visual recognition computer system, extracts at least one feature of an image or video frame with a feature extractor; approximates the at least one feature of the image or video frame with an auxiliary output provided in the neural network; and measures a feature difference between the extracted at least one feature of the image or video frame and the approximated at least one feature of the image or video frame with an auxiliary error calculator. A joint learner of the method and system adjusts at least one parameter of the neural network to minimize the measured feature difference.
Abstract:
Portable communication devices are provided. A portable communication device includes a housing, an antenna, and a push button. The housing includes a first receiving portion and a second receiving portion. The antenna is detachably disposed in the first receiving portion. The antenna includes a first engaging portion. The battery is detachably disposed in the second receiving portion. The battery includes a second engaging portion. The push button is disposed in the housing and includes a third engaging portion and a fourth engaging portion. The third engaging portion movably engages with the first engaging portion to fix the antenna to the housing. The fourth engaging portion movably engages with the second engaging portion to fix the battery to the housing.