Abstract:
Embodiments provide an apparatus and method for detecting a color checker in an image including: an extracting unit to extract sliding window features of sliding windows at multiple positions from an input image by using a predefined sliding window; a selecting unit to select, by using multiple predefined sliding window templates, a sliding window having a minimum distance from each sliding window template from the sliding windows at the multiple positions as a matched sliding window of the sliding window template; and a determining unit to determine segmentation lines of colored squares of a color checker in the input image according to a matching result of each sliding window template.
Abstract:
The invention provides an apparatus and method for extracting a boundary of an object in an image and an electronic device. The apparatus includes: a position determining unit, configured to determine a start point and an end point of a boundary of an object in an image and to determine a position of a reference point relevant to the start point and the end point; a first direction determining unit, configured to determine a first direction of the boundary; a gradient map obtaining unit, configured to obtain a gradient map of a first region; a gradient attenuating unit, configured to attenuate in the gradient map the gradients of a second region; and an extracting unit, configured to extract a boundary of an object. The technology of the invention can improve the accuracy of boundary extracting, and can be applied in the field of image processing.
Abstract:
The disclosure provides a document processing apparatus, method and a scanner. The document processing apparatus includes: a text line extraction unit extracting a text line from an input document; a language classification unit determining whether an OCR process is necessary for a language of the input document; an OCR unit determining, by performing the OCR process, an OCR confidence in the case that it is determined that the OCR process is necessary; an graphic feature recognition unit determining an graphic feature recognition confidence; and a determination unit determining a combination confidence based on at least one of the determined graphic feature recognition confidences and the determined OCR confidences, and determining an orientation of the input document based on the combination confidences. This technical solution can determine better an orientation of the document, and is especially applicable when the quality of the image of the document is deteriorated.
Abstract:
The present invention relates to a convolutional-neural-network-based classifier, a classifying method by using a convolutional-neural-network-based classifier and a method for training the convolutional-neural-network-based classifier. The convolutional-neural-network-based classifier comprises: a plurality of feature map layers, at least one feature map in at least one of the plurality of feature map layers being divided into a plurality of regions; and a plurality of convolutional templates corresponding to the plurality of regions respectively, each of the convolutional templates being used for obtaining a response value of a neuron in the corresponding region.
Abstract:
A method of presenting prompt information by utilizing a neural network which includes a BERT model and a graph convolutional neural network (GCN), comprising: generating a first vector based on a combination of an entity, a context of the entity, a type of the entity and a part of speech of the context by using BERT model; generating a second vector based on each of predefined concepts by using BERT model; generating a third vector based on a graph which is generated based on the concepts and relationships thereamong, by using GCN; generating a fourth vector by concatenating the second and third vectors; calculating semantic similarity between the entity and each concept based on the first and fourth vectors; determining, based on the first vector and the semantic similarity, that the entity corresponds to one of the concepts; and generating the prompt information based on the determined concept.
Abstract:
A method and a device for simulating atomic dynamics includes setting initial positions for multiple specific atoms in a specific scene; calculating, based on the initial positions, positions of the multiple specific atoms at each time in a first time series by utilizing a Large-scale Atomic/Molecular Massively Parallel Simulator (LAMMPS) configured with respect to the specific scene, as real positions; calculating, based on the initial positions, positions of the multiple specific atoms at the same time in the first time series by utilizing a generative adversarial network (GAN), as predicted positions; improving a configuration of the GAN based on the real positions and the predicted positions at a same time. Initial positions are settable for multiple atoms to be simulated in a scene; positions of the multiple atoms to be simulated are calculated at each time in a second time series in the scene by utilizing the improved GAN.
Abstract:
A device and a method for improving a processing speed of a neural network and applications thereof in the neural network where the device includes a processor configured to perform: determining, according to a predetermined processing speed improvement target, a dimension reduction amount of each of one or more parameter matrixes in the neural network obtained through training; preprocessing each parameter matrix based on the dimension reduction amount of the parameter matrix; and retraining the neural network based on a result of the preprocessing to obtain one or more dimension reduced parameter matrixes so as to ensure performance of the neural network meets a predetermined requirement. According to the embodiments of the present disclosure, it is possible to significantly improve the processing speed of the neural network while ensuring the performance of the neural network meets the predetermined requirement.
Abstract:
A recognition apparatus based on a deep neural network, a training apparatus and methods thereof. The deep neural network is obtained by inputting training samples comprising positive samples and negative samples into an input layer of the deep neural network and training. The apparatus includes: a judging unit configured to judge that a sample to be recognized is a suspected abnormal sample when confidences of positive sample classes in a classification result outputted by an output layer of the deep neural network are all less than a predefined threshold value. Hence, reliability of a confidence of a classification result outputted by the deep neural network may be efficiently improved.
Abstract:
The present application relates to a method and a device for correcting a document image captured by an image pick-up device. The method includes: determining world coordinates of four vertices of the document image; calculating an original aspect ratio of the document image based on a correspondence between the world coordinates of the four vertices and projective coordinates of the four vertices in a projective space, and an intrinsic matrix and characteristics of an extrinsic matrix of the image pick-up device; determining a projective transformation matrix based on the world coordinates of the four vertices and the aspect ratio; and obtaining a corrected document image based on the determined projective transformation matrix and the document image. According to the application, perspective transformation can be corrected by using only one captured image and an original image can be recovered based on an original aspect ratio.
Abstract:
An image correction method and an image correction apparatus when the image correction method includes: an identifying step of identifying each pixel in an image as a foreground pixel or a background pixel; a background filling step of estimating brightness of a background corresponding to a foreground pixel based on brightness and gradient of the brightness of background pixels adjacent to the foreground pixel to fill the background located in a position of the foreground pixel, to obtain a background illumination map of the image according to filled backgrounds along with background pixels; and a correcting step of correcting the image based on the brightness of each pixel in the image and the background illumination map. A non-uniform illumination image can be corrected effectively.