摘要:
Multiple cameras are configured for use in video analytics. A single configuration tool is provided. The interrelationships between cameras are included within the configuration. Using a combination of text entry fields, registration of the cameras on a floor or other map, and marking on images from the cameras, an efficient workflow for configuration may be provided.
摘要:
Multiple cameras are configured for use in video analytics. A single configuration tool is provided. The interrelationships between cameras are included within the configuration. Using a combination of text entry fields, registration of the cameras on a floor or other map, and marking on images from the cameras, an efficient workflow for configuration may be provided.
摘要:
A method of detecting an object in image data that is deemed to be a threat includes annotating sections of at least one training image to indicate whether each section is a component of the object, encoding a pattern grammar describing the object using a plurality of first order logic based predicate rules, training distinct component detectors to each identify a corresponding one of the components based on the annotated training images, processing image data with the component detectors to identify at least one of the components, and executing the rules to detect the object based on the identified components.
摘要:
A method of detecting an object in image data that is deemed to be a threat includes annotating sections of at least one training image to indicate whether each section is a component of the object, encoding a pattern grammar describing the object using a plurality of first order logic based predicate rules, training distinct component detectors to each identify a corresponding one of the components based on the annotated training images, processing image data with the component detectors to identify at least one of the components, and executing the rules to detect the object based on the identified components.
摘要:
First order predicate logics are provided, extended with a bilattice based uncertainty handling formalism, as a means of formally encoding pattern grammars, to parse a set of image features, and detect the presence of different patterns of interest implemented on a processor. Information from different sources and uncertainties from detections, are integrated within the bilattice framework. Automated logical rule weight learning in the computer vision domain applies a rule weight optimization method which casts the instantiated inference tree as a knowledge-based neural network, to converge upon a set of rule weights that give optimal performance within the bilattice framework. Applications are in (a) detecting the presence of humans under partial occlusions and (b) detecting large complex man made structures in satellite imagery (c) detection of spatio-temporal human and vehicular activities in video and (c) parsing of Graphical User Interfaces.
摘要:
First order predicate logics are provided, extended with a bilattice based uncertainty handling formalism, as a means of formally encoding pattern grmmars, to parse a set of image features, and detect the presence of different patterns of interest implemented on a processor. Information from different sources and uncertainties from detections, are integrated within the bilattice framework. Automated logical rule weight learning in the computer vision domain applies a rule weight optimization method which casts the instantiated inference tree as a knowledge-based neural network, to converge upon a set of rule weights that give optimal performance within the bilattice framework. Applications are in (a) detecting the presence of humans under partial occlusions and (b) detecting large complex man made structures in satellite imagery (c) detection of spatio-temporal human and vehicular activities in video and (c) parsing of Graphical User Interfaces.
摘要:
A method of identifying an optimum treatment for a patient suffering from coronary artery disease, comprising: (i) providing patient information selected from: (a) status in the patient of one or more coronary disease associated biomarkers; (b) one or more items of medical history information selected from prior condition history, intervention history and medication history; (c) one or more items of diagnostic history, if the patient has a diagnostic history; and (d) one or more items of demographic data; (ii) aggregating the patient information in: (a) a Bayesian network; (b) a machine learning and neural network; (c) a rule-based system; and (d) a regression-based system; (iii) deriving a predicted probabilistic adverse event outcome for each intervention comprising percutaneous coronary intervention by placement of a bare metal stent, or a drug-coated stent; or by coronary artery bypass grafting; and (iv) determining the intervention having the lowest predicted probabilistic adverse outcome.
摘要:
A method and system for cooperative diversity visual cognition in a wireless sensor network is disclosed. The method and system are capable of solving distributed visual cognition tasks (for example, online simultaneous reconstruction of 3D models of a large area) by using multiple video streams and exploiting cooperative diversity video sensing information while ensuring an optimal tradeoff between energy consumption and video quality of images received from said multiple video streams.
摘要:
Systems and methods for automatic speech recognition by training a neural network to learn features from raw speech. The system comprises a neural network executing on a computer system and comprising a feature extractor, a label classifier, and a domain classifier. The feature extractor processes raw speech data and generates a first output data. The label classifier processes the first output data and generates a second output data. The domain classifier processes the first output data and generating a third output data. The neural network calculates first loss data based on the second output, and second loss data based on the third output. Further, the neural network is trained to minimize a cross-entropy cost of the label classifier and to maximize a cross-entropy cost of the domain classifier using the first loss data and the second loss data.
摘要:
A method and system for cooperative diversity visual cognition in a wireless sensor network is disclosed. The method and system are capable of solving distributed visual cognition tasks (for example, online simultaneous reconstruction of 3D models of a large area) by using multiple video streams and exploiting cooperative diversity video sensing information while ensuring an optimal tradeoff between energy consumption and video quality of images received from said multiple video streams.