摘要:
The invention relates to a system (5, 15) and a method for detecting and annotating faces on-the-fly in video data. The annotation (29) is performed by modifying the pixel content of the video and is thereby independent of file types, protocols and standards. The invention can also perform real-time face-recognition by comparing detected faces with known faces from storage, so that the annotation can contain personal information (38) relating to the face. The invention can be applied at either end of a transmission channel and is particularly applicable in videoconferences, Internet classrooms, etc.
摘要:
The present invention relates to a method of communication (113) between a user (101) and a system (103) where it is detected whether the user looks at the system or somewhere else, and based thereon adjusting the communication.
摘要:
As medical imaging becomes more affordable, and the diversity of diagnostic modalities and therapeutic treatments increase, the amount of data being stored increases, and the problem becomes even more critical. One approach to improve retrieval efficiency of images is to employ semantics to establish a defined set of search and classification terms. However, such semantic systems still require the user to make a selection of the most appropriate term or terms to classify a report or image, and the accuracy of the results are thus dependent on the skill and knowledge of the classifier. According to a first aspect of the invention, a retriever is provided for retrieving a medical image having a searchable attribute, the retriever being configured to interface with a semantic database and an image database, and wherein the searchable attribute is determined by segmenting the medical image, using the anatomical model.
摘要:
A system and method for identifying an abnormality of an anatomical structure. The system and method segments, using a processor, the anatomical structure imaged in a volumetric image of a plurality of control patients to produce a control segmentation of the anatomical structures of each of the control patients, obtains a normative dataset by extracting a statistical representation of a morphology of the control segmentations, segments the anatomical structure of a patient being analyzed for abnormalities to produce a patient segmentation and compares the patient segmentation to the normative dataset obtained from the control segmentations.
摘要:
A system and method for automatic segmentation, performed by selecting a deformable model of an anatomical structure of interest imaged in a volumetric image, the deformable model formed of a plurality of polygons including vertices and edges, displaying the deformable model on a display, detecting a feature point of the anatomical structure of interest corresponding to each of the plurality of polygons and adapting the deformable model by moving each of the vertices toward the corresponding feature points until the deformable model morphs to a boundary of the anatomical structure of interest, forming a segmentation of the anatomical structure of interest.
摘要:
As medical imaging becomes more affordable, and the diversity of diagnostic modalities and therapeutic treatments increase, the amount of data being stored increases, and the problem becomes even more critical. One approach to improve retrieval efficiency of images is to employ semantics to establish a defined set of search and classification terms. However, such semantic systems still require the user to make a selection of the most appropriate term or terms to classify a report or image, and the accuracy of the results are thus dependent on the skill and knowledge of the classifier. According to a first aspect of the invention, a retriever is provided for retrieving a medical image having a searchable attribute, the retriever being configured to interface with a semantic database and an image database, and wherein the searchable attribute is determined by segmenting the medical image, using the anatomical model.
摘要:
For the recognition of coherently spoken speech with a large vocabulary, language model values which take into account the probability of word sequences are considered at word transitions. Prior to the recognition, these language model values are derived on the basis of training speech signals. If the amount of training data is kept within sensible limits, not all word sequences will actually occur, so that the language model values for, for example an N-gram language model must be determined from word sequences of N-1 words actually occurring. In accordance with the invention, these reduced word sequences from each different, complete word sequence are counted only once, irrespective of the actual frequency of occurrence of the complete word sequence or only reduced training sequences which occur exactly once in the training data are taken into account.
摘要:
The present invention relates to the determination of the specific orientation of an object. In order to provide enhanced positioning information of an object to a user, a medical imaging system and a method for operating of a medical imaging system are proposed wherein 2D image data (14) of an object is acquired (12) with an imaging system, wherein the object is provided with at least three markers visible in the 2D image; and wherein (16) the markers are detected in the 2D image; and wherein the spatial positioning and rotation angle (20) of the object in relation to the system geometry is identified (18) on behalf of the markers; and wherein an object-indicator (24) is displayed (22) indicating the spatial positioning and rotation angle of the object.
摘要:
Language models which take into account the probabilities of word sequences are used in speech recognition, in particular in the recognition of fluently spoken language with a wide vocabulary, in order to increase the recognition reliability. These models are obtained from comparatively large quantities of text and accordingly represent values which were averaged over several texts. This means, however, that the language model is not well adapted to peculiarities of a special text. To achieve such an adaptation of a given language model to a special text on the basis of only a short text fragment, according to the invention, it is suggested that first the unigram language model is adapted with the short text and, in dependence thereon, the M-gram language model is subsequently adapted. A method is described for adapting the unigram language model values which automatically carries out a subdivision of the words into semantic classes.
摘要:
A user profile for a plurality of users may be built for speech recognition purposes and for acting as an agent of the user. In some embodiments, a speech processing device automatically receives an utterance from a user. The utterance may be analyzed using signal processing to identify data associated with the user. The utterance may also be analyzed using speech recognition to identify additional data associated with the user. The identified data may be stored in a profile of the user. Data in the user profile may be used to select an acoustic model and/or a language model for speech recognition or to take actions on behalf of the user.