摘要:
A method for recommending a video content to a viewer. The method including the steps of: determining a user profile of the viewer, the user profile indicating the viewing preferences of the viewer, providing a plurality of user profiles; comparing the user profile of the viewer to each of the plurality of user profiles to determine if each of the plurality of user profiles contains at least one common characteristic with the user profile of the viewer; and determining a recommendation for the video content based on the plurality of user profiles, wherein user profiles having the at least one common characteristic are assigned a greater recommendation weight than user profiles not having the at least one common characteristic.
摘要:
A telephone user who is at a first telephone station and is placed on hold at a second station is prevented from listening to an objectionable audio signal while a call is in process between the user and the second telephone station. The objectionable audio signal is detected while the call is in process and the user is prevented from hearing the objectionable audio signal while the objectionable audio signal is being detected. In response to detection of the end of the objectionable audio signal, the user is coupled to the second telephone station.
摘要:
A method for generating recommendations, the method including: prompting a user for feedback on at least one preference for generating a recommendation, the at least one preference having two or more categories associated therewith; displaying at least one visual cue corresponding to each of the two or more categories; selecting one of the two or more categories based at least in part on the corresponding at least one visual cue; and generating a recommendation based at least in part on the selecting. Preferably, the generating generates a recommendation for video content, such as a television program and the at least one preference is the genre of the television program, such as action, drama, comedy-action, suspense-action, comedy, documentary, or romance.
摘要:
A method and apparatus are disclosed for classifying objects using a hierarchical object classification scheme. The hierarchical object classification scheme provides candidate classes with an increasing degree of specificity as the hierarchy is traversed from the root node to the leaf nodes. Each node in the hierarchy has an associated classifier, such as a Radial Basis Function classifier, that determines a probability that an object is a member of the class associated with the node. The nodes of the hierarchical tree are individually trained by any learning technique, such as the exemplary Radial Basis Function Network, that uses appearance-based information of the objects under consideration to classify objects. A disclosed recognition scheme uses a decision criterion based upon recognition error to classify objects.
摘要:
A video content analysis system extends content analysis capability of one system to multiple channels by providing for the spatial multiplexing of the multiple channels and appropriately analyzing the spatially multiplexed video signal. The resulting system may be lower in cost that present systems and permit the system to work with ancillary equipment such as video recorders. The system also preserves the real-time information inherent in the multiple source signals.
摘要:
A monitoring system for an infant, child, invalid, or other person requiring care uses computer vision and hearing, and inputs of other modalities, to analyze the status of a caretaker and/or cared-for person and its environment. The conditions are classified as normal or alarm conditions and an informative alarm signal is generated which may include records of the vision audio and other inputs. The system also has the ability to solicit responses from the occupants to stimulate a classifiable input to reduce ambiguity in its state signal.
摘要:
A camera system comprising a camera, a monitor, an image detector, a touch screen, and a remote control. The monitor displays a field of view of a lens of the camera. The image detector provides a viewer image to the remote control whereby the remote control can determine a desired image within the field of view that the viewer is gazing upon. The touch screen enables a viewer of the monitor to point to the desired image of the field of view and provides signal(s) indicative of the pointing. The remote control is operable to activate various drives to pan and/or tilt the camera, or to zoom and/or focus the lens to the desired image and to follow any movement of the image within the field of view.
摘要:
A method for classification of objects in video image data. The method including the steps of: detecting moving objects in the image data; extracting two or more features from each detected moving object in the image data; classifying each moving object for each of the two or more features according to a classification method; and deriving a classification for each moving object based on the classification method for each of the two or more features. Also provided is an apparatus for classification of objects in video image data.
摘要:
Methods and apparatus are disclosed for tracking an object of interest in a video processing system, using clustering techniques. An area is partitioned into approximate regions, referred to as clusters, each associated with an object of interest. Each cluster has associated average pan, tilt and zoom values. Audio or video information, or both, are used to identify the cluster associated with a speaker (or another object of interest). Once the cluster of interest is identified, the camera is focused on the cluster, using the recorded pan, tilt and zoom values, if available. An event accumulator initially accumulates audio (and optionally video) events for a specified time, to allow several speakers to speak. The accumulated audio events are then used by a cluster generator to generate clusters associated with the various objects of interest. After initialization of the clusters, the illustrative event accumulator gathers events at periodic intervals. The mean of the pan and tilt values (and zoom value, if available) occurring in each time interval are then used to compute the distance between the various clusters in the database by a similarity estimator, based on an empirically-set threshold. If the distance is greater than the established threshold, then a new cluster is formed, corresponding to a new speaker, and indexed into the database. Fuzzy clustering techniques allow the camera to be focused on more than one cluster at a given time, when the object of interest may be located in one or more clusters.
摘要:
A security monitoring system including; an exit and entrance camera located at the exit and entrance of a structure; a detector for detecting the exit or entry of an individual; an image recording system for stroring images from the exit and entrance cameras; and a computer vision system for analyzing the stored images using a predetermined criteria to determine if the exiting and entering individuals are the same. In a preferred implementation, the system also includes a database for storing face image data for each authorized individual of the structure; and a face recognition system for comparing the stored images from the entry and entrance camera with the stored image data in the database and for determining if the exiting and entering individual is one of the authorized individuals.