摘要:
Systems and methods providing automated extraction of information contained in video data and uses thereof are described. In particular, systems and associated methods are described that provide techniques for extracting data embedded in video, for example measurement-value pairs of medical videos, for use in a variety of applications, for example video indexing, searching and decision support applications.
摘要:
A mechanism is provided for rendering tape file system information. The mechanism obtains a list of one or more files residing on a tape in the tape file system from a file directory. The mechanism obtains location information associated with at least one file of the one or more files. Finally, the mechanism renders a representation of the location information with the at least one file.
摘要:
A system and a corresponding method for temporal modification of audio signals, to increase or reduce the playback rates of an audio and/or a video file in a client-server environment. The system and method improve the efficiency of serving streaming media to a client so that the client can select an arbitrary time-speedup factor. The speedup system performs many of the pre-calculations once, at the server, so that the bandwidth needs are reduced and the client's computational load is minimized. The final time-scale-modification can be either done completely on the server, thus reducing the client's needs, or partly on the client's computer to minimize latency, and to reduce on-the-fly computational load from the server that serves multiple clients concurrently.
摘要:
A system and a corresponding method for temporal modification of audio signals, to increase or reduce the playback rates of an audio and/or a video file in a client-server environment. The system and method improve the efficiency of serving streaming media to a client so that the client can select an arbitrary time-speedup factor. The speedup system performs many of the pre-calculations once, at the server, so that the bandwidth needs are reduced and the client's computational load is minimized. The final time-scale-modification can be either done completely on the server, thus reducing the client's needs, or partly on the client's computer to minimize latency, and to reduce on-the-fly computational load from the server that serves multiple clients concurrently.
摘要:
A method of detecting tasks performed by users wherein a single task is a sequence of web URLs invocation. Task patterns are detected in web logs to identify tasks performed by users and analyze task trends over time, across corporate divisions and geographies. A grammar-based framework is used to model and detect tasks from web log patterns. The framework has two components: a declarative unit—to generate a task grammar, and a processing unit—to detect tasks from access logs by generating a state machine for applying the task grammar to the tokens associated with the access records. By analyzing user tasks, rather than just URLs, useful business information can be extracted.
摘要:
A system, program storage device, and method of buffering an electronic document received from a host computer, wherein the method comprises determining whether an original source code of the electronic document includes executable coding which when executed by a client computer, causes the client computer to perform undesired operations, and producing an alternate source code of the electronic document, which eliminates the coding, wherein the undesired operations are characterized as undesirable based on predetermined settings established by the client computer. The electronic document comprises any of a web page, electronic mail message, an electronic mail attachment, a note in a hypertext format, a text document, a text file, and an application-specific electronic document. Each of the original source code and the alternate source code comprises a hypertext transfer protocol (HTTP) source code.
摘要:
A system enables a user to query for key words and phrases a text document, such as a presentation slide file, and an associated audio stream, such as can be derived from an audio-video recording that is made of a presenter contemporaneously with the showing of the slides to an audience. A graphical user interface is presented in which query results for both the text document and the audio stream are displayed in a time-aligned format, to enable a user to easily and conveniently browse the text document and accompanying time-aligned audio stream based on the key words/phrases.
摘要:
A system and method for calibration-free tracking of a user's eye gaze vector and point of regard even if substantial head movement or rotation occurs. The preferred embodiment includes two synchronized interlaced cameras, each viewing the user's eye and having on-axis lighting that is alternately modulated. An image difference between lighted and unlighted images of the eye is used to identify a user's pupil. A plane containing the gaze vector is defined by rotating a base plane through the angle in a camera image plane between a pupil center, a first glint, and a second glint. The intersection of two such planes (one from each camera), defines the gaze vector. The gaze position is the intersection of the gaze vector with the object being viewed by the user. Alternate embodiments are also described.
摘要:
A method and apparatus determine when a subject is looking at a specific target area by estimating a divergence angle between (1) the direction in which the subject is looking and (2) the direction from the subject directly to the target area. This technique accesses whether the subject is looking at a particular area. The invention may further condition this determination according to the subject's distance from the target area, because there is less tolerance for divergent angles when the subject is farther away. In one embodiment, the divergence angle is estimated using the position of a glint of light in the subject's pupil. The glint is created by a light source located in the target area. If the glint is sufficiently central to the pupil, with the camera and light source being near the target area, the subject is looking at the target area. At long distances, when the glint is not sufficiently discernable from the pupil, another technique may be employed to estimate divergence angle. Namely, the plane of the subject's face is computed, and analyzed with respect to a vector between the subject's face and the target area. If the plane is substantially normal to the vector, the subject is looking at the target area.
摘要:
A computer-driven system aids operator positioning of a cursor by integrating eye gaze and manual operator input, thus reducing pointing time and operator fatigue. A gaze tracking apparatus monitors operator eye orientation while the operator views a video screen. Concurrently, the computer monitors an input device, such as a mouse, for mechanical activation by the operator. According to the operator's eye orientation, the computer calculates the operator's gaze position. Also computed is a gaze area, comprising a sub-region of the video screen that includes the gaze position. This region, for example, may be a circle of sufficient radius to include the point of actual gaze with a certain likelihood. When the computer detects mechanical activation of the operator input device, it determines an initial cursor display position within the current gaze area. This position may be a predetermined location with respect to the gaze area, such as a point on the bottom of the gaze area periphery. A different approach uses the initial mechanical activation of the input device to determine the direction of motion, and sets the initial display position on the opposite side of the gaze area from this motion so that continued movement of the input device brings the cursor to the gaze position in a seamless transition between gaze and manual input. After displaying the cursor on the video screen at the initial display position, the cursor is thereafter positioned manually according to the operator's use of the input device, without regard to gaze.