摘要:
Preferred implementations of the invention permit a user to seamlessly switch from a first media stream to a second media stream in a synchronized way, such that the second media stream picks up where the first media stream left off. In this way, the user experiences events chronologically but without interruption. In a preferred implementation, a user watching a skim video switches to a full length video when, for example, the skim video reaches a frame that is of particular interest to the user. The full length video begins at a point corresponding to the frame in the skim video that is of interest to the user, without skipping over video segments, so that the user does not experience any time gaps in the story line.
摘要:
A system enables a user to query for key words and phrases a text document, such as a presentation slide file, and an associated audio stream, such as can be derived from an audio-video recording that is made of a presenter contemporaneously with the showing of the slides to an audience. A graphical user interface is presented in which query results for both the text document and the audio stream are displayed in a time-aligned format, to enable a user to easily and conveniently browse the text document and accompanying time-aligned audio stream based on the key words/phrases.
摘要:
A system and method for indexing an audio stream for subsequent information retrieval and for skimming, gisting, and summarizing the audio stream includes using special audio prefiltering such that only relevant speech segments that are generated by a speech recognition engine are indexed. Specific indexing features are disclosed that improve the precision and recall of an information retrieval system used after indexing for word spotting. The invention includes rendering the audio stream into intervals, with each interval including one or more segments. For each segment of an interval it is determined whether the segment exhibits one or more predetermined audio features such as a particular range of zero crossing rates, a particular range of energy, and a particular range of spectral energy concentration. The audio features are heuristically determined to represent respective audio events including silence, music, speech, and speech on music. Also, it is determined whether a group of intervals matches a heuristically predefined meta pattern such as continuous uninterrupted speech, concluding ideas, hesitations and emphasis in speech, and so on, and the audio stream is then indexed based on the interval classification and meta pattern matching, with only relevant features being indexed to improve subsequent precision of information retrieval. Also, alternatives for longer terms generated by the speech recognition engine are indexed along with respective weights, to improve subsequent recall.
摘要:
A system and associated method automatically discover salient segments in a speech transcript and focus on the segmentation of an audio/video source into topically cohesive segments based on Automatic Speech Recognition (ASR) transcriptions. The word n-grams are extracted from the speech transcript using a three-phase segmentation algorithm based on the following sequence or combination of boundary-based and content-based methods: a boundary-based method; a rate of arrival of feature method; and a content-based method. In the first two segmentation passes, the temporal proximity and the rate of arrival of features are analyzed to compute an initial segmentation. In the third segmentation pass, changes in the set of content-bearing words used by adjacent segments are detected, to validate the initial segments for merging them, to prevent over-segmentation.
摘要:
A method of detecting tasks performed by users wherein a single task is a sequence of web URLs invocation. Task patterns are detected in web logs to identify tasks performed by users and analyze task trends over time, across corporate divisions and geographies. A grammar-based framework is used to model and detect tasks from web log patterns. The framework has two components: a declarative unit—to generate a task grammar, and a processing unit—to detect tasks from access logs by generating a state machine for applying the task grammar to the tokens associated with the access records. By analyzing user tasks, rather than just URLs, useful business information can be extracted.
摘要:
A method of detecting tasks performed by users wherein a single task is a sequence of web URLs invocation. Task patterns are detected in web logs to identify tasks performed by users and analyze task trends over time, across corporate divisions and geographies. A grammar-based framework is used to model and detect tasks from web log patterns. The framework has two components: a declarative unit—to generate a task grammar, and a processing unit—to detect tasks from access logs by generating a state machine for applying the task grammar to the tokens associated with the access records. By analyzing user tasks, rather than just URLs, useful business information can be extracted.
摘要:
A document management system applies relevant document analysis, metadata extraction, and business process association algorithms and methodology to automatically and dynamically classify documents for routing, processing, and executing customized business logic. The document management system accepts documents from one or more channels, classifies the document and extracts metadata, executes customized application profiles and triggers business logic associated with the process. The document management system comprises a rules engine to detect and classify unstructured forms as well as structured forms, where the locations of attributes and visual layout are not fixed. The document management system provides automatic linkage between disparate systems that manages documents for the complete execution of a business process.
摘要:
A validation matrix computation system automatically computes a validation matrix to analyze performance and efficiency of a global logistics process. The present system comprises identifying metrics and automatically computing the metrics to derive a validation matrix from the metrics. The system comprises a metric capture module installed at all participant locations in the global logistics process. The metric capture module captures, from logs generated at a node, metrics that measure performance at the node. The system generates the validation matrix from the captured metrics and uses the validation matrix to evaluate performance. The metrics comprise quantitative and qualitative metrics; quantitative metrics are automatically computed by correlating logs generated at the nodes and qualitative metrics are computed using surveys completed by participants in the global logistics process.
摘要:
A method and system for specifying, deploying and updating work flows. A request is received from a user of a system with components comprising a computing system and human representatives. The components facilitate routing and servicing the request. A work flow comprising manual and automatic steps is specified to perform the routing and servicing of the request. The automatic steps are automatically performed by the computing system and the manual steps are manually performed by the representatives. Both the automatic steps and the manual steps are uniformly defined in a single general-purpose markup language (e.g., XML). Further, a manual step is dynamically transformed into a new automatic step that is represented as a new XML action element. The new XML action element includes an identifier of a software routine to be executed to perform the new automatic step.
摘要:
A method and system for specifying, deploying and updating work flows. A request is received from a user of a system with components comprising a computing system and human representatives. The components facilitate routing and servicing the request. A work flow comprising manual and automatic steps is specified to perform the routing and servicing of the request. The automatic steps are automatically performed by the computing system and the manual steps are manually performed by the representatives. Both the automatic steps and the manual steps are uniformly defined in a single general-purpose markup language (e.g., XML). Further, a manual step is dynamically transformed into a new automatic step that is represented as a new XML action element. The new XML action element includes an identifier of a software routine to be executed to perform the new automatic step.