摘要:
Disclosed herein are systems, methods, and computer readable-media for selecting a multimedia presentation to accompany text. The method for selecting a multimedia presentation to accompany text comprises analyzing a body of text, selecting a multimedia presentation based on the body of text, and playing the selected multimedia presentation at an appropriate time simultaneous with presenting portions of the body of text. In one embodiment, the audio track comprises music, sound effects, silence, one or more ambient effect (such as dimming lights), and any combination thereof. In another embodiment, the audio track is based on content of the text, language, an associated still illustration or video clip, meta-data or a user profile. In yet another embodiment, an appropriate volume is determined for playing the selected audio track and that volume is used to adjust how loudly the selected audio track is played. Multiple multimedia presentations can be played back collaboratively and simultaneously.
摘要:
Systems and methods for generating an annotation guide. Speech data is organized and presented to a user. After the user selects some of the utterances in the speech data, the selected utterances are included in a class and/or call type. Additional utterances that belong to the class and/or call type can be found in the speech data using relevance feedback, data mining, data clustering, support vector machines, and the like. After a call type is complete, it is committed to the annotation guide. After all call types are completed, the annotation guide is generated.
摘要:
Systems and methods for monitoring labelers of speech data. To test or train labelers, a labeler is presented with utterances that have already been identified as belonging to a particular class or call type. The labeler is asked to assign a call type to the utterances. The performance of the labeler is measured by comparing the call types assigned by the labeler with the existing call types of the utterances. The performance of a labeler can also be monitored as the labeler labels speech data by occasionally having the labeler label an utterance that is already labeled and by storing the results.
摘要:
A method and apparatus for a service platform capable of providing device-based task completion is disclosed. A request for a task is received at a service platform from a customer. A worker device to complete the task is selected from a group of worker devices registered with the service platform based on a current attribute of the worker device. Data resulting from completion of the task is received from the selected worker device, validated, and presented to the customer. A reward or incentive can be provided to the worker device in response to the data being received from the worker device and validated.
摘要:
A method and apparatus for providing an opportunistic crowd based service platform is disclosed. A mobile sensor device is identified based on a current location and/or other qualities, such as intrinsic properties, previous sensor data, or demographic data of an associated user of the mobile sensor device. Data is collected from the mobile sensor device. The data collected from the mobile sensor device is aggregated with data collected from other sensor devices, and content generated based on the aggregated data is delivered to a user device.
摘要:
A method, apparatus, and computer readable medium for identifying a person in an image includes an image analyzer. The image analyzer determines the content of an image such as a person, location, and object shown in the image. A person in the image may be identified based on the content and event data stored in a database. Event data includes information concerning events and related people, locations, and objects determined from other images and information. Identification metadata is generated and linked to each analyzed image and comprises information determined during image analysis. Tags for images are generated based on identification metadata. The event database can be queried to identify particular people, locations, objects, and events depending on a user's request.
摘要:
A method, apparatus, and computer readable medium for displaying multimedia information synchronized with user activity includes a multimedia processing unit. The multimedia processing unit receives requests for multimedia information from a user and synchronizes the display of a multimedia presentation to a user based on user activities which are observed using one or more sensors. The multimedia processing unit acquires multimedia information from various sources via a network and segments the multimedia information based on content and additional information determined to be related to particular multimedia information acquired. The multimedia processing unit generates multimedia presentations using multimedia segments obtained from different sources. Multimedia segments are selected for a particular multimedia presentation based on a rating associated with the multimedia information from which the segment was derived.
摘要:
Disclosed herein are systems, computer-implemented methods, and tangible computer-readable media for synthesizing a virtual window. The method includes receiving an environment feed, selecting video elements of the environment feed, displaying the selected video elements on a virtual window in a window casing, selecting non-video elements of the environment feed, and outputting the selected non-video elements coordinated with the displayed video elements. Environment feeds can include synthetic and natural elements. The method can further toggle the virtual window between displaying the selected elements and being transparent. The method can track user motion and adapt the displayed selected elements on the virtual window based on the tracked user motion. The method can further detect a user in close proximity to the virtual window, receive an interaction from the detected user, and adapt the displayed selected elements on the virtual window based on the received interaction.
摘要:
Disclosed herein are systems, methods, and computer-readable media for transmedia video bookmarks, the method comprising receiving a first place marker and a second place marker for a segment of video media, extracting metadata from the video media between the first and second place markers, normalizing the extracted metadata, storing the normalized metadata, first place marker, and second place marker as a video bookmark, and retrieving the media represented by the video bookmark upon request from a user. One aspect further aggregates video bookmarks from multiple sources and refines the first place marker and second place marker based on the aggregated video bookmarks. Metadata can be extracted by analyzing text or audio annotations. Another aspect of normalizing the extracted metadata includes generating a video thumbnail representing the video media between the first place marker and the second place marker. Multiple video bookmarks may be searchable by metadata or by the video thumbnail visually. In one aspect a user profile stores video bookmarks on a per media and per user basis.
摘要:
The present disclosure generally provides a method for delivering targeted content to a user device based on environmental variables sensed at or near the user device. At least one sensed environmental variable associated with a user device is received. At least one environmental state is determined based on the at least one environmental variable. Content is then selected to be delivered to the user device based on the at least one environmental state, and the selected content is delivered to the user device.