摘要:
In general, the subject matter described in this specification can be embodied in methods, systems, and program products. A plurality of electronic training images that are each classified as displaying substantially pictures is obtained. A plurality of local image features in each of the plurality of electronic training images is identified. A plurality of weak classifiers are recursively applied to the local image features. During each iteration a weak classifier that accurately classifies the local images features is selected. After each selection of a weak classifier features that were misclassified by the selected weak classifier are given greater weight than features that were classified correctly by the selected weak classifier. For each selected weak classifier a hillclimbing algorithm is performed to attempt to improve the weak classifier. A strong classifier that is a weighted combination of the selected weak classifiers on which hillclimbing algorithms have been performed is produced.
摘要:
A system and method detects matches between portions of video content. A matching module receives an input video fingerprint representing an input video and a set of reference fingerprints representing reference videos in a reference database. The matching module compares the reference fingerprints and input fingerprints to generate a list of candidate segments from the reference video set. Each candidate segment comprises a time-localized portion of a reference video that potentially matches the input video. A classifier is applied to each of the candidate segments to classify the segment as a matching segment or a non-matching segment. A result is then outputted identifying a matching portion of a reference video from the reference video set based on the segments classified as matches.
摘要:
Methods and systems for servicing content for delivery to a client device are described. An item of content is identified during a session with the client device. A type of service to be performed on the item of content is identified. A provider is selected from a plurality of providers capable of performing the service. The session is transferred to the selected provider, which performs the service on the item of content.
摘要:
A system that enables communication and collaboration among individuals using rich media environments. A system according to the present techniques includes a set of rich media environments each having a corresponding arrangement of sensing and rendering components for sensing of and rendering to a corresponding set of individuals. A system according to the present techniques includes an interest thread detector that uses the sensing and rendering components to detect formation of multiple communication interactions among the individuals and that creates an interest thread for each detected communication interaction and further includes a communication provider that for each interest thread captures a set of media data from a corresponding subset of the sensing components and that combines the captured media data in response to the activities of the corresponding individuals and that communicates the combined media data to a corresponding subset of the rendering components.
摘要:
A data-compressed audio waveform is temporally modified without requiring complete decompression of the audio signal. Packets of compressed audio data are first unpacked, to remove scaling that was applied in the formation of the packets. The unpacked data is then temporally modified, using one of a number of different approaches. This modification takes place while the audio information remains in a data-compressed format. New packets are then assembled from the modified data, to produce a data-compressed output stream that can be subsequently processed in a conventional manner to reproduce the desired sound. The assembly of the new packets employs a technique for inferring an auditory model from the original packets, to requantize the data in the output packets.
摘要:
Digital mapping techniques are disclosed that provide visually-oriented information to the user, such as driving directions that include visual data points along the way of the driving route, thereby improving the user experience. The user may preview the route associated with the driving directions, where the preview is based on, for example, at least one of satellite images, storefront images, and heuristics and/or business listings. The visually-oriented information can be presented to the user in a textual, graphical, or verbal format, or some combination thereof.
摘要:
Embodiments of the present invention recite a method for enhancing the quality of visual prompts in and interactive media response system. In one embodiment, a video coder/decoder (codec) used by a thin device is determined. A visual prompt to be displayed on the thin device is accessed and the display parameters of the visual prompt are modified such that at least one character of the visual prompt is aligned with a blocking artifact generated by the video codec.
摘要:
Embodiments of the present invention recite a method and system for improving the fidelity of a dialog system. In one embodiment, a first input generated by a user of a first system operating in a first modality is accessed. In embodiments of the present invention, the first system also generates a first output corresponding to the first input. An second input from a second user, who is engaged in a conversation with the first user, is accessed by a second system. The second input is then utilized to modify the first output of the first system.
摘要:
A data-compressed audio waveform is temporally modified without requiring complete decompression of the audio signal. Packets of compressed audio data are first unpacked, to remove scaling that was applied in the formation of the packets. The unpacked data is then temporally modified, using one of a number of different approaches. This modification takes place while the audio information remains in a data-compressed format. New packets are then assembled from the modified data, to produce a data-compressed output stream that can be subsequently processed in a conventional manner to reproduce the desired sound. The assembly of the new packets employs a technique for inferring an auditory model from the original packets, to requantize the data in the output packets.
摘要:
One embodiment of the invention includes a method for managing a streaming media service. The method includes receiving a request for a streaming media service from a client. It is noted that the streaming media service includes a media service component. Additionally, the method includes selecting a service manager from a plurality of service managers to provide the request to. Furthermore, the method includes selecting a provider from a plurality of providers of a network to assign the media service component. Moreover, the method includes informing said provider assigned to perform the media service component, enabling the streaming media service to be performed on a streaming media.