摘要:
Technologies for recovering from dropped frames in the real-time transmission of video over an IP network are provided. A video streaming module receives a notification from a receiving module that a data packet has been lost. The video streaming module determines, based on the type of video frame conveyed in the lost packet and the timing of the lost packet in relation to the sequence of video frames transmitted to the receiving module, whether or not a replacement video frame should be sent to the receiving module. If the video streaming module determines a replacement video frame is warranted, then the video streaming module instructs a video encoding module to generate a replacement video frame and then transmits the replacement video frame to the receiving module.
摘要:
Techniques for managing visual compositions for a multimedia conference call are described. An apparatus may comprise a processor to allocate a display object bit rate for multiple display objects where a total display object bit rate for all display objects is equal to or less than a total input bit rate, and decode video information from multiple video streams each having different video layers with different levels of spatial resolution, temporal resolution and quality for two or more display objects. Other embodiments are described and claimed.
摘要:
Video send and receive capabilities of participants are determined by the respective machines determining available combinations, as well as preferences for the receivers. Receiver capabilities are forwarded to the source for computation of negotiated video capabilities through a logic intersection of the determined capabilities based on desired number of streams and resolutions. If a resolution of a send capability exists within the receive capability, the highest frame and/or bit rate may be selected for transmission.
摘要:
A decoder receives a field start code for an entry point key frame. The field start code indicates a second coded interlaced video field in the entry point key frame following a first coded interlaced video field in the entry point key frame and indicates a point to begin decoding of the second coded interlaced video field. The first coded interlaced video field is a predicted field, and the second coded interlaced video field is an intra-coded field. The decoder decodes the second field without decoding the first field. The field start code can be followed by a field header. The decoder can receive a frame header for the entry point key frame. The frame header may comprise a syntax element indicating a frame coding mode for the entry point key frame and/or a syntax element indicating field types for the first and second coded interlaced video fields.
摘要:
Background vs. foreground decisions for video frames to be compressed and transmitted in a real time video communication system are made based on a non-parametric approach using signs of pixel value changes in sequential frames. Pixel value changes are tracked as negative or positive. Cost functions may be assigned to rows and columns of predefined blocks and a decision made based on randomness of the signs within the block whether the block represents background (noise) or foreground. Recursive temporal filtering is then employed to reduce the background noise progressively resulting in increased compression and transmission efficiency. Offset tiling is used to increase accuracy of randomness determination when blocks include background and foreground combinations.
摘要:
The claimed subject matter provides systems and/or methods that facilitate yielding closed caption service associated with real time communication. For example, audio data and video data can be obtained from an active speaker in a real time teleconference. Moreover, the audio data can be converted into a set of characters (e.g., text data) that can be transmitted to other participants of the real time teleconference. Additionally, the real time teleconference can be a peer to peer conference (e.g., where a sending endpoint communicates with a receiving endpoint) and/or a multi-party conference (e.g., where an audio/video multi-point control unit (AVMCU) routes data such as the audio data, the video data, and the text data between endpoints).
摘要:
During remote communication session, there can be situations where information needs to be sent at a high resolution. Sending information at a high resolution allows for the capture of detail that can be lost without the use of a high resolution. A web camera can obtain information in both a higher resolution and standard resolution. A sending component can send this information encoded with markers that allow a receiving component to process and display the information.
摘要:
The techniques and mechanisms described herein are directed at transmitting elementary streams in a broadcast environment. The mechanisms provide a buffer controller and packet scheduler that allow a media format to be transmitted through the broadcasting environment in a manner resulting in a low channel switch delay. A buffer-fullness indicator allows the operation with various types of decoders. A lower bound and an upper bound are calculated for each frame within the elementary stream. The lower bound corresponds to an earliest time for sending the frame without causing an overflow condition within a decoder buffer. The upper bound corresponds to a latest time for sending the frame without causing an underflow condition within the decoder buffer. A send time is then scheduled based on the lower bound and the upper bound that determines when a packet associated with the frame is transmitted over a channel in a broadcast environment.
摘要:
Techniques for managing visual compositions for a multimedia conference call are described. An apparatus may comprise a processor to allocate a display object bit rate for multiple display objects where a total display object bit rate for all display objects is equal to or less than a total input bit rate, and decode video information from multiple video streams each having different video layers with different levels of spatial resolution, temporal resolution and quality for two or more display objects. Other embodiments are described and claimed.
摘要:
A closed captioning configuration system is described. The system receives parameters of a digital video presentation and computes closed captioning parameters to drive a closed captions encoder, creating closed captions which are compatible with the presentation. In various implementations, the configuration system may be integrated into a video encoder, a closed captions encoder, or both. The configuration system, through analysis of the presentation parameters, can drive captioning for presentations which may differ by frame rate, interlacing, or frame encoding mode, and account for repetition of fields or frames.