摘要:
A zoom method and apparatus utilizing object detection. For example, some embodiments allow a user to zoom in or out from the digital content being displayed by moving their head towards or away from the display screen.
摘要:
A low complexity packet loss concealment method for use in voice-over-IP speech transmission calculates a cross-correlation of previous speech data to estimate the pitch period of the previous speech when speech frames have been lost. A tap interval used to calculate the cross-correlation is dynamically adapted, thereby reducing the computational complexity of the process. In addition, the pitch period estimation is bypassed completely when it is determined not to be necessary, as a result of the speech being unvoiced or silence. A waveform “bending” operation is performed into the current frame without inserting any algorithmic delay into each frame.
摘要:
A method and apparatus in which a mobile network advantageously communicates data requests to neighboring data stores so that they may pre-fetch the data. In particular, in accordance with an illustrative embodiment of the present invention, a protocol is advantageously established whereby a local data store in a mobile network notifies neighboring data stores of data requests, and whereby the neighboring data stores advantageously pre-fetch the data that may be required, thereby advantageously avoiding cascading cache misses. Such notifications may advantageously reduce the number of cache misses, which in turn may advantageously reduce the latency to download data as the user moves around within the mobile network and changes data sources. Specifically, in accordance with an illustrative embodiment of the present invention, a protocol for communicating data requests between local storage centers in a network supporting mobile users is provided.
摘要:
A method and apparatus for displaying images for use during a video teleconference provides improved eye contact between the participants. A video camera mounted on a display (e.g., a monitor or laptop) is co-located with a first participant in the video teleconference. An image of a second participant in the video teleconference is received, and a location of one or more facial features (e.g., the eyes) contained in the image of the second participant is determined. Then, the image of the second participant is displayed on the screen such that the eyes of the second participant are displayed in close proximity to (e.g., directly below) the video camera. In this manner, improved eye contact between the participants is advantageously provided. Alternatively, metadata representing the location of such facial features (e.g., the eyes) contained in the image of the second participant is received along with the image of the second participant.
摘要:
A method and apparatus for enhancing voice intelligibility for network communications of speech such as, for example, VoIP (Voice-Over-Internet-Protocol), in the presence of packets which arrive too late for normal playout. When a late speech packet is received by a speech decoder, that packet and, if necessary, one or more additional packets subsequent thereto, are played out over a shorter than normal duration so that the decoder can “catch up” with the encoder. Since a voice frame is usually decoded in several sub-frames—typically two or three—this shortened playout may be achieved, for example, by skipping one sub-frame from each frame to be shortened.
摘要:
A method of operating a packet network for carrying voice traffic, wherein the packets carrying voice traffic include voice samples. The method identifies a replacement packet opportunity, creates a replacement packet based on a selected packet, and inserts the replacement packet in the replacement packet opportunity. The replacement packet includes samples based on samples of the selected packet, but in an order that differs from the order of the samples in the selected packet. The method may further comprise identifying another replacement packet opportunity directly following the replacement packet opportunity, creating another replacement packet based on the replacement packet, and inserting the another replacement packet directly after the replacement packet. The another replacement packet differs from the replacement packet.
摘要:
A method and apparatus for displaying images for use during a video teleconference provides improved eye contact between the participants. A video camera mounted on a display (e.g., a monitor or laptop) is co-located with a first participant in the video teleconference. An image of a second participant in the video teleconference is received, and a location of one or more facial features (e.g., the eyes) contained in the image of the second participant is determined. Then, the image of the second participant is displayed on the screen such that the eyes of the second participant are displayed in close proximity to (e.g., directly below) the video camera. In this manner, improved eye contact between the participants is advantageously provided. Alternatively, metadata representing the location of such facial features (e.g., the eyes) contained in the image of the second participant is received along with the image of the second participant.
摘要:
A method and apparatus for bundling packets together for transmission in a Voice over IP communications network based on packet location within a talk spurt. Illustratively, all frames other than the first and last frames of a talk spurt may be advantageously bundled up to a predetermined maximum bundle size. This results from the recognition that the first and last packets of the talk spurt are the packets that will most directly affect the conversational delay. Therefore, other packets can be advantageously considered to be “non-critical” (with respect to conversational delay), and thus, may be bundled together with one or more other packets. In this manner, bandwidth may be advantageously reduced without negatively impacting the perceived conversational delay.
摘要:
A method for performing recoverable image and video watermarking which survives the use of block-based image and video compression techniques. One or more of the lowest order bits of the first DCT coefficient (the “DC” coefficient) which is to be coded are used as a “data channel” by which information representing a recoverable watermark may be embedded into an image or into a video signal frame. Encoding is performed by replacing one or more low order bits of the luminance value of each pixel in a block with a number of bits of the watermark data, and decoding is performed by averaging one or more low order bits of the decoded luminance values of the pixels in a block to retrieve a corresponding number of bits of the watermark data.
摘要:
A method and apparatus for performing Quality-of-Service (QoS) calculations on packet-based communications networks using a QoS measure which is based on data included in non-lost packets, as well as on data included in lost packets, when the proper interpretation of the data in non-lost packets depends upon data in one of the lost packets. Two new QoS measures that address the limitations inherent in the prior art PLR (Packet Loss Rate) measure are introduced. The Packet Loss Distortion Rate (PLDR) measure determines both packets which are lost, as well as packets whose proper interpretation depends on one or more packets which have been lost. The Media Distortion Rate (MDR) measures the actual quantity of media material that is lost, regardless of how the material is grouped into individual packets.