Recovering from dropped frames in real-time transmission of video over IP networks
    1.
    发明授权
    Recovering from dropped frames in real-time transmission of video over IP networks 有权
    通过IP网络实时传输视频,从丢帧中恢复

    公开(公告)号:US08929443B2

    公开(公告)日:2015-01-06

    申请号:US12350975

    申请日:2009-01-09

    摘要: Technologies for recovering from dropped frames in the real-time transmission of video over an IP network are provided. A video streaming module receives a notification from a receiving module that a data packet has been lost. The video streaming module determines, based on the type of video frame conveyed in the lost packet and the timing of the lost packet in relation to the sequence of video frames transmitted to the receiving module, whether or not a replacement video frame should be sent to the receiving module. If the video streaming module determines a replacement video frame is warranted, then the video streaming module instructs a video encoding module to generate a replacement video frame and then transmits the replacement video frame to the receiving module.

    摘要翻译: 提供了通过IP网络实时传输视频从丢帧中恢复的技术。 视频流模块从接收模块接收到数据分组丢失的通知。 视频流模块基于丢失分组中传送的视频帧的类型和与发送到接收模块的视频帧序列有关的丢失分组的定时,确定替换视频帧是否应发送到 接收模块。 如果视频流模块确定替换视频帧是有保证的,则视频流模块指示视频编码模块生成替换视频帧,然后将替换视频帧发送到接收模块。

    Techniques for managing visual compositions for a multimedia conference call
    2.
    发明授权
    Techniques for managing visual compositions for a multimedia conference call 有权
    用于管理多媒体电话会议的视觉作品的技术

    公开(公告)号:US08773494B2

    公开(公告)日:2014-07-08

    申请号:US11511749

    申请日:2006-08-29

    IPC分类号: H04N7/14

    摘要: Techniques for managing visual compositions for a multimedia conference call are described. An apparatus may comprise a processor to allocate a display object bit rate for multiple display objects where a total display object bit rate for all display objects is equal to or less than a total input bit rate, and decode video information from multiple video streams each having different video layers with different levels of spatial resolution, temporal resolution and quality for two or more display objects. Other embodiments are described and claimed.

    摘要翻译: 描述用于管理多媒体电话会议的视觉作品的技术。 设备可以包括处理器,用于为多个显示对象分配显示对象比特率,其中所有显示对象的总显示对象比特率等于或小于总输入比特率,并且从多个视频流解码视频信息,每个视频流具有 具有不同级别的空间分辨率,时间分辨率和两个或多个显示对象的质量的不同视频层。 描述和要求保护其他实施例。

    Field start code for entry point frames with predicted first field
    4.
    发明授权
    Field start code for entry point frames with predicted first field 有权
    具有预测第一个字段的入口点帧的现场起始码

    公开(公告)号:US07852919B2

    公开(公告)日:2010-12-14

    申请号:US10989596

    申请日:2004-11-15

    IPC分类号: H04N7/32 H04N3/10

    CPC分类号: H04N19/44 H04N19/70

    摘要: A decoder receives a field start code for an entry point key frame. The field start code indicates a second coded interlaced video field in the entry point key frame following a first coded interlaced video field in the entry point key frame and indicates a point to begin decoding of the second coded interlaced video field. The first coded interlaced video field is a predicted field, and the second coded interlaced video field is an intra-coded field. The decoder decodes the second field without decoding the first field. The field start code can be followed by a field header. The decoder can receive a frame header for the entry point key frame. The frame header may comprise a syntax element indicating a frame coding mode for the entry point key frame and/or a syntax element indicating field types for the first and second coded interlaced video fields.

    摘要翻译: 解码器接收入口点关键帧的场起始码。 场起始码指示入口点关键帧中的第一编码隔行扫描视频字段之后的入口点关键帧中的第二编码隔行扫描视频字段,并且指示开始对第二编码交错视频字段进行解码的点。 第一编码隔行视频字段是预测字段,第二编码隔行视频字段是帧内编码字段。 解码器解码第二场而不解码第一场。 字段起始码可以后跟一个字段标题。 解码器可以接收入口点关键帧的帧头。 帧头可以包括指示用于入口点关键帧的帧编码模式的语法元素和/或指示第一和第二编码隔行视频字段的字段类型的语法元素。

    TEMPORAL VIDEO FILTERING FOR REAL TIME COMMUNICATION SYTEMS
    5.
    发明申请
    TEMPORAL VIDEO FILTERING FOR REAL TIME COMMUNICATION SYTEMS 有权
    用于实时通信的时域视频滤波

    公开(公告)号:US20090110078A1

    公开(公告)日:2009-04-30

    申请号:US11924286

    申请日:2007-10-25

    申请人: Regis J. Crinon

    发明人: Regis J. Crinon

    IPC分类号: H04B1/66

    CPC分类号: H04N5/21

    摘要: Background vs. foreground decisions for video frames to be compressed and transmitted in a real time video communication system are made based on a non-parametric approach using signs of pixel value changes in sequential frames. Pixel value changes are tracked as negative or positive. Cost functions may be assigned to rows and columns of predefined blocks and a decision made based on randomness of the signs within the block whether the block represents background (noise) or foreground. Recursive temporal filtering is then employed to reduce the background noise progressively resulting in increased compression and transmission efficiency. Offset tiling is used to increase accuracy of randomness determination when blocks include background and foreground combinations.

    摘要翻译: 基于使用顺序帧中的像素值变化的符号的非参数方法来进行在实时视频通信系统中压缩和传输的视频帧的背景与前景决定。 像素值变化被跟踪为负数或正数。 可以将成本函数分配给预定义块的行和列,并且基于块内的符号的随机性做出的决定,无论块代表背景(噪声)还是前景。 然后采用递归时间滤波来逐步减少背景噪声,从而提高压缩和传输效率。 当块包括背景和前景组合时,使用偏移平铺来提高随机性确定的准确性。

    CLOSED CAPTIONS FOR REAL TIME COMMUNICATION
    6.
    发明申请
    CLOSED CAPTIONS FOR REAL TIME COMMUNICATION 审中-公开
    实时通信的封闭标准

    公开(公告)号:US20080295040A1

    公开(公告)日:2008-11-27

    申请号:US11753277

    申请日:2007-05-24

    申请人: Regis J. Crinon

    发明人: Regis J. Crinon

    IPC分类号: G06F3/00 G06F15/16

    摘要: The claimed subject matter provides systems and/or methods that facilitate yielding closed caption service associated with real time communication. For example, audio data and video data can be obtained from an active speaker in a real time teleconference. Moreover, the audio data can be converted into a set of characters (e.g., text data) that can be transmitted to other participants of the real time teleconference. Additionally, the real time teleconference can be a peer to peer conference (e.g., where a sending endpoint communicates with a receiving endpoint) and/or a multi-party conference (e.g., where an audio/video multi-point control unit (AVMCU) routes data such as the audio data, the video data, and the text data between endpoints).

    摘要翻译: 所要求保护的主题提供有助于产生与实时通信相关联的隐藏字幕服务的系统和/或方法。 例如,可以在实时电话会议中从有源说话者获得音频数据和视频数据。 此外,音频数据可以被转换成可被发送到实时电话会议的其他参与者的一组字符(例如,文本数据)。 此外,实时电话会议可以是对等会议(例如,发送端点与接收端点通信)和/或多方会议(例如,其中音频/视频多点控制单元(AVMCU) 路由诸如音频数据,视频数据和端点之间的文本数据的数据)。

    MULTIPLE RESOLUTION CAPTURE IN REAL TIME COMMUNICATIONS
    7.
    发明申请
    MULTIPLE RESOLUTION CAPTURE IN REAL TIME COMMUNICATIONS 有权
    实时通信中的多个分辨率捕获

    公开(公告)号:US20080266411A1

    公开(公告)日:2008-10-30

    申请号:US11740081

    申请日:2007-04-25

    IPC分类号: H04N5/228

    摘要: During remote communication session, there can be situations where information needs to be sent at a high resolution. Sending information at a high resolution allows for the capture of detail that can be lost without the use of a high resolution. A web camera can obtain information in both a higher resolution and standard resolution. A sending component can send this information encoded with markers that allow a receiving component to process and display the information.

    摘要翻译: 在远程通信会话期间,可能会出现需要以高分辨率发送信息的情况。 以高分辨率发送信息允许在不使用高分辨率的情况下捕获可能丢失的细节。 网络摄像机可以获得更高分辨率和标准分辨率的信息。 发送组件可以发送使用允许接收组件处理和显示信息的标记编码的信息。

    Mechanism for transmitting elementary streams in a broadcast environment
    8.
    发明授权
    Mechanism for transmitting elementary streams in a broadcast environment 有权
    在广播环境中传输基本流的机制

    公开(公告)号:US07433946B2

    公开(公告)日:2008-10-07

    申请号:US10917243

    申请日:2004-08-12

    IPC分类号: G06F15/173 G06F15/16 G06F5/00

    摘要: The techniques and mechanisms described herein are directed at transmitting elementary streams in a broadcast environment. The mechanisms provide a buffer controller and packet scheduler that allow a media format to be transmitted through the broadcasting environment in a manner resulting in a low channel switch delay. A buffer-fullness indicator allows the operation with various types of decoders. A lower bound and an upper bound are calculated for each frame within the elementary stream. The lower bound corresponds to an earliest time for sending the frame without causing an overflow condition within a decoder buffer. The upper bound corresponds to a latest time for sending the frame without causing an underflow condition within the decoder buffer. A send time is then scheduled based on the lower bound and the upper bound that determines when a packet associated with the frame is transmitted over a channel in a broadcast environment.

    摘要翻译: 这里描述的技术和机制针对在广播环境中传输基本流。 这些机制提供了一种缓冲器控制器和分组调度器,其允许以导致低通道切换延迟的方式通过广播环境传输媒体格式。 缓冲器充满度指示器允许使用各种类型的解码器进行操作。 为基本流中的每个帧计算下限和上限。 下限对应于发送帧的最早时间,而不会导致解码器缓冲器内的溢出状况。 上限对应于在解码器缓冲器内不发生下溢条件的发送帧的最新时间。 然后基于下限和上限来调度发送时间,该下限和上限确定与广播环境中的信道相关联的分组何时发送。

    Techniques for managing visual compositions for a multimedia conference call
    9.
    发明申请
    Techniques for managing visual compositions for a multimedia conference call 有权
    用于管理多媒体电话会议的视觉作品的技术

    公开(公告)号:US20080068446A1

    公开(公告)日:2008-03-20

    申请号:US11511749

    申请日:2006-08-29

    IPC分类号: H04N7/14

    摘要: Techniques for managing visual compositions for a multimedia conference call are described. An apparatus may comprise a processor to allocate a display object bit rate for multiple display objects where a total display object bit rate for all display objects is equal to or less than a total input bit rate, and decode video information from multiple video streams each having different video layers with different levels of spatial resolution, temporal resolution and quality for two or more display objects. Other embodiments are described and claimed.

    摘要翻译: 描述用于管理多媒体电话会议的视觉作品的技术。 设备可以包括处理器,用于为多个显示对象分配显示对象比特率,其中所有显示对象的总显示对象比特率等于或小于总输入比特率,并且从多个视频流解码视频信息,每个视频流具有 具有不同级别的空间分辨率,时间分辨率和两个或多个显示对象的质量的不同视频层。 描述和要求保护其他实施例。

    Method and system for inserting closed captions in video
    10.
    发明授权
    Method and system for inserting closed captions in video 有权
    在视频中插入隐藏式字幕的方法和系统

    公开(公告)号:US07342613B2

    公开(公告)日:2008-03-11

    申请号:US10973996

    申请日:2004-10-25

    申请人: Regis J. Crinon

    发明人: Regis J. Crinon

    IPC分类号: H04N7/00 H04N11/00

    CPC分类号: H04N7/0885 H04N7/0112

    摘要: A closed captioning configuration system is described. The system receives parameters of a digital video presentation and computes closed captioning parameters to drive a closed captions encoder, creating closed captions which are compatible with the presentation. In various implementations, the configuration system may be integrated into a video encoder, a closed captions encoder, or both. The configuration system, through analysis of the presentation parameters, can drive captioning for presentations which may differ by frame rate, interlacing, or frame encoding mode, and account for repetition of fields or frames.

    摘要翻译: 描述了一个闭路字幕配置系统。 系统接收数字视频呈现的参数,并计算隐藏字幕参数以驱动闭路字幕编码器,创建与演示文稿兼容的隐藏字幕。 在各种实施方案中,配置系统可以集成到视频编码器,隐藏式字幕编码器或两者中。 通过分析演示参数,配置系统可以驱动可能由帧速率,隔行扫描或帧编码模式不同的演示文字的字幕,并且说明字段或帧的重复。