Method and apparatus for signaling spherical region information in ISOBMFF

    公开(公告)号:US10819907B2

    公开(公告)日:2020-10-27

    申请号:US16498397

    申请日:2018-03-29

    申请人: MEDIATEK INC.

    摘要: A video processing method includes receiving a virtual reality (VR) content, encoding visual data obtained from the VR content to generate a part of a coded bitstream, and encapsulating the part of the coded bitstream into ISO Base Media File Format (ISOBMFF) file (s). In one exemplary implementation, the ISOBMFF file (s) may include a timed metadata track associated with a sphere visual track, where the timed metadata track is set to signal that the associated sphere visual track contains at least one spherical region contributed from at least one region visual track. In another exemplary implementation, the ISOBMFF file (s) may include a timed metadata track associated with a region visual track, where the timed metadata track is set to signal that the associated region visual track contributes to at least one spherical region carried in at least one sphere visual track. Further, an associated video processing apparatus is provided.

    METHOD AND APPARATUS FOR SIGNALING SPHERICAL REGION INFORMATION IN ISOBMFF

    公开(公告)号:US20200053282A1

    公开(公告)日:2020-02-13

    申请号:US16498397

    申请日:2018-03-29

    申请人: MEDIATEK INC.

    摘要: A video processing method includes receiving a virtual reality (VR) content, encoding visual data obtained from the VR content to generate a part of a coded bitstream, and encapsulating the part of the coded bitstream into ISO Base Media File Format (ISOBMFF) file (s). In one exemplary implementation, the ISOBMFF file (s) may include a timed metadata track associated with a sphere visual track, where the timed metadata track is set to signal that the associated sphere visual track contains at least one spherical region contributed from at least one region visual track. In another exemplary implementation, the ISOBMFF file (s) may include a timed metadata track associated with a region visual track, where the timed metadata track is set to signal that the associated region visual track contributes to at least one spherical region carried in at least one sphere visual track. Further, an associated video processing apparatus is provided.

    Method and apparatus for requesting and receiving selected segment streams based on projection information

    公开(公告)号:US10313763B2

    公开(公告)日:2019-06-04

    申请号:US15660710

    申请日:2017-07-26

    申请人: MEDIATEK INC.

    摘要: Aspects of the disclosure provide an apparatus having an interface circuit, a processing circuit, and a display device. The interface circuit is configured to receive media presentation description information of media data. The media data includes video content on a two-dimensional (2D) plane that is projected from video content of a sphere surface. The video content on the 2D plane includes a plurality of segment streams having different coverages on the 2D plane. The media presentation description information uses projection based spatial relationship description (P-SRD) to describe the different coverages of the video content on the 2D plane according to the projection. The processing circuit is configured to determine one or more segment streams based on a region of interests for image generation and the P-SRD, select segments in the one or more segment streams, and cause the interface circuit to request and receive the selected segments.

    METHOD AND APPARATUS FOR STREAMING VIDEO CONTENT

    公开(公告)号:US20180035172A1

    公开(公告)日:2018-02-01

    申请号:US15660710

    申请日:2017-07-26

    申请人: MEDIATEK INC.

    摘要: Aspects of the disclosure provide an apparatus having an interface circuit, a processing circuit and a display device. The interface circuit is configured to receive media presentation description information of media data. The media data includes video content on a two-dimensional (2D) plane that is projected from video content of a sphere surface according to a projection. The video content on the 2D plane includes a plurality of segment streams having different coverages of the video content on the 2D plane. The media presentation description information uses projection based spatial relationship description (P-SRD) to describe the different coverages of the video content on the 2D plane according to the projection. The processing circuit is configured to determine one or more segment streams based on a region of interests for image generation and the P-SRD, select segments in the one or more segment streams, and cause the interface circuit to request and receive the selected segments.

    METHODS AND APPARATUS FOR SIGNALING VIEWPORTS AND REGIONS OF INTEREST

    公开(公告)号:US20180199042A1

    公开(公告)日:2018-07-12

    申请号:US15861503

    申请日:2018-01-03

    申请人: MediaTek Inc.

    IPC分类号: H04N19/167 H04N19/172

    摘要: The techniques described herein relate to methods, apparatus, and computer readable media configured to encode or decode a region of interest associated with video data. A spherical region structure is associated with the video data that specifies the region of interest on a sphere, the spherical region structure including a reference point of the region of interest on the sphere, and data indicative of a set of side points, comprising a side point for each side of the region of interest on the sphere. The region of interest in the video data is determined based on the reference point and the set of side points. The video data can be composite video data. The spherical region structure, and/or metadata based on the spherical region structure, can be implicitly or explicitly associated with the video data.

    Hierarchical Entity Grouping And Entity Reference In Media Content Delivery

    公开(公告)号:US20240357194A1

    公开(公告)日:2024-10-24

    申请号:US18638606

    申请日:2024-04-17

    申请人: MediaTek Inc.

    发明人: Xin Wang Lulin Chen

    摘要: A method of delivering media content, where a streaming server provides a media content to a streaming client. The media content includes multiple media tracks. A first group of entities is specified for the media content. Each entity in the first group of entities is a media track, an item, or a child group of entities. A first data structure is populated to specify one or more arrays of entity identifiers for the first group of entities. Each array of entity identifiers has one or more identifiers of the entities in the first group. Each array of entity identifiers is associated with one reference type that describes the entities identified by the array of entity identifiers. The first data structure is provided to the streaming client, which provides entities of the first group from the media content for playback according to the first data structure.

    Method and apparatus for deriving composite tracks

    公开(公告)号:US10805620B2

    公开(公告)日:2020-10-13

    申请号:US15865916

    申请日:2018-01-09

    申请人: MEDIATEK INC.

    摘要: Aspects of the disclosure provide a method and an apparatus for deriving composite tracks. The disclosed apparatus includes a processing circuitry. The processing circuitry is configured to generate a file that includes elementary track boxes respectively for elementary tracks. The elementary track box indexes a sequence of media samples in a time order that forms an elementary track. The processing circuitry is configured to construct a composite track box for a composite track. The composite track box identifies one or more elementary tracks, and a composite operation to form the composite track based on the one or more elementary tracks. The processing circuitry is further configured to generate a media presentation based on the composite track.

    Methods and apparatus for deriving composite tracks with track grouping

    公开(公告)号:US10778993B2

    公开(公告)日:2020-09-15

    申请号:US16014856

    申请日:2018-06-21

    申请人: MediaTek Inc.

    摘要: The techniques described herein relate to methods, apparatus, and computer readable media configured to derive a composite track. Three-dimensional video data includes a plurality of two-dimensional sub-picture tracks associated with a viewport. A composite track derivation for composing the plurality of two-dimensional sub-picture tracks for the viewport includes data indicative of the plurality of two-dimensional sub-picture tracks belonging to a same group, placement information to compose sample images from the plurality of two-dimensional tracks into a canvas associated with the viewport, and a composition layout operation to adjust the composition if the canvas comprises a composition layout created by two or more of the plurality of two-dimensional sub-picture tracks composed on the canvas. The composite track derivation can be encoded and/or used to decode the three-dimensional video data.

    Method and apparatus for track composition

    公开(公告)号:US10602239B2

    公开(公告)日:2020-03-24

    申请号:US15928823

    申请日:2018-03-22

    申请人: MEDIATEK INC.

    摘要: Aspects of the disclosure provide an apparatus that includes interface circuitry and processing circuitry. The interface circuitry is configured to receive signals carrying metadata for visual track composition from multiple visual tracks. The visual track composition includes alpha compositing, and can include spatial compositing and background compositing. The processing circuitry is configured to parse the metadata to extract configuration information for the visual track composition. Further, the processing circuitry receives a first sample from a first visual track and a second sample from a second visual track, and combines the first sample with the second sample to generate a composite sample based on the configuration information for the visual track composition.