-
公开(公告)号:WO2021058878A2
公开(公告)日:2021-04-01
申请号:PCT/FI2020/050633
申请日:2020-09-25
Applicant: NOKIA TECHNOLOGIES OY
Inventor: AFLAKI BENI, Payman , SCHWARZ, Sebastian
IPC: H04N19/597 , H04N13/161 , H04N19/119 , H04N19/174 , H04N19/85 , H04N19/70 , G06K9/00765 , G06T9/00 , H04N19/167 , H04N19/172
Abstract: Apparatuses, methods, and computer programs are disclosed for fractional/arbitrary tile grouping. An apparatus includes means for: receiving a video presentation frame, wherein the video presentation frame represents three-dimensional data; dividing the video presentation frame into a plurality of tiles, wherein one or more of the plurality of tiles may be a fractional tile, as part of a process of encoding the video presentation frame, wherein each tile represents a part of the three- dimensional data of the video presentation frame; grouping the tiles into one or more groups, wherein individual tiles of the video presentation frame have a capability of not belonging to any of the one or more groups; in response to fractional tiling being present, transmitting a signal of fractional tiling related syntax or semantics; and providing an encoded video presentation frame to a decoder, the encoded video presentation frame comprising the grouping of the tiles.
-
公开(公告)号:WO2021202221A1
公开(公告)日:2021-10-07
申请号:PCT/US2021/024053
申请日:2021-03-25
Applicant: AMAZON TECHNOLOGIES, INC.
Inventor: EFFINGER, Charles , DALL, Ryan, Barlow , SIAGIAN, Christian, Garcia , ITO, Jonathan, Y , VOLOVIK, Vadim , TSURUTANI, Brady, Court
IPC: H04N21/00 , G06K9/00718 , G06K9/00744 , G06K9/00765 , G06K9/627 , G06N20/00 , G06N3/0454 , G06N3/08 , G11B27/036 , H04N21/233 , H04N21/23418 , H04N21/812 , H04N21/8455 , H04N21/8456 , H04N21/8547
Abstract: Technologies are provided for generation of points of insertion of directed content into a video asset. In some embodiments, multiple time offsets within an interval spanned by the video asset can be determined using audio data corresponding to the video asset. A time offset defines a boundary between first and second segments of the video asset. Using image data corresponding to the video asset, respective pairs of video clips for the multiple time offsets can be generated. Visual features, aural features, and language features pertaining to the respective pairs of video clips can then be generated. Scores for the multiple time offsets can be generated using the visual features, the aural features, and the language features. A score represents an assessment of suitability to insert directed content into the video asset at a time offset. A file that contains specific time offsets can be generated.
-
公开(公告)号:WO2021202300A1
公开(公告)日:2021-10-07
申请号:PCT/US2021/024450
申请日:2021-03-26
Applicant: ON TIME STAFFING INC.
Inventor: OLSHANSKY, Roman
IPC: G06F16/60 , G06F16/70 , G06K9/00718 , G06K9/00744 , G06K9/00765 , G06Q10/1053 , G11B27/10 , H04N5/247 , H04N5/77 , H04N9/8715 , H04R1/08
Abstract: A system and method are presented to create custom versions for users of recorded sessions of individuals. Individuals are recorded at a booth responding to prompts. Audio and visual data recorded at the booth are divided into time segments according to the timing of the prompts. Depth sensors at the booth are used to assign score values to time segments. Prompts are related to criteria that were selected as being relevant to an objective. Users are associated with subsets of criteria in order to identify subsets of prompts whose responses are relevant to the users. Time segments of audio and visual data created by the identified subset of prompts are selected. The selected time segments are ordered according to herd behavior analysis. Lesser weighted time segments may be redacted. The remaining portions of ordered time segments are presented to the user as a custom version.
-
公开(公告)号:WO2021132802A1
公开(公告)日:2021-07-01
申请号:PCT/KR2020/003316
申请日:2020-03-10
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: PHILLIPS, Caleb , MOHOMED, Iqbal , FAZLY, Afsaneh , JEPSON, Allan
IPC: G06F16/783 , H04N21/84 , G06F16/735 , G06F16/7844 , G06K2209/01 , G06K9/00718 , G06K9/00765 , G11B27/19
Abstract: An apparatus for video searching, includes a memory storing instructions, and a processor configured to execute the instructions to split a video into scenes, obtain, from the scenes into which the video is split, one or more textual descriptors describing each of the scenes, encode the obtained one or more textual descriptors describing each of the scenes into a video scene vector of each of the scenes, encode a user query into a query vector having a same semantic representation as that of the video scene vector of each of the scenes into which the one or more textual descriptors describing each of the scenes are encoded, and identify whether the video scene vector of at least one among the scenes corresponds to the query vector into which the user query is encoded.
-
公开(公告)号:WO2021202017A1
公开(公告)日:2021-10-07
申请号:PCT/US2021/019529
申请日:2021-02-25
Applicant: MICRON TECHNOLOGY, INC.
Inventor: GOLOV, Gil
IPC: G06N5/02 , G06N20/00 , G06K9/00765 , G06K9/6256 , H04L47/127 , H04L47/24 , H04L47/823
Abstract: Methods, systems, and apparatuses related to reducing network congestion by analyzing data using a lightweight artificial intelligence (AI) layer prior to transmission are described. An AI model may predictively select data that need not be transmitted and, in some embodiments, further process data to be transmitted. As a result, the total size and amount of data transmitted over the network can be reduced, while the data needs of the receiving device can still be met. For example, data generated by a source application may be received and input into a predictive model, which may generate a prediction output for the data. The data may be pre-processed using a strategy selected based on the prediction output, and the pre-processed data may be transmitted over a network to a server.
-
公开(公告)号:WO2021145715A1
公开(公告)日:2021-07-22
申请号:PCT/KR2021/000580
申请日:2021-01-15
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: RUSTAGI, Shubham , CHERUKURI, Sathish , PARWANI, Shashi Kumar , BABU, Mineni Niswanth , CHAKRABORTY, Prasenjit
IPC: G06K9/00 , G06K9/20 , G06K9/62 , G06K9/00765 , G06T2200/24 , G06T2207/10016 , G06T5/002 , G11B27/031
Abstract: A computer-implemented method includes obtaining, from a storage, a video to be enhanced, based on a selection of a user; determining corresponding sets of object scores for a plurality of objects identified in the video, respectively, based on a set of predetermined factors; identifying a primary object and one or more secondary objects in the video, among the plurality of objects, based on the corresponding sets of object scores; and applying at least one visual effect to the primary object and at least one secondary object, from the one or more secondary objects, in at least a portion of the video, for obtaining an enhanced video with the at least one visual effect applied at least to the portion of the video.
-
公开(公告)号:WO2021202293A1
公开(公告)日:2021-10-07
申请号:PCT/US2021/024423
申请日:2021-03-26
Applicant: ON TIME STAFFING INC.
Inventor: OLSHANSKY, Roman
IPC: G06F16/60 , G06F16/70 , G06K9/00718 , G06K9/00744 , G06K9/00765 , G06Q10/1053 , G11B27/10 , H04N5/247 , H04N5/77 , H04N9/8715 , H04R1/08
Abstract: A system and method are presented for recording audio and video of an individual within a kiosk on separate audio and video computers that are locally connected to the kiosk. Instructions are provided to the individual through a locally connected controller computer. A remote user computer requests recorded data from the kiosk. The controller computer prompts the audio and video computers to separately stream audio and video to the remote user computer. The controller computer divides the audio and video data into time segments, and then presents different versions of the session to different users, with each different version comprising a different set of time segments. A central system server provides searching capabilities to the user computer to search and request data from a plurality of remotely located kiosks, each having separate controller, audio, and video computers.
-
公开(公告)号:WO2021202096A1
公开(公告)日:2021-10-07
申请号:PCT/US2021/022509
申请日:2021-03-16
Applicant: ALIBABA GROUP HOLDING LIMITED
Inventor: YANG, Zhe
IPC: H04N7/18 , G06K9/46 , G06T7/00 , G06K9/00765 , G11B27/031 , G11B27/34 , H04N5/272
Abstract: A method includes: obtaining video data; receiving a target object; processing the video data to obtain one or more first images of the video data that contain an image of the target object and one or more second images of the video data that do not contain the image of the target object; discarding the one or more second images from the video data; generating a background image based on the one or more second images; synthesizing the one or more first images of the video data with the background image to generate one or more synthesized images; obtaining a target image sequence with the one or more synthesized images based on a time order of the one or more first images in the video data; and displaying each of the one or more synthesized images in the target image sequence in a fade-in and fade-out manner.
-
公开(公告)号:WO2020072972A1
公开(公告)日:2020-04-09
申请号:PCT/US2019/054819
申请日:2019-10-04
Applicant: MAGIC LEAP, INC. , MOHAN, Anush , TAYLOR, Robert, Blake , MIRANDA, Jeremy, Dwayne , TORRES, Rafael, Domingos , OLSHANSKY, Daniel , SHAROKNI, Ali , GUENDELMAN, Eran , KRAMER, Nick , TOSSELL, Ken , MILLER, Samuel A. , TAJIK, Jehangir , SWAMINATHAN, Ashwin , AGARWAL, Lomesh , SINGHAL, Prateek , HOLDER, Joel, David , ZHAO, Xuan , CHOUDHARY, Siddharth , SUZUKI, Helder, Toshiro , BAROT, Hiral, Honar
Inventor: MOHAN, Anush , TAYLOR, Robert, Blake , MIRANDA, Jeremy, Dwayne , TORRES, Rafael, Domingos , OLSHANSKY, Daniel , SHAHROKNI, Ali , GUENDELMAN, Eran , KRAMER, Nick , TOSSELL, Ken , MILLER, Samuel A. , TAJIK, Jehangir , SWAMINATHAN, Ashwin , AGARWAL, Lomesh , SINGHAL, Prateek , HOLDER, Joel, David , ZHAO, Xuan , CHOUDHARY, Siddharth , SUZUKI, Helder, Toshiro , BAROT, Hiral, Honar , MOORE, Christian Ivan Robert
IPC: G06T19/00 , G06F3/0482 , G06F3/0486 , G06K9/46 , G06K9/52 , G06F3/04815 , G06K9/00671 , G06K9/00765 , G06K9/4671 , G06K9/629 , G06T19/006
Abstract: A cross reality system that provides an immersive user experience by storing persistent spatial information about the physical world that one or multiple user devices can access to determine position within the physical world and that applications can access to specify the position of virtual objects within the physical world. Persistent spatial information enables users to have a shared virtual, as well as physical, experience when interacting with the cross reality system. Further, persistent spatial information may be used in maps of the physical world, enabling one or multiple devices to access and localize into previously stored maps, reducing the need to map a physical space before using the cross reality system in it. Persistent spatial information may be stored as persistent coordinate frames, which may include a transformation relative to a reference orientation and information derived from images in a location corresponding to the persistent coordinate frame.
-
-
-
-
-
-
-
-