-
公开(公告)号:US12072845B2
公开(公告)日:2024-08-27
申请号:US18086415
申请日:2022-12-21
发明人: Xiaowen Wang , Lei Zhang , Paul Martin Messing , Chittibabu Pacharu , Weijun Guo , Kevin Alan Erickson , Shenghao Li , David Gregory Grant
IPC分类号: G06F16/174 , H03M7/30
CPC分类号: G06F16/1744 , H03M7/3071 , H03M7/6011
摘要: Some disclosed embodiments are directed to methods and systems for performing pair-wise delta compression. For example, systems obtain a set of files to be compressed into a single compressed file. The system identifies different attributes related to the set of files. For each file in the set of files, the system predicts an optimized set of candidate compression files and calculates a delta between each file in the optimized set and the target file corresponding to the optimized set. After identifying the smallest delta, the system compresses the selected pair of files associated with the smallest delta in order to generate the single compressed file for the set of files.
-
公开(公告)号:US11967120B2
公开(公告)日:2024-04-23
申请号:US17368133
申请日:2021-07-06
申请人: TENCENT AMERICA LLC
发明人: Xiang Zhang , Wen Gao , Shan Liu
CPC分类号: G06T9/001 , H03M7/3071 , H03M7/3077 , H03M7/6005 , H03M7/6011
摘要: A method, computer program, and computer system is provided for point cloud coding. The method includes receiving, from a bitstream, data corresponding to a point cloud; reconstructing, based on the data, a first attribute value of a first duplicate point from among a plurality of duplicate points corresponding to a single geometry position; obtaining at least one prediction residual corresponding to at least one remaining attribute value of at least one remaining duplicate point from among the plurality of duplicate points; reconstructing the at least one remaining attribute value based on the reconstructed first attribute and the at least one prediction residual; and decoding the data corresponding to the point cloud based on the reconstructed first attribute value and the reconstructed at least one remaining attribute value.
-
公开(公告)号:US20240046026A1
公开(公告)日:2024-02-08
申请号:US18488275
申请日:2023-10-17
发明人: Ronny LEMPEL , Chenyan XIONG
IPC分类号: G06F40/126 , G06F40/279 , H03M7/30
CPC分类号: G06F40/126 , G06F40/279 , H03M7/3071 , H03M7/6011
摘要: A method for text compression comprises recognizing a prefix string of one or more text characters preceding a target string of a plurality of text characters to be compressed. The prefix string is provided to a natural language generation (NLG) model configured to output one or more predicted continuations each having an associated rank. If the one or more predicted continuations include a matching predicted continuation relative to the next one or more text characters of the target string, the next one or more text characters are compressed as an NLG-type compressed representation. If no predicted continuations match the next one or more text characters of the target string, a longest matching entry in a compression dictionary is identified. The next one or more text characters of the target string are compressed as a dictionary-type compressed representation that includes the dictionary index value of the longest matching entry.
-
公开(公告)号:US11803693B2
公开(公告)日:2023-10-31
申请号:US17351531
申请日:2021-06-18
发明人: Ronny Lempel , Chenyan Xiong
IPC分类号: H03M7/00 , G06F40/126 , G06F40/279 , H03M7/30
CPC分类号: G06F40/126 , G06F40/279 , H03M7/3071 , H03M7/6011
摘要: A method for text compression comprises recognizing a prefix string of one or more text characters preceding a target string of a plurality of text characters to be compressed. The prefix string is provided to a natural language generation (NLG) model configured to output one or more predicted continuations each having an associated rank. If the one or more predicted continuations include a matching predicted continuation relative to the next one or more text characters of the target string, the next one or more text characters are compressed as an NLG-type compressed representation. If no predicted continuations match the next one or more text characters of the target string, a longest matching entry in a compression dictionary is identified. The next one or more text characters of the target string are compressed as a dictionary-type compressed representation that includes the dictionary index value of the longest matching entry.
-
公开(公告)号:US20190044533A1
公开(公告)日:2019-02-07
申请号:US16076347
申请日:2017-02-08
CPC分类号: H03M7/3071 , G06F16/285 , G06F17/18 , H03M7/3059 , H03M7/4012
摘要: A device (100) for and method of determining clusters of sequences of instances of a first type of data for compacting a data set comprising sequences of instances of the first type of data is provided. Also a method of compacting a data set, a method of transmitting compacted data and a computer program product are provided. In a sequence clustering unit (110) of the device, sequences of a first set of data are clustered on basis of conditional probabilities. Each unique sequence of the first set of data is associated with one or more conditional probabilities that an instance of the second set of data has a specific value given the unique sequence. In the clustering a significant part of the mutual information between the first set of data and the second set of data is maintained.
-
公开(公告)号:US20170302944A1
公开(公告)日:2017-10-19
申请号:US15639259
申请日:2017-06-30
IPC分类号: H04N19/44 , H04N19/174 , H04N19/13 , H04N19/167 , H04N19/51 , H04N19/436
CPC分类号: H04N19/44 , H03M7/3071 , H03M7/4037 , H04N19/13 , H04N19/167 , H04N19/174 , H04N19/436 , H04N19/503 , H04N19/51 , H04N19/91
摘要: The entropy coding of a current part of a predetermined entropy slice is based on, not only, the respective probability estimations of the predetermined entropy slice as adapted using the previously coded part of the predetermined entropy slice, but also probability estimations as used in the entropy coding of a spatially neighboring, in entropy slice order preceding entropy slice at a neighboring part thereof. Thereby, the probability estimations used in entropy coding are adapted to the actual symbol statistics more closely, thereby lowering the coding efficiency decrease normally caused by lower-delay concepts. Temporal interrelationships are exploited additionally or alternatively.
-
公开(公告)号:US09596469B2
公开(公告)日:2017-03-14
申请号:US14141374
申请日:2013-12-26
IPC分类号: H04N11/02 , H04N19/91 , H04N19/503 , H04N19/13 , H04N19/436 , H03M7/40 , H03M7/30
CPC分类号: H04N19/44 , H03M7/3071 , H03M7/4037 , H04N19/13 , H04N19/167 , H04N19/174 , H04N19/436 , H04N19/503 , H04N19/51 , H04N19/91
摘要: The entropy coding of a current part of a predetermined entropy slice is based on, not only, the respective probability estimations of the predetermined entropy slice as adapted using the previously coded part of the predetermined entropy slice, but also probability estimations as used in the entropy coding of a spatially neighboring, in entropy slice order preceding entropy slice at a neighboring part thereof. Thereby, the probability estimations used in entropy coding are adapted to the actual symbol statistics more closely, thereby lowering the coding efficiency decrease normally caused by lower-delay concepts. Temporal interrelationships are exploited additionally or alternatively.
摘要翻译: 预定熵片的当前部分的熵编码不仅基于使用预定熵片的先前编码部分进行适应的预定熵片的各自的概率估计,而且还基于在熵中使用的概率估计 在相邻部分处于熵切片之前的熵片序列中的空间相邻的编码。 由此,在熵编码中使用的概率估计更适合于实际符号统计,从而降低通常由较低延迟概念引起的编码效率降低。 时间相互关系被额外地或替代地利用。
-
公开(公告)号:US12080385B2
公开(公告)日:2024-09-03
申请号:US18237187
申请日:2023-08-23
申请人: Illumina, Inc.
CPC分类号: G16B50/50 , H03M7/3071 , H03M7/6011
摘要: Methods, systems, and computer programs for compressing nucleic acid sequence data. A method can include obtaining nucleic acid sequence data representing: (i) a read sequence, and (ii) a plurality of quality scores, determining whether the read sequence includes at least one “N” base, based on a determination that the read sequence includes at least one “N” base, generating, by one or more computers, a first encoding data set by using a first encoding process to encode each set of four quality scores of the read sequence into a single byte of memory, and using a second encoding process to encode the first encoded data set, thereby compressing the data to be compressed.
-
公开(公告)号:US20240062853A1
公开(公告)日:2024-02-22
申请号:US18237187
申请日:2023-08-23
申请人: Illumina, Inc.
CPC分类号: G16B50/50 , H03M7/3071 , H03M7/6011
摘要: Methods, systems, and computer programs for compressing nucleic acid sequence data. A method can include obtaining nucleic acid sequence data representing: (i) a read sequence, and (ii) a plurality of quality scores, determining whether the read sequence includes at least one “N” base, based on a determination that the read sequence includes at least one “N” base, generating, by one or more computers, a first encoding data set by using a first encoding process to encode each set of four quality scores of the read sequence into a single byte of memory, and using a second encoding process to encode the first encoded data set, thereby compressing the data to be compressed.
-
公开(公告)号:US20180026649A1
公开(公告)日:2018-01-25
申请号:US15654632
申请日:2017-07-19
申请人: Georges Harik
发明人: Georges Harik
CPC分类号: H03M7/30 , G06N3/0454 , G06N3/063 , G06N3/08 , G06N20/00 , H03M7/3071 , H03M7/3082
摘要: A data compression system includes: (a) a data compression module that receives a sequence of input vectors and that provides a sequence of compressed vectors; (b) a data decompression module that receives the compressed vectors to provide a sequence of output vectors; and (c) a parameter update module that receives the sequence of input vectors and the sequence of output vectors, and which learns the data compression module and data decompression module based on evaluating a loss function of the input vectors, the output vectors, and the parameters controlling the compression module and the decompression module. Each input vector and its corresponding output vector may represent digitized time-domain signals (e.g., speech, audio or video signals) over a predetermined time period. The loss function may be evaluated for each of a sequence of predetermined time periods.
-
-
-
-
-
-
-
-
-