Abstract:
In the present invention, the divider circuit divides two integer numbers using a novel hardware and circuit implementation technique to get the resultant quotient and the residue or remainder, based on using Modified Divisor (MDr) and variable dividend grouping based on the number of zeros canceled to achieve variable conversion time or latency and area efficiency. The number of iterations depends on the number of group dividends are formed based on the count of the number of zeros canceled while deriving Modified Divisor (MDr).
Abstract:
Systems and methods relate to division of a dividend by a divisor, with fast result formatting. Counts of leading sign bits of the dividend and the divisor are determined. The dividend and the divisor are normalized based on their respective counts of leading sign bits to obtain a normalized dividend and a normalized divisor, respectively. An exact number of significant quotient bits of a quotient of the division, based on the normalized dividend, the normalized divisor, and the counts of leading sign bits of the dividend and divisor and used to determine a correct position of a leading bit of the quotient based on this exact number. The quotient is developed by placing the leading bit at or near the correct position and appending less significant bits to the right of the leading bit. Thus, left-shifts in each iteration and large final shifts are avoided in formatting the result.
Abstract:
The present invention provides apparatus, methods, and computer program products for non-iterative division and non-iterative reciprocal generation. In one embodiment, the present invention uses a logic network that determines the bits of the quotient of a divisor and dividend by using a non-iterative, (i.e., non trial and error) method. Further, in another embodiment, the present invention may determine the reciprocal of a number M by separating the number M into at least two numbers X, Y...Z so that M = X + Y +...+ Z. The reciprocal of M is computed according to an equation 1/M = F(X, Y...Z) or an approximation 1/M APPROX G(X, Y...Z), where the approximation gives the correct value of the inverse of M to a predetermined accuracy. In some embodiments, the apparatus uses an equation that exactly describes the reciprocal or instead, it may include one or more memories for storing look-up tables containing pre-calculated parts of the equation.
Abstract translation:本发明提供用于非迭代分割和非迭代相互产生的装置,方法和计算机程序产品。 在一个实施例中,本发明使用逻辑网络,其通过使用非迭代(即,非试验和错误)方法来确定除数和除数的商的比特。 此外,在另一个实施例中,本发明可以通过将数M分成至少两个数X,Y ... Z来确定数M的倒数,使得M = X + Y + ... + Z。 根据等式1 / M = F(X,Y ... Z)或逼近1 / M APPROX G(X,Y ... Z)计算M的值,其中近似给出正确的逆的值 的M到预定的精度。 在一些实施例中,该装置使用精确描述倒数的方程式,或者替代地,其可以包括用于存储包含方程式的预先计算部分的查找表的一个或多个存储器。
Abstract:
A method for calculating the initial value in the Newton-Raphson method by synthesization using values retrieved from a small table stored in a memory by means of an arithmetic unit. The method of the invention is free from problems of conventional dividers such as a long operating time and a large circuit scale. The iterative calculation circuit of the Newton-Raphson method is mounted as an arithmetic circuit. A reciprocal calculating circuit having a short operating time, usable as a pipeline system, and exhibiting an improved throughput and a dividing circuit comprising the reciprocal calculating circuit are mounted. Part of the multiplying circuit included in the iterative calculation circuit is omitted by regarding the calculation precision, thus making it a compact iterative calculation circuit. Further by using the interim results of the calculation, the calculation precision is readily enhanced. Thus, a method for calculating the minimum size required for the table used for calculating the initial value is provided, and means for designing a circuit of a requisite minimum while fulfilling the required calculation precision is also provided.
Abstract:
Aspects of the disclosure provide methods and apparatuses for point cloud compression and decompression. In some examples, an apparatus for point cloud compression/decompression includes processing circuitry. The processing circuitry determines to use a prediction mode for coding (encoding/ decoding) information associated with a current point in a point cloud. In the prediction mode, the information associated with the current point is predicted based on one or more neighbor points of the current point. The processing circuitry calculates, using integer operations, a distance-based weighted average value based on distances of the one or more neighbor points to the current point, and determines the information associated with the current point based on the distance-based weighted average value.
Abstract:
The disclosure describes a method for performing a fixed point calculation of a floating point operation (A // B) in a coding device, wherein A // B represents integer division of A divided by B rounded to a nearest integer. The method may comprise selecting an entry from a lookup table (LUT) having entries generated as an inverse function of an index B, wherein B defines a range of values that includes every DC scalar value and every quantization parameter associated with a coding standard, and calculating A // B for coding according to the coding standard based on values A, B1 and B2, wherein B1 and B2 comprise high and low portions of the selected entry of the LUT. The techniques may simplify digital signal processor (DSP) implementations of video coders, and are specifically useful for MPEG-4 coders and possibly others.
Abstract:
Methods and apparatus for quickly dividing multiple-bit operands using bit-serial processors include strategies for eliminating the number of steps required to execute conventional division operations. According to an exemplary embodiment, a conditional subtraction step, based on a quotient bit computed during a given pass, is combined with a compare step which is used to compute a next quotient bit and which, according to conventional techniques, is ordinarily computed during a subsequent pass. Additionally, exemplary embodiments provide a zero/non-zero mask for denominator bits which extend beyond current most signficant remainder bit during a given pass. As a result, not all denominator bits need be considered during every pass. Advantageously, the methods and apparatus of the invention can provide approximately a 3 to 1 speed improvement as compared to conventional techniques.
Abstract:
A computation unit computes a division operation Y/X by determining the value of a divisir reciprocal l/X and multiplying the reciprocal by a numerator Y. The reciprocal l/X value is determined using a quadratic approximation having a form: Ax +Bx+C, where coefficients A, B, and C are constants that are stored in a storage or memory such as a read-only memory (ROM). The bit length of the coefficient determines the error in a final result. Storage size is reduced through use of "least mean square error" techniques in the determination of the coefficients that are stored in the coefficient storage. During the generation of partial products x , Ax , and Bx, the process of rounding is eliminated, thereby reducing the computational logic to implement the division functionality.
Abstract:
A single-/double-precision selector (106), a CPA (107) for performing a carry-propagation of a partial remainder, a quotient digit selection circuit (108) and a divisor selector (109) are added to the low-order side of a double-precision-bit-width SRT floating-point divider, a selector for selecting a carry propagation is provided between CSAs on the high-order and low-order sides, and a start position selector is provided within a quotient generating circuit, thereby it is possible to simultaneously execute two computations of a single-precision division without increasing the width of a computing unit. Since, during a double-precision computation, the same inputs are given to the quotient digit selection circuit to double the drive force of a selected quotient digit, a faster circuit can be implemented during a double-precision computation. As for a square-root computation, a single-precision two-computation parallel execution is possible just as is the case with division by adding a partial square root circuit.