Traditional rounding 'bias'. I don't get it.

septimus · June 12, 2020, 11:10am

I mentioned the Jpeg tricks because they seemed interesting. We’re not talking of banks that misplace a few pennies, but of lossy compression methods (not just Jpeg) which get a big performance boost by understanding rounding bias.

It’s safe to say that I have more than a passing familiarity with Jpeg.

The compression “trick” I mentioned was designed to operate with any ordinary decompressor. (And obviously, reduction of the frequency of ±1 would immediately yield a Huffman code optimization.) The decompression “trick” I mentioned operates with any ordinary compressor. (Jpeg files can include custom markers for compressor-decompressor cooperation to get further improvement, but the tricks I described don’t require them.)

The tricks are so obvious that it may be a mystery why Jpeg did NOT explicitly incorporate them. I am aware of one company which kept its knowledge of these ideas secret to obtain a competitive advantage over “vanilla” implementations. (I had a hunch what they were doing and confirmed it with a code disassembler.)

People on the Jpeg-2000 committee (some of who were aware of my own paper on this topic) did add a new quantization parameter:

Chronos · June 12, 2020, 1:13pm

If the company was keeping secret that they were doing this, then it couldn’t have been protected under patent, and anyone who managed to reverse-engineer their trick would be legally entitled to do whatever they wished with it (as long as they did so without help from any employees of that company, who are probably bound by their contracts).

Pleonast · June 12, 2020, 1:35pm

I found your post very interesting, but I can’t reply to it other than to say that. I don’t like replying to GQ posts without adding to the discussion.

septimus · June 12, 2020, 2:29pm

Thank you, Pleonast! Your kind words are appreciated.

I have a bad habit of tongue-in-cheekiness and this paragraph is a case in point.

A few decades ago, there was a craze to patent software algorithms. Were they even patentable? Some patent attorneys insisted that the improper patent claim “I claim the algorithm which blah blah blah” could be rendered proper by simply adding four words — “I claim an apparatus which implements the algorithm which blah blah blah.” (Even though the apparatus wasn’t the inventor’s at all — just an off-the-shelf Pentium or whatever.) The U.S. PTO would eventually get tired of arguing and issue the patent, but would it stand up in court if challenged?

This software-patenting craze did generate income for patent attorneys, so I guess it was at least as good as breaking windows. :smack:

Chronos · June 12, 2020, 4:01pm

Patents for methods or techniques, as opposed to physical objects, were well-established long before the age of computers. I don’t see why a method patent couldn’t apply to an algorithm.

DPRK · June 12, 2020, 4:19pm

Many patent jurisdictions specifically exclude computer programs, among other things. So one could argue that an algorithm is a description of a computer program and therefore not patentable, if it is not already excluded as an “abstract idea”. On the other hand, certain “practical applications” or “implementations” of a mathematical formula or algorithm are theoretically patentable…

septimus · June 12, 2020, 4:46pm

There are dozens of court cases on the subject of patenting mathematical algorithms, several of which went to the Supreme Court. Consider Parker v. Flook (1978):

Justice John Paul Stevens joined by 5 other Justices:

" ‘A principle, in the abstract, is a fundamental truth; an original cause; a motive; these cannot be patented, as no one can claim in either of them an exclusive right.’ Le Roy v. Tatham, 14 How. 156, 175, 14 L.Ed. 367. Phenomena of nature, though just discovered, mental processes, and abstract intellectual concepts are not patentable, as they are the basic tools of scientific and technological work." 409 U.S., at 67, 93 S.Ct., at 255.
…
Respondent correctly points out that this language does not apply to his claims. He does not seek to “wholly preempt the mathematical formula,” since there are uses of his formula outside the petrochemical and oil-refining industries that remain in the public domain. And he argues that the presence of specific “post-solution” activity—the adjustment of the alarm limit to the figure computed according to the formula distinguishes this case from Benson and makes his process patentable. We cannot agree.

The notion that post-solution activity, no matter how conventional or obvious in itself, can transform an unpatentable principle into a patentable process exalts form over substance. A competent draftsman could attach some form of post-solution activity to almost any mathematical formula; the Pythagorean theorem would not have been patentable, or partially patentable, because a patent application contained a final step indicating that the formula, when solved, could be usefully applied to existing surveying techniques. The concept of patentable subject matter under § 101 is not “like a nose of wax which may be turned and twisted in any direction . . . .” White v. Dunbar, 119 U.S. 47, 51, 7 S.Ct. 72, 74, 30 L.Ed. 303.

One test, I think, is to eliminate any details of a mathematical formula and see if the result is vacuous. In our example, “I claim a process wherein said intermediate arithmetic value is altered in some way …” Sounds almost vacuous to me.

But the Supreme Court has reversed itself and then reversed again since Parker v. Flook. Contact an attorney!

DPRK · June 13, 2020, 8:42am

Leaving aside JPEG and JPEG2k, there are many newer standards and proposed standards which include steps along the lines of computing the discrete cosine transform or wavelet transform of a block, followed by quantization and entropy coding. How do they handle the inevitable rounding?

septimus · June 15, 2020, 4:06pm

JPEG2k was after my time, let alone anything newer, but since no one else answered I’ll summarize the basic difference in quantization rounding between Jpeg and Jpeg2k. Recall that ‘x’ in the following has already been multiplied by a scaling constant which depends on psychometrics and user-desired compression ratio.

Jpeg:
…
-1.5 < x < -0.5 --> x = -1
-0.5 < x < +0.5 --> x = 0
+0.5 < x < +1.5 --> x = +1
+1.5 < x < +2.5 --> x = +2
+2.5 < x < +3.5 --> x = +3
…
The spec offers no flexibility beyond this. (One implementation increases the x=0 domain to -1.5 < x < +1.5 — i.e. eliminates all ±1’s which would be encoded — in image regions the user specifies as being of low interest.)

Jpeg2k, default:
…
-2.0 < x < -1.0 --> x = -1.5
-1.0 < x < +1.0 --> x = 0
+1.0 < x < +2.0 --> x = +1.5
+2.0 < x < +3.0 --> x = +2.5
+3.0 < x < +4.0 --> x = +3.5
…
This policy recovers most of the inefficiency that applied to Jpeg (cf. my ‘tricks’ above); further meddling would yield only diminished returns. However IIUC two optional parameters are provided to further tune this quantization: The width of the zero region can be adjusted, and the reconstruction values can be moved slightly toward zero to reduce average squared error. (Never mind more complicated modes of Jpeg2k.)

RaftPeople · June 16, 2020, 4:49am

You’re right that float isn’t appropriate for almost any business/accounting software, but neither is integer for currency and catch weight values.

Fixed precision decimal/numeric data types are what is used.

Chronos · June 16, 2020, 12:46pm

Fixed precision is the same thing as integer, just with different units.

DPRK · June 16, 2020, 12:55pm

It’s not like the financial programmer can just ignore rounding issues just because she uses decimal and/or fixed-point numbers. All the issues (possibility of incorrect rounding, etc) discussed in this thread still apply.

RaftPeople · June 16, 2020, 7:52pm

I was assuming you were using the term “integer” to mean the software data type of integer because the context was software running on the cash register. Maybe you were using the term more generically.

My point was that there are data types that are designed for this type of stuff as opposed to the integer data type which just requires a lot more work to achieve the same result as the decimal/numeric types.

Chronos · June 16, 2020, 9:58pm

No, I mean that a decimal type is the same thing as an integer type. Like, measuring in dollars to two places after the decimal point is equivalent to measuring in integer cents.

DPRK · June 17, 2020, 2:37am

Not exactly the same; you need to store the number of digits after the decimal point also, 2 in your example, and keep track of it when multiplying, etc.

Pleonast · June 17, 2020, 4:00am

If you’re storing the place of the decimal point, it’s floating point, not fixed point.

DPRK · June 17, 2020, 4:10am

I meant fixed point.

Say you have one decimal digit, and the operation is 1.2 x 3.4. The intermediate value is 4.08, and it must be correctly rounded to 4.1. On the other hand, 12 x 34 = 408. So, arithmetic is not quite the same.

Moris · June 17, 2020, 1:07pm

There’s something else in rounding that bugs me.

2.3217 could be rounded up to 2.328. And ultimately to 2, if we need an integer.

But 4.4449 could be rounded up to 4.445, and then 4.445 to 4.45, then to 4.5, then to 5.

I’m not saying that’s happening very often, but if a multiple people are working on the same spreadsheet, it’s possible. I did it myself a few times, absentmindedly.

I’ m just saying - rounding is a convention, and we should be careful following the rules of that convention.

Chronos · June 17, 2020, 3:49pm

Under what circumstances would you ever be multiplying two amounts of money?

And even if you do, if you have two amounts of money expressed in cents, and you multiply them, you’ll end up with a perfectly valid answer in square cents. Why are square cents any less valid of a unit than square dollars?

RaftPeople · June 24, 2020, 12:40am

It’s not typically two amounts of money, it’s typically money (e.g. unit price, invoice sub total) multiplied by things like shipped weight (typically food uses catch weight systems), or yards (e.g. 3.07 yards consumed) or currency conversion (typically 7 decimal positions), or tax rates, or discounts, or prorating costs across multiple invoices or across lines, etc., etc.,

Manually tracking the decimal position during all of these calculations requires extra work and is more error prone than just using the software (and sometimes hardware) that already handles it.

Topic		Replies	Views
Rules for rounding numbers Factual Questions	50	3040	June 19, 2003
How do you round numbers? Factual Questions	70	19450	November 15, 2009
rounding 5 exactly to evens Factual Questions	32	3465	March 19, 2017
Why Do You Round to the Five (in Math)? Factual Questions	39	1733	November 1, 2017
Why is 5 always rounded up? Factual Questions	27	6051	May 19, 2001

Traditional rounding 'bias'. I don't get it.

Related topics