Nice article, but I suspect the implementation will not work. I did essentially ...

bad_user · on Feb 27, 2012

I'm the author of the article ...

Yes you are right, it's better to convert the whole thing to a sum of logs, otherwise you end up with floating-point underflow.

The article was getting already too long however, but I'll add a note about it because it is an important optimization that affects both speed and even correctness (because of the underlying limitations of floating point).

UPDATE: note added.

ppod · on Feb 27, 2012

Is the sum of logs method mathematically equivalent to the multiplication of probabilities (i.e. will it always produce a monotonic ordering of class predictions)?

alexchamberlain · on Feb 27, 2012

log(AB) = log(A) + log(B) for all A, B real numbers

phaedon · on Feb 27, 2012

Only for A, B > 0, but that's good enough for probabilities (except 0, I guess)

alexchamberlain · on Feb 27, 2012

That was there to test you...

moultano · on Feb 27, 2012

Even better, convert everything to log(p/(1-p)) log-odds, and halve the number of things to sum.

webspiderus · on Feb 27, 2012

I think this should also have the added effect of being overall faster, as doing lots of addition would be quicker than doing lots of multiplication (log notwithstanding).

Radim · on Feb 27, 2012

That could be true in a hardcore numerical computation, assuming you know enough about your architecture and compiler to make use of it.

In the context of this article and implementation, the speed difference between + and * is utterly irrelevant and unmeasurable.

eduardoflores · on Feb 27, 2012

On Ruby you can use the Decimal class, although it's really slow