Fast Integer Log10

Mathematical function log10can be quite time consuming. If it is not available, for some languages such as VBScript, you have to compute it base on the fact.

$tex_fdf2555e90f933115922e1f70e0dd5ed Fast Integer Log10 algorithms beginner c # code code library floating point implementation math optimization programming languages tricks$

and therefore, we can compute log10 base on natural log.

$tex_3a9767cbcef17076e1b5bfdf1fc3c1f4 Fast Integer Log10 algorithms beginner c # code code library floating point implementation math optimization programming languages tricks$

and since $tex_01e1189a17955520d46d58c717454477 Fast Integer Log10 algorithms beginner c # code code library floating point implementation math optimization programming languages tricks$ can only be computed once and stored for next computation, we can replace it with the constant. Since multiplication is faster than division in most cases, we can replace the whole division thing with the constant 0.4342944819.

Sometimes, we just want the integer part of the log10, so we can use a faster function base on the fact that it is only 10 possible integer values for unsigned 32-bit integers. The maximum value that can be represented by unsigned 32-bit integers (e.g. uint in C#, LongWord in Delphi) is 4294967295, the log10 value of which is 9. i.e. 4294967295 is larger than $tex_bd476d396103d03da143d369ef63bc3e Fast Integer Log10 algorithms beginner c # code code library floating point implementation math optimization programming languages tricks$ but less than $tex_b886ce78b0c6f3e9a0d7f030d3613573 Fast Integer Log10 algorithms beginner c # code code library floating point implementation math optimization programming languages tricks$ .

So instead of computing directly the log10 value, we can easily check which range the given number falls into.

static uint log10(uint v)
{
    return (v >= 1000000000u) ? 9 : (v >= 100000000u) ? 8 : 
        (v >= 10000000u) ? 7 : (v >= 1000000u) ? 6 : 
        (v >= 100000u) ? 5 : (v >= 10000u) ? 4 :
        (v >= 1000u) ? 3 : (v >= 100u) ? 2 : (v >= 10u) ? 1u : 0u; 
}

static uint log10(uint v)
{
    return (v >= 1000000000u) ? 9 : (v >= 100000000u) ? 8 : 
        (v >= 10000000u) ? 7 : (v >= 1000000u) ? 6 : 
        (v >= 100000u) ? 5 : (v >= 10000u) ? 4 :
        (v >= 1000u) ? 3 : (v >= 100u) ? 2 : (v >= 10u) ? 1u : 0u; 
}

As shown in above C# code, the function log10 takes the unsigned 32-bit integer v and quickly checks which range it falls into, from a larger set of numbers to a smaller one. So if the input is uniform equally distributed, it will return values for the first few (e.g. 3) checks. i.e. The return value is zero when the input is less than 10 (10 possible inputs); the return value is one when the input is less than 100 (90 possible inputs) and so on. So we should check the largest group set first.

How does this perform? We time the performance in the C# .NET environment.

using System;
using System.Diagnostics;
 
namespace ConsoleApplication2
{
    class Program
    {
        static uint log10(uint v)
        {
            return (v >= 1000000000u) ? 9 : (v >= 100000000u) ? 8 : 
                (v >= 10000000u) ? 7 : (v >= 1000000u) ? 6 : 
                (v >= 100000u) ? 5 : (v >= 10000u) ? 4 :
                (v >= 1000u) ? 3 : (v >= 100u) ? 2 : (v >= 10u) ? 1u : 0u; 
        }
 
        static void Main(string[] args)
        {
            Stopwatch w = new Stopwatch();
            w.Start();
            for (uint i = 0; i <= 200000000; i++)
            {
                uint x = log10(i);
            }
            w.Stop();
            Console.WriteLine(w.ElapsedMilliseconds);
 
            w.Restart();
            for (uint i = 0; i <= 200000000; i++)
            {
                uint y = (uint)Math.Log10(i);
            }
            w.Stop();
            Console.WriteLine(w.ElapsedMilliseconds);
            Console.ReadLine();
        }
    }
}

using System;
using System.Diagnostics;

namespace ConsoleApplication2
{
    class Program
    {
        static uint log10(uint v)
        {
            return (v >= 1000000000u) ? 9 : (v >= 100000000u) ? 8 : 
                (v >= 10000000u) ? 7 : (v >= 1000000u) ? 6 : 
                (v >= 100000u) ? 5 : (v >= 10000u) ? 4 :
                (v >= 1000u) ? 3 : (v >= 100u) ? 2 : (v >= 10u) ? 1u : 0u; 
        }

        static void Main(string[] args)
        {
            Stopwatch w = new Stopwatch();
            w.Start();
            for (uint i = 0; i <= 200000000; i++)
            {
                uint x = log10(i);
            }
            w.Stop();
            Console.WriteLine(w.ElapsedMilliseconds);

            w.Restart();
            for (uint i = 0; i <= 200000000; i++)
            {
                uint y = (uint)Math.Log10(i);
            }
            w.Stop();
            Console.WriteLine(w.ElapsedMilliseconds);
            Console.ReadLine();
        }
    }
}

It is no surprise that the log10 is roughly three times faster. 3507 milliseconds versus 10327 milliseconds (the traditional System.Math.Log10).

References:

1. http://graphics.stanford.edu/~seander/bithacks.html#IntegerLog10Obvious

–EOF (The Ultimate Computing & Technology Blog) —

GD Star Rating
loading...

709 words
Last Post: Access Violation Bug Migrating to 64-bit
Next Post: Add Formatted Text to Word from VBScript

The Permanent URL is: Fast Integer Log10

9 Comments

asimonassi

my bench, compiled in release mode (thus probably the compiler is inlining the function) are 638 millisec for your version versus 6917 for the Math.Log version, that’s over 10x faster.

- ACMer
  
  Which compiler are you using?
  
  - asimonassi
    
    The default compiler that ships with Visual studio 2015 professional.
    
Luca Bruno

I’d rather use binary search at least 🙂

- asimonassi
  
  I read your comment and i too thought it was correct, but actually a binary search would be slower: integer having log10(x)>=9 are (4,294,967,296 – 999,999,999) = 3,294,967,297 that’s about 76% of the total, having log10(x)>=8 are 20%, having log10(x)>=7 are 2%, having 6 are 0,2%… so it make sense to do the test in that exact order as the autor.
  
  - ACMer
    
    Binary Search is only good for large N, for small N, it is faster without if-checks.
    
    - John C.
      
      > faster
      
      You can’t say that for sure without benchmarking. Both the binary search and chained-if methods suffer from branch misprediction, which is very far from ideal.
      
      - Ryan C.
        
        It’s exactly 1 branch misprediction, which is circa 16 cycles on x86_64. Hardly a big deal.
        
  - Javoid1
    
    Untrue! Integers in a real program do not occur with a uniform distribution, and smaller numbers are more common.

Algorithms, Blockchain and Cloud

Fast Integer Log10

References:

9 Comments

Leave a Reply

References:

Related posts:

9 Comments

Leave a Reply