Measuring execution time – in program code or in shell?

Question

I have a program and I want to measure its execution (wallclock) time for different input sizes. In some other similar questions I read that using clock_gettime in the source code wouldn&#8217;t be reliable because of the CPUs branch predictor, register renaming, speculative execution, out-of-order execution …

Accepted Answer

Measuring in the program will give you a more accurate answer. Sure, in theory, in some cases you can get the clock_gettime calls moved where you don&#8217;t expect them. In practice, it will not happen if you have only a function call in between. (If in doubt, check the resulting assembler code)Calling time in shell will include things you don&#8217;t care about, like the time it takes to load your executable and get to the interesting point. On the other hand, if your do_stuff takes a few seconds, then it doesn&#8217;t really matter.I&#8217;d go with the following recommendation:If you can isolate your function easily and make it take a few seconds (you can also loop it, but measure empty loop for comparison as well), then either clock_gettime or time will do just fine.If you cannot isolate easily, but your function consistently takes hundreds of milliseconds, use clock_gettime.If you cannot isolate and you&#8217;re optimising something tiny, have a look at rdtsc timing for a measuring a function which talks about measuring actual executed cycles.

Advertisement

Answer