structure Tak =
struct
fun tak1 (x, y, z) =
let
fun tak2 (x, y, z) =
if y >= x
then z
else
tak1 (tak2 (x - 1, y, z),
tak2 (y - 1, z, x),
tak2 (z - 1, x, y))
in
if y >= x
then z
else
tak1 (tak2 (x - 1, y, z),
tak2 (y - 1, z, x),
tak2 (z - 1, x, y))
end
end
val rec f =
fn 0 => ()
| ~1 => print "this branch is not taken\n"
| n => (Tak.tak1 (18, 12, 6) ; f (n-1))
val _ = f 5000
fun uncalled () = ()
Compile with time profiling and run the program.
% mlton -profile time tak.sml % ./tak
Display the profiling data.
% mlprof tak mlmon.out 6.00 seconds of CPU time (0.00 seconds GC) function cur ------------- ----- Tak.tak1.tak2 75.8% Tak.tak1 24.2%
This example shows how mlprof indicates lexical nesting: as a sequence of period-separated names indicating the structures and functions in which a function definition is nested. The profiling data shows that roughly three-quarters of the time is spent in the Tak.tak1.tak2 function, while the rest is spent in Tak.tak1.
Display raw counts in addition to percentages with -raw true.
% mlprof -raw true tak mlmon.out 6.00 seconds of CPU time (0.00 seconds GC) function cur raw ------------- ----- ------- Tak.tak1.tak2 75.8% (4.55s) Tak.tak1 24.2% (1.45s)
Display the file name and line number for each function in addition to its name with -show-line true.
% mlprof -show-line true tak mlmon.out
6.00 seconds of CPU time (0.00 seconds GC)
function cur
------------------------- -----
Tak.tak1.tak2 tak.sml: 5 75.8%
Tak.tak1 tak.sml: 3 24.2%
Time profiling is designed to have a very small performance impact. However, in some cases there will be a run-time performance cost, which may perturb the results. There is more likely to be an impact with -codegen c than -codegen native.
You can also compile with -profile time -profile-branch true to find out how much time is spent in each branch of a function; see ProfilingCounts for more details on -profile-branch.
Caveats
With -profile time, use of the following in your program will cause a run-time error, since they would interfere with the profiler signal handler.
-
MLton.Itimer.set (MLton.Itimer.Prof, ...)
-
MLton.Signal.setHandler (MLton.Signal.prof, ...)
Also, because of the random sampling used to implement -profile time, it is best to have a long running program (at least tens of seconds) in order to get reasonable time