For all forms of Profiling, you can gather counts for all functions on the stack, not just the currently executing function. To do so, compile your program with -profile-stack true. For example, suppose that list-rev.sml contains the following.
fun append (l1, l2) =
case l1 of
[] => l2
| x :: l1 => x :: append (l1, l2)
fun rev l =
case l of
[] => []
| x :: l => append (rev l, [x])
val l = List.tabulate (1000, fn i => i)
val _ = 1 + hd (rev l)
Compile with stack profiling and then run the program.
% mlton -profile alloc -profile-stack true list-rev.sml % ./list-rev
Display the profiling data.
% mlprof -show-line true list-rev mlmon.out 6,030,136 bytes allocated (108,336 bytes by GC) function cur stack GC ----------------------- ----- ----- ---- append list-rev.sml: 1 97.6% 97.6% 1.4% <gc> 1.8% 0.0% 1.8% <main> 0.4% 98.2% 1.8% rev list-rev.sml: 6 0.2% 97.6% 1.8%
In the above table, we see that rev, defined on line 6 of list-rev.sml, is only responsible for 0.2% of the allocation, but is on the stack while 97.6% of the allocation is done by the user program and while 1.8% of the allocation is done by the garbage collector.
The run-time performance impact of -profile-stack true can be noticeable since there is some extra bookkeeping at every nontail call and return.