Matthew Fluet
Sat, 17 May 2003 15:40:33 -0400 (EDT)
Here is a proposal for a MLton.Callback structure to replace MLton.FFI.
On the ML side we have:
signature MLTON_CALLBACK =
structure Type :
type ('a, 'b) arg
type 'a res
val C: (char, 'b) arg
val B: (bool, 'b) arg
val I: (int, 'b) arg
val U: (unit, 'b) arg
val W: (word, 'b) arg
val --> : ('a, 'b) arg * 'b res -> ('a -> 'b) res
val C' : char res
val B' : bool res
val I' : int res
val U' : unit res
val W' : word res
val register: string * ('a -> 'b) Type.res -> ('a -> 'b) -> unit
val unregister: string -> unit
val isRegistered: string -> bool
The MLton.Callback.Type structure takes as inspiration the Printf example.
It serve two purposes. First, it provides a type-safe interface to
Callbacks, in the sense that a user will only be able to register a
function whose type is compatible with the callback mechanism. Second,
the value of type ('a -> 'b) Type.res is a _real_ value that builds up
auxilary functions for working with the callback mechanism behind the
scenes; i.e., ('a, 'b) Type.arg and 'b Type.res are _not_ phantom types
whose only purpose is type checking, although this is a case where phantom
types of that form would be applicable.
Example client code is as follows:
(* z.sml *)
val f = _ffi "f": unit -> unit;
val f = fn () =>
(print "calling f\n"
; f ()
; print "done calling f\n")
open MLton.Callback
open Type
infixr -->
val _ =
register ("A", I --> I --> I')
(fn i => fn j =>
let val r = i + j in
print (concat ["A(", Int.toString i,
",", Int.toString j,
") = ", Int.toString r, "\n"])
; r
val _ =
register ("B", I --> I --> I')
(fn i => fn j =>
let val r = i * j in
print (concat ["B(", Int.toString i,
",", Int.toString j,
") = ", Int.toString r, "\n"])
; r
val _ = f ()
/* f.c */
#include <stdio.h>
#include <mlton-basis.h>
void f () {
int x;
fprintf (stderr, "f calling SML: A(1,2)\n");
MLton_Callback_setI(0, 1);
MLton_Callback_setI(1, 2);
x = MLton_Callback_getI();
fprintf (stderr, "f done calling SML: A(1,2) = %i\n", x);
fprintf (stderr, "f calling SML: B(1,2)\n");
MLton_Callback_setI(0, 1);
MLton_Callback_setI(1, 2);
x = MLton_Callback_getI();
fprintf (stderr, "f done calling SML: B(1,2) = %i\n", x);
Compiling and running yields:
[fluet@localhost test]$ mlton.cvs.HEAD z.sml f.c
[fluet@localhost test]$ ./z
calling f
f calling SML: A(1,2)
A(1,2) = 3
f done calling SML: A(1,2) = 3
f calling SML: B(1,2)
B(1,2) = 2
f done calling SML: B(1,2) = 2
done calling f
Some work could be done on the C-side, as the set-up and execution of an
ML call is a little burdensome. Furthermore, implementation details leak
through. As can probably be ascertained from the C-code above, the
implementation is essentially a collection of global locations used to
pass parameters back and forth between C and ML. On the C-side, we set up
the arguments with
MLton_Callback_setI(index, arg);
make a call with
and fetch return results with
We could either provide a collection of common calls or maybe it's
possible to use varargs to get a general solution.
On the ML-side, we have a collection of _ffi primitives:
(* basis-library/misc/primitive.sml *)
structure Callback =
val fetchI = _ffi "MLton_Callback_fetchI": int -> int;
val retI = _ffi "MLton_Callback_retI": int -> unit;
val callbackName = _ffi "MLton_Callback_callbackName": unit -> cstring;
For the time being, I've elected to go with really trivial registration;
obviously, we could use fancier datastructures for faster lookups.
(* basis-library/mlton/callback.sml
structure MLtonCallback :> MLTON_CALLBACK_EXTRA =
structure Prim = Primitive.MLton.Callback
structure Type =
struct ... end
val registered : (string * (unit -> unit)) list ref = ref []
fun pred n = (fn (n',_) => n = n')
fun isRegistered n =
List.exists (pred n) (!registered)
fun unregister n =
registered := List.filter (not o (pred n)) (!registered)
val register' =
val _ =
fn () =>
val cs = Prim.callbackName ()
val n = if Primitive.Cpointer.isNull cs
then raise Fail ("null callback function")
else C.CS.toString cs
case List.find (pred n) (!registered) of
SOME (_,f) => f ()
| NONE => raise Fail ("unregistered callback function: " ^ n)
fn (n,f) =>
(unregister n
; registered := (n,f)::(!registered))
fun register (n,ty) f =
register' (n, Type.make ty f)
This portion of the implementation is straightforward; the
CallFromCHandler is set to a function that fetches the requested function
name, looks it up in the list of registered functions, and executes it.
I've added a boolean argument to MLtonThread.setCallFromCHandler; when
false, setCallFromCHandler won't leave the critical section entered by the
MLton_callFromC. This is important here, because in order to make
callbacks thread-safe, we must fetch all the C-side arguments _before_
leaving the critical section (else, some other thread might start running,
enter C, and callback, clobbering our arguments). Likewise, we need to
enter a critical section when returning a result to C. Therefore, I've
modified the MLton_callFromC functions to leave a critical section after
the call to SML returns. MLtonThread.setCallFromCHandler inserts a call
to atomicBegin before Thread_returnToC, unless the installed handler
requests to handle the critical sections itself.
You can see that a lot of burden falls on Callback.Type. The function
val Type.make : ('a, 'b) Type.res -> ('a -> 'b) -> (unit -> unit)
takes a type description, a function, and returns a unit -> unit function
that fetches all the C-side arguments, leaves the critical section,
executes the function on the fetched arguments, enters a critical section,
and sets the C-side return value.
Here is the solution I came up with:
structure Type =
type indices = int vector
val numIndices : int = 4
val indexB : int = 0
val indexC : int = 1
val indexI : int = 2
val indexW : int = 3
fun mkFetchInc (fetch, i) =
(fn (z:indices) =>
fetch (Vector.sub (z,i)),
fn (z:indices) =>
fn j => let val v = Vector.sub (z,j)
in if i = j then v + 1 else v
val (fetchI,incI) = mkFetchInc (Prim.fetchI, indexI)
val retI = Prim.retI
val zeroIndices : indices =
Vector.tabulate(numIndices, fn _ => 0)
type ('a, 'b) arg =
indices -> (('a -> 'b) -> (unit -> (unit -> 'b))) * indices
type 'b res =
indices -> ((unit -> 'b) -> (unit -> (unit -> unit)))
val I : (int, 'b) arg =
fn (z:indices) =>
(fn f => fn () =>
let val i = fetchI z
in fn () => f i end,
incI z)
infixr -->
fun (X: ('a, 'b) arg) --> (Y: 'b res) : ('a -> 'b) res =
fn (z:indices) =>
val (X: ('a -> 'b) -> (unit -> (unit -> 'b)), z:indices) = X z
fn (F : unit -> ('a -> 'b)) =>
val f: unit -> (unit -> 'b) = X (F ())
fn () =>
Y z (f ()) ()
val I' : int res =
fn (z:indices) =>
fn f => fn () => fn () =>
let val v = f () in
MLtonThread.atomicBegin ()
; retI v
fun make (ty: ('a -> 'b) res) : ('a -> 'b) -> (unit -> unit) =
fn (f: 'a -> 'b) =>
let val f = ty zeroIndices (fn () => f) in
fn () =>
let val f = f () in
MLtonThread.atomicEnd ()
; f ()
The missing Type.arg and Type.res values are all similar to I and I'; just
change the fetch?, inc?, and ret? functions. Thunks are used like mad to
keep the real function suspended while we fetch all the C-side arguments,
pause for an atomicEnd (), and then run the function composed with an
atomicBegin () and setting the C-side return. We track the C-side
arguments with an int vector, where the appopriate slot is incremented
each time we encounter an argument. A tuple would work equally well,
which is why I've purposely left the indices type fairly abstract after
defining all the fetch? and inc? functions. Adding a new type and/or
changing the indices representation should be very straightforward.
The make function seeds the type with the all zero indices and an
suspended function. Then the Type.arg and Type.res values and --> take
over to build the right function.
Thoughts? Comments?
