[MLton] Unicode... again
Matthew Fluet
fluet at tti-c.org
Fri Feb 9 13:14:41 PST 2007
Gergely Buday wrote:
> On 08/02/07, Wesley W. Terpstra <terpstra at gkec.tu-darmstadt.de> wrote:
>> Once again I find myself needing Unicode in MLton.
> [...]
>> Agreed? Can I just whip this up and check it in? ;-)
>
> I'm getting into using ml-ulex, which is a unicode-able lexer for
> sml/nj. As far as I understood, it uses 4-byte chars:
> ml-lpt/ml-ulex/BackEnds/SML/template-ml-ulex.sml contains
>
> structure W = Word32
> type wchar = W.word
>
> It would be nice to be able to use ml-ulex with your proposed unicode
> library. For the first step I'll try to compile ml-ulex with mlton.
The lastest version of SML/NJ (ver 110.62) includes
signature UTF8
structure UTF8 : UTF8
in the SML/NJ Library (added to support ml-ulex). The signature there
does specify
type wchar = Word32.word
I got an older version of ml-ulex to work with MLton, and it wasn't too
hard.
More information about the MLton
mailing list