[MLton] WideChar
Stephen Weeks
MLton@mlton.org
Sat, 11 Dec 2004 16:57:50 -0800
> By convention, a Char.char is an "interpreted" 8bit value. I would
> argue that what one really wants is a Word8.word reader to a
> WideChar.char reader. A Word8.word is an "uniterpreted" 8bit value.
> When one wants to recover a 1.2 style decoding converter from a
> Char.char reader, it should first be sent through the Byte
> structure, which explicitly relinquishes the bit interpretation.
I like this way of thinking about it.
> The problem with both NONE and exceptions for 1.2 style converters is that
> the invalidity of the input stream is not discovered until sufficient
> input is read; i.e., not at the point where the conversion is applied.
Why does this matter, since these are streams? I.E. who can tell that
we've looked a few characters ahead?
Can someone explain the difference between LargeChar and WideChar?