Ah, sorting by codepoints is just what I want: something NOT locale dependent. (Actually, there is still the problem of alternate representations of the same codepoint in UTF-8, right? I think it has to do with marks.)