UTF-8 and filenames

Alan Cox alan at redhat.com
Thu Mar 15 10:34:25 UTC 2007


On Wed, Mar 14, 2007 at 11:24:18PM +0100, Nicolas Mailhot wrote:
> Except userspace has no way to guess the filename encoding: filename
> itself is too short to use any sort of euristic, and Linux filesystems
> won't provide any other hint.

Filenames should be in UTF-8 format. That has been what we've said since
forever

> 8-bit encoding in an UTF-8 system is asking for trouble (and Linus
> refused to enforce UTF-8 safety kernel-side)

And Unix behaviour pretty much says you can't and shouldn't do UTF8 filters
kernel side if you don't want to break stuff




More information about the Fedora-maintainers mailing list