utf8 vs iso8859-1 speed/responsiveness

Hideki Hiura hiura at openi18n.org
Wed Jun 30 01:32:31 UTC 2004


> From: Pedro Fernandes Macedo <webmaster at margo.bijoux.nom.br>
> Not exactly...  The issue is the input file.
> In RH 9 , the input file probably was iso8859-1 and then it was 
> processed without any conversion , as the whole system was using 
> iso8859-1. Now in fedora  , the input file has to be converted by the 
> app to UTF8  , so this extra step means that FC will be a bit slower 

In general this is true, but in this case, not exactly :-). 

Glibc internal encoding is UTF32/UCS4, and modern toolkits, thus
major desktop apps as well, OOo, all internal encoding is UTF-8, 
on RH9 as well.

Even you set the locale to *.iso8859-1, and keep file contents in
iso8859-1 encoding on RH9, whenever apps open it, it is converted to
UTF-8 for processing upon reading, and convert back to iso8859-1 when
apps saves it, even though it appeared as the whole system was using 
iso8859-1.

This architecture is unchanged between RH9 to FC2, so encoding 
conversion happens everywhere on the fly.

So regardless of RH9 or FC2, and regardless with the locale
you set(*.UTF-8 or *.iso8859-1 or whatever), at least the files 
encoded in UTF-8 takes less for I/O than other encodings.

Regards,
--
hiura@{freestandards.org,OpenI18N.org,li18nux.org,unicode.org,sun.com} 
Chair, OpenI18N.org/The Free Standards Group          http://www.OpenI18N.org
Architect/Sr. Staff Engineer, Sun Microsystems, Inc, USA   eFAX: 509-693-8356





More information about the fedora-list mailing list