[Bug 455981] Missing locl romanian magic

bugzilla at redhat.com bugzilla at redhat.com
Mon Jul 21 12:05:56 UTC 2008


Please do not reply directly to this email. All additional
comments should be made in the comments box of this bug report.

Summary: Missing locl romanian magic


https://bugzilla.redhat.com/show_bug.cgi?id=455981





------- Additional Comments From bl.bugs at gmail.com  2008-07-21 08:05 EST -------
Ugh, Unicode seems to have made an even bigger mess out of this than I 
originally thought...

So, apparently both U+015E-U+015F, U+0162-U+0163, and U+0218-U+021B can still 
all be used for Romanian. With the extra string attached to U+0218-U+021B that 
they should be used when a distinct shape with comma below is needed. So 
you're still allowed the U+015E-U+015F, U+0162-U+0163 glyphs to write Romanian 
apparently.

And since Unicode only cares about code points, it didn't give any clue on how 
fonts or renderers are supposed to know when distinct glyphs are needed. Yet 
Unicode expects them to clean up the mess they've made.

> It should be done because locl is an *optional* font feature.

I thought it was obligated if a language was passed to the renderer (but I may 
be wrong on this).

> Adobe introduced ROM/locl because they (and 99% of commercial fonts) remap
> "t with cedilla" to "t with comma" regardless of locale

That's just bad, t with cedilla _is_ used sometimes. I think it was even 
proposed a long time ago to be used in French for when a t sounds like /s/, 
like "relaţion" (didn't catch on unfortunately :-) ). Unicode itself mentions 
Semitic transliteration (but I guess that needs a lot of other glyphs those 
fonts don't have).

So far I've only found three Adobe fonts with Romanian glyphs and two didn't 
have the locl rule, so it looks like Adobe doesn't do it often either. They 
all have indeed t with comma below in the place of t with cedilla. If you have 
documents with mixed diacritics you can blame it on that practice, _not_ the 
absence of locl rules in the font.

I've also checked the MS Vista fonts once (usually they make the de facto 
standard rules since their fonts are most widely spread). Segoe UI and the new 
versions of Arial, Times New Roman etc. don't have locl rules or anything else 
and have t with cedilla at U+0162-U+0163 (I think the old versions known as 
the corefonts were pre-Unicode 3.0). The C-fonts which were made by another 
foundry have t with comma below at U+0162-U+0163 like Adobe fonts, and have a 
salt (stylistic alternate) _and_ a locl feature for s with cedilla glyphs to s 
with comma below for Romanian.

Also, one thing I'm asking myself is: why doesn't Gentium have locl rules (or 
ccmp rules)? It's a more recent font compared to Doulos and Charis, so the SIL 
people seem to have changed their minds about it, and I'd like to know their 
reasons before changing anything in DejaVu.

So, short conclusion: how it's dealt with it seems to just depend on the 
foundry that made the fonts, and it also seems to depend on who you ask. So 
far, I haven't seen enough yet to be sure that a locl rule is needed.

Also, don't always assume commercial fonts have it right. As said above, the 
same fonts have t with comma below in place of t with cedilla, together with a 
s with cedilla, which is the worst thing you can do here.


-- 
Configure bugmail: https://bugzilla.redhat.com/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug, or are watching someone who is.




More information about the Fedora-fonts-bugs-list mailing list