Discussion:
ZWNJ/ZWJ and ligation of combining marks
Peter Kirk
2003-08-26 17:09:14 UTC
Permalink
This is an issue which has come up on the Unicode Hebrew list but has
not been resolved there...

There is a requirement in some cases, depending on the typographical
convention in use, to form a ligature between the combining marks U+05BD
HEBREW POINT METEG and any one of the three "hataf" vowel points U+05B1,
U+05B2 and U+05B3. But there are also some exceptions i.e. cases in
which the ligature should not be formed when the default is for it to be
formed, and possibly cases when it should be formed when by default it
is not to be formed.

The Unicode standard, 4.0 section 15.2, specifies ZWJ and ZWNJ as the
characters to use for promoting or inhibiting ligatures. But the text
seems to assume that the characters being ligated are base characters. A
problem arises when ZWJ or ZWNJ is inserted between combining marks to
promote or inhibit ligation: as non-combining control characters these
characters break the combining sequence, such that the following
combining mark forms a defective combining sequence whose rendering is
undefined.

An alternative that has been suggested has been to use CGJ between the
combining marks. This has the advantage that CGJ is itself a combining
mark and so does not break the combining sequence; but the disadvantage
that this is one character, not two, and so cannot be used separately to
promote or inhibit ligation. The more serious problem is that this use
does not accord with the defined use of CGJ which is explicitly not to
affect ligation.

Can anyone on this list advise what should be done in this case?
--
Peter Kirk
***@qaya.org (personal)
***@qaya.org (work)
http://www.qaya.org/




------------------------ Yahoo! Groups Sponsor ---------------------~-->
KnowledgeStorm has over 22,000 B2B technology solutions. The most comprehensive IT buyers' information available. Research, compare, decide. E-Commerce | Application Dev | Accounting-Finance | Healthcare | Project Mgt | Sales-Marketing | More
http://us.click.yahoo.com/IMai8D/UYQGAA/cIoLAA/8FfwlB/TM
---------------------------------------------------------------------~->

To Unsubscribe, send a blank message to: unicode-***@yahooGroups.com

This mailing list is just an archive. The instructions to join the true Unicode List are on http://www.unicode.org/unicode/consortium/distlist.html


Your use of Yahoo! Groups is subject to http://docs.yahoo.com/info/terms/
Kent Karlsson
2003-08-28 13:03:27 UTC
Permalink
Post by Peter Kirk
The Unicode standard, 4.0 section 15.2, specifies ZWJ and ZWNJ as the
characters to use for promoting or inhibiting ligatures. But the text
seems to assume that the characters being ligated are base
characters. A
problem arises when ZWJ or ZWNJ is inserted between combining
marks to
promote or inhibit ligation: as non-combining control
characters these
characters break the combining sequence, such that the following
combining mark forms a defective combining sequence whose
rendering is undefined.
I have suggested that ZWJ and ZWNJ be made combining characters
of class 0 (which would be permissibly stable), so that ZWJ and
ZWNJ can occur between a base character and combining
characters (needed for Khmer) or between Hebrew combining
characters (that are in canonical order without the ZWJ/ZWNJ).
Current use (between combining sequences) would fall under
the special case of ZWJ/ZWNJ being at the end of a combining
sequence (rather than after it).

/kent k



------------------------ Yahoo! Groups Sponsor ---------------------~-->
KnowledgeStorm has over 22,000 B2B technology solutions. The most comprehensive IT buyers' information available. Research, compare, decide. E-Commerce | Application Dev | Accounting-Finance | Healthcare | Project Mgt | Sales-Marketing | More
http://us.click.yahoo.com/IMai8D/UYQGAA/cIoLAA/8FfwlB/TM
---------------------------------------------------------------------~->

To Unsubscribe, send a blank message to: unicode-***@yahooGroups.com

This mailing list is just an archive. The instructions to join the true Unicode List are on http://www.unicode.org/unicode/consortium/distlist.html


Your use of Yahoo! Groups is subject to http://docs.yahoo.com/info/terms/
Loading...