| From | Sent On | Attachments |
|---|---|---|
| Amir E. Aharoni | Oct 17, 2011 6:10 am | |
| Patrick Durusau | Oct 18, 2011 3:56 am | |
| Amir E. Aharoni | Oct 18, 2011 6:58 am | |
| Patrick Durusau | Oct 18, 2011 8:10 am | |
| Patrick Durusau | Oct 18, 2011 8:32 am | |
| John B | Oct 18, 2011 8:56 am | |
| Frode Severin Hatlevik | Oct 19, 2011 10:22 am | .odt |
| Amir E. Aharoni | Oct 19, 2011 10:41 am | .html |
| Andreas J. Guelzow | Oct 19, 2011 11:12 am | |
| Andreas J. Guelzow | Oct 19, 2011 11:22 am | |
| Frode Severin Hatlevik | Oct 19, 2011 12:08 pm | |
| Andreas J. Guelzow | Oct 19, 2011 12:29 pm | |
| robe...@us.ibm.com | Oct 19, 2011 12:48 pm | |
| Amir E. Aharoni | Oct 22, 2011 9:58 am | |
| Thorsten Behrens | Oct 23, 2011 3:34 am | |
| Caolán McNamara | Oct 24, 2011 3:20 am |
| Subject: | Re: [office-comment] marking directionality of text inside a paragraph | |
|---|---|---|
| From: | Caolán McNamara (caol...@redhat.com) | |
| Date: | Oct 24, 2011 3:20:12 am | |
| List: | org.oasis-open.lists.office-comment | |
On Sun, 2011-10-23 at 12:35 +0200, Thorsten Behrens wrote:
robe...@us.ibm.com wrote:
The ODF 1.2 (this was in 1.1 as well) spec, Appendix E1 of Part 1 covers Bidi text.
Amir E. Aharoni wrote:
Nothing that involves Unicode control characters can be described as "easily".
I tend to agree with Amir - albeit sufficient in expressiveness, that's still an area in ODF that's a bit inconsistent, and involves extra effort for 'simple' applications.
Beyond that, getting lower impedance towards/from HTML is desirable - if it doesn't break anything.
At the risk of confusing things, focusing on directionality alone might be a bit of a red herring. There are three script categories in ODF IIRC, Latin, Asian and Complex. And all text is divided into four categories, Latin, Asian, Complex and Weak.
Problems arise when encountering Weak characters, e.g. spaces, punctuation and mathematical symbols. They generally get assigned to one of the other three categories depending on context of surrounding text. There isn't a way to override the script-category they get assigned to I think ?, or is there ?
So, one example scenario is a document comprising of a paragraph that consists of only weak characters, something like .:?". There isn't a way to state that these weak characters should be biased towards one script category or another. If you open that in a version of LibreOffice/OpenOffice.org then the final fallback is to bias towards the locale the user is in, i.e. a Japanese user gets .:?" shown in their CJK font, and a Europan user gets .:?" shown in their Western font, so the same document isn't rendered the same for different locales.
i.e. if you select your problematic text and change e.g. Western font size to 50 and CTL to 25, what size is the misplaced ! drawn in?, has the ! been categorized as a 25pt CTL character or a 50pt Latin character.
We don't have a "script bias" or "idcthint" feature to force weak characters into one script of another. I believe that OpenXML has something of that nature.
i.e. given a weak character such as a bare ! or space, how do we specify that it should be rendered using the Asian font, the Complex font or the Western font, if we want to override the neighbouring strong-characters script ?
C.
-- This publicly archived list offers a means to provide input to the OASIS Open Document Format for Office Applications (OpenDocument) TC.
In order to verify user consent to the Feedback License terms and to minimize spam in the list archive, subscription is required before posting.
Subscribe: offi...@lists.oasis-open.org Unsubscribe: offi...@lists.oasis-open.org List help: offi...@lists.oasis-open.org List archive: http://lists.oasis-open.org/archives/office-comment/ Feedback License: http://www.oasis-open.org/who/ipr/feedback_license.pdf List Guidelines: http://www.oasis-open.org/maillists/guidelines.php Committee: http://www.oasis-open.org/committees/tc_home.php?wg_abbrev=office Join OASIS: http://www.oasis-open.org/join/






.odt