Apache OpenOffice (AOO) Bugzilla – Issue 12617
Greek characters are lost when saving to MS Word 6/95 format
Last modified: 2013-08-07 14:41:36 UTC
Just open the attached document (it contains some Russian and Greek text) and try to save it as MS Word 6 or 95 document. The Russian text is saved correctly (however, OOo still can't correctly read it), but the Greek text is simply replaced with question marks.
Created attachment 5215 [details] Test document containing some Greek and Russian text.
Thank you for using and supporting OOo. Does this problem still exist in 1.1 Beta 2?
Yes, the problem still appears in 1.1Beta2. So I've changed the "version" field for this issue.
I think this is a font issue. You may need a localized build in order to support non-english fonts. Does it happen with the localized builds of OOo?
First of all, there is no localized build for 1.1Beta2. I experimented with English version. When I save a file to Winword 6.0/95 format, Cyrillic characters are interpreted correctly, but Greek ones are not. I think, should be no difference between two non-Latin codepages (windows-1251 and windows-1253) in this point. And it is not a font issue, because I have Greek characters in my fonts, I can see them in my documents and input them from keyboard. Of course, I can save my files to any Unicode-based format (like rtf, Winword 97, or native sxw). However, when I'm saving a file to Winword 6.0/95 format, my Greek characters are lost (replaced with question marks). Really such a behaviour is normal for Unicode characters not present in any windows-125* codepage, but Basic Greek characters must be converted to windows-1253, as well as Cyrillic ones are converted to windows-1251.
Reassigned to MRU
This is a well known issue and not so easy to solve. As long as Word 6/95 format is not capable of storing Unicode, there does not exist a solution how to save different encodings in one document. See issue #12445 for more details. This will be closed as duplicate. *** This issue has been marked as a duplicate of 12445 ***
Closed.
Disagree. First of all, since Word 6/95 format is not capable of storing Unicode, it *does* store text in different 8-bit codepages in one document. The only thing we need is the conversion of our characters to the correct windows-125* codepage. OOo is already capable to perform such a conversion, since Cyrillic characters are saved correctly. So the problem is not with saving national characters in general (as in issue 12455), but only with Greek characters, which should be converted to windows-1253. This means that this issue may be marked as depending from 12455, but not as a duplicate of it.
Reassigned to Caolan McNamara.
With fix for issue 12445 this should work in 2.0. Its pretty experimental because the codepage to export to has to be figured out from the unicode range the characters are in.
reopen to reassign
cmc->mru: Working limerickfilterteam08
Checked fix with internal CWS filterteam08.
Fix verified. Will be part of OO 2.0.
Fix good in OO 2.0 snapshot src680m13.