Apache OpenOffice (AOO) Bugzilla – Issue 14416
Import fails on MS 6/7 DOC created from WordPerfect 8.0-Linux
Last modified: 2013-08-07 14:43:45 UTC
I am trying to get a bunch of WordPerfect documents converted to WORD, HTML, and PDF. OO 1.1 beta looked like a great choice, except I had to export them from WP8 as MS Word. When I try to import the DOC file into OO 1.1 beta, I get a text import and garbage. I tested this with a very simple, 1 line wpd/doc file. Word 2002 has no problem opening the .doc file. OO fails. 1. create test_1.wpd using WP 8.0 on Linux (with export fix applied) default font: courier, 10 Pt type "This is a test" 2. save and then export as word 6/7 -> test_1.doc 3. try to open in OO 1.1 beta, RH 8, Linux -> imports as text 4. try to open in Word 2002 on Windows 2K in VMware -> imports properly I can provide the docs. Please mail me and I'll forward them.
Hi reporter, by the way something completely different: we have someone who is working on an import-filter for Wordperfect-documents , and now he needs testfiles. Can you pls. attach some testfiles for issue 4731 ? Now something concerning your problem: Pls. attach one of your .doc - files so that we can reproduce your problem! Thanks Rainer
reassigned to mru
wadeh, please attach such a file to this issue, so that we can evaluate the problem. Feel free to re-open this issue when done. Thanks for supporting us!
Got the files from Wade. Problem was, that the filter wasn't able to recognize this as a Word document, because WP's export is kind of poor. OO Writer's filter has now been enhanced, so that it also recognizes these format.
Checked with OO 1.1 RC4, fix is integrated.
I am wrong, this does not work with RC4 :-(
MRU->CMC: SO is able to open this document, but OO isn't. Is there something different in the filter detection?
Created attachment 9245 [details] .doc produced by WP
Michael, Caolan, would you please reconsider the target milestone? Thanks, Stefan
I can open wpd_simple_test_doc1_msw6-7.doc without problem with 1.1RC3 German version WIN98SE: 645m15(Build8669) [CWS:ooo11rc3] but the content is destroyed. Before the text "This is a very simple document ..." starts, I see: Ü¥e#-À #####e###############œ###m ##################œ#######################################ú#######ú###ú ######ú ######ú ######ú ######ú ####### ####### ## . . . Seems to be some misinterpretation of the document header. Rainer
Quote: "MRU->CMC: SO is able to open this document, but OO isn't. Is there something different in the filter detection? " I hope we can get the same between OOo and SO I hope we can get this max 1.1.1
Please check out issue 19641 as a possible duplicate of this issue. I had guessed that the offending document had been created by word and now I see that perhaps it was WP.
Issue not found There does not seem to be an issue numbered 19641. I think it is Internal number again
If you need additional samples (e.g., more complex formatting), I can generate them for you. Several might be good test cases once you get the initial problem solved? If you get a patch against RC4, I can test it (please e-mail me).
retarget to 1.1.1 since no reason given to be a stopper.
There an entry missing from the OOo TypeDetection.xcu (well a filter appears to be misclassified as being an third party filter, when its not). See (duplicate) issue 12175 for a workaround.
Setting as duplicate. *** This issue has been marked as a duplicate of 12175 ***
This bug is identical with i12175. Therefore I close it now.