Issue 472 - Large file (80+pgs) with HTML in Endnotes Crashes
Summary: Large file (80+pgs) with HTML in Endnotes Crashes
Status: CLOSED FIXED
Alias: None
Product: Writer
Classification: Application
Component: code (show other issues)
Version: 614
Hardware: PC Linux, all
: P2 Trivial (vote)
Target Milestone: ---
Assignee: michael.brauer
QA Contact: issues@www
URL: http://www.nihonlinks.com/OpenOffice/...
Keywords:
Depends on:
Blocks:
 
Reported: 2001-02-25 03:03 UTC by issues@www
Modified: 2007-09-23 15:14 UTC (History)
1 user (show)

See Also:
Issue Type: DEFECT
Latest Confirmation in: ---
Developer Difficulty: ---


Attachments

Note You need to log in before you can comment on or make changes to this issue.
Description issues@www 2001-02-25 03:03:29 UTC
The file at http://www.nihonlinks.com/OpenOffice/CURRENT.sdw will consistently
crash all versions of Open Office and StarOffice.

The file is fairly big (80+ pages).  I don't think its size related.

The endnotes contain HTML.  I think it's not parsing correctly.

In any event it'll crash anything.

The file is copyrighted.. preliminary draft of a paper.  It is not to be
distributed except for testing.. ;)
Comment 1 stefan.baltzer 2001-02-26 10:46:46 UTC
Not even Storage viewing tools were able to open the file. Therefore the storage
is broken. 

This file was created with a StarOffice 5.2 (569), that much was visible, but we
can't do more than anyone else (saving text contents by using an editor).

If you could tell us EXACTLY how to reproduce this (without a head crash or an
electricity blackout at the right time while saving or copying the file ;-) then
we might try to find out what went wrong. Otherwise this file is simply lost.

PLease comment.

Comment 2 issues@www 2001-02-27 07:25:52 UTC
Well, I will try to reproduce again but I'm pretty sure pasting HTML text with 
extended characters did it.  I'll try with the HTML I think I was using in a 
blank document.

Last time it was Icelandic.  I think this time it was French.  Always the 
French.. ;)

Anyway, I'll try it with the same HTML.  I'm pretty sure there's an interesting 
bug somewhere.  I can get an HTML exported and displaying it will not.  Saving 
the HTML out as a StarOffice format will still make a file that crashes on 
load.  Would you like to see the HTML 
(http://www.nihonlinks.com/OpenOffice/CURRENT.html)  (BTW, the endnotes get all 
destroyed from the HTML otherwise I'd be totally happy recovered camper.)

BTW, my editor has "advised" me to start using Word for the rest of the 
document.. ug.. ;)
Comment 3 jp 2001-03-02 16:18:56 UTC
The problem is, that the textencoding of the document is RTL_TEXTENCODING_EUC_JP 
and the ByteString -> UnicodeString conversion routine remove characters from 
the string. This does never happens!
Comment 4 issues@www 2001-03-14 19:19:06 UTC
Jeurgen,

Thanks for the info!  I'm not sure I understand what's going on though.

The strings are encoded internally as EUC_JP according to the environment 
variable LANG's value? or are they internally Unicode?

Is the ByteString to Unicode string conversion barfing because the strings 
being passed are malformed in some way?

If the text is being entered from the clipboard there needs to be a way to 
apply conversion on the clipboard text, no?
Comment 5 Unknown 2001-07-13 14:11:41 UTC
TH->JP: See Interface-Announcement where you get a function to ignore 
MultiByte-Charsets. Plese discuss/inform also other developer (Malte, Niklas, 
...) which could have the same problem when they save Old SO FileFormats.
Comment 6 jp 2001-07-30 08:34:19 UTC
Please change the code of binary im-/export filter like it is describe in the 
changes mail.
Comment 7 michael.brauer 2001-08-02 09:19:11 UTC
The bug document isn't existing any longer, therfor I can't verify whether it
loads correctly using my changes.
Comment 8 p9w.vu.31122010 2007-09-23 15:14:16 UTC
Closing.