Issue 18675

Summary: when saving a hebrew file to HTML, punctuation is incorrectly marked as english
Product: Internationalization Reporter: sforbes <xslf>
Component: BiDiAssignee: AOO issues mailing list <issues>
Status: ACCEPTED --- QA Contact:
Severity: Trivial    
Priority: P3 CC: issues, mehlng
Version: OOo 1.1 RC3   
Target Milestone: ---   
Hardware: PC   
OS: Windows XP   
Issue Type: DEFECT Latest Confirmation in: ---
Developer Difficulty: ---
Issue Depends on: 16354    
Issue Blocks:    
Attachments:
Description Flags
original write file
none
exported html
none
test file created directly in the html editor
none
simpel test file created with OOo 680m26
none
HTML created by OOo 680m26- notice that the punctuation is in an English span none

Description sforbes 2003-08-26 09:08:51 UTC
OO1.1RC3, running on winXP, Hebrew local. HTML export encoding set as windows-1255

I created a simple document using styles and no ad-hoc formatting.
Saved it as HTML.

Looking at the code, it seems that all punctuation is enclosed within  <SPAN
LANG="en-US">, which is incorrect- the puncuation is not in English, and marking
it as english could lead to incurrect browser rendering (the browser choosing a
diffrent font for the punctuation).

I got the same result with the HTML editor
Comment 1 sforbes 2003-08-26 09:09:28 UTC
Created attachment 8764 [details]
original write file
Comment 2 sforbes 2003-08-26 09:10:01 UTC
Created attachment 8765 [details]
exported html
Comment 3 sforbes 2003-08-26 09:10:57 UTC
Created attachment 8766 [details]
test file created directly in the html editor
Comment 4 mehlng 2003-08-26 09:32:20 UTC
This is the correct behaviour as currently for some reason all
punctuations are considered Western.
When this will be fixed this not-really-problem will be fixed as well.

*** This issue has been marked as a duplicate of 16354 ***
Comment 5 sforbes 2003-08-26 09:50:07 UTC
They are related, but not duplicate- even if issue 16354 is fixed,
there  is no telling that this will be fixed as well.
Comment 6 mehlng 2003-08-26 10:03:49 UTC
well I won't argue, but I think that changing all puctuations into
Hebrew fonts will defenitely solve the problem.
Anyhow search yourself - if you think there's anything to fix let's
leave it open.
I hope that the puctuations issue will be solvede quickly.
Comment 7 Dieter.Loeschky 2003-08-27 11:38:46 UTC
DL->MIB: Would you please takeover?
Comment 8 michael.brauer 2003-08-27 11:55:24 UTC
The HTML export uses the same code as the UI to figure out the font
and langauge for all characters, so if 16354 is fixed, this bug will
be fixed as well.

*** This issue has been marked as a duplicate of 16354 ***
Comment 9 sforbes 2004-02-23 11:05:22 UTC
Tested with 680M26, which accroding to the release notes
( http://development.openoffice.org/releases/680_m26_snapshot.html ) has the fix
for issue #16354 which this bug was marked duplicate of- 

This problem still occores, and doesn't seem to be fixed by the fix to issue #16354

See newsample files
Comment 10 sforbes 2004-02-23 11:06:40 UTC
Created attachment 13355 [details]
simpel test file created with OOo 680m26
Comment 11 sforbes 2004-02-23 11:07:25 UTC
Created attachment 13356 [details]
HTML created by OOo 680m26- notice that the punctuation is in an English span
Comment 12 michael.brauer 2004-06-04 16:21:04 UTC
To my surprise, #i16354# did not solve the issue.
Comment 13 sforbes 2004-06-21 14:44:36 UTC
adding dina as cc