Issue 116560 - problem with the arabic characters لا لأ ﻹ while reading pdf with Draw
Summary: problem with the arabic characters لا لأ ﻹ while reading pdf with Draw
Status: UNCONFIRMED
Alias: None
Product: extensions
Classification: Extensions
Component: pdfimport (show other issues)
Version: current
Hardware: PC Linux, all
: P3 Trivial (vote)
Target Milestone: ---
Assignee: AOO issues mailing list
QA Contact: wolframgarten
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2011-01-22 08:50 UTC by ammine007
Modified: 2013-04-02 19:30 UTC (History)
1 user (show)

See Also:
Issue Type: DEFECT
Latest Confirmation in: ---
Developer Difficulty: ---


Attachments
This is the odt file with the correct text in arabic that should appear once the exported pdf reopened with Draw (86.60 KB, image/png)
2011-01-22 08:52 UTC, ammine007
no flags Details
Here is my version info of OoO (595.91 KB, image/png)
2011-01-22 08:54 UTC, ammine007
no flags Details
Here is my exported pdf reopened with Draw, and the أ ا إ chars that are combined in لا لأ لإ are missing (107.68 KB, image/png)
2011-01-22 08:56 UTC, ammine007
no flags Details
Here is my exported pdf reopened with Evince (a pdf viewer for Linux), and the chars are correct, and thats is the normal behavior expected for Draw+PDFImport Extension (51.39 KB, image/png)
2011-01-22 08:59 UTC, ammine007
no flags Details
This is My Original Arab Text in odt format (68.80 KB, image/png)
2011-01-22 19:29 UTC, farouk85
no flags Details
When i converted into pdf everthing were OK (31.99 KB, image/png)
2011-01-22 19:31 UTC, farouk85
no flags Details
But when i Opened my pdf in openoffice draw to change my text the characters لا لأ لإ disappeared (56.29 KB, image/png)
2011-01-22 19:35 UTC, farouk85
no flags Details
arabic chars for testing لا لأ لإ لآ problems (8.22 KB, text/plain)
2011-01-24 10:23 UTC, ammine007
no flags Details
arabic chars for testing لا لأ لإ لآ problems - the exported pdf file (10.08 KB, application/pdf)
2011-01-24 10:24 UTC, ammine007
no flags Details

Note You need to log in before you can comment on or make changes to this issue.
Description ammine007 2011-01-22 08:50:13 UTC
Salam,

In our company we are filling out pdf forms using three languages (arabic,
english and french).

We use OoO+PDFImport to (fill out) annotate our pdf forms.

using english characters, opening the pdf with Draw "is correct"

using french character, opening the pdf with Draw "is still correct" (but for
some special characters like à and ç)

using arabic characters, open the pdf with Draw "is still correct" (but for the
only characters لا لأ لإ)

pleaz have a look at the screenshots joined.

I really do not know if it's a Draw problem or a PDFImportExtensio problem..?


i'm facing this issue using OoO-3.2.1.4 + PDFImport extension-1.0.4 on both a
Linux/Ubuntu-10.10-32bits machine and a WinXP machine.

thank you for your time !
Comment 1 ammine007 2011-01-22 08:52:30 UTC
Created attachment 75606 [details]
This is the odt file with the correct text in arabic that should appear once the exported pdf reopened with Draw
Comment 2 ammine007 2011-01-22 08:54:00 UTC
Created attachment 75607 [details]
Here is my version info of OoO
Comment 3 ammine007 2011-01-22 08:56:31 UTC
Created attachment 75608 [details]
Here is my exported pdf reopened with Draw, and the أ ا إ chars that are combined in لا لأ لإ are missing
Comment 4 ammine007 2011-01-22 08:59:08 UTC
Created attachment 75609 [details]
Here is my exported pdf reopened with Evince (a pdf viewer for Linux), and the chars are correct, and thats is the normal behavior expected for Draw+PDFImport Extension
Comment 5 aznag 2011-01-22 09:58:36 UTC
I recommand to you and to your company to use this software, for many reasons,
especially because it's free and efficient,and it's a conccurent of office 2007.

thanks
Comment 6 farouk85 2011-01-22 12:51:14 UTC
Hi, I tested the same case and i get the same problem as described by this user.
How to fix this ?
Comment 7 farouk85 2011-01-22 19:29:47 UTC
Created attachment 75611 [details]
This is My Original Arab Text  in odt format
Comment 8 farouk85 2011-01-22 19:31:31 UTC
Created attachment 75612 [details]
When i converted into pdf everthing were OK
Comment 9 farouk85 2011-01-22 19:35:18 UTC
Created attachment 75613 [details]
But when i Opened my pdf in openoffice draw to change my text the characters  لا لأ لإ disappeared
Comment 10 wolframgarten 2011-01-24 09:40:01 UTC
Please attach adocument for testing, not only screenshots. Thanks.
Comment 11 ammine007 2011-01-24 10:23:26 UTC
Created attachment 75621 [details]
arabic chars for testing لا لأ لإ لآ problems
Comment 12 ammine007 2011-01-24 10:24:59 UTC
Created attachment 75622 [details]
arabic chars for testing لا لأ لإ لآ problems - the exported pdf file
Comment 13 ammine007 2011-01-24 11:07:03 UTC
Salam,


thank you for your time !

here are the odt file and it's exported pdf file, filled correctly

if it can help you :

1- in all the cases the characters لا لأ لإ لآ are not correctly handled, when
opening the PDF file with (OoO-Draw + PDFImport-Extension)

2- their UTF-16 codes are ranging from FEF5->FEFC in Arabic Form presentation B

3- To write the لأ character in arabic we write first the ل char then the أ char.

     Using some fonts when writing in OoO-writer ()

--- the أ character in لأ is removed and we just see the ل character. 
e.g. لأ becomes ل (Dejavu sans)

--- the أ character in لأ is substitued with the م character.  
e.g. 

--- the أ character in لأ is removed and the space after it is also removed 
e.g. لأ ق becomes لق (Courier 10 pitch, BitStream charter)


The (Liberation serif fonts) is the only font that once the PDF opened with
Draw, shows the لأ character but still removes any space after the لأ char or
after the word that contain the لأ char, if any.


Thank you for your efforts ! and good luck to you !
Comment 14 ammine007 2011-01-24 17:36:07 UTC
Salam,

I think that this is the "unicode block" to add/support in the PDF import
extension source code


http://www.fontspace.com/unicode/block/Arabic+Presentation+Forms-B
Comment 15 ammine007 2011-01-24 17:37:10 UTC
This is a list of fonts that support characters in the Arabic Presentation
Forms-B Unicode block.

http://www.fileformat.info/info/unicode/block/arabic_presentation_forms_b/fontsupport.htm
Comment 16 faouzi 2011-01-29 20:57:19 UTC
Hi everyone,

We try to migrate to the OoO, we use the arabic language in our enterprise, but
we found problem when importing pdf forms that contains لا لأ لإ لآ characters
as said before!

We need this functionality because we cant to migrate from MS to Linux

Is there any developer that has been assigned to this issue ...?
Comment 17 Rob Weir 2013-02-02 02:58:08 UTC
This Issue requires more information ('needmoreinfo'), but has not been updated
within the last year. Please provide feedback as requested and re-test with the the latest version of OpenOffice - the problem(s) may already be addressed. 

You can download Apache OpenOffice 3.4.1 from http://www.openoffice.org/download

Please report back the outcome of your testing, so this Issue may be closed or
progressed as necessary - otherwise the issue may be Resolved as Invalid in the
future.