Apache OpenOffice (AOO) Bugzilla – Full Text Issue Listing |
Summary: | Index from concordance file to ignore optional hyphens in doc | ||||||
---|---|---|---|---|---|---|---|
Product: | Writer | Reporter: | othr <jnk1> | ||||
Component: | code | Assignee: | AOO issues mailing list <issues> | ||||
Status: | CONFIRMED --- | QA Contact: | |||||
Severity: | Trivial | ||||||
Priority: | P3 | CC: | issues | ||||
Version: | OOo 1.1 RC5 | ||||||
Target Milestone: | --- | ||||||
Hardware: | Other | ||||||
OS: | Windows XP | ||||||
Issue Type: | ENHANCEMENT | Latest Confirmation in: | --- | ||||
Developer Difficulty: | --- | ||||||
Issue Depends on: | |||||||
Issue Blocks: | 128492 | ||||||
Attachments: |
|
Description
othr
2003-12-14 07:58:10 UTC
Reassigned to BH If one uses optional hyphens [-] in index entries they are also treated differently. For example the following would generate four entries in the index: concordance con[-]cordance concor[-]dance con[-]cor[-]dance It would be better if all optional hyphens [-]could be ignored when comparing index entries. Perhaps ordinary hyphens should be ignored too, though this is debatable. Perhaps the following should be treated as a single entry? (anti-semitism) anti-semitism antisemitism anti[-]semitism antisemit[-]ism anti-semit[-]ism Created attachment 53695 [details]
Test Case with Generated Index
This issue still affects release 2.4 Optional hyphens should be ignored for indexing purposes, though it would be useful to include them in the index so that long words still break in the desired place in the index. Non-breaking hyphens and regular hyphens should be treated as the same. Optionally, hyphenated and unhyphenated terms that are otherwise identical could be combined under a single index entry, i.e. for indexing purposes anti-semitism = antisemitism = Anti-semitism. Whichever spelling was used first would take precedence as the index entry. To grep the issues easier via "requirements" I put the issues currently lying on my owner to the owner "requirements". |