Dear all,
Others on this list have expressed interest in using Pathway to
export/print from FLEx, and we've been wanting to do so with our
project as well. Last time, we exported to SFM, converted to MDF, used
Toolbox's MDF-to-RTF exporter, and published a limited number of pocket
dictionaries using MS Word (and PDF995--for some reason PDFCreator
couldn't handle our page configuration). This time, I attempted to get
to
MS Word more directly by installing the Pathway plugin and running it
from within FLEx. (I haven't tried the third option, publishing via
Lexique Pro, because I
was told that its output is too "loose", too draft-like for book
publications.)
Overall, I think that Pathway is probably now a better way to go than
Toolbox, especially
since the data's sort order, separators, etc, can be configured within
FLEx itself. Most of the work I had to do after exporting from Pathway
was
formatting, rather than massaging the data. I did have some issues to
overcome, however, so I've reported them at
http://code.google.com/p/typeset-dictionary/issues
I've also copied them in full below for those of you who want to read
through the gory details.
BTW, our dictionary is trilingual, so Pathway couldn't export it in one
pass. I actually did three passes: exported the dictionary, exported
the reversals, switched the default analysis language, and exported the
reversals again. Then I merged the three documents together with the
book's front and back matter to get a single file I could print to a
single PDF file.
You may want to make note of the fact that sometimes styles can
mysteriously disappear when you're copying data from OpenOffice into MS
Word. I was surprised to find that the part of speech style (and a
couple of others) had disappeared
completely from my data, even though most of the styles came through
just fine. This is a serious problem that you want to catch early on in
the process. (I didn't, so I had to get creative later.)
Likewise, you can't assume that headers will convert over properly.
After I saved as .DOC from OpenOffice and opened the dictionary in
Word, the header labels were missing from the top of the page. (They
were in the file, but formatted as hidden text and mixed in with the
lexical data. I'm not sure why OpenOffice did this.) So if you're going
to use Word anyways, it's better not to bother leaving the "Insert
first and last word in header" option
checked in Pathway, and no need
to bother with the macro.
Maybe next time I'll be brave enough to do the whole thing in
OpenOffice.
-Jon (working on a root-based trilingual dictionary for an Indonesian
language)
Pathway 0.5.6-2010-03-27
Fieldworks 6.0.1
WinXP SP2
SUMMARY:
- Pathway export crashes OpenOffice
(I found a workaround)
- Character and paragraph styles have identical names
(with workaround)
- Exported file performs very slowly
- Homograph numbers not subscripted for
reversals
- Duplicate homograph numbers on subentries
- Visual appearance doesn't match FLEx
Dictionary
view
- Too many section breaks
(with workaround)
- Entries and subentries need to stay together
DETAILS:
Pathway export crashes OpenOffice
1. Export from FLEx to OpenOffice (via Pathway of course)
2. Leave both options checked: "Configured Dictionary" and "Reversal
Indexes"
(3. When prompted by OO, Enable Macros)
Instead of the exported ODM file opening properly in OO, it crashes OO
every time.
Character and paragraph styles have identical names
1. Export from Pathway to OpenOffice
2. Save as .DOC
3. Open in Word and try to edit a character style for which there is a
paragraph style with the same name
I would expect to be able to edit the style. Instead, I get an error
from
Word: "This style name already exists or is reserved for a built-in
style."
Also, it is not possible to search for either style in Word as long as
both
exist with identical names.
Deleting or renaming the identically-named paragraph style seems to
solve
the problem. Renaming would seem the safer option.
Some styles that I found to have duplicates:
headword_.klw_sense_senses_entry_letData
mainentryref_.klw_complexformcomponents_complexform_primaryrefs_sense_senses_entry_letData
mainentryref_.klw_xitem_complexformcomponents_complexform_primaryrefs_sense_senses_entry_letData
complexform_primaryrefs_sense_senses_entry_letData
Exported file performs very slowly
1. Export to OpenOffice
2a. Enable macros and then try to edit in OpenOffice
OR
2b. Use OpenOffice to save as .DOC and then try to edit in MS Word.
In OpenOffice, the macro that updates the page headings is EXTREMELY
slow,
and it runs every time I open the document. Is there perhaps some other
way
to generate these headings? (In MS Word, a computed field based on the
lexeme form's style can do this.)
In MS Word, something about the file (perhaps the number and complexity
of
styles?) causes repagination to take a long time (about 3 minutes for my
dictionary of about 4,000 entries). In Print Layout mode, various edits
can
trigger repagination.
MS Word does not have this issue when working with the document (and
styles) exported by Toolbox's MDF-to-RTF exporter. The same amount of
data
repaginates in about 3 seconds.
Homograph numbers not subscripted for reversals
1. Export reversals from FLEx
Homographs ought to be marked with subscripted numbers. Instead, they
used
full-sized numbers, which is especially problematic because "1"
resembles
the letter "l".
Duplicate homograph numbers on subentries
1. Configure FLEx Dictionary view to root-based.
2. Export data containing subentries that are homographs.
I'd expect those subentries to be marked with just a subscripted number,
like a main entry with homographs is, but these are marked with both a
subscripted number and a duplicate, full-sized number. (The numbers are
separated by a space. Screenshot attached.)
Visual appearance doesn't match FLEx Dictionary view
1. Format the Dictionary view in FLEx as desired.
2. Export to OpenOffice via Pathway.
Instead of using the fonts and sizes defined in FLEx, Pathway exports
the
data using its own formatting. This is true for both the default layout
(TwoColumn) and the FieldworksStyles layout.
Fortunately, the data itself (including its order and any separators) is
preserved through the export, and the formatting can be redone via
styles.
It's just that it's tedious to do so.
The default formatting is legible, but having the headword in a smaller
point size than the definitions seems a little odd to me. Also, I'm not
sure that Charis SIL is the best default font. It uses up a lot of
vertical
space compared to, say, Times New Roman. I guess this wouldn't be a big
deal if most styles could avoid defining the font and just inherit them
from a parent style. Changing the parent style's font would then ripple
down to its children.
Too many section breaks
I see that you're using a similar approach to letter headings (e.g.
"--- A
---") as the Toolbox/MDF-to-RTF exporter does, creating section breaks
between letter sections so that the letter headings can span both
columns.
The problem with this is that it makes it difficult to format or
cut-and-paste the document as a whole, since each section is somewhat
autonomous.
For example, I saved from OO into a Word doc and then wanted to
cut/paste
our dictionary data into our pocket-sized half-A5 template, but I
needed to
first carefully delete all of the section breaks, so that page and
column
formatting wouldn't be disrupted.
You might want to consider using a simpler solution instead, such as an
in-column heading.
Entries and subentries need to stay together
1. Export a root-based dictionary
Currently some pages end with a root, but all of that root's subentries
are
on the next page. I need to set some main-entry paragraphs to "keep with
next" so that they're on the same page as at least one of their
subentries.
We cannot apply this option to all main-entry paragraphs (that is, to
the
entry_letData style) because this creates lots of white space: not every
main entry has subentries, and if it doesn't it shouldn't be attached to
the following entry.
Maybe "parent" entries (main entries that have subentries) need to be a
special subtype of entry_letData so they can be set to "keep with next".
You received this message because you are subscribed to the discussion group "FLEx list". This group is hosted by Google Groups and is open for anyone to browse.