Hello everyone,
I would like to share an example related to html subfilter.
Example sentence in XLS file:
A <p1>replacement unit</p2> will be shipped <p3>at no additional cost</p4> once the issue is confirmed.
In this example, the tags are meant to mark inline content only. The sentence should remain a single segment and should not be split because of these tags.
I am using
#v1
maxAttributeSize.i=4194304
bPreferenceTranslateDocProperties.b=true
translatePowerpointDocProperties.b=true
reorderPowerpointDocProperties.b=false
reorderPowerpointRelationships.b=false
translatePowerpointDiagramData.b=true
reorderPowerpointDiagramData.b=false
translatePowerpointCharts.b=true
reorderPowerpointCharts.b=false
bPreferenceTranslatePowerpointNotes.b=true
bPreferenceReorderPowerpointNotes.b=false
bPreferenceTranslateComments.b=true
translatePowerpointComments.b=true
reorderPowerpointComments.b=false
bPreferenceTranslatePowerpointMasters.b=true
bPreferenceIgnorePlaceholdersInPowerpointMasters.b=false
bPreferenceTranslateWordHeadersFooters.b=true
translateWordNumberingLevelText.b=false
bPreferenceTranslateWordHidden.b=false
bPreferenceTranslateWordExcludeGraphicMetaData.b=false
translatePowerpointGraphicMetadata.b=false
bPreferenceTranslatePowerpointHidden.b=false
bPreferenceTranslateExcelHidden.b=false
bPreferenceTranslateExcelExcludeColors.b=false
bPreferenceTranslateExcelSheetNames.b=false
translateExcelCellsCopied.b=true
bPreferenceAddLineSeparatorAsCharacter.b=false
sPreferenceLineSeparatorReplacement=$0a$
bPreferenceReplaceNoBreakHyphenTag.b=false
bPreferenceIgnoreSoftHyphenTag.b=false
bPreferenceAddTabAsCharacter.b=false
bPreferenceAggressiveCleanup.b=false
ignoreWhitespaceStyles.b=false
bPreferenceAutomaticallyAcceptRevisions.b=true
bExtractExternalHyperlinks.b=false
bPreferencePowerpointIncludedSlideNumbersOnly.b=false
bPreferenceTranslateExcelDiagramData.b=false
bPreferenceTranslateExcelDrawings.b=false
bInExcludeMode.b=true
bInExcludeHighlightMode.b=false
bPreferenceTranslateWordExcludeColors.b=false
bReorderPowerpointNotesAndComments.b=false
ignoreWordFontColors.b=false
allowWordStyleOptimisation.b=true
preserveExcelStylesInTargetColumns.b=false
extractExcelSourceAndTargetColumnsJoined.b=true
bPreferenceAllowEmptyTargets.b=false
tsComplexFieldDefinitionsToExtract.i=1
cfd0=HYPERLINK
tsExcelExcludedColors.i=0
tsExcludeWordStyles.i=0
tsWordHighlightColors.i=0
tsWordExcludedColors.i=0
tsPowerpointIncludedSlideNumbers.i=0
worksheetConfigurations.0.namePattern=.*
worksheetConfigurations.0.sourceColumns=D
worksheetConfigurations.0.targetColumns=E
worksheetConfigurations.0.excludedRows=1
worksheetConfigurations.0.excludedColumns=A,B,C
worksheetConfigurations.number.i=1
subfilter=okf_html
and I am getting sth like
<group id="P2442A051-sg2" resname="2">
<group id="P147242AB-tu1_ssf1" resname="sub-filter:task-2396718!E2">
<trans-unit id="P147242AB-tu1_sf1_tu1" resname="task-2396718!E2_1">
<source xml:lang="en">A</source>
<seg-source><mrk mid="0" mtype="seg">A</mrk></seg-source>
<target xml:lang="fr"><mrk mid="0" mtype="seg">A</mrk></target>
</trans-unit>
<trans-unit id="P147242AB-tu1_sf1_tu2" resname="task-2396718!E2_3">
<source xml:lang="en">complimentary reshipment</source>
<seg-source><mrk mid="0" mtype="seg">complimentary reshipment</mrk></seg-source>
<target xml:lang="fr"><mrk mid="0" mtype="seg">complimentary reshipment</mrk></target>
</trans-unit>
<trans-unit id="P147242AB-tu1_sf1_tu3" resname="task-2396718!E2_5">
<source xml:lang="en">can be arranged</source>
<seg-source><mrk mid="0" mtype="seg">can be arranged</mrk></seg-source>
<target xml:lang="fr"><mrk mid="0" mtype="seg">can be arranged</mrk></target>
</trans-unit>
<trans-unit id="P147242AB-tu1_sf1_tu4" resname="task-2396718!E2_7">
<source xml:lang="en">without the need for a return</source>
<seg-source><mrk mid="0" mtype="seg">without the need for a return</mrk></seg-source>
<target xml:lang="fr"><mrk mid="0" mtype="seg">without the need for a return</mrk></target>
</trans-unit>
<trans-unit id="P147242AB-tu1_sf1_tu5" resname="task-2396718!E2_9">
<source xml:lang="en">.</source>
<seg-source><mrk mid="0" mtype="seg">.</mrk></seg-source>
<target xml:lang="fr"><mrk mid="0" mtype="seg">.</mrk></target>
</trans-unit>
I am interested in approaches or best practices to ensure that such tagged content is handled as inline text and not broken into multiple segments during processing.
Thank you in advance for your help.