Am 31.03.21 um 17:23 schrieb Michael Lackhoff:
> Am 31.03.2021 um 15:20 schrieb Demian Katz:
>> Thanks for these helpful links; there’s also been some more discussion on the VuFind PR.
>>
>> From all of this, it seems that there are two different things which have changed, either of which might explain the change in behavior (and again, both of which could be tested experimentally if time permitted – but mine currently does not):
>>
>>
>> 1. The links below talk about XML; perhaps this is a quirk of the Solr XML output handler. VuFind has subsequently switched to use JSON instead of XML, so the XML quirks would no longer be relevant.
>> 2. SolrMarc used to default to writing to Solr in binary mode through the SolrJ API, but that support was subsequently removed in favor of writing via HTTP transactions. If the strange encoding was related to write mode, that could also be a factor.
>>
>> I think option 1 is perhaps more probable, but like I said, I haven’t had a chance to prove it. 😊
>
> I am quite sure it is option 1 and it would be very helpful if it would
> keep working that way.
Yes, I also seem to remember something with the XML Solr input or output
and MARC control characters.
I just tested this: Post a Solr JSON document with a ISO-MARC record
without replacing the control characters to a Solr collection in Solr
8.6.3, then retrieve it through the XML response writer. The ISOMARC
comes back with the control characters replaced by #29;, #30;, #31; in
the XMLish output (see below, relevant snippets of original XML and JSON
responses). While the JSON output has the original MARC control
characters. I have no idea, when Solr has learned this...
I'd still vote for keeping the replacement of the control characters at
least as an option in solrmarc for backwards compatibility (because it
seems, that is was once necessary, and there might be software setups
out there that expect/need it).
Till
Experiment with Solr 8.6.3, posted a MARC record without touching the
control characters and retrieved it through XML and JSON writers
respectively:
JSON writer result:
"fullrecord":"01734cam a22004932
4500001001000000003000700010005001700017007000300034008004100037015002100078016002200099020004800121035002200169035002500191035002000216040003100236041000800267041000800275044001000283084001300293100010100306240002300407245006400430250001200494264005000506300001700556336002600573337004600599338002500645655009800670689007600768689007500844689008600919689001101005700002001016700004601036912001501082912001401097912001201111951000701123980006201130984002501192985002301217\u001e124282148\u001eDE-627\u001e20190119145426.0\u001etu\u001e930526s1993
gw ||||| 00| ||ger c\u001e \u001fa93,A17,2161\u001f2dnb\u001e7
\u001fa930523938\u001f2DE-101\u001e \u001fa3458161937\u001fc: DM 38.00
(Pp.)\u001f93-458-16193-7\u001e \u001fa(DE-627)124282148\u001e
\u001fa(DE-599)GBV124282148\u001e \u001fa(OCoLC)75316021\u001e
\u001faDE-627\u001fbger\u001fcDE-627\u001ferakwb\u001e \u001fager\u001e
\u001fhheb\u001e \u001fcXA-DE\u001e \u001fa59\u001f2sdnb\u001e1
\u001faʿOz,
Amos\u001fd1939-2018\u001feverfasserin\u001f0(DE-588)118855379\u001f0(DE-627)079631088\u001f0(DE-576)165099275\u001f4aut\u001e10\u001faLada'at
ischa <dt>\u001e15\u001faEine Frau erkennen\u001fcAmos Oz. Aus dem Hebr.
von Ruth Achlama\u001e \u001fa2. Aufl\u001e 1\u001faFrankfurt am
Main\u001faLeipzig\u001fbInsel-Verl.\u001fc1993\u001e \u001fa318
S\u001fc21 cm\u001e \u001faText\u001fbtxt\u001f2rdacontent\u001e
\u001faohne Hilfsmittel zu benutzen\u001fbn\u001f2rdamedia\u001e
\u001faBand\u001fbnc\u001f2rdacarrier\u001e 7\u001faFiktionale
Darstellung\u001f0(DE-588)1071854844\u001f0(DE-627)82648378X\u001f0(DE-576)43337439X\u001f2gnd-content\u001e00\u001fDg\u001f0(DE-588)4027808-6\u001f0(DE-627)104767804\u001f0(DE-576)208972358\u001faIsrael\u001f2gnd\u001e01\u001fDs\u001f0(DE-588)4182341-2\u001f0(DE-627)105311219\u001f0(DE-576)210010959\u001faSpion\u001f2gnd\u001e02\u001fDs\u001f0(DE-588)4123184-3\u001f0(DE-627)104577681\u001f0(DE-576)209557559\u001faFamilienkonflikt\u001f2gnd\u001e0
\u001f5DE-101\u001e1 \u001f00590333232\u001f4oth\u001e12\u001faʿOz,
Amos\u001fd1939-2018\u001ftLada'at ischa <dt>\u001e
\u001faGBV_ILN_31\u001e \u001faSYSFLAG_1\u001e \u001faGBV_KXP\u001e
\u001faBO\u001e \u001f231\u001f101\u001fb220884811\u001ffMag\u001fd95 A
4813/1\u001feu\u001fx0027\u001fyn\u001fz14-07-00\u001e
\u001f231\u001f101\u001fa27$006869785\u001e
\u001f231\u001f101\u001faB93/R/4249\u001e\u001d",
XML writer result:
<str name="fullrecord">01734cam a22004932
4500001001000000003000700010005001700017007000300034008004100037015002100078016002200099020004800121035002200169035002500191035002000216040003100236041000800267041000800275044001000283084001300293100010100306240002300407245006400430250001200494264005000506300001700556336002600573337004600599338002500645655009800670689007600768689007500844689008600919689001101005700002001016700004601036912001501082912001401097912001201111951000701123980006201130984002501192985002301217#30;124282148#30;DE-627#30;20190119145426.0#30;tu#30;930526s1993
gw ||||| 00| ||ger c#30; #31;a93,A17,2161#31;2dnb#30;7
#31;a930523938#31;2DE-101#30; #31;a3458161937#31;c: DM 38.00
(Pp.)#31;93-458-16193-7#30; #31;a(DE-627)124282148#30;
#31;a(DE-599)GBV124282148#30; #31;a(OCoLC)75316021#30;
#31;aDE-627#31;bger#31;cDE-627#31;erakwb#30; #31;ager#30; #31;hheb#30;
#31;cXA-DE#30; #31;a59#31;2sdnb#30;1 #31;aʿOz,
Amos#31;d1939-2018#31;everfasserin#31;0(DE-588)118855379#31;0(DE-627)079631088#31;0(DE-576)165099275#31;4aut#30;10#31;aLada'at
ischa <dt>#30;15#31;aEine Frau erkennen#31;cAmos Oz. Aus dem Hebr. von
Ruth Achlama#30; #31;a2. Aufl#30; 1#31;aFrankfurt am
Main#31;aLeipzig#31;bInsel-Verl.#31;c1993#30; #31;a318 S#31;c21 cm#30;
#31;aText#31;btxt#31;2rdacontent#30; #31;aohne Hilfsmittel zu
benutzen#31;bn#31;2rdamedia#30; #31;aBand#31;bnc#31;2rdacarrier#30;
7#31;aFiktionale
Darstellung#31;0(DE-588)1071854844#31;0(DE-627)82648378X#31;0(DE-576)43337439X#31;2gnd-content#30;00#31;Dg#31;0(DE-588)4027808-6#31;0(DE-627)104767804#31;0(DE-576)208972358#31;aIsrael#31;2gnd#30;01#31;Ds#31;0(DE-588)4182341-2#31;0(DE-627)105311219#31;0(DE-576)210010959#31;aSpion#31;2gnd#30;02#31;Ds#31;0(DE-588)4123184-3#31;0(DE-627)104577681#31;0(DE-576)209557559#31;aFamilienkonflikt#31;2gnd#30;0
#31;5DE-101#30;1 #31;00590333232#31;4oth#30;12#31;aʿOz,
Amos#31;d1939-2018#31;tLada'at ischa <dt>#30; #31;aGBV_ILN_31#30;
#31;aSYSFLAG_1#30; #31;aGBV_KXP#30; #31;aBO#30;
#31;231#31;101#31;b220884811#31;fMag#31;d95 A
4813/1#31;eu#31;x0027#31;yn#31;z14-07-00#30;
#31;231#31;101#31;a27$006869785#30;
#31;231#31;101#31;aB93/R/4249#30;#29;</str>
--
Till Kinstler
Verbundzentrale des Gemeinsamen Bibliotheksverbundes (VZG)
Platz der Göttinger Sieben 1, D 37073 Göttingen
kins...@gbv.de,
+49 (0) 551 39-31414,
http://www.gbv.de/