Account Options

  1. Sign in
The old Google Groups will be going away soon, but your browser is incompatible with the new version.
Google Groups Home
« Groups Home
More on SolrMarc and uppercase subfields
There are currently too many topics in this group that display first. To make this topic appear first, remove this option from another topic.
There was an error processing your request. Please try again.
flag
  6 messages - Collapse all  -  Translate all to Translated (View all originals)
The group you are posting to is a Usenet group. Messages posted to this group will make your email address visible to anyone on the Internet.
Your reply message has not been sent.
Your post was successful
 
From:
To:
Cc:
Followup To:
Add Cc | Add Followup-to | Edit Subject
Subject:
Validation:
For verification purposes please type the characters you see in the picture below or the numbers you hear by clicking the accessibility icon. Listen and type the numbers you hear
 
Demian Katz  
View profile  
 More options Sep 28 2012, 7:56 am
From: Demian Katz <demian.k...@villanova.edu>
Date: Fri, 28 Sep 2012 11:56:38 +0000
Local: Fri, Sep 28 2012 7:56 am
Subject: More on SolrMarc and uppercase subfields

Another message I'm copying to solrmarc-tech...

This definitely sounds like a case of SolrMarc making assumptions that it really shouldn't; I don't think it's SolrMarc's job to validate the subfields, especially given that the MARC standard is constantly evolving.

If I had to guess, though, I'd speculate that this is actually not a problem specific to SolrMarc, but rather something related to the underlying Marc4j library.  Can somebody with more Marc4j experience confirm or deny that?

Assuming this is related to Marc4j, is there a way to loosen the restrictions, perhaps through a new SolrMarc configuration option?

- Demian
________________________________
From: Hannah Ullrich [hannah.ullr...@ub.uni-freiburg.de]
Sent: Friday, September 28, 2012 7:46 AM
To: vufind-t...@lists.sourceforge.net
Subject: Re: [VuFind-Tech] SolrMarc 2.4 upgrade

Hallo Stefan,

we have the same problem with the 689 fields.

I get the following info in our marc_error field:

<arr name="marc_error">
<str>Minor Error  : Subfield tag is an invalid uppercase character, changing it to lower case. --- [ 689 : D ]</str>
<str>Major Error  : Subfield tag is an invalid character, using first character of field as subfield tag. --- [ 689 : A ]</str>
</arr>

using vufind 1.3

Hannah

Am 28.09.2012 12:26, schrieb Winkler, Stefan:
Hi Demian,

I didn't try your upgrade in the 1.3 trunk but used the solrmarc 2.4 vufind binary directly from http://code.google.com/p/solrmarc/downloads/list

After indexing I seems like the case sensitivity while reading the subfield-tags was lost.

MarcEdit Marc21:
=689  01$Af$2gnd$aKongress

1. Vufind FullRecord (v2.3.1):
689 01 |A f  |2 gnd  |a Kongress

2. Vufind FullRecord (v2.4):
689 01 |a f  |2 gnd  |a Kongress

marc.properties properties parses this Field using "topic_facet = 689a"

The result is, that  in the second case I get the $A subfield with the value "f" als topic_facet.

Can anybody confirm this for his/her index? I suppose uppercase subfields like $A or $D   is used mainly in  germany.

Best wishes
Stefan

--
Stefan Winkler
Bibliotheksservice-Zentrum Baden-Württemberg (BSZ)
78457 Konstanz / Germany
Phone: +49 7531 88 2364
E-Mail: stefan.wink...@bsz-bw.de<mailto:stefan.wink...@bsz-bw.de>
http://www.bsz-bw.de

Von: Demian Katz [mailto:demian.k...@villanova.edu]
Gesendet: Mittwoch, 26. September 2012 19:11
An: vufind-t...@lists.sourceforge.net<mailto:vufind-t...@lists.sourceforge.net>
Betreff: [VuFind-Tech] SolrMarc 2.4 upgrade

I have just upgraded both the VuFind 1.x trunk and the VuFind 2.x master branch to use SolrMarc 2.4, the latest release.  (VuFind 1.x was previously on 2.3.1, while 2.x was previously using a custom-built version very close to 2.4).  Hopefully this will make Monday's 2.0beta release just a little bit easier to manage thanks to the inclusion of a known SolrMarc version.  It also allows me to close a couple of JIRA tickets related to fixes/improvements in 2.4.

I don't anticipate any problems related to this upgrade, but please let me know if you run into any indexing troubles using the latest-and-greatest code.

thanks,
Demian

--------------------------------------------------------------------------- ---
Got visibility?
Most devs has no idea what their production app looks like.
Find out how fast your code is with AppDynamics Lite.
http://ad.doubleclick.net/clk;262219671;13503038;y?
http://info.appdynamics.com/FreeJavaPerformanceDownload.html

_______________________________________________
Vufind-tech mailing list
Vufind-t...@lists.sourceforge.net<mailto:Vufind-t...@lists.sourceforge.net>
https://lists.sourceforge.net/lists/listinfo/vufind-tech

--
Hannah Ullrich
Fachinformatikerin

Universitaetsbibliothek Freiburg
EDV Dezernat
Rempartstr. 10-16
79098 Freiburg
Tel: +49-761 / 203-3877


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Simon Spero  
View profile  
 More options Sep 29 2012, 11:47 am
From: Simon Spero <sesunc...@gmail.com>
Date: Sat, 29 Sep 2012 11:47:45 -0400
Local: Sat, Sep 29 2012 11:47 am
Subject: Re: [solrmarc-tech] More on SolrMarc and uppercase subfields

Upper case letters are not allowed in MARC21 records (though they *are*
allowed in generic marc.

I thought that the DNB had completed the conversion to MARC21?

Simon

On Fri, Sep 28, 2012 at 7:56 AM, Demian Katz <demian.k...@villanova.edu>wrote:


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Demian Katz  
View profile  
 More options Oct 1 2012, 9:28 am
From: Demian Katz <demian.k...@villanova.edu>
Date: Mon, 1 Oct 2012 13:28:14 +0000
Local: Mon, Oct 1 2012 9:28 am
Subject: RE: [solrmarc-tech] More on SolrMarc and uppercase subfields

I am copying this message back to Hannah and Stefan in case they did not see your reply on the solrmarc-tech list.

Is SolrMarc meant to enforce the rules of MARC21?  If so, should there be an option to relax them?  It seems that there is value in being able to work with non-standard records as long as they are structurally correct.

- Demian

From: solrmarc-tech@googlegroups.com [mailto:solrmarc-tech@googlegroups.com] On Behalf Of Simon Spero
Sent: Saturday, September 29, 2012 11:48 AM
To: solrmarc-tech@googlegroups.com
Subject: Re: [solrmarc-tech] More on SolrMarc and uppercase subfields

Upper case letters are not allowed in MARC21 records (though they *are* allowed in generic marc.

I thought that the DNB had completed the conversion to MARC21?

Simon
On Fri, Sep 28, 2012 at 7:56 AM, Demian Katz <demian.k...@villanova.edu<mailto:demian.k...@villanova.edu>> wrote:
Another message I'm copying to solrmarc-tech...

This definitely sounds like a case of SolrMarc making assumptions that it really shouldn't; I don't think it's SolrMarc's job to validate the subfields, especially given that the MARC standard is constantly evolving.

If I had to guess, though, I'd speculate that this is actually not a problem specific to SolrMarc, but rather something related to the underlying Marc4j library.  Can somebody with more Marc4j experience confirm or deny that?

Assuming this is related to Marc4j, is there a way to loosen the restrictions, perhaps through a new SolrMarc configuration option?

- Demian
________________________________
From: Hannah Ullrich [hannah.ullr...@ub.uni-freiburg.de<mailto:hannah.ullr...@ub.uni-freiburg.de>]
Sent: Friday, September 28, 2012 7:46 AM
To: vufind-t...@lists.sourceforge.net<mailto:vufind-t...@lists.sourceforge.net>
Subject: Re: [VuFind-Tech] SolrMarc 2.4 upgrade
Hallo Stefan,

we have the same problem with the 689 fields.

I get the following info in our marc_error field:

<arr name="marc_error">
<str>Minor Error  : Subfield tag is an invalid uppercase character, changing it to lower case. --- [ 689 : D ]</str>
<str>Major Error  : Subfield tag is an invalid character, using first character of field as subfield tag. --- [ 689 : A ]</str>
</arr>

using vufind 1.3

Hannah

Am 28.09.2012 12:26, schrieb Winkler, Stefan:
Hi Demian,

I didn't try your upgrade in the 1.3 trunk but used the solrmarc 2.4 vufind binary directly from http://code.google.com/p/solrmarc/downloads/list

After indexing I seems like the case sensitivity while reading the subfield-tags was lost.

MarcEdit Marc21:
=689  01$Af$2gnd$aKongress

1. Vufind FullRecord (v2.3.1):
689 01 |A f  |2 gnd  |a Kongress

2. Vufind FullRecord (v2.4):
689 01 |a f  |2 gnd  |a Kongress

marc.properties properties parses this Field using "topic_facet = 689a"

The result is, that  in the second case I get the $A subfield with the value "f" als topic_facet.

Can anybody confirm this for his/her index? I suppose uppercase subfields like $A or $D   is used mainly in  germany.

Best wishes
Stefan

--
Stefan Winkler
Bibliotheksservice-Zentrum Baden-Württemberg (BSZ)
78457 Konstanz / Germany
Phone: +49 7531 88 2364<tel:%2B49%207531%2088%202364>
E-Mail: stefan.wink...@bsz-bw.de<mailto:stefan.wink...@bsz-bw.de>
http://www.bsz-bw.de

Von: Demian Katz [mailto:demian.k...@villanova.edu]
Gesendet: Mittwoch, 26. September 2012 19:11
An: vufind-t...@lists.sourceforge.net<mailto:vufind-t...@lists.sourceforge.net>
Betreff: [VuFind-Tech] SolrMarc 2.4 upgrade

I have just upgraded both the VuFind 1.x trunk and the VuFind 2.x master branch to use SolrMarc 2.4, the latest release.  (VuFind 1.x was previously on 2.3.1, while 2.x was previously using a custom-built version very close to 2.4).  Hopefully this will make Monday's 2.0beta release just a little bit easier to manage thanks to the inclusion of a known SolrMarc version.  It also allows me to close a couple of JIRA tickets related to fixes/improvements in 2.4.

I don't anticipate any problems related to this upgrade, but please let me know if you run into any indexing troubles using the latest-and-greatest code.

thanks,
Demian

--------------------------------------------------------------------------- ---

Got visibility?

Most devs has no idea what their production app looks like.

Find out how fast your code is with AppDynamics Lite.

http://ad.doubleclick.net/clk;262219671;13503038;y?

http://info.appdynamics.com/FreeJavaPerformanceDownload.html

_______________________________________________

Vufind-tech mailing list

Vufind-t...@lists.sourceforge.net<mailto:Vufind-t...@lists.sourceforge.net>

https://lists.sourceforge.net/lists/listinfo/vufind-tech

--

Hannah Ullrich

Fachinformatikerin

Universitaetsbibliothek Freiburg

EDV Dezernat

Rempartstr. 10-16

79098 Freiburg

Tel: +49-761 / 203-3877<tel:%2B49-761%20%2F%20203-3877>
--
You received this message because you are subscribed to the Google Groups "solrmarc-tech" group.
To post to this group, send email to solrmarc-tech@googlegroups.com<mailto:solrmarc-tech@googlegroups.com>.
To unsubscribe from this group, send email to solrmarc-tech+unsubscribe@googlegroups.com<mailto:solrmarc-tech%2Bunsubscri be@googlegroups.com>.
For more options, visit this group at http://groups.google.com/group/solrmarc-tech?hl=en.

--
You received this message because you are subscribed to the Google Groups "solrmarc-tech" group.
To post to this group, send email to solrmarc-tech@googlegroups.com.
To unsubscribe from this group, send email to solrmarc-tech+unsubscribe@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/solrmarc-tech?hl=en.


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Jonathan Rochkind  
View profile  
 More options Oct 1 2012, 10:42 am
From: Jonathan Rochkind <rochk...@jhu.edu>
Date: Mon, 01 Oct 2012 10:42:42 -0400
Local: Mon, Oct 1 2012 10:42 am
Subject: Re: [solrmarc-tech] More on SolrMarc and uppercase subfields
Marc4J has always been very Marc agnostic -- it doesn't even assume
Marc21, but works fine with UNIMARC and other European MARC variants.
Meaning it makes absolutely no assumptions about field/subfield
semantics, or what subfields or legal, or anything like that.

With one important principled exception: Marc4J _does_ follow the actual
MARC specification (not Marc21, the generic MARC spec they are all based
on, I forget the number), to the letter.  If the MARC spec says that
subfields are not case sensitive, then it would not surprise me to see
Marc4J implementing that as written in the spec.

On 9/28/2012 7:56 AM, Demian Katz wrote:


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Robert Haschart  
View profile  
 More options Oct 1 2012, 3:39 pm
From: Robert Haschart <rh...@virginia.edu>
Date: Mon, 01 Oct 2012 15:39:30 -0400
Local: Mon, Oct 1 2012 3:39 pm
Subject: Re: [solrmarc-tech] More on SolrMarc and uppercase subfields

It is something I added.  I had come across several records in our
collection that had erroneous upper case subfield tags, and then found
documentation that seemed to state unequivocally that subfield tags must
be numeric 0-9 or lowercase alphabetic a-z.   I thought that this
restriction was true for MARC in general, rather than a specific
implementation of MARC.

I'm pretty sure that this change is in the MarcPermissiveStreamReader
portion of Marc4j.   Since it appears to be causing a problem I think
that it should be able to enabled/disabled, perhaps via a property.    
I'll try to put together a minor release either of marc4j or solrmarc
with this fix as soon as possible.  My current thinking is that is
should be disabled by default, and be able to be enabled with a
property.  Perhaps this is true for some other pieces of the error
detecting/correcting code in the Permissive reader.

-Bob Haschart

On 10/1/2012 9:28 AM, Demian Katz wrote:


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Demian Katz  
View profile  
 More options Oct 1 2012, 3:59 pm
From: Demian Katz <demian.k...@villanova.edu>
Date: Mon, 1 Oct 2012 19:59:38 +0000
Local: Mon, Oct 1 2012 3:59 pm
Subject: RE: [solrmarc-tech] More on SolrMarc and uppercase subfields

That makes sense to me.  Thanks for the update!

From: solrmarc-tech@googlegroups.com [mailto:solrmarc-tech@googlegroups.com] On Behalf Of Robert Haschart
Sent: Monday, October 01, 2012 3:40 PM
To: solrmarc-tech@googlegroups.com
Subject: Re: [solrmarc-tech] More on SolrMarc and uppercase subfields

It is something I added.  I had come across several records in our collection that had erroneous upper case subfield tags, and then found documentation that seemed to state unequivocally that subfield tags must be numeric 0-9 or lowercase alphabetic a-z.   I thought that this restriction was true for MARC in general, rather than a specific implementation of MARC.

I'm pretty sure that this change is in the MarcPermissiveStreamReader portion of Marc4j.   Since it appears to be causing a problem I think that it should be able to enabled/disabled, perhaps via a property.    I'll try to put together a minor release either of marc4j or solrmarc with this fix as soon as possible.  My current thinking is that is should be disabled by default, and be able to be enabled with a property.  Perhaps this is true for some other pieces of the error detecting/correcting code in the Permissive reader.

-Bob Haschart

On 10/1/2012 9:28 AM, Demian Katz wrote:
I am copying this message back to Hannah and Stefan in case they did not see your reply on the solrmarc-tech list.

Is SolrMarc meant to enforce the rules of MARC21?  If so, should there be an option to relax them?  It seems that there is value in being able to work with non-standard records as long as they are structurally correct.

- Demian

From: solrmarc-tech@googlegroups.com<mailto:solrmarc-tech@googlegroups.com> [mailto:solrmarc-tech@googlegroups.com] On Behalf Of Simon Spero
Sent: Saturday, September 29, 2012 11:48 AM
To: solrmarc-tech@googlegroups.com<mailto:solrmarc-tech@googlegroups.com>
Subject: Re: [solrmarc-tech] More on SolrMarc and uppercase subfields

Upper case letters are not allowed in MARC21 records (though they *are* allowed in generic marc.

I thought that the DNB had completed the conversion to MARC21?

Simon
On Fri, Sep 28, 2012 at 7:56 AM, Demian Katz <demian.k...@villanova.edu<mailto:demian.k...@villanova.edu>> wrote:
Another message I'm copying to solrmarc-tech...

This definitely sounds like a case of SolrMarc making assumptions that it really shouldn't; I don't think it's SolrMarc's job to validate the subfields, especially given that the MARC standard is constantly evolving.

If I had to guess, though, I'd speculate that this is actually not a problem specific to SolrMarc, but rather something related to the underlying Marc4j library.  Can somebody with more Marc4j experience confirm or deny that?

Assuming this is related to Marc4j, is there a way to loosen the restrictions, perhaps through a new SolrMarc configuration option?

- Demian
________________________________
From: Hannah Ullrich [hannah.ullr...@ub.uni-freiburg.de<mailto:hannah.ullr...@ub.uni-freiburg.de>]
Sent: Friday, September 28, 2012 7:46 AM
To: vufind-t...@lists.sourceforge.net<mailto:vufind-t...@lists.sourceforge.net>
Subject: Re: [VuFind-Tech] SolrMarc 2.4 upgrade
Hallo Stefan,

we have the same problem with the 689 fields.

I get the following info in our marc_error field:

<arr name="marc_error">
<str>Minor Error  : Subfield tag is an invalid uppercase character, changing it to lower case. --- [ 689 : D ]</str>
<str>Major Error  : Subfield tag is an invalid character, using first character of field as subfield tag. --- [ 689 : A ]</str>
</arr>

using vufind 1.3

Hannah

Am 28.09.2012 12:26, schrieb Winkler, Stefan:
Hi Demian,

I didn't try your upgrade in the 1.3 trunk but used the solrmarc 2.4 vufind binary directly from http://code.google.com/p/solrmarc/downloads/list

After indexing I seems like the case sensitivity while reading the subfield-tags was lost.

MarcEdit Marc21:
=689  01$Af$2gnd$aKongress

1. Vufind FullRecord (v2.3.1):
689 01 |A f  |2 gnd  |a Kongress

2. Vufind FullRecord (v2.4):
689 01 |a f  |2 gnd  |a Kongress

marc.properties properties parses this Field using "topic_facet = 689a"

The result is, that  in the second case I get the $A subfield with the value "f" als topic_facet.

Can anybody confirm this for his/her index? I suppose uppercase subfields like $A or $D   is used mainly in  germany.

Best wishes
Stefan

--
Stefan Winkler
Bibliotheksservice-Zentrum Baden-Württemberg (BSZ)
78457 Konstanz / Germany
Phone: +49 7531 88 2364<tel:%2B49%207531%2088%202364>
E-Mail: stefan.wink...@bsz-bw.de<mailto:stefan.wink...@bsz-bw.de>
http://www.bsz-bw.de

Von: Demian Katz [mailto:demian.k...@villanova.edu]
Gesendet: Mittwoch, 26. September 2012 19:11
An: vufind-t...@lists.sourceforge.net<mailto:vufind-t...@lists.sourceforge.net>
Betreff: [VuFind-Tech] SolrMarc 2.4 upgrade

I have just upgraded both the VuFind 1.x trunk and the VuFind 2.x master branch to use SolrMarc 2.4, the latest release.  (VuFind 1.x was previously on 2.3.1, while 2.x was previously using a custom-built version very close to 2.4).  Hopefully this will make Monday's 2.0beta release just a little bit easier to manage thanks to the inclusion of a known SolrMarc version.  It also allows me to close a couple of JIRA tickets related to fixes/improvements in 2.4.

I don't anticipate any problems related to this upgrade, but please let me know if you run into any indexing troubles using the latest-and-greatest code.

thanks,
Demian

--------------------------------------------------------------------------- ---

Got visibility?

Most devs has no idea what their production app looks like.

Find out how fast your code is with AppDynamics Lite.

http://ad.doubleclick.net/clk;262219671;13503038;y?

http://info.appdynamics.com/FreeJavaPerformanceDownload.html

_______________________________________________

Vufind-tech mailing list

Vufind-t...@lists.sourceforge.net<mailto:Vufind-t...@lists.sourceforge.net>

https://lists.sourceforge.net/lists/listinfo/vufind-tech

--

Hannah Ullrich

Fachinformatikerin

Universitaetsbibliothek Freiburg

EDV Dezernat

Rempartstr. 10-16

79098 Freiburg

Tel: +49-761 / 203-3877<tel:%2B49-761%20%2F%20203-3877>
--
You received this message because you are subscribed to the Google Groups "solrmarc-tech" group.
To post to this group, send email to solrmarc-tech@googlegroups.com<mailto:solrmarc-tech@googlegroups.com>.
To unsubscribe from this group, send email to solrmarc-tech+unsubscribe@googlegroups.com<mailto:solrmarc-tech%2Bunsubscri be@googlegroups.com>.
For more options, visit this group at http://groups.google.com/group/solrmarc-tech?hl=en.

--
You received this message because you are subscribed to the Google Groups "solrmarc-tech" group.
To post to this group, send email to solrmarc-tech@googlegroups.com<mailto:solrmarc-tech@googlegroups.com>.
To unsubscribe from this group, send email to solrmarc-tech+unsubscribe@googlegroups.com<mailto:solrmarc-tech+unsubscribe @googlegroups.com>.
For more options, visit this group at http://groups.google.com/group/solrmarc-tech?hl=en.
--
You received this message because you are subscribed to the Google Groups "solrmarc-tech" group.
To post to this group, send email to solrmarc-tech@googlegroups.com<mailto:solrmarc-tech@googlegroups.com>.
To unsubscribe from this group, send email to solrmarc-tech+unsubscribe@googlegroups.com<mailto:solrmarc-tech+unsubscribe @googlegroups.com>.
For more options, visit this group at http://groups.google.com/group/solrmarc-tech?hl=en.

--
You received this message because you are subscribed to the Google Groups "solrmarc-tech" group.
To post to this group, send email to solrmarc-tech@googlegroups.com.
To unsubscribe from this group, send email to solrmarc-tech+unsubscribe@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/solrmarc-tech?hl=en.


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
End of messages
« Back to Discussions « Newer topic     Older topic »