Slower processing of institutional holdings file

14 views
Skip to first unread message

Tim Lehmann

unread,
Mar 7, 2012, 2:16:22 PM3/7/12
to xerxes...@googlegroups.com
We applied SFX v4 KB updates 20120400 and 20120500 on 1/31/2012 (we were at SFX 4.2.0.1 at that time, which we had applied on 1/10/2012).

I noticed today that after 1/31/2012 for Xerxes instances using 1.8.2, the processing of the institutional holdings file takes five to 30 minutes (depending on institutional holdings file size) and uses noticeable CPU.  Previously all instances completed in under 90 seconds with insignificant CPU usage.  Xerxes instances still using 1.7.2 still process quickly.  The additional processing time occurs in the "Processing file" step of the "php -f ${ContentDir}/index.php action=populate base=availability" command.

Walker, David

unread,
Mar 7, 2012, 2:24:31 PM3/7/12
to xerxes...@googlegroups.com

Yeah, I’m noticing that too.

 

Tim, do you know if there has been a recent change to the Google scholar export in SFX?

 

I’ll ask our SFX admin here that same question.

 

--Dave

 

 

-----------------

David Walker

Library Web Services Manager

California State University

--
You received this message because you are subscribed to the Google Groups "xerxes-portal" group.
To post to this group, send email to xerxes...@googlegroups.com.
To unsubscribe from this group, send email to xerxes-porta...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/xerxes-portal?hl=en.

Walker, David

unread,
Mar 7, 2012, 2:36:09 PM3/7/12
to xerxes...@googlegroups.com

Okay, just got this from our admin, from the SFX release notes.

 

“The requirements for the Google Scholar holdings files changed recently, and as a result in service pack 4.1.4 there were changes to the Google Scholar export process in SFX. By default now a single XML file, split XML files, a single gzip file, and split gzip files will be created. This is to accommodate a policy on Google's end where the institutional holdings file must either be less than 1MB or split into parts that are smaller than 1MB.

 

“These 10 "prt gzip" type files (or xml versions as a fall back) you're seeing will be gathered by Google Scholar and used the way the single .xml or .gzip file was previously. They should also be overwritten each time the export process runs.”

 

I’ll have to investigate this a little more closely.

 

--Dave

 

-----------------

David Walker

Library Web Services Manager

California State University

 

From: Walker, David
Sent: Wednesday, March 07, 2012 11:25 AM
To: xerxes...@googlegroups.com
Subject: RE: [xerxes-portal] Slower processing of institutional holdings file

 

Yeah, I’m noticing that too.

 

Tim, do you know if there has been a recent change to the Google scholar export in SFX?

 

I’ll ask our SFX admin here that same question.

 

--Dave

 

 

-----------------

David Walker

Library Web Services Manager

California State University

 

From: xerxes...@googlegroups.com [mailto:xerxes...@googlegroups.com] On Behalf Of Tim Lehmann
Sent: Wednesday, March 07, 2012 11:16 AM
To: xerxes...@googlegroups.com
Subject: [xerxes-portal] Slower processing of institutional holdings file

 

We applied SFX v4 KB updates 20120400 and 20120500 on 1/31/2012 (we were at SFX 4.2.0.1 at that time, which we had applied on 1/10/2012).



I noticed today that after 1/31/2012 for Xerxes instances using 1.8.2, the processing of the institutional holdings file takes five to 30 minutes (depending on institutional holdings file size) and uses noticeable CPU.  Previously all instances completed in under 90 seconds with insignificant CPU usage.  Xerxes instances still using 1.7.2 still process quickly.  The additional processing time occurs in the "Processing file" step of the "php -f ${ContentDir}/index.php action=populate base=availability" command.

--

Walker, David

unread,
Apr 4, 2012, 10:58:36 AM4/4/12
to xerxes...@googlegroups.com

Hi all,

 

Sorry for the delay on this.  I just haven’t had any time to look at it.

 

Just so you all know, my boss recently (and very unexpectedly) passed away.  I’ve assumed much of his duties and projects, in addition to my own, and so have been absolutely swamped lately.

 

I’m hoping to clear some time to work on Xerxes here soon, and I’ll address this issue at that time.

 

--Dave

 

-----------------

David Walker

Interim Director of Systemwide Digital Library Services

California State University

Walker, David

unread,
May 17, 2012, 10:30:36 AM5/17/12
to xerxes...@googlegroups.com

This should now be fixed in trunk, for those who want to try it out.

 

  http://xerxes-portal.googlecode.com/svn/trunk/commands/availability/PopulateFulltext.php

 

--Dave

 

-------------------------

David Walker

Interim Director, Systemwide Digital Library Services

California State University

562-355-4845

Reply all
Reply to author
Forward
0 new messages