Issue in JGit 3.5.1.201410131835-r (used in 2.10-rc1)

Lundh, Gustaf

unread,

Dec 17, 2014, 5:28:16 AM12/17/14

to repo-discuss, Pursehouse, David (Sony Mobile), Selberg, Sven, Saša Živkov

We patched Gerrit 2.8.6.1 with the JGit version that can be found in v2.10-rc1 (3.5.1.201410131835-r) since we wanted a few fixed that was introduced since 3.2.x

Sadly we had to rollback our production system quite quickly due to a pretty bad JGit issue in this version (at least, that is our assumption).

Symptoms:

In certain circumstances Gerrit cannot load refs/meta/config, hence the projects become invisible since Gerrit cannot calculate the ACL for that Git.

The logs are _full_ of these:

[2014-12-15 05:35:14,276] WARN com.google.gerrit.server.project.ProjectCacheImpl : Cannot read project platform/git/project
java.util.concurrent.ExecutionException: org.eclipse.jgit.errors.MissingObjectException: Missing unknown 40bcb7aa806d2790bbf0f7d2ee6a42cd9ee2f03a
at com.google.common.util.concurrent.AbstractFuture$Sync.getValue(AbstractFuture.java:299)
at com.google.common.util.concurrent.AbstractFuture$Sync.get(AbstractFuture.java:286)
at com.google.common.util.concurrent.AbstractFuture.get(AbstractFuture.java:116)
at com.google.common.util.concurrent.Uninterruptibles.getUninterruptibly(Uninterruptibles.java:135)
at com.google.common.cache.LocalCache$Segment.getAndRecordStats(LocalCache.java:2344)
at com.google.common.cache.LocalCache$Segment.loadSync(LocalCache.java:2316)
at com.google.common.cache.LocalCache$Segment.lockedGetOrLoad(LocalCache.java:2278)
at com.google.common.cache.LocalCache$Segment.get(LocalCache.java:2193)
at com.google.common.cache.LocalCache.get(LocalCache.java:3932)
at com.google.common.cache.LocalCache.getOrLoad(LocalCache.java:3936)
at com.google.common.cache.LocalCache$LocalLoadingCache.get(LocalCache.java:4806)
at com.google.gerrit.server.project.ProjectCacheImpl.checkedGet(ProjectCacheImpl.java:122)
at com.google.gerrit.server.project.ProjectControl$GenericFactory.controlFor(ProjectControl.java:80)
at com.google.gerrit.server.project.ChangeControl$GenericFactory.controlFor(ChangeControl.java:74)
at com.google.gerrit.server.query.change.IsVisibleToPredicate.match(IsVisibleToPredicate.java:62)
at com.google.gerrit.server.query.change.IsVisibleToPredicate.match(IsVisibleToPredicate.java:27)
at com.google.gerrit.server.query.AndPredicate.match(AndPredicate.java:75)
at com.google.gerrit.server.query.change.AndSource.readImpl(AndSource.java:130)
at com.google.gerrit.server.query.change.AndSource.read(AndSource.java:94)
at com.google.gerrit.server.query.change.QueryProcessor.queryChanges(QueryProcessor.java:261)
at com.google.gerrit.server.query.change.QueryChanges.query0(QueryChanges.java:153)
at com.google.gerrit.server.query.change.QueryChanges.query(QueryChanges.java:141)
at com.google.gerrit.server.query.change.QueryChanges.apply(QueryChanges.java:108)
at com.google.gerrit.server.query.change.QueryChanges.apply(QueryChanges.java:41)
at com.google.gerrit.httpd.restapi.RestApiServlet.service(RestApiServlet.java:306)
at javax.servlet.http.HttpServlet.service(HttpServlet.java:722)
at com.google.inject.servlet.ServletDefinition.doServiceImpl(ServletDefinition.java:278)
at com.google.inject.servlet.ServletDefinition.doService(ServletDefinition.java:268)
at com.google.inject.servlet.ServletDefinition.service(ServletDefinition.java:180)
at com.google.inject.servlet.ManagedServletPipeline.service(ManagedServletPipeline.java:93)
at com.google.inject.servlet.FilterChainInvocation.doFilter(FilterChainInvocation.java:85)
[Removed some of the stack trace here]
Caused by: org.eclipse.jgit.errors.MissingObjectException: Missing unknown 40bcb7aa806d2790bbf0f7d2ee6a42cd9ee2f03a
at org.eclipse.jgit.internal.storage.file.WindowCursor.open(WindowCursor.java:148)
at org.eclipse.jgit.lib.ObjectReader.open(ObjectReader.java:229)
at org.eclipse.jgit.revwalk.RevWalk.parseAny(RevWalk.java:839)
at org.eclipse.jgit.revwalk.RevWalk.parseCommit(RevWalk.java:752)
at com.google.gerrit.server.git.VersionedMetaData.load(VersionedMetaData.java:122)
at com.google.gerrit.server.git.VersionedMetaData.load(VersionedMetaData.java:98)
at com.google.gerrit.server.project.ProjectCacheImpl$Loader.load(ProjectCacheImpl.java:274)
at com.google.gerrit.server.project.ProjectCacheImpl$Loader.load(ProjectCacheImpl.java:258)
at com.google.common.cache.LocalCache$LoadingValueReference.loadFuture(LocalCache.java:3522)
at com.google.common.cache.LocalCache$Segment.loadSync(LocalCache.java:2315)
... 56 more

It is always a commit in the refs/meta/config that is missing (but not necessary the tip). The issue does not seem appear on refs/heads/*.

When the first "Missing unknown" happens in a Git, it will continue to happen every time you access the Git. No clearing of caches helps.

As time pass, more Gits gets affected. After just two days we had 15 inaccessible gits.

Since we cannot recover from the issue, we think an IOException happens in
openPackedObject(WindowCursor curs, AnyObjectId objectId), and the packFile is removed from future access.

Interesting enough this happened on a Git with just an empty initial commit, and it had not been touched for months. So it does _not_ seem to be a racing condition where we are trying to read a pack-file while writing to the git directory.

I cannot tie the initial MissingObjectException to a certain user interaction, it just suddenly happens when Gerrit is trying to read objects from refs/meta/config. I cannot reproduce it on our staging environment, which makes the issue very hard to debug. So I'm asking for help. This issue makes me very hesitant to update to 2.10.

Rolling back to 3.2.0.201312181205-r solved all issues.

/Gustaf Lundh

Bassem Rabil

unread,

Dec 17, 2014, 8:01:57 AM12/17/14

to repo-d...@googlegroups.com, David.Pu...@sonymobile.com, Sven.S...@sonymobile.com, ziv...@gmail.com

We experienced a similar behavior last month using a custom jgit based on jgit 3.4 with Gerrit 2.9.1, however this was coinciding with storage migration and we thought that this was the root cause of such behavior.

To fix this, we restored refs/meta/config from slaves and since then things are stable.

Regards

Bassem

Saša Živkov

unread,

Dec 17, 2014, 8:10:44 AM12/17/14

to Lundh, Gustaf, repo-discuss, Pursehouse, David (Sony Mobile), Selberg, Sven

Does a restart of Gerrit help?

As time pass, more Gits gets affected. After just two days we had 15 inaccessible gits.

Since we cannot recover from the issue, we think an IOException happens in
openPackedObject(WindowCursor curs, AnyObjectId objectId), and the packFile is removed from future access.

Interesting enough this happened on a Git with just an empty initial commit, and it had not been touched for months.

If a restart doesn't solve the issue then we would like to have that empty repository from your system and debug JGit on it.

Gustaf Lundh

unread,

Dec 17, 2014, 8:55:44 AM12/17/14

to repo-d...@googlegroups.com, Gustaf...@sonymobile.com, David.Pu...@sonymobile.com, Sven.S...@sonymobile.com

> Does a restart of Gerrit help?

A restart temporarily removes the issue until it starts happening again on more and more gits (consistent with pack files marked as bad).

We also saw the issue on this release:

version 3.4.1.201406201815-r.112-g94c4d7e

Also only in production. Cannot get it to show in our staging environment, it seems to be related to load/racing condition.

Best regards

Gustaf Lundh

Alex Blewitt

unread,

Dec 17, 2014, 9:07:19 AM12/17/14

to Bassem Rabil, repo-d...@googlegroups.com, David.Pu...@sonymobile.com, Sven.S...@sonymobile.com, ziv...@gmail.com

If you do a git fsck then it will tell you if the repository data has problems. If it doesn't you can do a git gc which will repack the files meaning jgit may be able to load them.

Alex

Sent from my iPhat 6

--
--
To unsubscribe, email repo-discuss...@googlegroups.com
More info at http://groups.google.com/group/repo-discuss?hl=en

---
You received this message because you are subscribed to the Google Groups "Repo and Gerrit Discussion" group.
To unsubscribe from this group and stop receiving emails from it, send an email to repo-discuss...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Bassem Rabil

unread,

Dec 17, 2014, 9:12:36 AM12/17/14

to repo-d...@googlegroups.com, bassem.ra...@ericsson.com, David.Pu...@sonymobile.com, Sven.S...@sonymobile.com, ziv...@gmail.com

In our case running git gc, gerrit gc, nor flushing caches didn't help to restore the repository visibility, fetching refs/meta/config from slaves was the solution to restore the visibility of these repositories.

Regards

Bassem

Kevin Sage

unread,

Dec 17, 2014, 12:48:05 PM12/17/14

to repo-d...@googlegroups.com, bassem.ra...@ericsson.com, David.Pu...@sonymobile.com, Sven.S...@sonymobile.com, ziv...@gmail.com

We're seeing this same behavior - though we hadn't tracked it down to refs/meta/config - on Gerrit 2.9.1 (JGit 3.4.0.201405051725-m7). In every case we've seen so far, running a "git gc --aggressive" fixes the problem immediately. This is happening for repos of varying sizes and complexity. In every case we've tried, "git fsck" does not report any errors. C Git seems totally happy with the repos; it's just JGit/Gerrit that are making them invisible.

Flushing caches does not help and "gerrit gc" refuses to run because Gerrit has made the repo invisible and therefore thinks it doesn't exist.

Anyway, just wanted to add a data point that it's happening on 2.9.1 and not only the 2.10 rc.

Thanks,
Kevin

Saša Živkov

unread,

Dec 18, 2014, 5:05:55 AM12/18/14

to Lundh, Gustaf, repo-discuss, Pursehouse, David (Sony Mobile), Selberg, Sven

On Wed, Dec 17, 2014 at 11:27 AM, Lundh, Gustaf <Gustaf...@sonymobile.com> wrote:

Can you find this IOException in your error_log and post it here?

Gustaf Lundh

unread,

Dec 18, 2014, 7:46:03 AM12/18/14

to repo-d...@googlegroups.com, Gustaf...@sonymobile.com, David.Pu...@sonymobile.com, Sven.S...@sonymobile.com

> Can you find this IOException in your error_log and post it here?

No. Annoyingly the IOException is silenced when the PackFile is marked as bad[1]. And a lot of stuff can trigger an IOException when JGit tries to read the object from the PackFile. Due to this, it is pretty much impossible for me to track down the issue any further.

I was thinking about writing a patch, to allow us to re-throw the Exception. But that would of course mean we would not continue looking in other PackFiles if we stumble upon a "broken" one. Which I guess we want.

[1]

ObjectLoader openPackedObject(WindowCursor curs, AnyObjectId objectId) {

PackList pList;

do {

SEARCH: for (;;) {

pList = packList.get();

for (PackFile p : pList.packs) {

try {

ObjectLoader ldr = p.get(curs, objectId);

if (ldr != null)

return ldr;

} catch (PackMismatchException e) {

// Pack was modified; refresh the entire pack list.

if (searchPacksAgain(pList))

continue SEARCH;

} catch (IOException e) {

// Assume the pack is corrupted.

removePack(p);

}

break SEARCH;

}

} while (searchPacksAgain(pList));

return null;

Sven Selberg

unread,

Dec 22, 2014, 3:52:03 AM12/22/14

to repo-d...@googlegroups.com, Gustaf...@sonymobile.com, David.Pu...@sonymobile.com, Sven.S...@sonymobile.com

The code is from org.eclipse.jgit/src/org/eclipse/jgit/internal/storage/file/ObjectDirectory.java

The "facts":

1. If it happens once to a Git repoistory, it keeps on happening until Gerrit Server is restarted.

2. A restart solve the issue.

Points towards:

First occourence:

1. An IOException is thrown somewhere in the try-clause.

2. The pack file is assumed corrupted and is removed.

3. No more pack files => both for loops terminate.

4. Null is returned which results in the ObjectMissingException and the stacktrace above.

Following occourences:

1. pList.packs is now empty => both for loops terminate

2. Null is returned which results in the ObjectMissingException and the stacktrace above.

Looking at the code executed in the try clause, there can be a number of causes behind the IOException that gets this vicious cycle going. And as Gustaf stated earlier (and which is apparent by the snippet of code provided in his latest post. This IOException is simply swallowed.

Since the problem only seems to manifest in a high load production environment, we would have to break our production environment to reproduce it, if someone can't come up with brilliant idea what could be the root cause.

Has anyone been able to recreate this issue in a test/dev environment?

or

Is there anyone who might have an idea how you might be able to recreate it?

Will investigate this further during the holidays.

/Sven

Matthias Sohn

unread,

Dec 22, 2014, 9:27:17 AM12/22/14

to Gustaf Lundh, Repo and Gerrit Discussion, David Pursehouse, Sven.S...@sonymobile.com

you may build a custom jgit version using the attached patch to get the exception logged to System.err,

apply it on top of jgit v3.4.2.201412180340-r, run "mvn clean install" to build it and replace it in your gerrit version.

--

Matthias

0001-Log-reason-for-ignoring-pack-when-IOException-occurr.patch

Christian Halstrick

unread,

Dec 22, 2014, 10:30:06 AM12/22/14

to repo-d...@googlegroups.com, Gustaf...@sonymobile.com, David.Pu...@sonymobile.com, Sven.S...@sonymobile.com

My proposal is to modify JGit to at least re-throw the exception if the object couldn't be found in another packfile. When we want to load object X from any of our pack files and we hit an exception when trying to load X from packfile p(n) but we succeed to load X from packfile p(n+1) then the operation should succeed. Only if we hit the IOException when reading from p(n+1) and we can't read it from any other pack then we should rethrow the exception. If reading X from p(n) leads to an exception and also reading it from p(n+1) I think it's ok to rethrow that latest exception.

What I am not sure about: When reading from p(n) leading to an exception but reading from p(n+1) succeeds then we want to ignore p(n) from now on. But I think that should definitly go also into one of the logs of gerrit. We have a succesfull JGit API call which should lead to a new entry in the gerrit error log. Not sure how to do that best.

I personally made bad experiences with JGit dealing with packfiles stored on NFS shares and I had to introduce a new config param for that (see commit 0fc8b05 in JGit). Are youre repos on NFS?

Sven Selberg

unread,

Dec 23, 2014, 5:05:19 AM12/23/14

to repo-d...@googlegroups.com, Gustaf...@sonymobile.com, David.Pu...@sonymobile.com, Sven.S...@sonymobile.com

Den måndagen den 22:e december 2014 kl. 16:30:06 UTC+1 skrev Christian Halstrick:

My proposal is to modify JGit to at least re-throw the exception if the object couldn't be found in another packfile. When we want to load object X from any of our pack files and we hit an exception when trying to load X from packfile p(n) but we succeed to load X from packfile p(n+1) then the operation should succeed. Only if we hit the IOException when reading from p(n+1) and we can't read it from any other pack then we should rethrow the exception. If reading X from p(n) leads to an exception and also reading it from p(n+1) I think it's ok to rethrow that latest exception.

I think this idea is excellent.

What I am not sure about: When reading from p(n) leading to an exception but reading from p(n+1) succeeds then we want to ignore p(n) from now on. But I think that should definitly go also into one of the logs of gerrit. We have a succesfull JGit API call which should lead to a new entry in the gerrit error log. Not sure how to do that best.

I'm guessing you meant "un-successful". It would be nice if JGit could perculate some notice about the fact that a pack-file is corrupt, even if it also finds a working pack file.

I personally made bad experiences with JGit dealing with packfiles stored on NFS shares and I had to introduce a new config param for that (see commit 0fc8b05 in JGit). Are youre repos on NFS?

Our repos resides on SAN so no. SCSI I guess.

/Sven

Christian Halstrick

unread,

Jan 21, 2015, 9:17:53 AM1/21/15

to repo-d...@googlegroups.com, Gustaf...@sonymobile.com, David.Pu...@sonymobile.com, Sven.S...@sonymobile.com

On Tuesday, December 23, 2014 at 11:05:19 AM UTC+1, Sven Selberg wrote:

Den måndagen den 22:e december 2014 kl. 16:30:06 UTC+1 skrev Christian Halstrick:
My proposal is to modify JGit to at least re-throw the exception if the object couldn't be found in another packfile. When we want to load object X from any of our pack files and we hit an exception when trying to load X from packfile p(n) but we succeed to load X from packfile p(n+1) then the operation should succeed. Only if we hit the IOException when reading from p(n+1) and we can't read it from any other pack then we should rethrow the exception. If reading X from p(n) leads to an exception and also reading it from p(n+1) I think it's ok to rethrow that latest exception.

I think this idea is excellent.

See https://git.eclipse.org/r/#/c/39685 . Something similar is in now.

What I am not sure about: When reading from p(n) leading to an exception but reading from p(n+1) succeeds then we want to ignore p(n) from now on. But I think that should definitly go also into one of the logs of gerrit. We have a succesfull JGit API call which should lead to a new entry in the gerrit error log. Not sure how to do that best.

I'm guessing you meant "un-successful". It would be nice if JGit could perculate some notice about the fact that a pack-file is corrupt, even if it also finds a working pack file.

No, I meant "succesful". A request to gerrit (e.g. a push) can be succesful although during request processing we detected corrupt packfiles. Because in the end it came out that these corrupt packfiles where not required to process the request we want to return success and additionally log the existence of corrupt packfiles.

Sven Selberg

unread,

Feb 5, 2015, 8:26:08 AM2/5/15

to repo-d...@googlegroups.com, Gustaf...@sonymobile.com, David.Pu...@sonymobile.com, Sven.S...@sonymobile.com

Background:

* We run $ git gc on all repositories every night. (that is (C)Git gc not JGit gc) # I have a feeling that this is the crux...

* This happend after the first gc after deployment

We patched our gerrit v2.8.6.1 with jgitv3.6.2 and ran in production. The output created through [1] resulted in [2].

This only led us to org.eclipse.jgit.internal.storage.file.PackFile.java:599

private void doOpen() throws IOException {

try {

if (invalid)

throw new PackInvalidException(packFile); // Right here!!!!

synchronized (readLock) {

fd = new RandomAccessFile(packFile, "r"); //$NON-NLS-1$

length = fd.length();

onOpenPack();

}

} catch (IOException ioe) {

openFail();

throw ioe;

} catch (RuntimeException re) {

openFail();

throw re;

} catch (Error re) {

openFail();

throw re;

}

So apparently something marked this PackFile as invalid... back to square one.

The invalid flag is set in three places.

1. org.eclipse.jgit.internal.storage.file.PackFile.idx() (line:179) [3]

2. org.eclipse.jgit.internal.storage.file.PackFile.setInvalid() (line:550)

3 org.eclipse.jgit.internal.storage.file.PackFile.openFail() (line: 620)

All of which feels more or less like Rome (All paths leads to...)

/Sven

[1] https://git.eclipse.org/r/#/c/39661

[2] ERROR: Exception caught while accessing pack file /some/path/problem-repository.git/objects/pack/pack-84fc226d0fb0f0c0b97c4f4d3ab8a1d1c2553b93.pack, the pack file might be corrupt

org.eclipse.jgit.errors.PackInvalidException: Pack file invalid: /some/path/problem-repository.git/objects/pack/pack-84fc226d0fb0f0c0b97c4f4d3ab8a1d1c2553b93.pack

at org.eclipse.jgit.internal.storage.file.PackFile.doOpen(PackFile.java:599)

at org.eclipse.jgit.internal.storage.file.PackFile.beginWindowCache(PackFile.java:583)

at org.eclipse.jgit.internal.storage.file.WindowCache.load(WindowCache.java:284)

at org.eclipse.jgit.internal.storage.file.WindowCache.getOrLoad(WindowCache.java:368)

at org.eclipse.jgit.internal.storage.file.WindowCache.get(WindowCache.java:179)

at org.eclipse.jgit.internal.storage.file.WindowCursor.pin(WindowCursor.java:354)

at org.eclipse.jgit.internal.storage.file.WindowCursor.copy(WindowCursor.java:226)

at org.eclipse.jgit.internal.storage.file.PackFile.readFully(PackFile.java:556)

at org.eclipse.jgit.internal.storage.file.PackFile.load(PackFile.java:714)

at org.eclipse.jgit.internal.storage.file.PackFile.get(PackFile.java:257)

at org.eclipse.jgit.internal.storage.file.ObjectDirectory.openPackedObject(ObjectDirectory.java:416)

at org.eclipse.jgit.internal.storage.file.ObjectDirectory.openPackedFromSelfOrAlternate(ObjectDirectory.java:385)

at org.eclipse.jgit.internal.storage.file.ObjectDirectory.openObject(ObjectDirectory.java:377)

at org.eclipse.jgit.internal.storage.file.WindowCursor.open(WindowCursor.java:145)

at org.eclipse.jgit.lib.ObjectReader.open(ObjectReader.java:229)

at org.eclipse.jgit.revwalk.RevWalk.parseAny(RevWalk.java:840)

at org.eclipse.jgit.revwalk.RevWalk.parseCommit(RevWalk.java:753)

at com.google.gerrit.server.git.VersionedMetaData.load(VersionedMetaData.java:122)

at com.google.gerrit.server.git.VersionedMetaData.load(VersionedMetaData.java:98)

at com.google.gerrit.server.project.ProjectCacheImpl$Loader.load(ProjectCacheImpl.java:274)

at com.google.gerrit.server.project.ProjectCacheImpl$Loader.load(ProjectCacheImpl.java:258)

at com.google.common.cache.LocalCache$LoadingValueReference.loadFuture(LocalCache.java:3522)

at com.google.common.cache.LocalCache$Segment.loadSync(LocalCache.java:2315)

at com.google.common.cache.LocalCache$Segment.lockedGetOrLoad(LocalCache.java:2278)

at com.google.common.cache.LocalCache$Segment.get(LocalCache.java:2193)

at com.google.common.cache.LocalCache.get(LocalCache.java:3932)

at com.google.common.cache.LocalCache.getOrLoad(LocalCache.java:3936)

at com.google.common.cache.LocalCache$LocalLoadingCache.get(LocalCache.java:4806)

at com.google.gerrit.server.project.ProjectCacheImpl.checkedGet(ProjectCacheImpl.java:122)

at com.google.gerrit.server.project.ProjectControl$GenericFactory.controlFor(ProjectControl.java:80)

at com.google.gerrit.server.project.ChangeControl$GenericFactory.controlFor(ChangeControl.java:74)

at com.google.gerrit.server.query.change.IsVisibleToPredicate.match(IsVisibleToPredicate.java:62)

at com.google.gerrit.server.query.change.IsVisibleToPredicate.match(IsVisibleToPredicate.java:27)

at com.google.gerrit.server.query.AndPredicate.match(AndPredicate.java:75)

at com.google.gerrit.server.query.change.AndSource.readImpl(AndSource.java:111)

at com.google.gerrit.server.query.change.AndSource.read(AndSource.java:94)

at com.google.gerrit.server.query.change.QueryProcessor.queryChanges(QueryProcessor.java:261)

at com.google.gerrit.server.query.change.QueryChanges.query0(QueryChanges.java:153)

at com.google.gerrit.server.query.change.QueryChanges.query(QueryChanges.java:141)

at com.google.gerrit.server.query.change.QueryChanges.apply(QueryChanges.java:108)

at com.google.gerrit.server.query.change.QueryChanges.apply(QueryChanges.java:41)

at com.google.gerrit.httpd.restapi.RestApiServlet.service(RestApiServlet.java:306)

[3] private synchronized PackIndex idx() throws IOException {

if (loadedIdx == null) {

if (invalid)

throw new PackInvalidException(packFile);

try {

final PackIndex idx = PackIndex.open(extFile(INDEX));

if (packChecksum == null)

packChecksum = idx.packChecksum;

else if (!Arrays.equals(packChecksum, idx.packChecksum))

throw new PackMismatchException(JGitText.get().packChecksumMismatch);

loadedIdx = idx;

} catch (IOException e) {

invalid = true;

throw e;

}

return loadedIdx;

}

Matthias Sohn

unread,

Feb 10, 2015, 12:29:09 PM2/10/15

to Sven Selberg, Repo and Gerrit Discussion, Gustaf...@sonymobile.com, David Pursehouse

do you think this patch would help to understand what's going wrong ?

https://git.eclipse.org/r/#/c/41545/

-Matthias

Sven Selberg

unread,

Feb 17, 2015, 9:13:34 AM2/17/15

to repo-d...@googlegroups.com, sven.s...@sonymobile.com, Gustaf...@sonymobile.com, David.Pu...@sonymobile.com

do you think this patch would help to understand what's going wrong ?
https://git.eclipse.org/r/#/c/41545/

-Matthias

We built our Gerrit with a jgit with your patch. And increased the logging in every point in PackFile.java where the PackFile is set to invalid.

And found this:

1. FileNotFoundException while trying to read the index file in a git => corresponding packfile is set invalid=true; [1]

#Four seconds delay and then an incoming git clone

2. For each object in git: [3]

2.1 Try to read the packfile in ObjectToPack => PackInvalidException [2] (because PackFile.invalid == true)

2.2 Try to find object in one of the packfiles => success.

The affected git has about ~3.5M objects and due to our extra logging we ended up with >2G of logs before we aborted and switched to a gerrit.war with a more friendly jgit.

We suspect that this isn't the root cause of our issue but it looks like a flaw. There's a whole lot of Exceptions thrown and swallowed.

One could perhaps check if the packfile is valid in some way, instead of throwing and catching millions of exceptions just because the git was gc:ed.

We made sure we wouldn't end up in the same log-bonanza and deployed again today. No luck yet...

/Sven

[1]

[2015-02-13 15:12:01,707] ERROR org.eclipse.jgit.internal.storage.file.ObjectDirectory : Pack file /gerrit/location/git/trouble-project.git/objects/pack/pack-1bbbc0e22d96ec6bb40ac4e69110e659e4676772.pack was deleted, removing it from pack list

java.io.FileNotFoundException: /gerrit/location/git/trouble-project.git/objects/pack/pack-1bbbc0e22d96ec6bb40ac4e69110e659e4676772.idx (No such file or directory)

at java.io.FileInputStream.open(Native Method)

at java.io.FileInputStream.<init>(FileInputStream.java:146)

at org.eclipse.jgit.internal.storage.file.PackIndex.open(PackIndex.java:94)

at org.eclipse.jgit.internal.storage.file.PackFile.idx(PackFile.java:175)

at org.eclipse.jgit.internal.storage.file.PackFile.get(PackFile.java:266)

at org.eclipse.jgit.internal.storage.file.ObjectDirectory.openPackedObject(ObjectDirectory.java:417)

at org.eclipse.jgit.internal.storage.file.ObjectDirectory.openPackedFromSelfOrAlternate(ObjectDirectory.java:386)

at org.eclipse.jgit.internal.storage.file.ObjectDirectory.openObject(ObjectDirectory.java:378)

at org.eclipse.jgit.internal.storage.file.WindowCursor.open(WindowCursor.java:145)

at org.eclipse.jgit.internal.storage.pack.PackWriter.writeWholeObjectDeflate(PackWriter.java:1563)

at org.eclipse.jgit.internal.storage.pack.PackWriter.writeObjectImpl(PackWriter.java:1549)

at org.eclipse.jgit.internal.storage.pack.PackWriter.writeObject(PackWriter.java:1492)

at org.eclipse.jgit.internal.storage.pack.PackOutputStream.writeObject(PackOutputStream.java:164)

at org.eclipse.jgit.internal.storage.file.WindowCursor.writeObjects(WindowCursor.java:196)

at org.eclipse.jgit.internal.storage.pack.PackWriter.writeObjects(PackWriter.java:1480)

at org.eclipse.jgit.internal.storage.pack.PackWriter.writeObjects(PackWriter.java:1467)

at org.eclipse.jgit.internal.storage.pack.PackWriter.writePack(PackWriter.java:1036)

at org.eclipse.jgit.transport.UploadPack.sendPack(UploadPack.java:1417)

at org.eclipse.jgit.transport.UploadPack.sendPack(UploadPack.java:1271)

at org.eclipse.jgit.transport.UploadPack.service(UploadPack.java:717)

at org.eclipse.jgit.transport.UploadPack.upload(UploadPack.java:628)

at com.google.gerrit.sshd.commands.Upload.runImpl(Upload.java:57)

at com.google.gerrit.sshd.AbstractGitCommand.service(AbstractGitCommand.java:101)

at com.google.gerrit.sshd.AbstractGitCommand.access$000(AbstractGitCommand.java:32)

at com.google.gerrit.sshd.AbstractGitCommand$1.run(AbstractGitCommand.java:70)

at com.google.gerrit.sshd.BaseCommand$TaskThunk.run(BaseCommand.java:442)

at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471)

at java.util.concurrent.FutureTask.run(FutureTask.java:262)

at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$201(ScheduledThreadPoolExecutor.java:178)

at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:292)

at com.google.gerrit.server.git.WorkQueue$Task.run(WorkQueue.java:364)

at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)

at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)

at java.lang.Thread.run(Thread.java:724)

[2]

[2015-02-13 15:12:04,723] WARN org.eclipse.jgit.internal.storage.file.PackFile : Exception while opening packfile /gerrit/location/git/trouble-project.git/objects/pack/pack-1bbbc0e22d96ec6bb40ac4e69110e659e4676772.pack

org.eclipse.jgit.errors.PackInvalidException: Pack file invalid: /gerrit/location/git/trouble-project.git/objects/pack/pack-1bbbc0e22d96ec6bb40ac4e69110e659e4676772.pack

at org.eclipse.jgit.internal.storage.file.PackFile.doOpen(PackFile.java:612)

at org.eclipse.jgit.internal.storage.file.PackFile.beginCopyAsIs(PackFile.java:577)

at org.eclipse.jgit.internal.storage.file.PackFile.copyAsIs(PackFile.java:365)

at org.eclipse.jgit.internal.storage.file.WindowCursor.copyObjectAsIs(WindowCursor.java:190)

at org.eclipse.jgit.internal.storage.pack.PackWriter.writeObjectImpl(PackWriter.java:1515)