Re: Issue 80 in msysgit: git-clone fails when repo contains UTF-8 filepath

33 views
Skip to first unread message

msy...@googlecode.com

unread,
Nov 30, 2009, 5:52:41 PM11/30/09
to msy...@googlegroups.com

Comment #44 on issue 80 by kusmabite: git-clone fails when repo contains
UTF-8 filepath
http://code.google.com/p/msysgit/issues/detail?id=80

Issue 376 has been merged into this issue.

--
You received this message because you are listed in the owner
or CC fields of this issue, or because you starred this issue.
You may adjust your issue notification preferences at:
http://code.google.com/hosting/settings

msy...@googlecode.com

unread,
Dec 6, 2009, 3:33:50 PM12/6/09
to msy...@googlegroups.com

Comment #45 on issue 80 by schuettecarsten: git-clone fails when repo
this issue is really annyoing and makes msysGit unuseable for my projects.
Any ideas
when this is going to be fixed?

msy...@googlecode.com

unread,
Dec 6, 2009, 4:31:41 PM12/6/09
to msy...@googlegroups.com

Comment #46 on issue 80 by johannes.schindelin: git-clone fails when repo
schuettecarsten: you are mistaking Open Source for something you paid for.
Either fix
it yourself, or compensate someone to do it yourself.

Oh, and I will delete all comments that are not helpful at all, like yours
(and this
reply).

msy...@googlecode.com

unread,
Dec 10, 2009, 11:56:31 AM12/10/09
to msy...@googlegroups.com

Comment #47 on issue 80 by Joerg.Rosenkranz: git-clone fails when repo
I think Thorvald's commit to the mob branch (assuming git's internal
representation
as UTF-8 and using all file system calls in their Unicode form) was a step
in the
right direction. I can't find it anymore. Was it completely dropped?

msy...@googlecode.com

unread,
Jan 5, 2010, 2:57:57 AM1/5/10
to msy...@googlegroups.com

Comment #48 on issue 80 by KJackie: git-clone fails when repo contains
UTF-8 filepath
http://code.google.com/p/msysgit/issues/detail?id=80

I think if someone really need to work with Unicode file names, he should
use cygwin
1.7 instead. cygwin 1.7 support UTF-8 locale like *nix does. I don't think
we should
hack mingw-runtime to do the charset conversion like cygwin. Since ...
1. There are too many works to do.
2. It might reduce the performance.
3. We are creating another cygwin! noooo ...

Another approach is rewrite Git by using native Windows API, and do the
charset
conversion in Git itself. I think this is the ideal solution, but who will
do this ...

msy...@googlecode.com

unread,
Jan 5, 2010, 4:44:38 AM1/5/10
to msy...@googlegroups.com

Comment #49 on issue 80 by johannes.schindelin: git-clone fails when repo

KJackie: your comment would have been welcome on the mailing list, but not
on the
issue tracker, where we only want comments relating substantial information
regarding
the issue at hand.

msy...@googlecode.com

unread,
Jul 16, 2010, 10:13:36 AM7/16/10
to msy...@googlegroups.com

Comment #50 on issue 80 by johannes.schindelin: git-clone fails when repo

Issue 508 has been merged into this issue.

msy...@googlecode.com

unread,
Jul 25, 2010, 8:50:57 PM7/25/10
to msy...@googlegroups.com

Comment #51 on issue 80 by takuya.murakami: git-clone fails when repo

I've tried to fix this problem, and now it seems work well.
I sent patch to the mob branch. Commit id is
83d90724607aec6b479e5e5574d39ef68ed2285d

msy...@googlecode.com

unread,
Jul 27, 2010, 2:38:24 AM7/27/10
to msy...@googlegroups.com

Comment #52 on issue 80 by carl.hoerberg: git-clone fails when repo

Thank you so much Takuya! I really hope the patch gets accepted.

msy...@googlecode.com

unread,
Jul 27, 2010, 6:05:47 AM7/27/10
to msy...@googlegroups.com

Comment #53 on issue 80 by johannes.schindelin: git-clone fails when repo

Oh wow, we're using the issue tracker as a replacement for the mailing list
now.

And nobody told me.

msy...@googlecode.com

unread,
Oct 1, 2010, 9:55:43 AM10/1/10
to msy...@googlegroups.com

Comment #54 on issue 80 by Aleksey.Vasenev: git-clone fails when repo

whats about takuya.murakami fix?

git checkout 83d90724607aec6b479e5e5574d39ef68ed2285d exist
but this commit not exist in mob branch

msy...@googlecode.com

unread,
Oct 1, 2010, 10:14:57 AM10/1/10
to msy...@googlegroups.com

Comment #55 on issue 80 by patthoyts: git-clone fails when repo contains
UTF-8 filepath
http://code.google.com/p/msysgit/issues/detail?id=80

As posted on the mailing list 23 Sep 2010:

"I have reset the git mob branch in 4msysgit.git to devel.
There were 2 pieces of outstanding work on that branch and I've moved
them to branches as follows:

work/bm/file_copy_content Bastian Moldenhauer: copy content of config
file into old - 1 patch
work/tm/utf8 Takuya Murakami: utf8 support (2 patches and a merge)

The tm/utf8 one doesn't apply to 'devel' but I didn't look very hard.
There seem to be a few work/*/utf8 branches around."


msy...@googlecode.com

unread,
Oct 7, 2010, 6:39:36 PM10/7/10
to msy...@googlegroups.com

Comment #56 on issue 80 by palesz: git-clone fails when repo contains UTF-8
filepath
http://code.google.com/p/msysgit/issues/detail?id=80

Guys, I think this is one of the biggest issues, so please change it's
priority to High!

msy...@googlecode.com

unread,
Oct 7, 2010, 6:43:43 PM10/7/10
to msy...@googlegroups.com

Comment #57 on issue 80 by kusmabite: git-clone fails when repo contains
UTF-8 filepath
http://code.google.com/p/msysgit/issues/detail?id=80

That requires someone to prioritize actually WORKING on it, not just
talking about it. Are you volunteering?

msy...@googlecode.com

unread,
Oct 30, 2010, 11:44:52 PM10/30/10
to msy...@googlegroups.com

Comment #58 on issue 80 by tifish: git-clone fails when repo contains UTF-8
filepath
http://code.google.com/p/msysgit/issues/detail?id=80

Did anyone see this patch:
http://github.com/tmurakam/4msysgit-utf8-filepath/commit/64f15332c154a067911e7730b5e5529a37b41cf3.
I tried Git-1.7.0.2-utf8-20100725.exe from http://tmurakam.org/git, it did
fix this problem.

msy...@googlecode.com

unread,
Oct 30, 2010, 11:48:55 PM10/30/10
to msy...@googlegroups.com

Comment #59 on issue 80 by tifish: git-clone fails when repo contains UTF-8
filepath
http://code.google.com/p/msysgit/issues/detail?id=80

I tried Git-1.7.0.2-utf8-20100725.exe from http://tmurakam.org/git/, it did
fix this problem.

msy...@googlecode.com

unread,
Nov 23, 2010, 5:54:25 AM11/23/10
to msy...@googlegroups.com

Comment #60 on issue 80 by djt301: git-clone fails when repo contains UTF-8
filepath
http://code.google.com/p/msysgit/issues/detail?id=80

It seems there are quite a few people who would are keen to see this issue
resolved. It also seems that a patch is available, and there are some
isolated reports of it working for certain people.

Kusmabite - can you (or anyone else) elaborate on what further work needs
to be done? How can people who want to see this issue fixed help move it
towards the official release?

msy...@googlecode.com

unread,
Nov 23, 2010, 6:18:39 AM11/23/10
to msy...@googlegroups.com

Comment #61 on issue 80 by kusmabite: git-clone fails when repo contains
UTF-8 filepath
http://code.google.com/p/msysgit/issues/detail?id=80

The patch(es) should be submitted to the msysGit mailing list for
discussion. You can forward patches from other people if you're willing to
polish them until they are ready for inclusion (as long as you have a
sign-off from the original author).

Read Documentation/SubmittingPatches for details on the submission process.
Such a series should probably be sent to the msysGit mailing list rather
than the Git mailing list, though.

msy...@googlecode.com

unread,
Nov 23, 2010, 11:12:38 AM11/23/10
to msy...@googlegroups.com

Comment #62 on issue 80 by kirill.smelkov: git-clone fails when repo

Last time, re-unified patch series on this topic was submitted to the
mailing list almost 1 month ago by Karsten Blees:

http://groups.google.com/group/msysgit/browse_thread/thread/e7887444ec8f4cf5

without any reply at all!

msy...@googlecode.com

unread,
Nov 23, 2010, 8:37:20 PM11/23/10
to msy...@googlegroups.com

Comment #63 on issue 80 by johannes.schindelin: git-clone fails when repo

> It seems there are quite a few people who would are keen to see this
> issue resolved.
> [...]
> Last time, re-unified patch series on this topic was submitted [...]

> without any
> reply at all!

Something does not quite compute: If it is really true that quite a few
people are
keen to see this issue resolved, how come they do not bother to review the
patch
series, let alone comment on it? If there are really interested parties, I
would
expect them to take care.

If people are not investing time and effort in seeing an issue getting
resolved, they are clearly not interested enough in that particular issue
(it is no excuse if those people are not able to review patches; they could
make the issue interesting to those who can; if they don't, they are really
not interested enough in seeing the issue getting resolved).

msy...@googlecode.com

unread,
Nov 24, 2010, 3:53:16 AM11/24/10
to msy...@googlegroups.com

Comment #64 on issue 80 by kirill.smelkov: git-clone fails when repo

I'm taking care - will report soon about testing it under wine and real
windows.

msy...@googlecode.com

unread,
Nov 24, 2010, 4:09:20 AM11/24/10
to msy...@googlegroups.com

Comment #65 on issue 80 by cr...@mcqueen.id.au: git-clone fails when repo

> Something does not quite compute: If it is really true that quite a few

> people are
> keen to see this issue resolved, how come they do not bother to review
> the patch
> series, let alone comment on it? If there are really interested parties,
> I would
> expect them to take care.

Johannes, perhaps it's unrealistic to expect people must contribute
directly to this project to prove their interest or commitment. I'm sure
the average software person uses dozens of software tools, yet it is
unrealistic to expect them to ramp up on the learning curve of all of them
to contribute. So they may give back to the OSS community, but on other
projects they are focussing their effort on.

msy...@googlecode.com

unread,
Nov 24, 2010, 4:18:23 AM11/24/10
to msy...@googlegroups.com

Comment #66 on issue 80 by johannes.schindelin: git-clone fails when repo

Craig, I would like to keep personal attacks out of this issue tracker.
Please understand that. If you think that my comment was personal, consider
this: when I talked about the people not caring enough, I included myself,
and every other developer who is on the mailing list and did not review the
patch series.

So let's keep things professional. In that spirit, I will delete both your
and my comment.

msy...@googlecode.com

unread,
Nov 24, 2010, 5:56:40 AM11/24/10
to msy...@googlegroups.com

Comment #67 on issue 80 by djt301: git-clone fails when repo contains UTF-8
filepath
http://code.google.com/p/msysgit/issues/detail?id=80

I would like to help where I can, but my C skills are pretty weak and I
don't think I am able to review patches beyond just applying them and
seeing if they work for me.

Johannes, I'm not sure I understand you. What do you mean when you
say "make the issue interesting to those who can"?

msy...@googlecode.com

unread,
Nov 24, 2010, 4:54:25 PM11/24/10
to msy...@googlegroups.com

Comment #68 on issue 80 by bl...@dcon.de: git-clone fails when repo

If you want to test my patch series, please note that google groups has
split it in two threads. The second thread
(http://groups.google.com/group/msysgit/browse_thread/thread/d4414235850ce181)
also contains a version rebased to the current v1.7.3.2 (as git bundle).


msy...@googlecode.com

unread,
Jan 10, 2011, 8:49:22 PM1/10/11
to msy...@googlegroups.com

Comment #69 on issue 80 by sam0737: git-clone fails when repo contains
UTF-8 filepath
http://code.google.com/p/msysgit/issues/detail?id=80

My suggestions:
Make one git repository configuration variable to specify the filename
encoding used in the repo. Then future client that obey this flag could do
conversion during checkin/checkout/log, old client still work as-is. Hence
it won't break existing repo, instead of assuming everyone is using utf-8
already.

msy...@googlecode.com

unread,
Jan 11, 2011, 1:10:06 AM1/11/11
to msy...@googlegroups.com

Comment #70 on issue 80 by robin.rosenberg: git-clone fails when repo

core.pathencoding was suggested for the configuration variable name in a
patch t EGit.
Needless to say we have the sam problem and we assume UTF-8 as the default.
However, if you are on Windows, you'll get into trouble because C Git
doesn't recognize an UTF-8 encoded filename as being UTF-8.

The premise behind the UTF-8 strategy in JGit/EGit is that, if it smells
like UTF-8, we treat it like UTF-8, else fallback to locale encoding (or
ISO-8859-1 if that doesn't work too). I believe we are not done with that
because Java on mac things the 8-bith encodings is MacRoman.

Even if heuristics could work for most people, some would need the
configuration variable anyway.


msy...@googlecode.com

unread,
Jan 11, 2011, 5:20:00 AM1/11/11
to msy...@googlegroups.com

Comment #71 on issue 80 by bl...@dcon.de: git-clone fails when repo

core.pathencoding requires an incompatible change of the repository format,
as the pathencoding would have to be stored along with every single file
name in the repo (as i18n.commitencoding does for every single commit
message by means of the Content-Encoding header).

Consider userA with core.pathencoding=A adds fileA, and userB with
core.pathencoding=B adds fileB; now you have two filenames with different
encodings in the same tree object. You'll never get that sorted out with a
configuration variable alone.

So, let's just stick with UTF-8 and fix the few platforms that don't
support it yet.

msy...@googlecode.com

unread,
Jan 11, 2011, 6:25:25 AM1/11/11
to msy...@googlegroups.com

Comment #72 on issue 80 by helpmepro1: git-clone fails when repo contains
UTF-8 filepath
http://code.google.com/p/msysgit/issues/detail?id=80

the problem i have is not with files inside the repository
but with the path witch the repository is inside

like this:

c:\users\[my username in unicode]\documents\myrepository <- this does not
work

msy...@googlecode.com

unread,
Jan 11, 2011, 1:19:39 PM1/11/11
to msy...@googlegroups.com

Comment #73 on issue 80 by pendulum...@gmail.com: git-clone fails when repo
The problem in comment 72 is related but a bit different.

Some multibyte character encodings widely used in Windows (such as Japanese
Shift-JIS and Chinese Big5) have characters which contains octet 0x5C
(backslash in ASCII) in their second bytes. Using non-utf-8 encodings means
you can't even assume 0x5C is a path-delimiter.

The same problem is described in TortoiseGit's issue:
http://code.google.com/p/tortoisegit/issues/detail?id=194#c13
Or related bug in PHP should help:
http://bugs.php.net/bug.php?id=53350

Anyway, if you continue to allow legacy non-UTF-8 encodings as path
strings, no matter what encoding is used to store them, you'll end up
converting them into UTF-8 or UTF-16 internally every time before you parse
them.

'pathencoding' config variable may be added only for backward-compatibility
for existing non-UTF-8 repos: such repositories should have carefully
avoided using the set of characters which confuses git.
But allowing to commit files like '表.txt' in non-UTF-8 encodings in the
future should cost much.
The right direction seems to be encouraging everyone to migrate to UTF8.

BTW, this is the same for the names of tags/branches. Many wide characters
contain colons (0x3a) and dots (0x2e) and hats (0x5e) in their second bytes
if you use non-UTF-8 encodigns.

msy...@googlecode.com

unread,
Jan 11, 2011, 1:40:45 PM1/11/11
to msy...@googlegroups.com

Comment #74 on issue 80 by johannes.schindelin: git-clone fails when repo

Just a little note to all those who missed it: a patch series is already
being discussed on the mailing list, where your review and your help is
needed.
http://groups.google.com/group/msysgit/browse_thread/thread/d4414235850ce181/95bfcc1718fd3f1e?lnk=gst&q=blees#95bfcc1718fd3f1e

msy...@googlecode.com

unread,
Jan 12, 2011, 6:12:32 PM1/12/11
to msy...@googlegroups.com

Comment #75 on issue 80 by robin.rosenberg: git-clone fails when repo

Thanks Johannes. I pasted that link into an issue for JGit relating to
this, I.e. https://bugs.eclipse.org/bugs/process_bug.cgi

msy...@googlecode.com

unread,
Apr 5, 2011, 5:21:34 PM4/5/11
to msy...@googlegroups.com

Comment #76 on issue 80 by robin.ro...@gmail.com: git-clone fails when repo

What happened to that thread? Did it go somewhere else?


msy...@googlecode.com

unread,
Apr 6, 2011, 4:23:30 AM4/6/11
to msy...@googlegroups.com

Comment #77 on issue 80 by Aleksey....@gmail.com: git-clone fails when repo

Some work make in
http://repo.or.cz/w/git/mingw/4msysgit.git/shortlog/refs/heads/kb/unicode
I found that pagination with less has some problem because less don't
process utf-16 input stream correctly.
Also gui like tortoisegit must be updated to work with utf-8 std streams
and wide char command line parameters instead ascii.

msy...@googlecode.com

unread,
Dec 3, 2011, 8:05:21 AM12/3/11
to msy...@googlegroups.com

Comment #78 on issue 80 by gaod...@gmail.com: git-clone fails when repo

When this problem can be resolved?

msy...@googlecode.com

unread,
Dec 3, 2011, 11:15:29 AM12/3/11
to msy...@googlegroups.com

Comment #79 on issue 80 by khom...@gmail.com: git-clone fails when repo

Karsteen Blees periodically makes available the results of ongoing efforts
to solve this bug. The latest (to this date) patch series and a ready-made
installer are available at
http://http://groups.google.com/group/msysgit/browse_thread/thread/40112decdc564117

Make your testing, report any problems discovered to the mailing list.

Watch the mailing list for mails having the "Issue 80" in their subject and
test what's being offerred.

P.S.
Note that this bug tracker is officially closed and you're supposed to
discuss any issues on the mailing list instead.

msy...@googlecode.com

unread,
Dec 22, 2011, 9:19:22 AM12/22/11
to msy...@googlegroups.com

msy...@googlecode.com

unread,
Apr 2, 2012, 12:09:23 PM4/2/12
to msy...@googlegroups.com
Updates:
Status: Fixed

Comment #81 on issue 80 by kusmab...@gmail.com: git-clone fails when repo

This issue has been fixed in the source repository, and the fix will be
included in the next release of Git for Windows.

msy...@googlecode.com

unread,
May 28, 2012, 8:14:35 AM5/28/12
to msy...@googlegroups.com

Comment #82 on issue 80 by marinmi...@gmail.com: git-clone fails when repo
no, it's not working. try with character like this "–"

msy...@googlecode.com

unread,
May 28, 2012, 8:24:23 AM5/28/12
to msy...@googlegroups.com

Comment #83 on issue 80 by kusmab...@gmail.com: git-clone fails when repo
This issue tracker is closed, so please send an e-mail to the mailing list
instead.

Reply all
Reply to author
Forward
0 new messages