Gerv
Bogus.
"They have at least 77 packs, we only do 48,"
No, we offer 48 fully localized builds of Firefox 3 and they offer 24
fully localized builds of IE7.
A language pack is not the same as a fully localized product.
If you want to count language packs in addition to fully localized
builds, then you should add the 28 langpacks we currently offer on AMO
and we get to a total of 76 locales supported versus 77 for IE.
Furthermore, microsoft and mozilla have a greatly diverging definition
of what a langpack is :
"The Internet Explorer 7 LIP is based on MUI technology. The Internet
Explorer 7 LIP translates a reduced set of UI elements and provides
approximately an 80 percent native language user experience"
80% of the UI done would not be accepted as a final firefox version
promoted on mozilla.com, but would be accepted on AMO where we have
several of the locales you said we don't have like Hindi:
https://addons.mozilla.org/fr/firefox/browse/type:3
Pascal
These are very valid points that Pascal argues. Gerv, can you respond?
Additionally (or perhaps it's more to what Pascal is saying), I'd like to read how you define the term "Language Pack" in your post. We ship locales that are deeply integrated in our build and release system. As Pascal mentions, we also host language packs on AMO. To me, these are two distinctly different offerings that together provide many language offerings for our users. I think your post is a bit misleading...that or I am confused.
Do you include langpacks for FF 2 here? If yes, you can add my private
ones for Upper Sorbian (hsb) and Lower Sorbian (dsb). For both are
packed builds (zip, tar.bz2), too. So you will have 78 locales, 1 more
than IE :-)
Yes, in the "All localizations found" statistic.
Are you able to tell me in which country these two languages are spoken,
and what percentage of the population speaks them as a first language?
Gerv
Whoa, easy, dude. It's just loose language, not some attempt to be rude
to people.
> A language pack is not the same as a fully localized product.
OK, fair enough. I was attempting to simplify and ask the question "is
it possible for this person to have a version of this product with the
UI in their language?". So I lumped together addons, packs, full builds,
MUI packs and all other sorts of things into one. Yes, they offer
somewhat different experiences, but any of them is far, far better than
having the UI in another language.
> If you want to count language packs in addition to fully localized
> builds, then you should add the 28 langpacks we currently offer on AMO
> and we get to a total of 76 locales supported versus 77 for IE.
This is what the "total localizations found" line is. So why is that
number not 28 greater than the FF3 number? A few reasons:
Firstly, I only count one version of English - so for example, English
(South African) isn't counted as separate. Again, this process involves
some simplification. I'm looking at percentage coverage, not absolute
numbers of languages supported, and a South African English person can
get by pretty well with American English (just as I, a British English
person, often get by with American English products). I've also
collapsed the two versions of Tamil although, knowing less about that
language, I'm less sure how reasonable that is.
Secondly, that page offers some packs which are also available as full
builds, such as Brazilian Portuguese. So that's not additional either.
Esperanto is a difficult case, because it is not the language of a
particular country or region, and so doesn't fit well with the
country-based method I'm using. Every method has flaws. There's an
apology to the Esperantists on the first page of the spreadsheet,
although I didn't put one in the blogpost.
Lastly, there are several African languages which we have packs for that
I now realise I haven't yet managed to integrate into the spreadsheet. I
apologise, and I agree that this needs doing. They are: Malagasy,
Ndebele (South), Northern Sotho, Siswati, Southern Sotho, Venda and
Wolof. These should also be added to the list of languages which only we
support. But they will make very little difference to the percentage
figures, because the internet populations of the countries in which they
are spoken is so small.
It's all a work in progress.
> Furthermore, microsoft and mozilla have a greatly diverging definition
> of what a langpack is :
>
> "The Internet Explorer 7 LIP is based on MUI technology. The Internet
> Explorer 7 LIP translates a reduced set of UI elements and provides
> approximately an 80 percent native language user experience"
That quote is from here:
http://www.microsoft.com/globaldev/DrIntl/faqs/lipfaq.mspx
and talks about Windows XP Professional Language Interface Packs (LIP),
not specifically IE. So you aren't comparing like with like. That's 80%
of the strings across the whole OS, noy 80% of IE.
As you will know, a large proportion of translation work for software
relates to error messages and other strings which are seen very rarely.
So (my conjecture is) a LIP doesn't mean that 4 out of 5 IE menu items
will be translated, it means that a whole bunch of obscure OS errors
will come up in e.g. Spanish rather than Catalan.
Quoting further from that page:
"Components not localized include Help and Support Center,
administrative tools, system error messages, event logs, Messenger, and
others. User Assistance (Help) files are not localized, and most
packages add one special localized topic to Help and Support Center."
So I agree that there is the question of what happens with Help. What is
the current position on translating Help? Is it required for a full
build? For an official pack (bottom of the download page)? For
addons.mozilla.org?
If someone has installed such a pack and finds that, in fact, core UI
items in IE are not translated, then I'll reconsider. But based on
available evidence, I think my logic is good here, and that my
simplification is not unreasonable.
Gerv
>
> If someone has installed such a pack and finds that, in fact, core UI
> items in IE are not translated, then I'll reconsider. But based on
> available evidence, I think my logic is good here, and that my
> simplification is not unreasonable.
>
> Gerv
I think your oversimplification is totally unreasonable and shows a lack
of understanding of our own localization process.
Pascal
OK, so there's two things wrong with that paragraph.
Firstly, I had included Northern Sotho and Southern Sotho, just under
different names. So that's OK. Secondly, it turns out it does make some
difference. Once I've got any other feedback that there is, I'll publish
updated figures.
As I said, a work in progress :-)
Gerv
Pascal, with respect, I really don't think that's true. For a start, the
question is about whether Microsoft's packs are as useful to an end-user
as ours are. So it's as much about their processes as ours.
Now let's say that you are right, and that their stuff is not as good as
ours for those LIP packs. Even if that's true, it's way, way better than
having 100% of the UI in your second or third language. What I'm doing
is effectively taking a 90% solution and a 100% solution, putting them
in the same basket, and contrasting them with the 0% solution (no
localization at all). I really don't think that's unreasonable.
What would you prefer? Do you think my spreadsheet should say that
Microsoft has no support at all for Afrikaans, Albanian, Armenian, ...,
Welsh and Zulu (all the LIP packs)? I personally don't think that would
be a fair reflection of the truth. But I'll add a line in for that if
you like.
<edits spreadsheet>
OK. IE with no LIP is at 85.7%, with 32 localizations. (But note that
some of the other figures have moved a bit, so wait till I publish the
sheet again for a comparison.)
Gerv
> pascal wrote:
>> Bogus.
>
> Whoa, easy, dude. It's just loose language, not some attempt to be
> rude
> to people.
Gerv,
With all due respect, I hope you can see that "loose language" to you
would be rude to the people who are tirelessly working to support our
l10n communities.
>> If you want to count language packs in addition to fully localized
>> builds, then you should add the 28 langpacks we currently offer on
>> AMO
>> and we get to a total of 76 locales supported versus 77 for IE.
>
> This is what the "total localizations found" line is. So why is that
> number not 28 greater than the FF3 number? A few reasons:
>
> Firstly, I only count one version of English - so for example, English
> (South African) isn't counted as separate. Again, this process
> involves
> some simplification. I'm looking at percentage coverage, not absolute
> numbers of languages supported, and a South African English person can
> get by pretty well with American English (just as I, a British English
> person, often get by with American English products). I've also
> collapsed the two versions of Tamil although, knowing less about that
> language, I'm less sure how reasonable that is.
The percentage coverage number is one metric that is useful but it
should not be the main metric that we judge our l10n efforts on.
> Secondly, that page offers some packs which are also available as full
> builds, such as Brazilian Portuguese. So that's not additional either.
Completely disregarding the efforts of the en-GB, en-SA, pt-BR, etc.
teams is not a great way engender support for your effort here.
> Lastly, there are several African languages which we have packs for
> that
> I now realise I haven't yet managed to integrate into the
> spreadsheet. I
> apologise, and I agree that this needs doing. They are: Malagasy,
> Ndebele (South), Northern Sotho, Siswati, Southern Sotho, Venda and
> Wolof. These should also be added to the list of languages which
> only we
> support. But they will make very little difference to the percentage
> figures, because the internet populations of the countries in which
> they
> are spoken is so small.
Again, your focus on percentage of usage is, in my opinion, a poor
metric for a number of reasons. It's more important that people can
get Firefox in languages that Microsoft will never support like the
11 languages of South Africa that Dwayne and the Translate.org.za
folks have helped make a reality. In many ways it is the smallest
communities which are the most important as they are the most
marginalized- by the percentage metric as well.
Microsoft will never release a support for Dzongkha due to political
pressures. Therefore Bhutan's Department of Information and
Technology developed their own version of Linux (with Firefox) in
their own language. Please add Dzongkha to your list if you do not
have it on there already.
http://dzongkha.sourceforge.net/
> It's all a work in progress.
Fair enough. That could have been said up front more clearly.
> So I agree that there is the question of what happens with Help.
> What is
> the current position on translating Help? Is it required for a full
> build? For an official pack (bottom of the download page)? For
> addons.mozilla.org?
Help is now online with SUMO, so it is a separate effort.
> If someone has installed such a pack and finds that, in fact, core UI
> items in IE are not translated, then I'll reconsider. But based on
> available evidence, I think my logic is good here, and that my
> simplification is not unreasonable.
If you are asking for feedback, and get feedback from the person who
manages our global l10n effort, and then disregard that information,
I hope you can see why there would be consternation. If you are
looking for support for your efforts, especially from Mozilla's own
l10n community, I humbly suggest that you reconsider your
simplifications.
Gen Kanai
If I may chime in, they are both languages spoken by a small population
in Germany. Wikipedia says it's 55000 and 14000 people for Upper and
Lower Sorbian, respectively, so less than 0.1% of German inhabitants.
Peter.
> Yes, in the "All localizations found" statistic.
>
> Are you able to tell me in which country these two languages are spoken,
> and what percentage of the population speaks them as a first language?
AS Peter wrote in Germany, precisely in Eastern Germany in the German
federal countries Saxony and Brandenburg, see:
http://en.wikipedia.org/wiki/Sorbian_languages
http://en.wikipedia.org/wiki/Upper Sorbian
http://en.wikipedia.org/wiki/Lower Sorbian language
The number of speakers Peter mentioned could be right. I don't know the
exact numbers, I heard about 40,000 (Upper):20,000 (Lower).
Both are autochtoneous West Slavic languages, that means that Upper and
Lower Sorbs aren't ethnic minorities of other nationalities. Danes are
e.g. a non-autochtoneous minority in Germany. Another autochtoneous
languages are Northern Frisian and Eastern (Saterlandic) Frisian which
are spoken in Germany only.
Upper Sorbian is spoken in the Saxon region Upper Lusatia, in a triangle
of the towns Kamenz, Hoyerswerda and Bautzen (to teh north east resp.
east of Dresden)
Lower Sorbian is spoken in the Brandenburg region Lower Lusatia around
the city Cottbus.
Michael
This also applies to Michael's efforts on the Sorbian languages, or even
Esperanto, by the way. People native to languages like Sorbian usually
speak another language (like German in this case) fluently, and they are
small groups, so a for-profit endeavour like MS's will never support
them in the way our contributors can.
The question is if we statistics are about "how much of the population
can we reach?" or about "how many people do we support with software in
their own native language?"
The answers to those questions are a lot different.
Robert Kaiser
Sure. Who said it was? Not me.
>> Secondly, that page offers some packs which are also available as full
>> builds, such as Brazilian Portuguese. So that's not additional either.
>
> Completely disregarding the efforts of the en-GB, en-SA, pt-BR, etc.
> teams is not a great way engender support for your effort here.
I'm not "completely disregarding their efforts"! I'm a Brit - I use the
British English localization, for goodness sake! I'm just trying to take
a higher-level view.
Now if British English and South African English were mutually
unintelligible, then that would be a fair criticism. But they aren't.
Microsoft don't ship en-SA. Does that mean they have no support for
English speakers in South Africa? I don't think it's fair on them to
claim that.
>> Lastly, there are several African languages which we have packs for that
>> I now realise I haven't yet managed to integrate into the spreadsheet. I
>> apologise, and I agree that this needs doing. They are: Malagasy,
>> Ndebele (South), Northern Sotho, Siswati, Southern Sotho, Venda and
>> Wolof. These should also be added to the list of languages which only we
>> support. But they will make very little difference to the percentage
>> figures, because the internet populations of the countries in which they
>> are spoken is so small.
>
> Again, your focus on percentage of usage is, in my opinion, a poor
> metric for a number of reasons.
I think it's the best we are going to get of a measure of the absolute
number of people which we serve in their native language. I agree it's
not the only metric, but it's a useful one. It also gives us some guide
as to where we might focus our efforts next, to help the maximum number
of people.
> It's more important that people can get
> Firefox in languages that Microsoft will never support like the 11
> languages of South Africa that Dwayne and the Translate.org.za folks
> have helped make a reality.
I don't think it's easy to make comparisons like that; how do you say
whether it's more important to give the power of Firefox to 5 million
Bengali speakers, or to give a native-language browser to 89,000 Venda
speakers, many of whom probably speak another language as well? I don't
think the answer to that question is obvious - or useful, for that matter.
> Microsoft will never release a support for Dzongkha due to political
> pressures. Therefore Bhutan's Department of Information and Technology
> developed their own version of Linux (with Firefox) in their own
> language. Please add Dzongkha to your list if you do not have it on
> there already.
>
> http://dzongkha.sourceforge.net/
OK. To add it, I need to know a few things.
Is a Firefox localization available on its own? Or is Firefox in
Dzongkha only available to Linux users who install that distribution? I
don't think it's fair to mark it as supported if you need to install an
entire OS that you weren't using before.
What percentage of the people of Bhutan speak Dzongkha as their first
language? (I can try and look this up, but if you have good information,
let me know.)
>> It's all a work in progress.
>
> Fair enough. That could have been said up front more clearly.
Well, I did post in this group two weeks ago asking for help. What else
would you have liked me to have done? Now I've posted some results, lots
of people are providing more information.
>> So I agree that there is the question of what happens with Help. What is
>> the current position on translating Help? Is it required for a full
>> build? For an official pack (bottom of the download page)? For
>> addons.mozilla.org?
>
> Help is now online with SUMO, so it is a separate effort.
OK. In that case, I think it's fine, in my approximation, to mark MUI
builds down as that language being "supported" by IE. Because they don't
have help translated, but we don't require it either.
> If you are asking for feedback, and get feedback from the person who
> manages our global l10n effort, and then disregard that information, I
> hope you can see why there would be consternation.
I'm happy to make a distinction between IE's "full" localizations and
the MUI ones, and the current spreadsheet has entries for both. But
Pascal has not yet given any concrete reason why MUI builds are not as good.
Gerv
The later question is a lot more interesting to me, and beyond that the
question "how many more *could* we reach by the addition of each new
incremental locale addition?" should get more attention.
That would help to set up a priority list and discussion of what new
locales would be most valuable to add.
Hindi, es-mx, and others are great places where I think we could fill
gaps to improve the experience for users with new locales or "tuned"
versions that better fit the needs of users globally, and in specific
areas, but we could use more analysis to back up those ideas.
From that we could recruit and help organize those locales to get off
the ground and shipping for upcoming releases.
Gerv, could your research be restructured to help answer those questions.
-chofmann
> The answers to those questions are a lot different.
>
> Robert Kaiser
> _______________________________________________
> dev-l10n mailing list
> dev-...@lists.mozilla.org
> https://lists.mozilla.org/listinfo/dev-l10n
>
I've added both. Well done to Michael for his hard work :-)
However, and I hope I can say this without offending either of Peter and
Michael, if we now celebrate because we've got more packs than IE, we're
missing part of the picture.
When those numbers are scaled down by the German internet population
percentage, these two packs serve around 34,000 people, I suspect many
of whom speak German. Some of the packs we are missing, e.g. Bengali,
serve up to 5 million internet citizens, many of whom speak only that
language.
I agree that my spreadsheet is not the only metric. But number of packs
is not the only metric either. (Can we please discuss what good metrics
are calmly, without anyone accusing anyone else of denigrating someone's
work?)
Gerv
| Gen Kanai wrote:
| > The percentage coverage number is one metric that is useful but it
| > should not be the main metric that we judge our l10n efforts on.
|
| Sure. Who said it was? Not me.
|
| >> Secondly, that page offers some packs which are also available as
| full
| >> builds, such as Brazilian Portuguese. So that's not additional
| either.
| >
| > Completely disregarding the efforts of the en-GB, en-SA, pt-BR,
| etc.
| > teams is not a great way engender support for your effort here.
|
| I'm not "completely disregarding their efforts"! I'm a Brit - I use
| the
| British English localization, for goodness sake! I'm just trying to
| take
| a higher-level view.
|
| Now if British English and South African English were mutually
| unintelligible, then that would be a fair criticism. But they aren't.
|
| Microsoft don't ship en-SA. Does that mean they have no support for
| English speakers in South Africa? I don't think it's fair on them to
| claim that.
One interesting thing to consider is the other elements that go into making a localizations...
For instance, all of our localizers work with Mozilla to find the best localized search partners, newsfeeds, and RSS handlers. Imagine if you were using an en-US build in India, but kept receiving U.S.-centric default options. As a user, you might love to have a localized product that actually aligned with the search options, news feeds, and RSS handlers that were local to your community. See what I am getting at?
It doesn't have to be repeated to this crowd, but localization can often be about more than just translation. And, one could make a strong argument that en-GB, en-US, en-CA, etc. do provide a different experience for users. In addition to all that, there are colloquialisms, grammar, and spelling differences within those languages that localizers are working on. Those efforts should either be acknowledged in a different study or included in this one and described why that hard work by those localizers is worthy of inclusion.
(Now, some will probably have good comments to add about why creating localizations for every community might have its challenges...probably another very legitimate conversation to have one day.)
Overall, Gerv, I think you're on to something with this study. It's very cool to see how much coverage in the world's speaking population Mozilla covers.
But, I really think it's important to separate the work of every localization team and count it as a localization. As has been mentioned by Gen and Pascal, our localizer community works so tirelessly to do what they do. I am happy to discuss and collaborate with you on your study methodology...might be very interesting to refine how we are looking at it and then repost something.
-seth
irc nick: sethb (irc.mozilla.org)
Agree on all points.
However, I suggest that all these things are secondary to having the
browser UI in your language at all. As I said to Pascal, what I'm doing
is taking 80%, 90% and 100% solutions and putting them together and
contrasting them with 0% solutions. I don't think that's absolutely
unreasonable.
It's not saying that the 80% solution and the 100% solution are
equivalent in all ways, only that both are way better than a 0% solution.
> Those efforts should
> either be acknowledged in a different study or included in this one
> and described why that hard work by those localizers is worthy of
> inclusion.
I'm not sure how this particular study could include them, unless we
adopted some way of rating a pack. If you do a full en-GB translation
but have en-US search engines, does that count as 90% serving the UK
public? And is that the same as serving 90% of the public 100% well? I
don't think one can really do that sort of metric meaningfully.
> But, I really think it's important to separate the work of every
> localization team and count it as a localization. As has been
> mentioned by Gen and Pascal, our localizer community works so
> tirelessly to do what they do. I am happy to discuss and collaborate
> with you on your study methodology...might be very interesting to
> refine how we are looking at it and then repost something.
Well, I could separate en-US, en-GB and en-SA, but I think it would only
be fair to say that Microsoft's English pack supported them all. And if
I did, it would have the following two effects:
- The percentage figures would stay _exactly_ the same
- Microsoft would appear to have created more language packs than it has.
So I'm not sure modifying the spreadsheet in that way would improve its
clarity.
Gerv
>> Microsoft will never release a support for Dzongkha due to political
>> pressures. Therefore Bhutan's Department of Information and
>> Technology
>> developed their own version of Linux (with Firefox) in their own
>> language. Please add Dzongkha to your list if you do not have it on
>> there already.
>>
>> http://dzongkha.sourceforge.net/
>
> OK. To add it, I need to know a few things.
>
> Is a Firefox localization available on its own? Or is Firefox in
> Dzongkha only available to Linux users who install that
> distribution? I
I have not seen it outside of the Dzongkha Linux distribution.
> don't think it's fair to mark it as supported if you need to
> install an
> entire OS that you weren't using before.
I don't follow your logic. There is no other OS which supports
Dzongkha, so to use a Dzhongkha Mozilla browser, you have to use that
Linux distribution. Whether it is available separately or not is not
relevant.
> What percentage of the people of Bhutan speak Dzongkha as their first
> language? (I can try and look this up, but if you have good
> information,
> let me know.)
Quickly, Wikipedia says first language of 130,000 and second language
of ~470,000.
http://en.wikipedia.org/wiki/Dzongkha_language
How accurate that is, your guess is as good as mine.
> The later question is a lot more interesting to me, and beyond that
> the question "how many more *could* we reach by the addition of each
> new incremental locale addition?" should get more attention.
>
> That would help to set up a priority list and discussion of what new
> locales would be most valuable to add. Hindi, es-mx, and others are
> great places where I think we could fill gaps to improve the
> experience for users with new locales or "tuned" versions that better
> fit the needs of users globally, and in specific areas, but we could
> use more analysis to back up those ideas.
>
> From that we could recruit and help organize those locales to get off
> the ground and shipping for upcoming releases.
>
> Gerv, could your research be restructured to help answer those questions.
>
> -chofmann
>> The answers to those questions are a lot different.
>>
>> Robert Kaiser
Huh, am I the only one _not_ seeing Gerv's post as a direct attack to
Mozilla's L10n community (even though I'm an es-ES localizer)? :-) To
me, what he is trying to find out is how many Internet population can
use Firefox vs. IE feeling that they are using the product in their
language.
Of course that IE "langpacks" will likely be of inferior quality or
less complete than Firefox ones. That will be true even for full
supported languages; after some time being involved in Mozilla L10n, I
found out some wrong translations to Spanish in Microsoft OSes. But,
after all, this is how Microsoft has achieved his position: they don't
try to do the best ever possible product, but the one "good enough"
for most people. And, let's face it, for a bengali person it will be
better to have a "not perfectly translated to bengali" product than an
en-US one. At least, this how I read Gerv's post.
Of course, another completely different question is: does this mean
Mozilla should pay bengali translators just because nobody is
interested to volunteer to do it? If so, how will all the not paid,
volunteer translators to the current XX (45, 48, 76, whatever)
languages feel about such decission?
Or maybe the real point is what can Mozilla do to attract people from
those countries without Mozilla localizations to volunteer?
JM2C
Thank you very much. :-)
>
> However, and I hope I can say this without offending either of Peter and
> Michael, if we now celebrate because we've got more packs than IE, we're
> missing part of the picture.
>
> When those numbers are scaled down by the German internet population
> percentage, these two packs serve around 34,000 people, I suspect many
> of whom speak German. Some of the packs we are missing, e.g. Bengali,
> serve up to 5 million internet citizens, many of whom speak only that
> language.
All Sorbs speak German. IMHO metric for localization statistics
shouldn't be based upon comparison with other software. In German we say
"der Vergleich hinkt", the comparison "goes lame", that means you
compare different things that are not really comparable.
The Sorbian languages are dying languages, they should be preserved. I
translated software into the Sorbian languages that they can meet the
modern requirements of computing. Otherwise these languages would
perhaps stagnate on a folkloristic level and sooner or later they would
be extinct. BTW, I'm not a Sorb, I'm an ethnic German.
The great advantage of Open Source, all Mozilla programs are based upon,
that they consider small languages because there is a community of
volunteers who do this work to realize certain aims and not to gain
money. They don't ask if it is worthwhile. They want to do it and
therefore they do it.
We must create locales both for small languages and for big "exotic"
languages. We should do one of this without leaving the other alone.
Long time ago (one or two years) Axel started a thread in this group
about the target of 100 locales for Fx. Unfortunately that thread ended,
if I remember well, in a discussion what localizations tools would be
better, simple editor, Mozilla Translator, translate toolkit etc. That
is secondary.
Michael
I suppose it is the language barrier that causes troubles.
If ones reads your first two lines in the reply, they might think that
what you are trying to say is "Gerv's post is a direct attack to the
L10n community".
This is because some readers might be primed from the first replies that
Gerv received,
and think negatively. I misread it like that.
> Of course that IE "langpacks" will likely be of inferior quality or
> less complete than Firefox ones. That will be true even for full
> supported languages; after some time being involved in Mozilla L10n, I
> found out some wrong translations to Spanish in Microsoft OSes. But,
> after all, this is how Microsoft has achieved his position: they don't
> try to do the best ever possible product, but the one "good enough"
> for most people. And, let's face it, for a bengali person it will be
> better to have a "not perfectly translated to bengali" product than an
> en-US one. At least, this how I read Gerv's post.
>
> Of course, another completely different question is: does this mean
> Mozilla should pay bengali translators just because nobody is
> interested to volunteer to do it? If so, how will all the not paid,
> volunteer translators to the current XX (45, 48, 76, whatever)
> languages feel about such decission?
>
> Or maybe the real point is what can Mozilla do to attract people from
> those countries without Mozilla localizations to volunteer?
>
The importance of Gerv's post for me is that he put together information
that is lying around.
This information can then be disseminated, etc.
For languages that no translation project exists, there can be
a call for volunteers from Mozilla and local user groups to join up.
For languages that lang packs exist, but are not part of the official
Firefox,
it's an issue of getting in contact with the local teams and asking them
how to help
to get the work in Firefox.
Simos
While I agree that % of internet population covered is an interesting
metric, it doesn't give us any idea on where to focus our efforts.
Small chinese cities will outnumber most of the remaining languages in
terms of internet population, I guess. At least when you consider
"small" to be in chinese proportions.
I can imagine several more or less arbitrary metrics for where to focus,
all of which are some combination of population, internet population,
and the growth of the two.
I'd still add an estimate for the return of investment, which is the
thing we're struggling with the most.
I'll follow up on "recruiting" in a reply to Chris' post.
>> It's more important that people can get
>> Firefox in languages that Microsoft will never support like the 11
>> languages of South Africa that Dwayne and the Translate.org.za folks
>> have helped make a reality.
>
> I don't think it's easy to make comparisons like that; how do you say
> whether it's more important to give the power of Firefox to 5 million
> Bengali speakers, or to give a native-language browser to 89,000 Venda
> speakers, many of whom probably speak another language as well? I don't
> think the answer to that question is obvious - or useful, for that matter.
I wouldn't make statements on the English skills of either, really. I
wouldn't speculate on how English skills map to access to computers,
though. Which is another point, enabling people without "imperialistic"
language skills to feel comfortable in using the internet might empower
them more than the awesomebar.
Again, I really think that the values are good, but they're data points.
They don't really serve as a "oh, and the next on the list is the best
thing to do".
<...>
Axel
Here are some comments for the Greece, Cyprus and the Greek language.
In the ODS file, the population of Greece is listed as 2,000,000 while
it is about 10,000,000.
A similar smaller population (about 350,000) is listed for Cyprus.
The data in the ODS file should be updated to the following:
Greece (population): 10,722,816 (CIA World Factbook)
Cyprus (population): 792,604 (CIA World Factbook)
Greek is also spoken in Cyprus, though a percentage is not shown
at either the CIA World Factbook, or Wikipedia.
My estimate is that the vast majority of the population speaks Greek or
are bilingual.
Currently there is no percentage for the Greek language in Cyprus, in
the ODS file.
Cheers,
Simos
I'm generally skeptical on recruiting localization teams. To me, users
benefit from a growing community, and they get hardly anything from a
one-time effort.
There is historical evidence that the recruiting we've done in the area
of localization hasn't really achieved growing communities, and they we
got stuck with one-time efforts, relatively short-lived ones, even.
So before we head out and make a priority list, we should make sure that
we're spending out limited resources on the right thing. The right thing
is to get users a sustained on-line life with mozilla, the wrong thing
is to make marks on the wall and just count languages.
Understanding how complex the creation and maintainance of a particular
localization should be is another point, as is making sure that people
actually find the localizations they're looking for.
We're testing the limits of our ecosystem at various ends, and we
shouldn't just test them, but expand them. But at the same time, "let's
have a binary for those" isn't always the answer to the question.
I'm feeling fine to call that "return of investment", not just from our
side, but from the side of the community as well.
Axel
We're not going to pay. As I said in another reply to this thread, it's
not about marks on the wall. As you said above, paid work isn't good
work, either, something we see across the ecosystem surprisingly often.
And we do have volunteers for Bengali, both India and Bangladesh.
There are a bunch of things to "invest" in, becoming more agile in how
we get localizations on board is probably the most important. I have
some ideas in that area, and I'm working on others. Yet others are
actually resolved, like not freezing everything for 1.9.0.x.
"Invest" in terms of time, thought, love, frust. Not money.
Axel
Sorry if my post could be interpreted like I was suggesting to ask for
money. Mozilla and Mozilla Europe accounting books will show that I've
been not exactly eager to claim for fundings to travel to FOSDEM and
so. :-)
What I was (unsuccessfully) trying to expose is closer to what you
have said in another post, that "the next language in the list" can be
something difficult to achieve, because the right way to do it is
having volunteers and they should have appeared already by themselves.
If we really have volunteers localizing a language that will add
Firefox population and there is no official binary just because the
bug is waiting in the queue, then the process is where Mozilla needs
to act upon. But if there are registered localizers and a bug for a
language, but no langpack nor activity, then maybe we don't really be
so sure that we have volunteers for that language.
Ricardo
Simos,
Thank you for your comments. The figures given are not population, but
number of Internet users. My source is here:
https://www.cia.gov/library/publications/the-world-factbook/rankorder/2153rank.html
It lists Greece as having 2,048,000 Internet users in 2006. If you have
a better or more recent figure, I would be happy to hear it.
> Greek is also spoken in Cyprus, though a percentage is not shown
> at either the CIA World Factbook, or Wikipedia.
> My estimate is that the vast majority of the population speaks Greek or
> are bilingual.
Do you mean the southern part of Cyprus, or the entire island? I don't
want to use the wrong words or offend anyone, but I was under the
impression that in part of Cyprus, Turkish is mostly spoken.
Is it possible to obtain population figures for the two parts of the
island? We could then divide up the internet population proportionally.
Gerv
That's the question I'm trying to answer.
> and beyond that the
> question "how many more *could* we reach by the addition of each new
> incremental locale addition?" should get more attention.
And this one too. The spreadsheet gives absolute numbers of internet
population for each language, so we can see which language has the
highest number but no localization yet.
Gerv
Axel is right here. There is historical evidence that recruitment hasn't
worked well. Is that because recruitment is a bad idea full stop, or we
just did a bad job of it last time? (I did some of it, for Summer of
Code, so this is self-criticism here.)
But more information about the state of things can't be bad. How we use
that info is another question.
Gerv
That doesn't follow. If someone was struggling along with Windows in
their second or third language, they'd be very happy to use a browser in
their first language.
>> What percentage of the people of Bhutan speak Dzongkha as their first
>> language? (I can try and look this up, but if you have good information,
>> let me know.)
>
> Quickly, Wikipedia says first language of 130,000 and second language of
> ~470,000.
>
> http://en.wikipedia.org/wiki/Dzongkha_language
If we take that 130,000 (out of a population of 652,000) and apply it to
the Internet population of 30,000, we get a target market for Dzongkha
Firefox of 5,800 people.
Gerv
Don't worry, I don't think Axel thought that. When he said "We're not
going to pay", he was making a general statement, not one directed at
you :-)
Gerv
Quite so.
As a general principle, introducing payment into a free software project
community is always a difficult thing. Apache, for example, avoids it
like the plague. Only recently did they agree to hire even a sysadmin,
as long as he did no coding.
Perl, on the other hand, has a history of making grants to key project
contributors to enable them to work on Perl full time for a short
period, to get key things done.
> Or maybe the real point is what can Mozilla do to attract people from
> those countries without Mozilla localizations to volunteer?
That's a good question. Or, to encourage and help the volunteers we do
have to get the work done. As Axel says, we have teams for quite a few
of the missing languages, but no localization yet.
Gerv
I completely agree. There are good reasons to make a localization other
than serving an unserved community.
But we need to realise and accept the consequences of that, one of which
is that not all language packs are equal when you are trying to work out
how "well" the project is doing, whatever that may mean.
Gerv
Yeah, I know, sorry if it sounded like that. Just wanted to make sure
that that was clear. It took me a while to learn that Chris'
"recruiting" has nothing to do with cash. And I need to bang my head on
the wall to re-remember each time he says that word ;-).
> What I was (unsuccessfully) trying to expose is closer to what you
> have said in another post, that "the next language in the list" can be
> something difficult to achieve, because the right way to do it is
> having volunteers and they should have appeared already by themselves.
>
> If we really have volunteers localizing a language that will add
> Firefox population and there is no official binary just because the
> bug is waiting in the queue, then the process is where Mozilla needs
> to act upon. But if there are registered localizers and a bug for a
> language, but no langpack nor activity, then maybe we don't really be
> so sure that we have volunteers for that language.
Yeah
Axel
Gervase Markham wrote:
> Axel Hecht wrote:
>
>> I'm generally skeptical on recruiting localization teams. To me, users
>> benefit from a growing community, and they get hardly anything from a
>> one-time effort.
>>
>> There is historical evidence that the recruiting we've done in the area
>> of localization hasn't really achieved growing communities, and they we
>> got stuck with one-time efforts, relatively short-lived ones, even.
>>
>> So before we head out and make a priority list, we should make sure that
>> we're spending out limited resources on the right thing. The right thing
>> is to get users a sustained on-line life with mozilla, the wrong thing
>> is to make marks on the wall and just count languages.
>>
>
> Axel is right here. There is historical evidence that recruitment hasn't
> worked well. Is that because recruitment is a bad idea full stop, or we
> just did a bad job of it last time? (I did some of it, for Summer of
> Code, so this is self-criticism here.)
>
I think we have made some attempts a recruiting but really haven't put
the kind of effort that I'm thinking it would take to be successful.
I'm thinking about some on-line social networking and recruiting,
followed up by multiple in-person visits to a locale to organize events
to get the localization work off the ground and sustainable. Its the
kind of recruiting work that a few of us have tried to do, but on
steroids! ;-) Visits to Universities to find interested students and
others that want to contribute might be good venues. We should share
notes on the recruiting work that we have all done and figure out what
we might do to make that effort more successful.
As the:
-number of localized web properties expands past website, sumo, addons,
and beyond
-the amount of total localization work grows,
-the complexity of the work increases,
-amount of testing that needs to be done increases to improve quality
we need to step up and be more successful in recruiting if localization
efforts in both existing and new communities. We simply need to expand
the number of good contributors and figure out additional ways to
distribute the expanding work load and sustain all the localizations
that we want to do over the next 10 to 100 years. ;-)
-chofmann
> But more information about the state of things can't be bad. How we use
> that info is another question.
>
> Gerv
>
So you're actually heavily mixing the two questions. "How many (more)
could we reach by...?" is a followup the the first qeustion in my post.
The big difference comes from the fact that people might grow up
speaking two or more languages (or language variants for that matter),
so you reach them by providing them with either of them - but there's
probably one of those languages they like better and for various reasons
consider their first language, and you serve them (sometimes only
ideologically) better by providing software in that language.
I think both those metrics are interesting, while still different. Your
statistics seems to me like they are actually targeting the former one,
i.e. reaching a big number of people, but tying this to their first
language (I know, it's harder to do it otherwise).
Sorbs and en-SA speakers are very similar in this regard, actually: They
both understand a different, probably sometimes similar, language very
well (German / en-US), but they feel more comfortable with the other
language. So if we provide Sorbian or en-SA software does not make a
difference in reaching them at all, they are both served well with us
providing de and en-US and could use those flawlessly. But they feel
specially served and like us much better if we provide Sorbian and South
African English, and they will prefer those versions.
Same with e.g. Austrian Crotians, who usually speak both languages
fluently from when they learn to talk, but often consider Croatian as
their first language, or Argentinian people who will tell you that
Spanish as in "Spain" will sound alien to lots of their population, or
Catalans who probably all speak said Spanish well but are proud of their
own region and language. And there are countless others.
You probably always will come into a problem when you do such a mixup of
reach vs. best-served. Maybe that calls for two stats, and twice (or
more) work to do them. :-/
Robert Kaiser
How do you know they struggle? In fact, lots of people, esp. linguistic
minorities in most countries, speak their second language as fluently as
their first, with the big difference between them only being their
cultural or personal preference.
Robert Kaiser
I'm not seeing it as an attack, on the contrary, it's cool that someone
does such stats, and I'm very happy that Gerv puts his time into this!
I just think there are dangers in the view of the L10n world it presents
and what we derive from that, as it's different if I look what languages
people speak fluently or what languages people prefer, and where to draw
the lines of how one does that.
Robert Kaiser
Yes, all Sorbs understand German, they often write that they have 2
mother tongues, and Sorbian institutions will probably continue using
German software. But private persons use Sorbian software to cultivate
their national identity which is important to keep the Sorbian people
alive. And one important part of this identity is the own language.
Sorbian Firefox and Thunderbird are used e.g in a basic school of a
Sorbian village (maybe in other ones, too). It's a public school which
is usually working according to a state school curriculum. I'm a little
proud of that such a public school is working with my private
translations. Unfortunately, I get few feedback only, for Lower Sorbian
I haven't got feedback because Lower Sorbian Firefox is 2 weeks old only.
Michael
Quite so. Which is why I'm looking at first language.
I think it would be very difficult to get figures of coverage for "any
language the person speaks", because we'd need a list of languages
spoken by every person on earth :-)
> You probably always will come into a problem when you do such a mixup of
> reach vs. best-served.
Indeed. And I hope that everyone will understand that to get any figures
at all, we have to make some approximations (such as "the internet
population of a country is evenly divided among its language groups")
and not be offended.
Gerv
OK, some may well. But other people only speak one language. Like me :-)
My point is, you can't say that "because Dzhongkha Firefox is available
in a Linux distribution, that's the same as having a localized version
available for Windows". People may not want to install a whole new OS to
get software in their language.
Gerv
> OK, some may well. But other people only speak one language. Like
> me :-)
> My point is, you can't say that "because Dzhongkha Firefox is
> available
> in a Linux distribution, that's the same as having a localized version
> available for Windows". People may not want to install a whole new
> OS to
> get software in their language.
That's your opinion.
If, for example, people in Bhutan cannot get the dominant operating
system in their first language they may want to install a whole new
OS in order to get software in their first language.
Maybe found another check mark for unofficial localizations:
is (Icelandic)
http://firefoxis.mozdev.org/installation.html
Firefox 2 version looks complete at glance, requires Firefox 2.0.0.14
Firefox 3 version looks partial: seems to cover dialogs, some menuitems.
(tried in FF3rc3/20080529).
Or maybe in testing...
https://bugzilla.mozilla.org/show_bug.cgi?id=311773
I believe you did not intend to create a perception of discounting the
great efforts of our localization teams worldwide in all forms full
builds, beta builds, language packs, etc.. It seems the statistic or
method used doesn't entirely reflect the importance of opening up new
languages to all communities (small on the relative worldwide scale
but having big impact within their regions; what Pascal and Gen wrote
about this is important). Which leads me to the real question...
I also agree the critical question for us is where RKaiser, Chofmann
and Pike were going in posts above, trying to better understand "how
many people could we support with software in their own native
language?" when adding support for a new locale. I'm not sure how you
can focus your research there differently, but certainly picking up on
Seth's offer in his post would be good as he (along with Stanislav and
others) has done some research on our communities and languages. This
would be super helpful for us.
Maybe (hindsight always being 20/20) in your "Note" section you could
consider indicating that you are exploring this so as to create
discussions towards improving our capabilities in this regards (while
this may seem obvious).
mic
I left out four (I think) of our packs by mistake. When I reissue the
blog post, I'll include updated numbers.
> It might
> be good to clarify that somehow in your post as the pure number
> doesn't seem to be based exactly on apples to apples.
I think it's a reasonable comparison, but I've added a disclaimer to the
top of the current blogpost (which I'm going to reissue today or
tomorrow), and the current spreadsheet includes the other method
calculation that Pascal asked for.
> Also, something
> on the differences in quality between a MS add-on and an Fx add-on or
> something to that affect may be useful.
I'll add some text about this, but as far as I can tell (and no-one's
presented evidence to the contrary), a Microsoft full localization is
about the same as a Firefox full localization, and a Microsoft LIP pack
is about the same sort of thing as a Firefox language pack.
> It seems the statistic or
> method used doesn't entirely reflect the importance of opening up new
> languages to all communities (small on the relative worldwide scale
> but having big impact within their regions; what Pascal and Gen wrote
> about this is important).
It depends on how you define "importance". My spreadsheet treats
everyone in the world's internet population as equally important. More
people == more important. That's not the only way to measure, but I
think it's a reasonable one.
> I also agree the critical question for us is where RKaiser, Chofmann
> and Pike were going in posts above, trying to better understand "how
> many people could we support with software in their own native
> language?" when adding support for a new locale.
That's exactly the question my spreadsheet answers, so I'm puzzled why
both you and chofmann seemed to think it doesn't answer it.
Look on the Totals tab. Directly under each language is the best
estimate of how many people we will cater for in their native language
when we add support for a new locale. For example, adding support for
Balochi would serve about 580,000 people.
> I'm not sure how you
> can focus your research there differently, but certainly picking up on
> Seth's offer in his post would be good as he (along with Stanislav and
> others) has done some research on our communities and languages. This
> would be super helpful for us.
I replied to Seth, but I'm not exactly certain what he's offering.
Perhaps he could clarify, after I've released a new version of the sheet?
> Maybe (hindsight always being 20/20) in your "Note" section you could
> consider indicating that you are exploring this so as to create
> discussions towards improving our capabilities in this regards (while
> this may seem obvious).
I'm exploring it because I thought it would be interesting. What the
l10n community does with the information is an entirely separate
question. :-) But certainly, one potential use is guiding our ideas of
where to focus various types of effort.
Gerv