[Django] #35852: intcomma not locale aware when given a decimal as string

Django

unread,

Oct 18, 2024, 3:08:11 PM10/18/24

to django-...@googlegroups.com

#35852: intcomma not locale aware when given a decimal as string
-------------------------------------+-------------------------------------
Reporter: Jonathan Ströbele | Type: Bug
Status: new | Component:
| contrib.humanize
Version: 5.1 | Severity: Normal
Keywords: | Triage Stage:
| Unreviewed
Has patch: 0 | Needs documentation: 0
Needs tests: 0 | Patch needs improvement: 0
Easy pickings: 0 | UI/UX: 0
-------------------------------------+-------------------------------------
The
[https://docs.djangoproject.com/en/5.1/topics/i18n/formatting/#overview
documentation] regarding localization says that for thousand separators
`USE_THOUSAND_SEPARATOR = True` or `intcomma` can be used. The
[https://docs.djangoproject.com/en/5.1/ref/contrib/humanize/#std-
templatefilter-intcomma documentation] of `intcomma` says either int/float
or string representation works.

This seems to be true for a `en` locale, but in `de` locale this doesn't
hold up if the input is a string:

{{{
en:
9999999999.11 -> 9,999,999,999.11
"9999999999.11" -> 9,999,999,999.11

de:
9999999999.11 -> 9.999.999.999,11
"9999999999.11" -> 9,999,999,999.11
}}}

In the case of a decimal value as string in the `de`-locale the output is
wrong, it should be `9.999.999.999,11` (dot (.) as separator).

This seems to be the case because `intcomma` casts the value to `int` if
its not a `float`/`Decimal` (which raises a `ValueError` for
`"9999999999.11"` and then just puts in commas as thousand separators
without any awareness of the current locale.

Also the
[https://github.com/django/django/blob/3fad712a91a8a8f6f6f904aff3d895e3b06b24c7/django/contrib/humanize/templatetags/humanize.py#L67
DocBlock] of the `intcomma` functions says "Convert an integer to a string
containing commas every three digits." wich contradicts the int/float (or
string thereof) in the documentation.

So either the function is not following the documentation or the
documentation is wrong (?).

I think this is problematic as the function is communicated as an
alternative/equivalent option to ` USE_THOUSAND_SEPARATOR=True`. When a
float value is formatted with `{{ value|floatformat:2|intcomma}}` the
`floatformat` will return a string and as such, `intcomma` is no longer
locale aware (also reported in
[https://code.djangoproject.com/ticket/33771 #33771]). This can lead to
outputs like `9,999,999,999,11` in the `de` locale:

{{{
9999999999.11 -> floatformat:2 -> "9999999999,11" -> intcomma ->
"9,999,999,999,11"
}}}

A solution could be to integrate some sort of float casting into
`intcomma`? Or remove the recommendation of using it in favor of just the
`USE_THOUSAND_SEPARATOR = True` setting and clarifying the documentation
of `intcomma`?
--
Ticket URL: <https://code.djangoproject.com/ticket/35852>
Django <https://code.djangoproject.com/>
The Web framework for perfectionists with deadlines.

Django

unread,

Oct 19, 2024, 5:11:04 PM10/19/24

to django-...@googlegroups.com

#35852: intcomma not locale aware when given a decimal as string

-----------------------------------+--------------------------------------
Reporter: Jonathan Ströbele | Owner: hemant5454
Type: Bug | Status: assigned
Component: contrib.humanize | Version: 5.1
Severity: Normal | Resolution:

Keywords: | Triage Stage: Unreviewed
Has patch: 0 | Needs documentation: 0
Needs tests: 0 | Patch needs improvement: 0
Easy pickings: 0 | UI/UX: 0

-----------------------------------+--------------------------------------
Changes (by hemant5454):

* owner: (none) => hemant5454
* status: new => assigned

--
Ticket URL: <https://code.djangoproject.com/ticket/35852#comment:1>

Django

unread,

Oct 21, 2024, 5:16:52 PM10/21/24

to django-...@googlegroups.com

#35852: intcomma not locale aware when given a decimal as string

-------------------------------------+-------------------------------------
Reporter: Jonathan Ströbele | Owner: HEMANT
| MISHRA

Type: Bug | Status: assigned
Component: contrib.humanize | Version: 5.1
Severity: Normal | Resolution:

Keywords: | Triage Stage: Accepted

Has patch: 0 | Needs documentation: 0
Needs tests: 0 | Patch needs improvement: 0
Easy pickings: 0 | UI/UX: 0

-------------------------------------+-------------------------------------
Changes (by Natalia Bidart):

* stage: Unreviewed => Accepted

Comment:

Hello Jonathan, thank you for taking the time to create this report. I
have reproduced what you said, by setting the server LANGUAGE_CODE to
`de`. I agree that the result `9,999,999,999.11` feels unexpected for the
string `"9999999999.11"` when `intcomma` is applied.

Could you like to prepare a patch?
--
Ticket URL: <https://code.djangoproject.com/ticket/35852#comment:2>

Django

unread,

Oct 23, 2024, 5:04:19 PM10/23/24

to django-...@googlegroups.com

#35852: intcomma not locale aware when given a decimal as string
-------------------------------------+-------------------------------------
Reporter: Jonathan Ströbele | Owner: HEMANT
| MISHRA
Type: Bug | Status: assigned
Component: contrib.humanize | Version: 5.1
Severity: Normal | Resolution:
Keywords: | Triage Stage: Accepted
Has patch: 0 | Needs documentation: 0
Needs tests: 0 | Patch needs improvement: 0
Easy pickings: 0 | UI/UX: 0
-------------------------------------+-------------------------------------

Comment (by Jonathan Ströbele):

Hi Natalia,

to be honest I'm not quite sure what the best fix actually would be and I
need a bit more guidance here. ;) As I mentioned I think there is a mis-
match between the documentation and the actual code, and DocBlock. Also
I'm not sure what `intcomma` actually was intended to mean and how it
should be used:

1. `int[eger]comma` as in separate a integer into groups of 3 digits
separated by a comma (only valid for an `en` locale). This seems to be the
[https://github.com/django/django/commit/fb537e177d7d304d6642ee6005258a82584a8032
original code and intention] of the function when it was added to Django.
2. `int[ernational]comma` as in provide a manual way to print a grouped
number according to the current locale, if the setting
`USE_THOUSAND_SEPARATOR` is `False` and so no automatic number grouping is
happening.

The 2. option is what I figured from reading the documentation and also
think that's the intended function from the current state of the source
code. Though it has the aforementioned bug regarding the floats passed as
string where it's using a fallback to the old logic that's correct in the
`en` locale.

Since `intcomma` is passing down the given value to the
`django.utils.formats.number_format` (with enabled `force_grouping=True`)
if it's a float or Decimal already, and only uses the code that is `en`
locale specific code in case of a string, I would suggest to use
`intcomma` as a wrapper of `number_format` with `force_grouping=True`.
Since the `number_format` already handles the formatting/grouping of
numbers, I think it's no longer needed to reproduce this logic in
`intcomma`?

Making this change seems to conform to nearly all unit tests in
`tests/humanize_tests/tests.py`, except for some unicode stuff introduced
in #28628 (fe76944269c13d59f8bb3bc1cfff7ab6443777e4) and a long string
without numbers (16a8fe18a3b81250f4fa57e3f93f0599dc4895bc).

This seems to be the case, because `number_format` doesn't check if the
string is containing digits, and just inserts the thousand seperator into
the string. Which I think is Okay?

{{{
AssertionError: '１,２３４,５６７' != '1,234,567' : intcomma test failed,
produced '１,２３４,５６７', should've produced '1,234,567'
AssertionError: 'th,e q,uic,k b,row,n f,ox ,jum,ped, ov,er ,the, la,zy
,dog' != 'the quick brown fox jumped over the lazy dog' : intcomma test
failed, produced 'th,e q,uic,k b,row,n f,ox ,jum,ped, ov,er ,the, la,zy
,dog', should've produced 'the quick brown fox jumped over the lazy dog'
}}}

I have created a suggestion for a patch in a fork here:
https://github.com/stroebjo/django/commit/f5302d43c45208ebb92516ab2724e86a1e60d71d
--
Ticket URL: <https://code.djangoproject.com/ticket/35852#comment:3>

Django

unread,

Oct 24, 2024, 9:28:12 AM10/24/24

to django-...@googlegroups.com

#35852: intcomma not locale aware when given a decimal as string
-------------------------------------+-------------------------------------
Reporter: Jonathan Ströbele | Owner: HEMANT
| MISHRA
Type: Bug | Status: assigned
Component: contrib.humanize | Version: 5.1
Severity: Normal | Resolution:
Keywords: | Triage Stage: Accepted
Has patch: 0 | Needs documentation: 0
Needs tests: 0 | Patch needs improvement: 0
Easy pickings: 0 | UI/UX: 0
-------------------------------------+-------------------------------------

Changes (by Natalia Bidart):

* cc: Claude Paroz (added)

Comment:

Claude, would you happen to have an opinion on the above? I can't invest
time into this until November as I'm taking some holidays, but I thought
you might have a quicker answer.
--
Ticket URL: <https://code.djangoproject.com/ticket/35852#comment:4>

Django

unread,

Apr 26, 2025, 5:36:11 AM4/26/25

to django-...@googlegroups.com

#35852: intcomma not locale aware when given a decimal as string
-------------------------------------+-------------------------------------

Reporter: Jonathan Ströbele | Owner: Tim
| McCurrach

Changes (by Tim McCurrach):

* owner: HEMANT MISHRA => Tim McCurrach

Comment:

Thanks all for the conversation above, that's really helpful :)

I think the issue is that for non-float/decimals, we attempt conversion to
an integer, and when that fails, we fallback to using the logic within
intcomma (instead of the logic within `number_format` - which is locale
aware). I think converting to a Decimal instead might solve this issue.
I'll investigate this and make a patch if it seems to fix things.
--
Ticket URL: <https://code.djangoproject.com/ticket/35852#comment:5>

Django

unread,

Apr 26, 2025, 8:47:39 AM4/26/25

to django-...@googlegroups.com

#35852: intcomma not locale aware when given a decimal as string
-------------------------------------+-------------------------------------
Reporter: Jonathan Ströbele | Owner: Tim
| McCurrach
Type: Bug | Status: assigned
Component: contrib.humanize | Version: 5.1
Severity: Normal | Resolution:
Keywords: | Triage Stage: Accepted

Has patch: 1 | Needs documentation: 0

Needs tests: 0 | Patch needs improvement: 0
Easy pickings: 0 | UI/UX: 0
-------------------------------------+-------------------------------------
Changes (by Tim McCurrach):

* has_patch: 0 => 1

--
Ticket URL: <https://code.djangoproject.com/ticket/35852#comment:6>

Django

unread,

May 14, 2025, 4:04:42 AM5/14/25

to django-...@googlegroups.com

#35852: intcomma not locale aware when given a decimal as string
-------------------------------------+-------------------------------------
Reporter: Jonathan Ströbele | Owner: Tim
| McCurrach
Type: Bug | Status: assigned
Component: contrib.humanize | Version: 5.1
Severity: Normal | Resolution:

Keywords: | Triage Stage: Ready for
| checkin

Has patch: 1 | Needs documentation: 0
Needs tests: 0 | Patch needs improvement: 0
Easy pickings: 0 | UI/UX: 0
-------------------------------------+-------------------------------------

Changes (by Sarah Boyce):

* stage: Accepted => Ready for checkin

--
Ticket URL: <https://code.djangoproject.com/ticket/35852#comment:7>

Django

unread,

May 14, 2025, 10:04:15 AM5/14/25

to django-...@googlegroups.com

#35852: intcomma not locale aware when given a decimal as string
-------------------------------------+-------------------------------------
Reporter: Jonathan Ströbele | Owner: Tim
| McCurrach

Type: Bug | Status: closed
Component: contrib.humanize | Version: 5.1
Severity: Normal | Resolution: fixed

Changes (by Sarah Boyce <42296566+sarahboyce@…>):

* resolution: => fixed
* status: assigned => closed

Comment:

In [changeset:"24693a144f5c4cc77e1046e32594ab8dc33776f1" 24693a1]:
{{{#!CommitTicketReference repository=""
revision="24693a144f5c4cc77e1046e32594ab8dc33776f1"
Fixed #35852 -- Fixed intcomma locale-aware formatting of string number
representations.
}}}
--
Ticket URL: <https://code.djangoproject.com/ticket/35852#comment:8>

Reply all

Reply to author

Forward