during migrating from Django 1.7.5 to 1.8 I found a bug(?)
truncate_date = connection.ops.date_trunc_sql('day', 'created_at')
qs =
MyModel.objects.filter(user_id=1).annotate(max_date=models.Max('created_at')).extra({'created_at':
truncate_date}).values('product_id','created_at','max_date')
qs.query.group_by = ['product_id', truncate_date]
qs.order_by('-max_date')[:10]
Traceback (most recent call last):
File "<console>", line 1, in <module>
File "/home/env/local/lib/python2.7/site-
packages/django/db/models/query.py", line 138, in __repr__
data = list(self[:REPR_OUTPUT_SIZE + 1])
File "/home/env/local/lib/python2.7/site-
packages/django/db/models/query.py", line 162, in __iter__
self._fetch_all()
File "/home/env/local/lib/python2.7/site-
packages/django/db/models/query.py", line 965, in _fetch_all
self._result_cache = list(self.iterator())
File "/home/env/local/lib/python2.7/site-
packages/django/db/models/query.py", line 1085, in iterator
for row in self.query.get_compiler(self.db).results_iter():
File "/home/env/local/lib/python2.7/site-
packages/django/db/models/sql/compiler.py", line 783, in results_iter
results = self.execute_sql(MULTI)
File "/home/env/local/lib/python2.7/site-
packages/django/db/models/sql/compiler.py", line 818, in execute_sql
sql, params = self.as_sql()
File "/home/env/local/lib/python2.7/site-
packages/django/db/models/sql/compiler.py", line 367, in as_sql
extra_select, order_by, group_by = self.pre_sql_setup()
File "/home/env/local/lib/python2.7/site-
packages/django/db/models/sql/compiler.py", line 51, in pre_sql_setup
group_by = self.get_group_by(self.select + extra_select, order_by)
File "/home/env/local/lib/python2.7/site-
packages/django/db/models/sql/compiler.py", line 102, in get_group_by
expressions.append(self.query.resolve_ref(expr))
File "/home/env/local/lib/python2.7/site-
packages/django/db/models/sql/query.py", line 1522, in resolve_ref
self.get_initial_alias(), reuse)
File "/home/env/local/lib/python2.7/site-
packages/django/db/models/sql/query.py", line 1461, in setup_joins
names, opts, allow_many, fail_on_missing=True)
File "/home/env/local/lib/python2.7/site-
packages/django/db/models/sql/query.py", line 1386, in names_to_path
"Choices are: %s" % (name, ", ".join(available)))
FieldError: Cannot resolve keyword u"CAST(DATE_FORMAT(created_at,
'%%Y-%%m-%%d 00:00:00') AS DATETIME)" into field.
This code works perfectly on Django 1.7.5
--
Ticket URL: <https://code.djangoproject.com/ticket/24570>
Django <https://code.djangoproject.com/>
The Web framework for perfectionists with deadlines.
* needs_docs: => 0
* needs_tests: => 0
* needs_better_patch: => 0
Old description:
New description:
Hello,
--
--
Ticket URL: <https://code.djangoproject.com/ticket/24570#comment:1>
Old description:
New description:
Hello,
}}}
--
--
Ticket URL: <https://code.djangoproject.com/ticket/24570#comment:2>
--
Ticket URL: <https://code.djangoproject.com/ticket/24570#comment:3>
Old description:
> Hello,
>
> during migrating from Django 1.7.5 to 1.8 I found a bug(?)
>
> {{{
> truncate_date = connection.ops.date_trunc_sql('day', 'created_at')
>
> qs = MyModel.objects.filter(
> user_id=1
> ).annotate(
> max_date=models.Max('created_at')
> ).extra(
New description:
Hello,
during migrating from Django 1.7.5 to 1.8 I found a bug(?)
{{{
truncate_date = connection.ops.date_trunc_sql('day', 'created_at')
qs = MyModel.objects.filter(
user_id=1
).annotate(
max_date=models.Max('created_at')
).extra(
{'created_at': truncate_date}
).values(
'product_id', 'created_at', 'max_date'
)
qs.query.group_by = ['product_id', truncate_date]
qs.order_by('-max_date')[:10]
}}}
The result is:
FieldError: Cannot resolve keyword u"CAST(DATE_FORMAT(created_at,
'%%Y-%%m-%%d 00:00:00') AS DATETIME)" into field.
This code works perfectly on Django 1.7.5
Traceback:
{{{
--
--
Ticket URL: <https://code.djangoproject.com/ticket/24570#comment:4>
* severity: Normal => Release blocker
* stage: Unreviewed => Accepted
Comment:
Hi,
I can indeed reproduce this issue and I bisected it down to commit
0c7633178fa9410f102e4708cef979b873bccb76.
I'm bumping the severity to release blocker because of the regression.
Thanks!
--
Ticket URL: <https://code.djangoproject.com/ticket/24570#comment:5>
Comment (by shaib):
is that `extra(select= {'created_at': truncate_date})`?
(I'm not sure about the Release Blocker status -- I don't think setting
group_by on a QSet's query like this is public API)
--
Ticket URL: <https://code.djangoproject.com/ticket/24570#comment:6>
Comment (by user0007):
Replying to [comment:6 shaib]:
> is that `extra(select= {'created_at': truncate_date})`?
Yes.
--
Ticket URL: <https://code.djangoproject.com/ticket/24570#comment:7>
* severity: Release blocker => Normal
* stage: Accepted => Unreviewed
Comment:
I skimmed through the code too quickly and didn't spot the
`qs.query.group_by = ...`.
As mentionned by Shai, using the attribute like that is not documented so
this doesn't technically count as a regression.
Is there no way to achieve what you're doing with the documented/supported
`queryset.group_by()` method?
--
Ticket URL: <https://code.djangoproject.com/ticket/24570#comment:8>
Comment (by jarshwah):
What you were doing with your original queryset seems suspect to me, it
looks like there are better ways to accomplish your goals in a completely
supported way. Let me highlight some issues with your original queryset
(I'll remove pieces that aren't relevant for brevity):
{{{
MyModel.objects.annotate(
max_date=models.Max('created_at')
).extra(
{'created_at': truncate_date} # here you're overwriting the field
called created_at, you should choose a new alias
).values(
'product_id', 'created_at', 'max_date'
)
qs.query.group_by = ['product_id', truncate_date] # this is private
API, but you'd need to use the alias
# name of the extra clause ("created_at"), which is ambiguous because
# you've used the same name. group_by=['product_id', 'created_extra']
qs.order_by('-max_date')[:10]
}}}
I've written a comparable (but not exact) queryset with 1.8 that I'd like
you to take a look at.
{{{
In [46]: truncate_date = connection.ops.date_trunc_sql('day', 'created')
In [47]: qs = Stats.objects.extra({'created_day':
truncate_date}).values('created_day').annotate(max_growth=Max('growth'))
In [48]: print(qs.query)
SELECT (DATE_TRUNC('day', created)) AS "created_day",
MAX("scratch_stats"."growth") AS "max_growth" FROM "scratch_stats" GROUP
BY (DATE_TRUNC('day', created))
In [49]: for q in qs:
....: print(q)
....:
{'created_day': datetime.datetime(2015, 3, 29, 0, 0, tzinfo=<UTC>),
'max_growth': 16}
{'created_day': datetime.datetime(2015, 3, 24, 0, 0, tzinfo=<UTC>),
'max_growth': 11}
{'created_day': datetime.datetime(2015, 3, 31, 0, 0, tzinfo=<UTC>),
'max_growth': 17}
{'created_day': datetime.datetime(2015, 3, 25, 0, 0, tzinfo=<UTC>),
'max_growth': 18}
{'created_day': datetime.datetime(2015, 3, 23, 0, 0, tzinfo=<UTC>),
'max_growth': 17}
...
}}}
Note that the query groups by the extra created_day clause because we've
not shadowed any model fields, and we've named it in our values call. The
above code I've written **is** fully supported too, although I'd encourage
you to avoid using `extra` and to instead use
[https://docs.djangoproject.com/en/dev/ref/models/expressions/ Query
Expressions]. Django should provide proper db_functions for extracting
date parts, but that's a work in progress (I would imagine Django 1.9 will
have something).
What I think your query should look like:
{{{
truncate_date = connection.ops.date_trunc_sql('day', 'created_at')
qs = MyModel.objects.filter(
user_id=1
).annotate(
max_date=models.Max('created_at')
).extra(
{'created_day': truncate_date}
).values(
'product_id', 'created_day', 'max_date'
).order_by('-max_date')[:10]
}}}
Can you give that a go and let us know the outcome please?
--
Ticket URL: <https://code.djangoproject.com/ticket/24570#comment:9>
Comment (by user0007):
Thank You for the answer.
Replying to [comment:9 jarshwah]:
> What I think your query should look like:
>
> {{{
> truncate_date = connection.ops.date_trunc_sql('day', 'created_at')
>
> qs = MyModel.objects.filter(
> user_id=1
> ).annotate(
> max_date=models.Max('created_at')
> ).extra(
> {'created_day': truncate_date}
> ).values(
> 'product_id', 'created_day', 'max_date'
> ).order_by('-max_date')[:10]
> }}}
>
> Can you give that a go and let us know the outcome please?
I've tried in that way. This QuerySet generates incorrect SQL (it groups
by `id` which is wrong and gives duplicate results).
SQL output:
{{{
SELECT (CAST(DATE_FORMAT(created_at, '%Y-%m-%d 00:00:00') AS DATETIME)) AS
`created_day`, `mymodel`.`product_id`, MAX(`mymodel`.`created_at`) AS
`max_date` FROM `mymodel` WHERE (`mymodel`.`user_id` = 1) GROUP BY
`mymodel`.`id` ORDER BY `max_date` DESC LIMIT 10
}}}
The query which works is:
{{{
SELECT (CAST(DATE_FORMAT(created_at, '%Y-%m-%d 00:00:00') AS DATETIME)),
`mymodel`.`product_id`, MAX(`mymodel`.`created_at`) AS `max_date` FROM
`mymodel` WHERE (`mymodel`.`user_id` = 1) GROUP BY `mymodel`.`product_id`,
(CAST(DATE_FORMAT(created_at, '%%Y-%%m-%%d 00:00:00') AS DATETIME)) ORDER
BY `max_date` DESC LIMIT 10
}}}
And the question is how to write it using QuerySet? :-) I was able to that
in Django 1.7.5 using `query.group by`, but now it throws errors.
--
Ticket URL: <https://code.djangoproject.com/ticket/24570#comment:10>
Comment (by jarshwah):
Which database backend are you using? Is that MySQL?
--
Ticket URL: <https://code.djangoproject.com/ticket/24570#comment:11>
Comment (by user0007):
Replying to [comment:11 jarshwah]:
> Which database backend are you using? Is that MySQL?
Yup, MySQL.
--
Ticket URL: <https://code.djangoproject.com/ticket/24570#comment:12>
* status: new => closed
* resolution: => invalid
Comment:
It looks like this has turned into a support ticket about how to use
private APIs; see TicketClosingReasons/UseSupportChannels.
--
Ticket URL: <https://code.djangoproject.com/ticket/24570#comment:13>