This is the intended patch.
{{{
diff --git a/django/middleware/csrf.py b/django/middleware/csrf.py
index f323ffb..deaf7d8 100644
--- a/django/middleware/csrf.py
+++ b/django/middleware/csrf.py
@@ -22,6 +22,8 @@ from django.utils.log import log_response
logger = logging.getLogger('django.security.csrf')
+ASCII_ALPHANUMERIC_RE = re.compile('[^a-zA-Z0-9]')
+
REASON_BAD_ORIGIN = "Origin checking failed - %s does not match any
trusted origins."
REASON_NO_REFERER = "Referer checking failed - no Referer."
REASON_BAD_REFERER = "Referer checking failed - %s does not match any
trusted origins."
@@ -107,7 +109,7 @@ def rotate_token(request):
def _sanitize_token(token):
# Allow only ASCII alphanumerics
- if re.search('[^a-zA-Z0-9]', token):
+ if ASCII_ALPHANUMERIC_RE.search(token):
return _get_new_csrf_token()
elif len(token) == CSRF_TOKEN_LENGTH:
return token
}}}
I'm not sure how exactly to profile this change. I tried using the
[https://github.com/django/djangobench/ djangobench] package after some
tinkering to its source code. Since it was reporting changes even on
queries, I wasn't sure to trust it. Any leads on this front would be
great.
I would be happy to make the change, if this seems reasonable.
--
Ticket URL: <https://code.djangoproject.com/ticket/32778>
Django <https://code.djangoproject.com/>
The Web framework for perfectionists with deadlines.
Comment (by Abhyudai):
For what's it worth, the results for the `default_middleware` section when
run on my machine when comparing the patch branch to the main branch were:
{{{
Control: Django 4.0.dev20210524043148 (in git branch main)
Experiment: Django 4.0.dev20210524043148 (in git branch feat/optimize-
csrf)
Running 'default_middleware' benchmark ...
Min: -0.000020 -> -0.000111: 0.1772x faster
Avg: 0.000751 -> 0.000740: 1.0160x faster
Not significant
Stddev: 0.00529 -> 0.00522: 1.0144x smaller (N = 50)
}}}
Although, I'm not sure if this just tests the changes for the `Csrf` class
since the default middleware includes a lot more things.
--
Ticket URL: <https://code.djangoproject.com/ticket/32778#comment:1>
* stage: Unreviewed => Accepted
Comment:
Thanks, sounds good. Would you like to prepare patch? Please use
`_lazy_re_compile()` to avoid compilation when importing the module.
--
Ticket URL: <https://code.djangoproject.com/ticket/32778#comment:2>
* easy: 0 => 1
--
Ticket URL: <https://code.djangoproject.com/ticket/32778#comment:3>
* owner: nobody => Abhyudai
* status: new => assigned
--
Ticket URL: <https://code.djangoproject.com/ticket/32778#comment:4>
* has_patch: 0 => 1
Comment:
[https://github.com/django/django/pull/14442 PR]
--
Ticket URL: <https://code.djangoproject.com/ticket/32778#comment:5>
* stage: Accepted => Ready for checkin
--
Ticket URL: <https://code.djangoproject.com/ticket/32778#comment:6>
* status: assigned => closed
* resolution: => fixed
Comment:
In [changeset:"866dccb65075159c7e99e8d165e52761965f3625" 866dccb6]:
{{{
#!CommitTicketReference repository=""
revision="866dccb65075159c7e99e8d165e52761965f3625"
Fixed #32778 -- Avoided unnecessary recompilation of token regex in
_sanitize_token().
}}}
--
Ticket URL: <https://code.djangoproject.com/ticket/32778#comment:7>
Comment (by Chris Jerdonek):
I added a follow-up PR here that renames `token_re`:
https://github.com/django/django/pull/14461
--
Ticket URL: <https://code.djangoproject.com/ticket/32778#comment:8>
Comment (by GitHub <noreply@…>):
In [changeset:"d270dd584e0af12fe6229fb712d0704c232dc7e5" d270dd58]:
{{{
#!CommitTicketReference repository=""
revision="d270dd584e0af12fe6229fb712d0704c232dc7e5"
Refs #32778 -- Improved the name of the regex object detecting invalid
CSRF token characters.
This also improves the comments near where the variable is used.
}}}
--
Ticket URL: <https://code.djangoproject.com/ticket/32778#comment:9>