Formalizing template loader and debug api's

Preston Timmons

unread,

Dec 28, 2014, 12:30:02 AM12/28/14

to django-d...@googlegroups.com

Hi all,

I've been working on #15053, which enables Django template loaders to extend

templates of the same name recursively. The most common use-case for this is

extending the admin templates without needing to copy the application templates

into a project.

To illustrate the new algorithm, here are some examples:

(-> indicates the next file that is extended)

(fs indicates filesystem template folders)

1. Extend admin template from filesystem directory

fs/admin/base.html # extends admin/base.html

-> django/admin/base.html

2. Extend in multiple filesystem directories

fs/base.html # extends base.html

-> fs2/base.html # extends base.html

-> fs3/base.html

3. Extend using both a filesystem and app loader with multiple directories

fs/base.html # extends base.html

-> fs2/base.html # extends base.html

-> app1/base.html # extends base.html

-> app2/base.html # extends base2.html

-> fs/base2.html # extends base2.html

-> fs2/base2.html

I found it difficult to support this algorithm with the existing template

loaders, and as you can see from example 3, the extends algorithm isn't

necessarily linear with respect to template loaders anymore.

Therefore, the patch I submitted changes three main areas. The first two are

mostly separate from Aymeric's recent changes, but the third is not.

1. Added a new loader api and made it consistent across all loaders.

2. Modified the extends node to track extended templates and passes

them as a skip argument to the loaders.

3. Updated the debug view template postmortem api to work with the new

algorithm, as well as added support for the egg and cached loader.

Below is an explanation of what I changed in 1 and 2 and a proposal for

3 that hopefully works with multiple template engines.

## 1. Add new template loader apis

I tried to solve this patch without changing the template loader apis, but I

eventually decided this was inevitable for two reasons:

1. The loaders don't implement a consistent api. For instance, the

filesystem and app loaders define a get_template_source method that is

used elsewhere for debug information, whereas the egg and cached loaders do

not. The cached loader implements load_template but not load_template_source.

2. The loaders need to be able to skip templates that have already been

extended, but adding a new argument to load_template is difficult to do in a

backwards-compatible way.

This led me to the following loader api:

### Loader API

Loader.get_template_sources

A method that yields all paths the loader will look at for a given template
name. For example, if the filesystem loader receives "index.html" as an
argument, this yields the full path of "index.html" for each directory on the
search path. The cached loader yields tuples of (loader_class, template_paths)
for each configured loader.

This method already exists for the filesystem and app loaders. This patch

implements it for all loaders.

Loader.get_internal

Returns the contents for a template given a ``source`` as returned by

``get_template_sources``.

This is where a filesystem loader would read contents from the filesystem,

or a database loader would read from the database. If a matching template

doesn't exist this should raise a ``TemplateDoesNotExist`` error.

BaseLoader.get_template

Returns a ``Template`` object for a given ``template_name`` by looping

through results from ``get_template_sources`` and calling ``get_internal``.

This returns the first matching template. If no template is found,

``TemplateSyntaxError`` is raised.

This method takes an optional ``skip`` argument. ``skip`` is a set

containing template sources. This is used when extending templates to

allow enable extending other templates of the same name. It's also used

to avoid recursion errors.

In general, it will be enough to define ``get_template_sources`` and

``get_internal`` for custom template loaders. ``get_template`` will

usually not need to be overridden.

This loader api enables the following changes:

## 2. Update the extends tag

The new extends tag keeps track of which sources were tried in the local

context. These sources are passed to the loader ``get_template`` method as

the ``skip`` argument. In doing so, the extends tag never extends the same

source file twice. This enables recursive extending, and also avoids

filesystem recursion errors when extending is truly circular.

The main caveat here is that I changed Template.origin to always be available

on the template--not just when TEMPLATE_DEBUG is true. Without this, the

extends tag has no way to know the source path of the current template.

I think this change is okay, but I don't know the original reasons for

excluding this in the first place. It could be simply because there was no

use-case presented outside of debug.

## 3. Debug api

Django displays a template postmortem message when a matching template isn't

found. This is a linear list grouped by template loader. This api is currently

supported only by the filesystem and app directories loaders.

This api works by looping through template loaders that define a

``get_template_sources`` method, calling that method, and using os.path

functions on each source to see if it exists as a file or is readable. This

information is passed in as ``loader_debug_info`` to the debug template.

There are some limitations to this approach:

* It only works with Django loaders that define a ``get_template_sources``

method -- the filesystem and app loaders

* It only works if ``get_template_sources`` are filesystem paths

* It assumes each loader will only be called at most once and in order. This

isn't always the case if template recursion is enabled.

* The postmortem doesn't account for multiple template engines.

This patch proposes a new debug api that works with all loaders. It simplifies

the logic in the debug views quite a bit, and I think it can work for multiple

template engines as well, as explained below.

### The approach

Rather than trying to rerun template engines or loaders, this patch passes in

debug information as part of ``TemplateDoesNotExist``. This is done by updating

the multiple places where Django catches ``TemplateDoesNotExist`` exceptions

and silently throws them away. Instead of simply silencing them, this patch

accumulates which templates failed and includes them in subsequent

``TemplateDoesNotExist`` exceptions that are raised.

This information is kept as a list of tuples saved to a ``tried`` attribute of

the exception.

Here's a somewhat complex example of this attribute when using multiple

loaders. It extends through multiple loaders, but fails when no more matching

templates exist:

[

# Note: (loader, path, status)

(filesystem.Loader, "/path/to/fs/base.html", "Found")

(filesystem.Loader, "/path/to/fs/base.html", "Skipped")

(app_directories.Loader, "/path/to/app/base.html", "Found")

(filesystem.Loader, "/path/to/fs/base.html", "Skipped")

(filesystem.Loader, "/path/to/app/base.html", "Skipped")

(app_directories.Loader, "/path/to/app2/base.html", "Source does not exist")

]

Here's a before and after of what the postmortem looks like:

Before:

After:

After (complex example):

### Multiple engine support

I'll publish an updated pull request once Aymeric's multiple template engines

branch lands. At that time, I think the ``tried`` attribute will change to

this:

[

# (loader, path, status)

(engine, filesystem.Loader, "/path/to/fs/base.html", "Found")

(engine, filesystem.Loader, "/path/to/fs/base.html", "Skipped")

(engine, app_directories.Loader, "/path/to/app/base.html", "Found")

(engine, filesystem.Loader, "/path/to/fs/base.html", "Skipped")

(engine, filesystem.Loader, "/path/to/app/base.html", "Skipped")

(engine, app_directories.Loader, "/path/to/app2/base.html", "Source does not exist")

]

With this api, other engines can display a postmortem by simply passing a

``tried`` value into ``TemplateDoesNotExist``. I'll qualify simply, though,

because other template engines don't necessarily provide an easy way to get the

list of templates that were tried in their internal exceptions. This seems to

be the case with Jinja2.

Jinja2 loaders have a list_templates method, but it's not that helpful here,

escecially since the results are sorted alphabetically. With that said, I don't

think it would be too bad to support the postmortem for the main loaders:

Filesystem, PackageLoader, ChoiceLoader, etc. It does require reimplementing

some code, though, to loop through the loader search paths.

With multiple engine support, I see the template postmortem info changing

to the format below. This information is optional in case engine is unable to

provide it. Here's a simple example with some engines, 1, 2, and 3, a template

that doesn't exist:

Django tried loading templates with <engine1>:

* path/to/template1 (Source does not exist)

* otherpath/to/template1 (Source does not exist)

Django tried loading templates with <engine2>:

* path/to/template1 (Source does not exist)

* otherpath/to/template1 (Source does not exist)

Django tried loading templates with <engine3>:

* template1 (Source does not exist)

(this engine doesn't provide the list of templates that were tried)

## Conclusion

Overall, I'm pretty happy with this new api. I have a few concerns though,

which are as follows:

1. This patch deprecates the old loader apis: Loader.load_template,

Loader.load_template_sources, etc. I think this is good, but there's already

a lot of deprecations happening with the multiple engine branch.

2. The origin api needs a design decision. I made that available regardless of

the debug setting. Also, it sounds like Aymeric may be doing some work to

refactor this api.

3. The debug api is also something Aymeric is scheduled to work on. Aymeric,

if you've got something better in mind I've no problem waiting for that to be

in place.

Feedback is welcome.

Thanks.

Preston

Unai Zalakain

unread,

Dec 28, 2014, 4:34:13 AM12/28/14

to django-d...@googlegroups.com

Thanks for your great work Preston, I tried to tackle #15053 myself and
I found it was quite a complex issue. I hope I will find some time to
look at it in more detail!

><https://lh5.googleusercontent.com/-6QNQEJ2KRgk/VJ-PD146t3I/AAAAAAAAAL4/7I9_bfG0_CA/s1600/ee0e758e-7e63-11e4-9a72-7571d591de87.png>
>After:
>
><https://lh5.googleusercontent.com/-dV-rsPD1Mng/VJ-PWou7LiI/AAAAAAAAAMA/1jMpl_i8u1M/s1600/17f55944-7e64-11e4-8ff5-8ab12d135f48.png>
>After (complex example):
>
><https://lh6.googleusercontent.com/-01W4-h6L8jY/VJ-Pj8bPeEI/AAAAAAAAAMI/cT9qEU8jWj8/s1600/3010746e-7e64-11e4-919b-d222b66a28d7.png>

>--
>You received this message because you are subscribed to the Google Groups "Django developers (Contributions to Django itself)" group.
>To unsubscribe from this group and stop receiving emails from it, send an email to django-develop...@googlegroups.com.
>To post to this group, send email to django-d...@googlegroups.com.
>Visit this group at http://groups.google.com/group/django-developers.
>To view this discussion on the web visit https://groups.google.com/d/msgid/django-developers/9389e64f-a0cb-475b-9b42-3964e03046be%40googlegroups.com.
>For more options, visit https://groups.google.com/d/optout.

--
unai

signature.asc

Riccardo Magliocchetti

unread,

Dec 28, 2014, 8:37:42 AM12/28/14

to django-d...@googlegroups.com

Hello Preston,

Il 28/12/2014 06:30, Preston Timmons ha scritto:
> Hi all,
>
> I've been working on #15053, which enables Django template loaders to extend
> templates of the same name recursively. The most common use-case for this is
> extending the admin templates without needing to copy the application
> templates
> into a project.

Thanks a lot for working on this!

> ### Loader API
[snip]

> Loader.get_internal
>
> Returns the contents for a template given a ``source`` as returned by
> ``get_template_sources``.

The name is a bit confusing to me, what about Loader.get_contents instead?

thanks,
riccardo

Preston Timmons

unread,

Dec 28, 2014, 12:54:51 PM12/28/14

to django-d...@googlegroups.com

Hi Unai. Yes, your previous work was helpful for me in developing this solution. Some of Aymeric's recent refactorings have simplified things since you last worked on it as well.

Hi Riccardo. Thanks for the suggestion. I don't mind changing the name of get_internal if others agree. That's an easy fix.

Aymeric Augustin

unread,

Dec 28, 2014, 1:21:04 PM12/28/14

to django-d...@googlegroups.com

Hi Preston,

Thanks for working on this ticket — and for bearing with my changes :-)

On 28 déc. 2014, at 06:30, Preston Timmons <preston...@gmail.com> wrote:

## 1. Add new template loader apis

I tried to solve this patch without changing the template loader apis, but I
eventually decided this was inevitable for two reasons:

1. The loaders don't implement a consistent api. For instance, the
filesystem and app loaders define a get_template_source method that is
used elsewhere for debug information, whereas the egg and cached loaders do
not. The cached loader implements load_template but not load_template_source.

Even if it isn’t consistently implemented, the loader API is documented:

https://docs.djangoproject.com/en/1.7/ref/templates/api/#using-an-alternative-template-language

To sum up:

load_template_source(template_name) —> (template_string, template_origin)

load_template(template_name) —> (template, template_origin)

The behavior is loosely defined; anything that matches the signature is fine.

template_origin is especially ill-defined.

Are you proposing to change the API outright? Or are you planning to

implement a deprecation path for existing third-party template loaders?

2. The loaders need to be able to skip templates that have already been
extended, but adding a new argument to load_template is difficult to do in a
backwards-compatible way.

At first sight adding a “skip” optional argument to Engine.get_template,

Engine.find_template, BaseLoader.__call__ and BaseLoader.load_template

may work. Could you explain where the difficulty lies?

This led me to the following loader api:

### Loader API

Loader.get_template_sources

A method that yields all paths the loader will look at for a given template
name. For example, if the filesystem loader receives "index.html" as an
argument, this yields the full path of "index.html" for each directory on the
search path. The cached loader yields tuples of (loader_class, template_paths)
for each configured loader.
This method already exists for the filesystem and app loaders. This patch
implements it for all loaders.

How would a loader that loads templates from a database implement this

method?

It seems to me that the proper way to define such a method is “yields some

opaque objects that implement the following API: …” I suspect that API

would be quite similar to the Origin API — maybe an idealized version if

the current version doesn’t suffice.

Loader.get_internal

Returns the contents for a template given a ``source`` as returned by
``get_template_sources``.

This is where a filesystem loader would read contents from the filesystem,
or a database loader would read from the database. If a matching template
doesn't exist this should raise a ``TemplateDoesNotExist`` error.

Attaching this method to the objects yielded by get_template_sources would

provide better encapsulation than attaching it to the Loader itself.

BaseLoader.get_template

Returns a ``Template`` object for a given ``template_name`` by looping
through results from ``get_template_sources`` and calling ``get_internal``.
This returns the first matching template. If no template is found,
``TemplateSyntaxError`` is raised.

This method takes an optional ``skip`` argument. ``skip`` is a set
containing template sources. This is used when extending templates to
allow enable extending other templates of the same name. It's also used
to avoid recursion errors.

In general, it will be enough to define ``get_template_sources`` and
``get_internal`` for custom template loaders. ``get_template`` will
usually not need to be overridden.

With my suggestions, skip would become an iterable of opaque objects.

These objects would have to be hashable and implement equality.

Your design makes sense. My main suggestion is to fit it in the current APIs.

I would try to reuse load_template instead of introducing get_template and to

hook to a revamped Origin API. The Origin API was documented in Django

1.7; changing it it less disruptive than changing the Loader API. Did you try

this, and if you did, why didn’t it work?

## 2. Update the extends tag

The new extends tag keeps track of which sources were tried in the local
context. These sources are passed to the loader ``get_template`` method as
the ``skip`` argument. In doing so, the extends tag never extends the same
source file twice. This enables recursive extending, and also avoids
filesystem recursion errors when extending is truly circular.

The main caveat here is that I changed Template.origin to always be available
on the template--not just when TEMPLATE_DEBUG is true. Without this, the
extends tag has no way to know the source path of the current template.

I think this change is okay, but I don't know the original reasons for
excluding this in the first place. It could be simply because there was no
use-case presented outside of debug.

Yes, origin was a debug-only API for performance reasons. Most of Django's

template engine dates back to 2006. Back then when performance was a

different story from what it i are today, due to improvements in hardware,

improvements in CPython and availability of PyPy.

## 3. Debug api

Django displays a template postmortem message when a matching template isn't
found. This is a linear list grouped by template loader. This api is currently
supported only by the filesystem and app directories loaders.

My general idea is that Django should define generic debugging APIs that all

template backends, built-in or third-party, can implement. The debugging API

when a template isn’t found shouldn’t know about template loaders because

they’re specific to the Django Template Language.

(…)

### The approach

Rather than trying to rerun template engines or loaders, this patch passes in
debug information as part of ``TemplateDoesNotExist``. This is done by updating
the multiple places where Django catches ``TemplateDoesNotExist`` exceptions
and silently throws them away. Instead of simply silencing them, this patch
accumulates which templates failed and includes them in subsequent
``TemplateDoesNotExist`` exceptions that are raised.

This information is kept as a list of tuples saved to a ``tried`` attribute of
the exception.

That makes a lot of sense. django.template.TemplateDoesNotExist is Django’s

canonical API for dealing with these errors. For example Django’s jinja2 backend

reraises jinja2’s TemplateNotFound as TemplateDoesNotExist.

Here's a somewhat complex example of this attribute when using multiple
loaders. It extends through multiple loaders, but fails when no more matching
templates exist:

[
# Note: (loader, path, status)
(filesystem.Loader, "/path/to/fs/base.html", "Found")
(filesystem.Loader, "/path/to/fs/base.html", "Skipped")
(app_directories.Loader, "/path/to/app/base.html", "Found")
(filesystem.Loader, "/path/to/fs/base.html", "Skipped")
(filesystem.Loader, "/path/to/app/base.html", "Skipped")
(app_directories.Loader, "/path/to/app2/base.html", "Source does not exist")
]

This API is still tied to the Django Template Language. I would prefer a generic

list of (“template identifier”, “loading mechanism”, “reason for failure”) that every

backend could implement.

Furthermore, I’m not sure what this list represents. Is it a “template call stack”

i.e. the chain of templates included up to the point where loading a template

failed? Or is it a list of ways the template backend tried to load the failing

template before giving up?

I think the list is intended to be the latter but your complex example below

makes me think it may be the former.

### Multiple engine support

I'll publish an updated pull request once Aymeric's multiple template engines
branch lands. At that time, I think the ``tried`` attribute will change to
this:

[
# (loader, path, status)
(engine, filesystem.Loader, "/path/to/fs/base.html", "Found")
(engine, filesystem.Loader, "/path/to/fs/base.html", "Skipped")
(engine, app_directories.Loader, "/path/to/app/base.html", "Found")
(engine, filesystem.Loader, "/path/to/fs/base.html", "Skipped")
(engine, filesystem.Loader, "/path/to/app/base.html", "Skipped")
(engine, app_directories.Loader, "/path/to/app2/base.html", "Source does not exist")
]

Adding the engine to the API makes sense, however my suggestion to

make the rest of the data more generic remains.

## Conclusion

Overall, I'm pretty happy with this new api. I have a few concerns though,
which are as follows:

1. This patch deprecates the old loader apis: Loader.load_template,
Loader.load_template_sources, etc. I think this is good, but there's already
a lot of deprecations happening with the multiple engine branch.

Yes, you need a better argument than “difficult to do in a backwards-compatible

way” :-)

2. The origin api needs a design decision. I made that available regardless of
the debug setting. Also, it sounds like Aymeric may be doing some work to
refactor this api.

I agree with making it always available. The real question is about the API itself.

We documented the existing concrete implementations in Django 1.7 but they’re

inconsistent and their purpose is still unclear.

https://docs.djangoproject.com/en/1.7/ref/templates/api/#template-origin

3. The debug api is also something Aymeric is scheduled to work on. Aymeric,
if you've got something better in mind I've no problem waiting for that to be
in place.

I was mainly thinking about providing tracebacks for errors in templates. Jinja2

has fancy integration with Python’s traceback system. Django emulates it in its

debug view with exc.django_template_source and get_template_exception_info.

That’s independent from template loading exceptions.

--

Aymeric.

Preston Timmons

unread,

Dec 28, 2014, 10:53:31 PM12/28/14

to django-d...@googlegroups.com

Hi Aymeric,

Thanks for the feedback!

Even if it isn’t consistently implemented, the loader API is documented:
https://docs.djangoproject.com/en/1.7/ref/templates/api/#using-an-alternative-template-language
To sum up:
load_template_source(template_name) —> (template_string, template_origin)
load_template(template_name) —> (template, template_origin)
The behavior is loosely defined; anything that matches the signature is fine.
template_origin is especially ill-defined.
Are you proposing to change the API outright? Or are you planning to
implement a deprecation path for existing third-party template loaders?

The deprecation is gradual. Old loaders continue to work but can't take advantage

of recursive extending until they implement get_template_sources and

get_internal.

How would a loader that loads templates from a database implement this
method?
It seems to me that the proper way to define such a method is “yields some
opaque objects that implement the following API: …” I suspect that API
would be quite similar to the Origin API — maybe an idealized version if
the current version doesn’t suffice.

My original thought was it would yield a model instance which is then passed to

get_internal. Your right, though. It'd be better to return an origin or

other opaque object. This is a nice change. I'll update the loaders so they

have a consistent return value.

Attaching this method to the objects yielded by get_template_sources would
provide better encapsulation than attaching it to the Loader itself.

Agreed.

At first sight adding a “skip” optional argument to Engine.get_template,
Engine.find_template, BaseLoader.__call__ and BaseLoader.load_template
may work. Could you explain where the difficulty lies?

There are a few reasons I chose to go with a new api:

First, the advantage of "get_template" over "load_template" is that it calls

"get_template_sources" directly. This means the skip and debug information is

readily available in a single place.

If "skip" is added to load_template instead, it also needs to be added to

"load_template_source". That means the skip and debug logic needs to be

duplicated in each of the loaders, including third-party ones. This is because

"load_template_source" is where the looping through template sources currently

exists.

Second, the cached loader reimplements most of load_template with additional

logic. If "get_template_sources" can be counted on, this cleans it up quite a

bit.

Third, returning (template, origin) is redundant if origin is a property of

template. If we change load_template, that ends up sticking around, unless

we later introduce another deprecation for it.

With that said, it's possible to refactor "load_template" to stop using

"load_template_source" and instead call "get_template_sources". If it does

this I can address 1 and 2. I'm not sure if we can do 3. We'd still need to

do the following:

* Add a deprecation period because get_template_sources is required to support

template recursion.

* If get_template_sources is used, there still needs to be a get_internal

method of some sort. I could try to reuse load_template_source, but it's

a pretty big change in behavior of what that function currently does. I'm

not sure this lends itself to a nice upgrade path.

Therefore, I see the deprecation period for 3rd-party loaders being one of:

1. Add get_template_sources and get_internal

2. Add get_template_sources and modify load_template_source

Do you still prefer refactoring load_template instead? If so, should I add

get_internal or try to modify load_template_source?

Your design makes sense. My main suggestion is to fit it in the current APIs.
I would try to reuse load_template instead of introducing get_template and to
hook to a revamped Origin API. The Origin API was documented in Django
1.7; changing it it less disruptive than changing the Loader API. Did you try
this, and if you did, why didn’t it work?

Okay. I think this is how the api would look then:

get_template_sources(template_name) -> yields origin(s)

get_internal(origin) -> template_string

get_template(template_name) -> template

--or--

load_template(template_name) -> (template, template_origin)

This API is still tied to the Django Template Language. I would prefer a generic
list of (“template identifier”, “loading mechanism”, “reason for failure”) that every
backend could implement.

Yes, this is an easy change. I'll add it into my branch.

Furthermore, I’m not sure what this list represents. Is it a “template call stack”
i.e. the chain of templates included up to the point where loading a template
failed? Or is it a list of ways the template backend tried to load the failing
template before giving up?

Sorry. I didn't explain that example clearly, and it was slightly wrong. If by

"template call stack" you mean the complete list of sources that were tried, then

yes. This means that for a "get_template" call, it's simply a linear list of all tried

sources. For extending, it includes information about the initial get_template

call, then all the subsequent calls in the extends node.

For instance, if you had a filesystem loader with two folders that extended a

missing "base.html", tried would appear as:

[

(engine, "/path/to/fs/index.html", "filesystem.Loader", "Found")

(engine, "/path/to/fs/base.html", "filesystem.Loader", "Source does not exist")

(engine, "/path/to/fs2/base.html", "filesystem.Loader", "Source does not exist")

]

Assume the same example found "base.html" in fs and it also extended

"base.html". If "base.html" didn't exist in fs2, tried would appear as:

[

(engine, "/path/to/fs/index.html", "filesystem.Loader", "Found")

(engine, "/path/to/fs/base.html", "filesystem.Loader", "Found")

(engine, "/path/to/fs/base.html", "filesystem.Loader", "Skipped")

(engine, "/path/to/fs2/base.html", "filesystem.Loader", "Source does not exist")

]

Again, if neither base.html or index.html exist in fs, and fs2 has index.html but not

base.html:

[

(engine, "/path/to/fs/index.html", "filesystem.Loader", "Source does not exist")

(engine, "/path/to/fs2/index.html", "filesystem.Loader", "Found")

(engine, "/path/to/fs/base.html", "filesystem.Loader", "Source does not exist")

(engine, "/path/to/fs2/base.html", "filesystem.Loader", "Source does not exist")

]

Perhaps it'd be helpful to annotate "Found" as "Extended" when extending.

Did that explain it better? If not, I can try again. :)

I agree with making it always available. The real question is about the API itself.
We documented the existing concrete implementations in Django 1.7 but they’re
inconsistent and their purpose is still unclear.

https://docs.djangoproject.com/en/1.7/ref/templates/api/# template-origin

Oops, that was my fault. :/

I was looking for something easy to contribute to:

https://code.djangoproject.com/ticket/16096

In the pull request discussion Tim was wary about removing the debug check for

adding the origin. The use-case for the ticket was to display the current

template path to front-end devs. I'm not sure it has seen much usage in the

wild.

I was mainly thinking about providing tracebacks for errors in templates. Jinja2
has fancy integration with Python’s traceback system. Django emulates it in its
debug view with exc.django_template_source and get_template_exception_info.
That’s independent from template loading exceptions.

Okay. Great. I see your branch has been merged. I'll start work on getting mine

updated with your suggestions so far.

Thanks!

Preston

Aymeric Augustin

unread,

Dec 29, 2014, 5:21:47 AM12/29/14

to django-d...@googlegroups.com

2014-12-29 4:53 GMT+01:00 Preston Timmons <preston...@gmail.com>:

Did that explain it better? If not, I can try again. :)

Yes, I understand better now.

The debugging information is two dimensional data: for each level of

template extending, for each template loader, you have one entry —

entries are 4-uples in your current proposal.

The format you described is a flattened view of this list. That's good

because it avoids encoding the concepts of "template extension" or

"template loader" in the API, keeping it generic.

If entries contain the template identifier i.e. the string you pass to

get_template then the two dimensional view can almost be

reconstructed — almost because it won't detect when a template

extends another template with the same name, the very feature

you're working on adding...

At this point, I think we should talk about template origins. What

is their purpose? What is the complete abstract Origin API? The

current API only includes a `name` attribute and an unspecified

`reload()` method; I don't think that's sufficient.

I will audit current uses of Origin in Django. Could you specify

the requirements for objects yielded by get_template_sources?

Then we'll see if they're compatible. If we can avoid introducing

a new concept, that's always better.

Thanks,

--

Aymeric.

Preston Timmons

unread,

Dec 29, 2014, 8:12:05 PM12/29/14

to django-d...@googlegroups.com

The debugging information is two dimensional data: for each level of
template extending, for each template loader, you have one entry —

entries are 4-tuples in your current proposal.

Yes, this is correct. Although there may not be an entry for each loader on

each level of extending. The looping stops if a matching template is found.

At this point, I think we should talk about template origins. What
is their purpose? What is the complete abstract Origin API? The
current API only includes a `name` attribute and an unspecified
`reload()` method; I don't think that's sufficient.

Origin currently has these attributes:

name

For filesystem and app loaders this is the full path to the template.

For the egg loader this is "egg:<app_config.name>:<pkg_name>".

For locmem this is just the template name.

loader

This is the load_template_source method of the loader that found the template.

loadname

This is the original template_name used for load_template_source.

dirs

This is a deprecated value set when dirs is passed to get_template.

reload

This is used by the debug views to get the source content of the template. It

calls self.loader--which is really load_template_source--with loadname and

dirs.

My branch makes some changes to this class. (Note: this will change a bit if

get_template_sources returns an origin instead)

1. name is the value as returned by get_template_sources. For filesystem and

app loaders this is still the full path. For the egg and cached loaders it's

a tuple (app_config.name, template_name) or (loader, source).

2. loader is changed to reference the loader instance rather than the

load_template_source method.

3. A tried attribute is added. This is the same format as passed to

TemplateDoesNotExist.

4. reload is updated to call loader.get_internal when available, otherwise

it falls back to loader.load_template_source.

This is what I use origin for:

1. I add origin.name to the loader extends history. This enables the current

template to be skipped rather than re-extended.

2. If an extended template is found, the new origin.name is added to the

extends history. Also, the original origin.tried and new origin.tried are

combined to give a complete list of templates found before and after extending.

3. If an extended template isn't found, the TemplateDoesNotExist tried value

is combined with the original origin.tried to give a complete list of what

templates were tried before failing.

4. Loader.get_template sets tried on origin with the values returned by

get_template_sources.

## Potential changes

If get_template_sources is changed to return an origin, it seems to make sense

that "tried" would simply be a list of origins instead of a 4-tuple. There

would need to be a status attribute added. This would default to None since we

don't know the status until get_internal is called.

"tried" currently lives on origin. This seems weird to me since origin.tried

would contain a reference to itself. It would probably be better to move that

to the template object. The extends node would then access the list of origins

up to that point, rather than just the current origin.

The extends history would be simplified a bit. Currently, it's a dict that

tracks history for each loader. This is done to eliminate any possible clashes

if two template loaders return the same source value. This could be updated to

just be a list of origins.

It's sufficient for my patch if origin has these values:

origin.name -> the source path - may be a string, tuple, model instance, etc.

origin.loader -> the template loader

origin.loadname -> the template_name value

origin.status -> the status of the template. This is set after get_internal is

called.

origin.loader_name -> returns a string representation of the loader class. This

is used in the debug view postmortem.

## Other template engines

If "tried" is a list of origins, the other template engines would need to use

the origin class if they display a postmortem.

In order to get a correct string representation of the underlying loader in the

postmortem, engines could subclass Origin and implement a custom

loader_name method. Using Origin then should be similar to using the tuple.

I don't think it will cause any tie-in to Django-specific loaders.

Would origin.reload be used for other engines? If so, it could also be added in

the subclass. I'll have to take some time to understand how the traceback works

before I could comment much on this, though.

Thanks.

Preston

Preston Timmons

unread,

Jan 14, 2015, 2:43:20 PM1/14/15

to django-d...@googlegroups.com

Hi Aymeric,

I got a chance to update my patch with the use of origins. The good news is

that it's simpler than the old implementation. I have a few api questions

below. The updated branch is here for now:

https://github.com/prestontimmons/django/compare/ticket-15053-origin?expand=1

As a quick recap:

First, this branch uses origins everywhere: for extends history, for

TemplateDoesNotExist errors, etc. There's no more 4-tuple of statuses.

Second, I use these attributes on the origin:

engine

loader

loadname -> template name

name -> full template path

status -> Skipped, Found, Source does not exist

tried -> a list of origins tried up to this one

reload -> gets template contents

__eq__

__hash__ -> added so the cache loader can use origin as a cache key

Third, loaders implement these methods:

get_template_sources -> yields origins

get_internal -> takes an origin and returns the template contents

Fourth, TemplateDoesNotExist takes an optional tried argument which is also

a list of origins. This will be used in the debug views. I didn't change the

debug views in this branch yet, though.

Questions

Origin attributes

I'd like to do some renaming:

origin.loadname -> origin.template_name

I think it's hard to remember the difference between origin.name and

origin.loadname. I think "template_name" is more conventional.

origin.reload -> origin.read

If the method to read a template contents lives on the origin, I chose to

prefer using the origin method rather than the loader method directly. There's

not much consequence here, but it means it's no longer a special method just

for the debug view.

tried and status

The nice thing about putting the tried and status on origin are that it works

well with a recursive api. It even means I could get rid of the extra context

variable used to store extends history. The sucky things are that tried and

status are set and manipulated outside the origin constructor. The cached

loader also has to be sure to reset the origin before returning the cached

template. That might be warranted, but part of me wonders if it's lousy api.

Deprecation path

If we want to maintain the load_template api, I can change it to accept the

skip argument. Within load_template I can add a hasattr check for

"get_internal". If it exists, use the new code. Otherwise, use the old

"load_template_sources" method.

Since adding a kwarg to load_template could potentially break 3rd-party

loaders that override it, I could add an _accepts_skip_kwarg property to

base.Loader like you've done with _accepts_engine_in_init = True. This

would enable Engine.find_template to exclude that if necessary.

If we do this, I'd like to deprecate the (template, template_origin)

return value in favor of just template. Returning a tuple is redundant.

Ditto for Engine.find_template.

If we use get_template instead, we don't need to deprecate the return value and

we don't need _accepts_skip_kwargs. Instead we can just recommend use of the

new method until load_template is removed.

Either way, load_template_sources is obsoleted.

Which deprecation path do you think is preferable?

Reading template contents

If we have an origin.read() function, an option to consider is updating each

loader to return a custom origin object rather than adding "get_internal"

or "load_template_source" to the loader. These could be something like

FileOrigin, EggOrigin, CacheOrigin, DbOrigin, etc.

I can't think of any deeper advantages this holds, though. So I leave it at

that.

Adding skip to Engine.get_template/Engine.find_template

I wondered whether skip should also be added to Engine.get_template. Doing so

eliminates some looping logic from Engine.find_template that is reimplemented

in the extends node. After implementing it though, I've decided against it:

1) In general, users don't work with origins directly. Therefore I don't think

it has much applicability outside the internal loaders.

2) It takes quite a bit of fiddling around to maintain a linear list of

templates that were found or skipped. Even though the new implementation

removed some duplication, it also wasn't any simpler than the old.

Conclusion

If the api so far makes sense, I'll update the debug view post-mortem

messages and add in the docs and deprecation warnings.

Thanks,

Preston

Aymeric Augustin

unread,

Jan 18, 2015, 5:59:19 AM1/18/15

to django-d...@googlegroups.com

Hi Preston,

On 14 janv. 2015, at 20:43, Preston Timmons <preston...@gmail.com> wrote:

As a quick recap:

First, this branch uses origins everywhere: for extends history, for
TemplateDoesNotExist errors, etc. There's no more 4-tuple of statuses.

Good.

Second, I use these attributes on the origin:

engine
loader
loadname -> template name
name -> full template path
status -> Skipped, Found, Source does not exist
tried -> a list of origins tried up to this one
reload -> gets template contents
__eq__
__hash__ -> added so the cache loader can use origin as a cache key

Perhaps we’ll bikeshed names at some point but let’s leave the “naming

things” problem aside for now.

“name” must be optional for loaders that don’t load templates from files.

“loadname” must be optional too for templates instantiated from strings.

Here optional means that things don’t break if the value is None.

Third, loaders implement these methods:

get_template_sources -> yields origins
get_internal -> takes an origin and returns the template contents

I don’t like the code duplication between get_internal and

load_template_source. I assume your design ended up that way because

you needed get_internal and load_template_sources will be deprecated.

Is this correct?

Fourth, TemplateDoesNotExist takes an optional tried argument which is also
a list of origins. This will be used in the debug views. I didn't change the
debug views in this branch yet, though.

TemplateDoesNotExist is used both internally by the DTL and in generic

APIs such as django.template.loader.get_template. If it’s designed to

accept a list of tried origins, then we need:

- a generic Origin without engine, loader, status and tried (at least)

- a DTL Origin as described above.

Does this make sense?

Questions

Origin attributes

I'd like to do some renaming:

origin.loadname -> origin.template_name

I think it's hard to remember the difference between origin.name and
origin.loadname. I think "template_name" is more conventional.

Yes. Maintaining compatibility with LoaderOrigin as documented in 1.7

appears to be a lost cause anyway.

origin.reload -> origin.read

If the method to read a template contents lives on the origin, I chose to
prefer using the origin method rather than the loader method directly. There's
not much consequence here, but it means it's no longer a special method just
for the debug view.

I think the appropriate way for origins to return template code would be to

call back into the template loaders. Reading template contents in the job

of the loader. It should be possible to write a single origin class that works

for most loaders.

The sole purpose of the reload() method is to display tracebacks in templates.

This logic should be moved out of the debug view and into the template

backends. See #24119. As a consequence, we should leave it outside of this

refactoring and simply preserve it for DTL origins until that ticket gets fixed.

tried and status

The nice thing about putting the tried and status on origin are that it works
well with a recursive api. It even means I could get rid of the extra context
variable used to store extends history. The sucky things are that tried and
status are set and manipulated outside the origin constructor. The cached
loader also has to be sure to reset the origin before returning the cached
template. That might be warranted, but part of me wonders if it's lousy api.

I’ll take your word on this. I haven’t spent enough time working on the

implementation to suggest an alternative.

Deprecation path

If we want to maintain the load_template api, I can change it to accept the
skip argument. Within load_template I can add a hasattr check for
"get_internal". If it exists, use the new code. Otherwise, use the old
"load_template_sources" method.

Since adding a kwarg to load_template could potentially break 3rd-party
loaders that override it, I could add an _accepts_skip_kwarg property to
base.Loader like you've done with _accepts_engine_in_init = True. This
would enable Engine.find_template to exclude that if necessary.

If we do this, I'd like to deprecate the (template, template_origin)
return value in favor of just template. Returning a tuple is redundant.
Ditto for Engine.find_template.

If we use get_template instead, we don't need to deprecate the return value and
we don't need _accepts_skip_kwargs. Instead we can just recommend use of the
new method until load_template is removed.

Either way, load_template_sources is obsoleted.

Which deprecation path do you think is preferable?

If you’re going to change both its arguments and its return type, you can change

the method's name :-) The second solution looks better to me. Hopefully we can

write a “just do this” guide to help people maintaining custom template loaders

upgrading.

Conclusion

If the api so far makes sense, I'll update the debug view post-mortem
messages and add in the docs and deprecation warnings.

Yes.

--

Aymeric.

Preston Timmons

unread,

Jan 19, 2015, 5:51:37 PM1/19/15

to django-d...@googlegroups.com

Perhaps we’ll bikeshed names at some point but let’s leave the “naming
things” problem aside for now.
“name” must be optional for loaders that don’t load templates from files.
“loadname” must be optional too for templates instantiated from strings.
Here optional means that things don’t break if the value is None.

If we combine LoaderOrigin and StringOrigin, then yes, it makes sense for

these to be optional. The origin will then gain an additional "source"

attribute from StringOrigin.

I don’t like the code duplication between get_internal and
load_template_source. I assume your design ended up that way because
you needed get_internal and load_template_sources will be deprecated.
Is this correct?

Correct.

To minimize duplication I can modify load_template_sources to use get_internal.

The test suite won't hit this code path anymore, though. That means I should

probably add additional tests to cover the deprecated methods.

That leaves the cached loader. If we modify Loader.load_template to

call Loader.get_template when get_internal is available, then I think the

cached.Loader.load_template and cached.loader.find_template methods can be

safely removed.

I'll go ahead and make these updates in the branch.

TemplateDoesNotExist is used both internally by the DTL and in generic
APIs such as django.template.loader.get_template. If it’s designed to
accept a list of tried origins, then we need:
- a generic Origin without engine, loader, status and tried (at least)
- a DTL Origin as described above.
Does this make sense?

I think this is fine. "tried" and "loader" are internal concepts to Django.

If an engine is able to support the post-mortem an origin needs only to be able

to return a string representation of the load method and a status. These are

optional if the engine can't provide this information.

Yes. Maintaining compatibility with LoaderOrigin as documented in 1.7
appears to be a lost cause anyway.

Okay.

I think the appropriate way for origins to return template code would be to
call back into the template loaders. Reading template contents in the job
of the loader. It should be possible to write a single origin class that works
for most loaders.
The sole purpose of the reload() method is to display tracebacks in templates.
This logic should be moved out of the debug view and into the template
backends. See #24119. As a consequence, we should leave it outside of this
refactoring and simply preserve it for DTL origins until that ticket gets fixed.

Sounds good.

If you’re going to change both its arguments and its return type, you can change
the method's name :-) The second solution looks better to me. Hopefully we can
write a “just do this” guide to help people maintaining custom template loaders
upgrading.

Okay. I'll post back when I've implemented things further.

Preston Timmons

unread,

Jan 29, 2015, 9:24:18 PM1/29/15

to django-d...@googlegroups.com

Hi Aymeric,

It took me awhile, but I haz an update:

https://github.com/django/django/pull/3475

Loader cleanup

This addresses the code duplication between get_internal and

load_template_sources. I also made the app directories loader inherit

from the filesystem loader, since the only real difference between the two

are the directories they look in.

Cached loader

The cached loader didn't clean up so nicely. I thought I could remove

the custom load_template implementation, but a problem arises if the cached

loader is configured with both recursive and non-recursive loaders. Therefore,

I kept the old implementation in place for now.

Updating the cached loader was a bit tricky. The old loader caches templates

and TemplateDoesNotExist exceptions with a key defined by template_name and

template_dirs. This isn't as straightforward when the skip argument is added

to the equation.

The options were then to create a custom cache key per skip argument, or defer

caching to the lower-level methods. During extending, skip always includes the

originating template. That would lead to many more cache misses, so I chose to

do the latter instead. When skip isn't provided, I use the simple algorithm.

There may be a simpler approach, here.

I've done some performance testing without noticing much difference, but this

is something I'll test some more. I also experimented with using lru_cache

instead of local dictionaries. This simplifies the implementation but is

measurably slower.

Origin hash

The cache algorithm I added uses the origin object as a cache key. The "name"

attribute is used as part of the key value. Therefore, the underlying loader

must be sure to set name to a unique value per template.

You mention above that "name" must be optional for loaders that don't load

templates from the filesystem, but I'm not sure this is true. Either name,

or another attribute on origin, needs to uniquely identify the template within

a loader. Otherwise, there's not a way to implement __eq__ and __hash__. The

locmem loader is a non-filesystem loader that is able to do this. I also

implemented a sample db loader that does as well:

https://github.com/prestontimmons/dbloader/blob/master/dbloader/loader.py

Can you think of a loader where we wouldn't want to require a unique name?

Egg and db loader origin

Some loaders use different identifiers for templates. The filesystem loaders

are just file paths and the locmem loader is just a string key, but the egg

loader is a tuple, (app_config.name, pkg_name) and the sample db loader is

the model instance.

The egg and db loaders don't have an obvious attribute on origin to save the

extra information to. You'll see in those cases that I subclassed origin as

EggOrigin and DbOrigin.

Does that makes sense, or should I look for a way to incorporate that

into LoaderOrigin?

LoaderOrigin vs StringOrigin

I looked at combining these, but didn't yet since it doesn't impact the feature

this patch is implementing. If they're combined to a DTLOrigin, should that

live in django.template.base? It can't live in django.template.loader since

that module isn't safe for django.template.base to import.

context.origin

In order for the origin to be available to the ExtendsNode, I also used the

context.origin = self.origin hack in the Context.render method. I saw the logic

you used to only set engine on toplevel_render. When I moved this assignment

into the if statement or not, it didn't seem to make a difference.

Is that a hack we can live with for now?

Debug view

This branch includes an update to the debug views. The best way to view the

output is to run the debug page in a browser. I added a sample project here

with various scenarios:

https://github.com/prestontimmons/project-15053

There's one case where the debug postmortem isn't optimal. If multiple engines

are configured and a template is found by the second engine that fails

extending, the postmortem won't include the templates that were tried by the

initial get_template call to the first engine. I'm not seeing an obvious way

to persist that information at the moment.

Things to do

The next things I plan to do is measure the cached loader performance more

closely and add the docs.

--

Preston

Preston Timmons

unread,

Feb 12, 2015, 5:45:38 PM2/12/15

to django-d...@googlegroups.com

My pull request is updated with a simplified cache loader and docs. I also found

some nicer solutions to some of the hackier things, like making origin available

to the ExtendsNode.

Preston Timmons

unread,

Feb 16, 2015, 11:46:19 PM2/16/15

to django-d...@googlegroups.com

Hi Aymeric,

I'm thinking of proposing an alternative to the cached loader. This new

approach makes Django faster in general.

To start, I put together some benchmarks here:

https://github.com/prestontimmons/templatebench

The goal was to identify where Django spends it's time. Is it the loaders that

are slow? The parsing? The rendering? Something else?

Here are some basic timings from my Macbook air. This is the cumulative time

to run 1000 iterations:

Instantiating a basic template, i.e. Template("hello"):

0.0344369411

Parsing a complex template with extends and includes:

0.3044617176

Unsurprising so far, but the time for parsing measurably grows as the template

has more to parse.

Running get_template with a simple template, like "hello":

0.1308078766

Running get_template on a complex template:

0.4068300724

With a simple template, more time is spent finding the template than parsing

it. As template contents grow, though, the parsing time far outgrows the

template loading time.

Running get_template on a template with 200 includes:

12.2357971668

Here's a classic case where Django bombs. The parsing time really adds up.

Time to render a basic template:

0.0240666866

Time to render a complex template:

0.1018106937

In this case, the rendering of a complex template takes four times more than

a simple template. This is compared to a 10 times increase in parsing time

from the previous benchmark. A chunk of this time is also parsing, though, due

to extends and include nodes. All in all, the parsing time grows much quicker

than the render time does.

Based on these benchmarks, I've come to believe most of the time in Django

templates is spent on parsing, not on loading templates or rendering. The

cached loader is effective because it removes the need to reparse templates

more than once.

Interesting enough, Jinja2 has different results:

Running get_template with a simple template, like "hello":

0.0112802982

Running get_template with a complex template:

0.0122888088

Even complex templates make little difference in parsing time for Jinja2.

Running get_template on a template with 200 includes:

0.0110247135

Many includes don't make a difference.

Time to render a basic template:

0.0134618282

Time to render a complex template:

0.0217206478

For a complex template, Jinja2 rendering is about 50% faster than Django.

Even so, the overall time difference is small since rendering is quick

anyway.

After digging into Jinja2, I think this is because the Jinja2 environment

keeps an internal cache of templates. If a template is in the cache, it

calls the template "uptodate" method. If "uptodate" is true, the cached

template is used. For filesystem loaders, this incurs a filesystem hit each

time, but that's fine. File system calls aren't the bottleneck. Parsing is.

With that, I wondered if we couldn't do something similar in Django. I made an

experimental commit here, based on my branch:

https://github.com/prestontimmons/django/commit/4683300c60033ba0db472c81244f01ff932c6fb3

This adds internal caching to django.template.engine.Engine and to the extends

node. It also adds an "uptodate" method to the template origin so templates are

reparsed when modified. This is different than the cached loader, which never

checks if templates are changed. That means it's also viable in development.

Running get_template with a simple template, like "hello":

Before: 0.1308078766

After: 0.0192785263

Jinja2: 0.0112802982

Running get_template on a complex template:

Before: 0.4068300724

After: 0.0204186440

Jinja2: 0.0122888088

By parsing only when necessary these benchmarks see a 10-20x speed up.

Running get_template on a template with 200 includes:

Before: 12.2357971668

After: 0.0179648399

Using include many times is now an option.

So far, all the tests pass, and I've been testing with other templates. The

implementation seems almost too easy for the increase in speed.

Granted there's not a dealbreaker I haven't noticed yet, I'd like to propose

that we follow Jinja2's example by adding internal caching in place of the

cache loader. It has a nice speed increase and simplifies things for my

recursive loader branch as well.

There is one risk I can think of. External template tags can store state

on the Node instance rather than context.render_context. This is warned

against in the docs and is not thread-safe. In practice though, if the cached

loader isn't used, a developer could be unaware that they have a problem

at all. Switching to an internal cache would cause those to be revealed.

Even so, I think that can be handled with documentation.

Do you think it's worth making an attempt to formalize this?

Preston

Aymeric Augustin

unread,

Feb 17, 2015, 4:34:29 PM2/17/15

to django-d...@googlegroups.com

Hi Preston,

Thanks for your continued work on this and sorry for my slow answers.

On 30 janv. 2015, at 03:24, Preston Timmons <preston...@gmail.com> wrote:

> The cache algorithm I added uses the origin object as a cache key. The "name"
> attribute is used as part of the key value. Therefore, the underlying loader
> must be sure to set name to a unique value per template.

Indeed, since you added skipping, you can’t use template_name (the value
passed to get_template()) as the cache key.

> Can you think of a loader where we wouldn't want to require a unique name?

Well, given what I just said above, we don’t have a choice here. We must
make this a requirement of the template loader API.

> Does that makes sense, or should I look for a way to incorporate that
> into LoaderOrigin?

Looking at the implementation, it feels weird to add attributes to the origin
that are only used by the loader… If origins were dumb bags of data this
would be all right but they have a reload() method that calls the loader.

Such mutual dependencies never end well :-/ That’s a broader design
problem not directly related to your refactoring, but while you’re breaking
all the things, perhaps we could fix it too.

> LoaderOrigin vs StringOrigin
>
> I looked at combining these, but didn't yet since it doesn't impact the feature
> this patch is implementing. If they're combined to a DTLOrigin, should that
> live in django.template.base? It can't live in django.template.loader since
> that module isn't safe for django.template.base to import.

Yes, it can go into django.template.base or django.template.origin, a new
module.

By the way, I would like to check if we can deprecate the dirs argument of
LoaderOrigin. It may only be used by deprecated code as of Django 1.8.

> context.origin
>
> In order for the origin to be available to the ExtendsNode, I also used the
> context.origin = self.origin hack in the Context.render method. I saw the logic
> you used to only set engine on toplevel_render. When I moved this assignment
> into the if statement or not, it didn't seem to make a difference.
>
> Is that a hack we can live with for now?

Well this isn’t an example you’re supposed to follow… ;-)

context.origin doesn’t make a lot of sense. Wouldn’t context.template be better?
Then you could walk to context.template.origin.

In fact context.template.engine would be much better than context.engine. I’ll try
to fix that in 1.8 before it’s too late.

> There's one case where the debug postmortem isn't optimal. If multiple engines
> are configured and a template is found by the second engine that fails
> extending, the postmortem won't include the templates that were tried by the
> initial get_template call to the first engine. I'm not seeing an obvious way
> to persist that information at the moment.

This isn’t a big deal. Using multiple Django template engines is an edge case.

--
Aymeric.

Aymeric Augustin

unread,

Feb 17, 2015, 4:38:01 PM2/17/15

to django-d...@googlegroups.com

On 17 févr. 2015, at 05:46, Preston Timmons <preston...@gmail.com> wrote:

> Even so, I think that can be handled with documentation.

And an option to disable it. This is very easy to implement.

> Do you think it's worth making an attempt to formalize this?

Yes, I do.

This is an independent follow-up to the big refactoring we’re talking about, isn’t it?

--
Aymeric.

Preston Timmons

unread,

Feb 17, 2015, 5:11:14 PM2/17/15

to django-d...@googlegroups.com

Thanks for your continued work on this and sorry for my slow answers.

No worries. I've been taking the time to get a better grasp of Django templates as a whole.

> Does that makes sense, or should I look for a way to incorporate that
> into LoaderOrigin?

Looking at the implementation, it feels weird to add attributes to the origin
that are only used by the loader… If origins were dumb bags of data this
would be all right but they have a reload() method that calls the loader.

Such mutual dependencies never end well :-/ That’s a broader design
problem not directly related to your refactoring, but while you’re breaking
all the things, perhaps we could fix it too.

Yes. I'll look at this. The reload method isn't documented and I think it can

go away if the debug view is updated.

By the way, I would like to check if we can deprecate the dirs argument of
LoaderOrigin. It may only be used by deprecated code as of Django 1.8.

Deprecate or remove? :)

This argument is one I don't think is documented anywhere.

In fact context.template.engine would be much better than context.engine. I’ll try
to fix that in 1.8 before it’s too late.

Sure. I forgot to mention it but my latest patch actually solves this by passing origin

in context.render_context. This makes sense because render_context is available

only in the current render scope, rather than modifying the global context. I'm just

as happy with accessing context.template, though.

Preston Timmons

unread,

Feb 17, 2015, 5:47:09 PM2/17/15

to django-d...@googlegroups.com

Yes, I do.

This is an independent follow-up to the big refactoring we’re talking about, isn’t it?

Cool.

It depends. The cached loader is what I'm least happy with in my refactoring. This

internal cache idea I think is simpler, more performant, and easier to understand.

Adding it makes the cached loader changes unnecessary.

Because the changes to support it are minimal, I'll at least make a side-branch that

replaces the cached loader changes with an internal cache. If it pans out and doesn't

balloon into further unrelated changes, it might be easier to include it in this refactoring.

Preston

Aymeric Augustin

unread,

Feb 18, 2015, 3:44:02 AM2/18/15

to django-d...@googlegroups.com

2015-02-17 23:47 GMT+01:00 Preston Timmons <preston...@gmail.com>:

It depends. The cached loader is what I'm least happy with in my refactoring. This
internal cache idea I think is simpler, more performant, and easier to understand.
Adding it makes the cached loader changes unnecessary.

In that case, would it be possible to do the caching improvements before the main

refactoring? I'm trying to keep patches at a reviewable size :-)

--

Aymeric.

Preston Timmons

unread,

Feb 18, 2015, 2:12:09 PM2/18/15

to django-d...@googlegroups.com

In that case, would it be possible to do the caching improvements before the main
refactoring? I'm trying to keep patches at a reviewable size :-)

--
Aymeric.

Maybe. The existing loaders don't provide a nice way to add Origin.uptodate. It would

require an informal hack that only supported Django builtin loaders initially.

I'll start by separating out some of the cleanup/groundwork changes I did into separate

PRs. If those can go in first, then there'll be a lot less noise in the patch.

At the moment, I think it will be easiest to land the cache improvements after the

loaders are updated.

Preston Timmons

unread,

Mar 6, 2015, 1:15:41 PM3/6/15

to django-d...@googlegroups.com

A small update:

I have a local branch working again, but nothing ready to review.

I have decided the origin.status and origin.tried apis I mentioned above aren't

going to work. For templates to be safely cacheable in a multi-threaded

environment, they basically need to be treated as immutable. Anything that

manipulates a template or it's subobjects is problematic.

I think I can work with this restraint, though.

The next things I'll probably submit is to remove the origin.reload method. As

mentioned above, we should prefer template loading to go through the loaders

instead. I've made progress here but have went down the rabbit hole of cleaning

up the debug implementation.

It looks promising that we can combine the debug parsing, as brought up in another

thread, with minimal overhead. Most of the extra time is due to extra function calls,

not tokenizing. These calls aren't necessary if there's a single parser and lexer

class. I think we can avoid copying debug info to every token and node as well.

This can also address #24119.

Unfortunately, this part of the code isn't tested well. When I have additional tests

written I'll submit a PR.

Preston

Preston Timmons

unread,

Mar 11, 2015, 11:37:21 PM3/11/15

to django-d...@googlegroups.com

My branch is updated. This now combines the origin classes into one. It also

includes a simpler cache algorithm than before. This same algorithm can be

used if we implement an internal cache to the engine, which is now a small

addition to this patch.

I'd like to see origin.reload go away but it's left intact in this branch. My other

PR to rework template debug information for multiple engines removes this.

A related goal for both of these branches is to simplify the debug view to only

deal with the presentation of the debug information rather than creating or

replaying information. It looks like we can simplify it a lot.

Preston

Reply all

Reply to author

Forward