Simple question about Queue.Queue and threads

Frank Millman

unread,

Feb 5, 2010, 7:45:30 AM2/5/10

to pytho...@python.org

Hi all

Assume you have a server process running, a pool of worker threads to
perform tasks, and a Queue.Queue() to pass the tasks to the workers.

In order to shut down the server cleanly, you want to ensure that the
workers have all finished their tasks. I like the technique of putting a
None onto the queue, and have each worker check for None, put None back onto
the queue, and terminate itself.

The main program would look something like this -

q.put(None)
for worker in worker_threads:
worker.join()

At this point you can be sure that each thread has completed its tasks and
terminated itself.

However, the queue is not empty - it still has the final None in it.

Is it advisable to finalise the cleanup like this? -

while not q.empty():
q.get()
q.task_done()
q.join()

Or is this completely redundant?

Thanks

Frank Millman

Gabriel Genellina

unread,

Feb 6, 2010, 12:59:35 AM2/6/10

to pytho...@python.org

En Fri, 05 Feb 2010 09:45:30 -0300, Frank Millman <fr...@chagford.com>
escribiï¿œ:

> Assume you have a server process running, a pool of worker threads to
> perform tasks, and a Queue.Queue() to pass the tasks to the workers.
>
> In order to shut down the server cleanly, you want to ensure that the
> workers have all finished their tasks. I like the technique of putting a
> None onto the queue, and have each worker check for None, put None back
> onto the queue, and terminate itself.
>
> The main program would look something like this -
>
> q.put(None)
> for worker in worker_threads:
> worker.join()
>
> At this point you can be sure that each thread has completed its tasks
> and terminated itself.
>
> However, the queue is not empty - it still has the final None in it.

Yes - but who cares? :)

> Is it advisable to finalise the cleanup like this? -
>
> while not q.empty():
> q.get()
> q.task_done()
> q.join()
>
> Or is this completely redundant?

I don't see what you gain by doing that.

On the other hand, you might check whether the queue contains exactly one
item and it is None, just to be sure that all queued work has been done.

(BTW, I'd use a different sentinel instead of None; perhaps this could not
happen in your current code, but it's easy to inadvertedly put a None in
the queue, stopping all workers prematurely.)

--
Gabriel Genellina

Frank Millman

unread,

Feb 6, 2010, 1:30:55 AM2/6/10

to pytho...@python.org

On Feb 6, 7:59 am, "Gabriel Genellina" <gagsl-...@yahoo.com.ar> wrote:
> En Fri, 05 Feb 2010 09:45:30 -0300, Frank Millman <fr...@chagford.com>
> escribiï¿œ:
>

[...]

> > However, the queue is not empty - it still has the final None in it.
>
> Yes - but who cares? :)
>

That was my point. I didn't think I needed to care, but I wanted to be sure
I was not missing something.

I will just discard all the final cleanup code.

>
> (BTW, I'd use a different sentinel instead of None; perhaps this could not
> happen in your current code, but it's easy to inadvertedly put a None in
> the queue, stopping all workers prematurely.)
>

Good advice.

Thanks, Gabriel

Frank

Steven

unread,

Feb 8, 2010, 9:51:02 AM2/8/10

to

On Feb 5, 7:45 am, "Frank Millman" <fr...@chagford.com> wrote:
> Hi all
>
> Assume you have a server process running, a pool of worker threads to

> perform tasks, and aQueue.Queue() to pass the tasks to the workers.

>
> In order to shut down the server cleanly, you want to ensure that the
> workers have all finished their tasks. I like the technique of putting a

> None onto thequeue, and have each worker check for None, put None back onto
> thequeue, and terminate itself.

>
> The main program would look something like this -
>
> q.put(None)
> for worker in worker_threads:
> worker.join()
>
> At this point you can be sure that each thread has completed its tasks and
> terminated itself.
>

> However, thequeueis not empty - it still has the final None in it.

>
> Is it advisable to finalise the cleanup like this? -
>
> while not q.empty():
> q.get()
> q.task_done()
> q.join()
>
> Or is this completely redundant?
>
> Thanks
>
> Frank Millman

Queue objects have support for this signaling baked in with
q.task_done and q.join.

After the server process has put all tasks into the queue, it can join
the queue itself, not the worker threads.

q.join()

This will block until all tasks have been gotten AND completed. The
worker threads would simply do this:
task_data = q.get()
do_task(task_data)
q.task_done()

Using pairs of get and task_done you no longer need to send a signal.
Just exit the server process and the worker threads will die (assuming
of course, you set .setDaemon(True) before starting each worker
thread).

Steven Rumbalski

Message has been deleted

Frank Millman

unread,

Feb 9, 2010, 1:45:03 AM2/9/10

to pytho...@python.org

On Feb 8, 4:51 pm, Steven <Steven.Rumbal...@fiserv.com> wrote:
>
> Queue objects have support for this signaling baked in with
> q.task_done and q.join.
>
> After the server process has put all tasks into the queue, it can join
> the queue itself, not the worker threads.
>
> q.join()
>
> This will block until all tasks have been gotten AND completed. The
> worker threads would simply do this:
> task_data = q.get()
> do_task(task_data)
> q.task_done()
>
> Using pairs of get and task_done you no longer need to send a signal.
> Just exit the server process and the worker threads will die (assuming
> of course, you set .setDaemon(True) before starting each worker
> thread).
>

Thanks, Steven.

This works perfectly in my scenario, and tidies up the code a bit.

Minor point - according to the 2.6 docs, .setDaemon(True) is the old API -
the current way of specifying this is .daemon = True.

Thanks for the tip.

Frank