Confusing about CAFFE's threading

300 views

CPUboostcaffegpukilledubuntu

Skip to first unread message

Zhenpei Yang

unread,

Nov 4, 2016, 3:57:46 PM11/4/16

to Caffe Users

Hi all,

I am new to boost::thread, and i am trying to understand how CAFFE use multi thread to do data prefetching.

What i think about prefetching is that it starts a thread to fetch data into the queue at the beginning. Afterwards during each step, the data layer only need to read data from the queue.

But i am confused after reading the "base_data_layer.cpp".

template <typename Dtype>

void BasePrefetchingDataLayer<Dtype>::Forward_gpu( const vector<Blob<Dtype>*>& bottom, const vector<Blob<Dtype>*>& top)

{ // First, join the thread

JoinPrefetchThread();

// Reshape to loaded data. top[0]->ReshapeLike(this->prefetch_data_);

// Copy the data caffe_copy(prefetch_data_.count(), prefetch_data_.cpu_data(), top[0]->mutable_gpu_data());

if (this->output_labels_)

{

// Reshape to loaded labels. top[1]->ReshapeLike(prefetch_label_);

// Copy the labels. caffe_copy(prefetch_label_.count(), prefetch_label_.cpu_data(), top[1]->mutable_gpu_data());

}

#ifdef USE_MPI

//advance (all_rank - (my_rank+1)) mini-batches to be ready for next run

BaseDataLayer<Dtype>::OffsetCursor(top[0]->num() * (Caffe::MPI_all_rank() - 1));

#endif

// Start a new prefetch thread

CreatePrefetchThread(); }

Seems in every call to the "net.forward", the datalayer run the "JoinPrefetchThread();" which blocks main thread and read a batch data from the disk.

If this is the case, then how this be considered "prefecthing"? I think we should read from a queue that is simultanously be feeded with another thread?

Can anyone shed some light on where i undersand wrong? I appreciate a lot!

Nate Ting

unread,

May 21, 2017, 2:30:17 AM5/21/17

to Caffe Users

Hi Zhenpei,

I know this thread has been a while, but I'm also reading caffe source code now, from an older version of caffe code, pretty consistent with the code you posted above.

If you look into the BasePrefetchingDataLayer<Dtype>::LayerSetUp() function, you'll find it has already called CreatePrefetchThread() to start prefetching the data batch for this round. So next time where CreatePrefetchThread() is called is within the Forward_cpu() function, which is preparing for next batch next round of competition.

Yeah, strictly speaking, if the computation takes very limited time, it might block the main thread as far as I'm concerned.

Correct me if I'm wrong.

Nate

Reply all

Reply to author

Forward

0 new messages