On Thursday, October 2, 2014 7:34:51 PM UTC+8, Riddhiman Dasgupta wrote:
> Hi,
> I'm a graduate student starting out in the area of deep learning, specifically convolutional neural networks. I am interested in scene parsing, and want to implement the following paper(s):
> Learning Hierarchical Features for Scene LabelingScene Parsing with Multiscale Feature Learning, Purity Trees, and Optimal CoversI have previously used CNNs for image classification, but I understand that here, instead of sending the entire image to the CNN, we need to send local patches obtained by a sliding window.
> Is there an efficient way to extract these so-called dense features from the entire image without having to resort to traversing the entire image using the sliding window, which is computationally very expensive?
>
>
> P.S. I noticed that one paper, viz. Joint Training of a Convolutional Network and a Graphical Model for Human Pose Estimation, talks of extracting dense features efficiently without resorting to a sliding window, but I am unable to understand the exact details.
Hi~ Have solve this problem yet? I am confused by dense feature extraction too. Do you have any good idea? Thanks a lot :-)