HDFS Space Utilization keeps on increasing

46 views
Skip to first unread message

Shashi Vishwakarma

unread,
Aug 25, 2015, 9:32:52 AM8/25/15
to apex-dev
Hi,

I have  DataTorrent 3.x installed on my cluster.Even thought there is no data torrent application is running , still my hdfs space utilization goes on increasing. Below is hdfs path that has occupied most of the space.

/user/dtadmin/datatorrent/apps

Why this is happening? Am I missing something here?

Thanks
Shashi

David Yan

unread,
Aug 25, 2015, 1:34:04 PM8/25/15
to Shashi Vishwakarma, apex-dev
Hi Shashi,

That directory is where Apex stores application information, like application jar files, checkpoints, container information, etc.  
Please run this command to see which directory is taking the most space.

$ hdfs dfs -du /user/dtadmin/datatorrent/apps

Then open dtcli and use the get-app-info command look at the information of that application.  For example:

dt> get-app-info application_1439598948299_0557

The field "state" will tell you whether the application is running or not. 

If you don't care about the application, you can safely kill it if it's running and delete the HDFS directory by doing hdfs dfs -rm -r /user/dtadmin/datatorrent/apps/application_xxx_yyy (replace xxx and yyy with appropriate values).  Note that doing so will wipe all stored information about that application.

David

--
You received this message because you are subscribed to the Google Groups "apex-dev" group.
To unsubscribe from this group and stop receiving emails from it, send an email to apex-dev+u...@googlegroups.com.
To post to this group, send email to apex...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/apex-dev/8754d662-4948-4920-96f3-cb58f70d5f39%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Amol Kekre

unread,
Aug 25, 2015, 4:46:18 PM8/25/15
to David Yan, d...@apex.incubator.apache.org, Shashi Vishwakarma, apex-dev

Shashi Vishwakarma

unread,
Aug 26, 2015, 2:44:14 AM8/26/15
to Amol Kekre, David Yan, d...@apex.incubator.apache.org, apex-dev
Thanks David for detailed explanation. I checked apps directory in HDFS,there are around 12858 application in that folder each of having 6.2 M size. It will be a time consuming process to find status of each application by running get-app-info in dtcli. So logged in to web interface of datatorrent(port 9090) but there is no application running at this moment.

Still HDFS space utilization  is increasing,any pointers on this?

Thanks and Regards, 
Shashi

David Yan

unread,
Aug 26, 2015, 12:08:15 PM8/26/15
to Shashi Vishwakarma, Amol Kekre, d...@apex.incubator.apache.org, apex-dev
That's a lot of applications.  I suspect there is something that keeps starting the application, which causes the folder to keep increasing in size. Can you just run get-app-info on dtcli on just one application and see what is being spawned up?

David

Tushar Gosavi

unread,
Aug 26, 2015, 12:51:36 PM8/26/15
to David Yan, Shashi Vishwakarma, Amol Kekre, d...@apex.incubator.apache.org, apex-dev
You can also check yarn resource manager ui and logs to verify which applications are getting restarted continuously.


For more options, visit https://groups.google.com/d/optout.



--
“I'd have blown my top, because I want to beat this damn thing,
 as long as I've gone this far. I can't just leave it after I've found
 out so much about it. I have to keep going to find out ultimately
what is the matter with it in the end."
                Richard P. Feynman

Chetan Narsude

unread,
Aug 26, 2015, 3:31:02 PM8/26/15
to Tushar Gosavi, David Yan, Shashi Vishwakarma, Amol Kekre, d...@apex.incubator.apache.org, apex-dev

Shashi Vishwakarma

unread,
Aug 31, 2015, 10:07:04 AM8/31/15
to Chetan Narsude, Tushar Gosavi, David Yan, Amol Kekre, d...@apex.incubator.apache.org, apex-dev
Hi All,

Thanks for your reply. I believe you guys are right. There is data torrent application which keeps on restarting. I observed resource manager UI, I always see one application running even no one running app from my team.

Chetan,




DTLog.txt

Gaurav Gupta

unread,
Aug 31, 2015, 11:53:17 AM8/31/15
to Shashi Vishwakarma, Chetan Narsude, Tushar Gosavi, David Yan, Amol Kekre, d...@apex.incubator.apache.org, apex-dev
Shashi,
I see what is happening. For now, please stop gateway, clear /user/dtadmin/datatorrent/audit/ folder and start gateway again. This should resolve the issue for now.


Thanks
-Gaurav

Gaurav Gupta

unread,
Aug 31, 2015, 12:09:02 PM8/31/15
to Shashi Vishwakarma, Chetan Narsude, Tushar Gosavi, David Yan, Amol Kekre, d...@apex.incubator.apache.org, apex-dev
Shashi,

Are you running multiple instances of gateway with same license?

Thanks
- Gaurav
Reply all
Reply to author
Forward
0 new messages