$DRUID/var/tmp which grew to almost 1TB (terabyte, yes) now. And $DRUID/var/druid is mere 350GB. Is it safe to delete that $DRUID/var/tmp folder to free the space?--
You received this message because you are subscribed to the Google Groups "Druid User" group.
To unsubscribe from this group and stop receiving emails from it, send an email to druid-user+...@googlegroups.com.
To post to this group, send email to druid...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/druid-user/149265c8-947b-4ae7-864e-2e48d8bda551%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
Hey Nikita,It should be safe to delete anything ingestion-related from tmp if you are not currently doing an ingestion job. Most processes should be cleaning up after themselves, though; could I ask what is in there right now?
Gian
On Wed, Jun 1, 2016 at 1:31 AM, Nikita Salnikov-Tarnovski <ni...@plumbr.eu> wrote:
Good morning.I am testing batch data ingestion in Druid. Currently in my installation there is a folder$DRUID/var/tmpwhich grew to almost 1TB (terabyte, yes) now. And$DRUID/var/druidis mere 350GB. Is it safe to delete that$DRUID/var/tmpfolder to free the space?Thank you in advance,Nikita
--
You received this message because you are subscribed to the Google Groups "Druid User" group.
To unsubscribe from this group and stop receiving emails from it, send an email to druid-user+unsubscribe@googlegroups.com.
tmp/base948886206048112452flush/
└── merged
├── 00000.smoosh
├── meta.smoosh
└── version.bin
1 directory, 3 files
is $DRUID/var/tmp set as the java tmp directory?
On Wednesday, June 1, 2016 at 9:42:21 AM UTC-7, Gian Merlino wrote:
Hey Nikita,It should be safe to delete anything ingestion-related from tmp if you are not currently doing an ingestion job. Most processes should be cleaning up after themselves, though; could I ask what is in there right now?
Gian
On Wed, Jun 1, 2016 at 1:31 AM, Nikita Salnikov-Tarnovski <ni...@plumbr.eu> wrote:
Good morning.I am testing batch data ingestion in Druid. Currently in my installation there is a folder$DRUID/var/tmpwhich grew to almost 1TB (terabyte, yes) now. And$DRUID/var/druidis mere 350GB. Is it safe to delete that$DRUID/var/tmpfolder to free the space?Thank you in advance,Nikita
--
You received this message because you are subscribed to the Google Groups "Druid User" group.
To unsubscribe from this group and stop receiving emails from it, send an email to druid-user+...@googlegroups.com.
--
You received this message because you are subscribed to the Google Groups "Druid User" group.
To unsubscribe from this group and stop receiving emails from it, send an email to druid-user+unsubscribe@googlegroups.com.
To post to this group, send email to druid...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/druid-user/f62efc25-a29b-4a6b-a58d-34d68022eb18%40googlegroups.com.
--
You received this message because you are subscribed to the Google Groups "Druid User" group.
To unsubscribe from this group and stop receiving emails from it, send an email to druid-user+unsubscribe@googlegroups.com.
To post to this group, send email to druid...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/druid-user/79dafc0f-957f-4e04-8f49-3ce5a5e95c41%40googlegroups.com.
Hadoop indexer in YARN mode is totally good in production. It's just Hadoop in _local mode_ that isn't normally suggested for production. That's really just meant for testing and dev. I think a cron job is your best bet for cleaning up the files that it generates.
Gian
On Wed, Jul 12, 2017 at 4:07 PM, <den...@gmail.com> wrote:
Can you please provide a reference where it stated that hadoop-indexer isn't recommended in production and why?--Is there any sane way (besides cronjob) to clean up these files ?Thanks!
On Wednesday, July 12, 2017 at 11:41:38 PM UTC+3, Gian Merlino wrote:It should be safe to clear out the tmp directory when no indexing is running. Also, if you use a remote Hadoop cluster or if you use local mode native indexing (the "index" task in Druid) then this should not be an issue. I believe it should only be an issue with local mode hadoop (which isn't recommended in production anyway).Gian
You received this message because you are subscribed to the Google Groups "Druid User" group.
To unsubscribe from this group and stop receiving emails from it, send an email to druid-user+...@googlegroups.com.
To post to this group, send email to druid...@googlegroups.com.
--
You received this message because you are subscribed to a topic in the Google Groups "Druid User" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/druid-user/_fFdS0impzY/unsubscribe.
To unsubscribe from this group and all its topics, send an email to druid-user+...@googlegroups.com.
To post to this group, send email to druid...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/druid-user/095d244f-0537-4894-a8c3-0ddf3256c27f%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
To unsubscribe from this group and all its topics, send an email to druid-user+unsubscribe@googlegroups.com.
To post to this group, send email to druid...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/druid-user/d69681c2-2bdd-4fb0-9fd4-8d26ba779349%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
To view this discussion on the web visit https://groups.google.com/d/msgid/druid-user/d69681c2-2bdd-4fb0-9fd4-8d26ba779349%40googlegroups.com.
Various parts of Druid use temporary files to interact with the file system. These files can become quite large. This means that systems that have small /tmp directories can cause problems for Druid. Therefore, set the JVM tmp directory to a location with ample space.
Also consider the following when configuring the JVM tmp directory: