Today, our OrientDB crashed without any errors.

161 views
Skip to first unread message

Erik Pragt

unread,
Apr 7, 2016, 8:05:33 PM4/7/16
to OrientDB
Hi all,

As the question title suggests, our OrientDB database crashed today. We got notified by our monitoring server that our sites aren't accepting any requests anymore, which was quickly traced to OrientDB being unavailable, i.e., not running anymore.

However, when checking the logs (orientdb.err/orientdb.log), there was no indication at all of any crash. What can I do to investigate this issue, and, even better, prevent OrientDB from crashing again? This isn't the first time OrientDB crashes, but usually it's with an OutOfMemory exception, which we fixed(?) by giving it more memory. Currently, the DB hardly does anything (1 request per 10 seconds?), and we're a bit worried that once we do get some requests, that OrientDB might crash more often.

Our current settings to run OrientDB look like this:

ORIENTDB_SETTINGS="-Dprofiler.enabled=true"
JAVA_OPTS_SCRIPT="-Djna.nosys=true -XX:+HeapDumpOnOutOfMemoryError -Djava.awt.headless=true -Dfile.encoding=UTF8 -Drhino.opt.level=9"

# ORIENTDB MAXIMUM HEAP. USE SYNTAX -Xmx<memory>, WHERE <memory> HAS THE TOTAL MEMORY AND SIZE UNIT. EXAMPLE: -Xmx512m
MAXHEAP=-Xmx1024m
# ORIENTDB MAXIMUM DISKCACHE IN MB, EXAMPLE, ENTER -Dstorage.diskCache.bufferSize=8192 FOR 8GB
#MAXDISKCACHE=""
MAXDISKCACHE="-Dstorage.diskCache.bufferSize=8192"

If we need to provide some more information, please let me know.

Kind regards,

Erik Pragt

Erik Pragt

unread,
Apr 7, 2016, 8:18:13 PM4/7/16
to OrientDB
Btw, running OrientDB 2.1.9.

Luigi Dell'Aquila

unread,
Apr 8, 2016, 3:34:55 AM4/8/16
to orient-...@googlegroups.com
Hi Erik,

It's hard to give you an answer here, this is the first report we have from a system that crashes without emitting any logs at all.
In other cases, when we had an unexpected crash with few information, it was because of some OS related procedures (eg. process kill under heavy load), but also in that case we had a log stating that the server was killed.

Could you please share your logs? Maybe we will find some hints in the history

Thanks

Luigi


--

---
You received this message because you are subscribed to the Google Groups "OrientDB" group.
To unsubscribe from this group and stop receiving emails from it, send an email to orient-databa...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Luca Garulli

unread,
Apr 8, 2016, 3:39:06 AM4/8/16
to OrientDB
Hi Erik,

If OrientDB dies with no log, it means the OS killed it. The most common reason is the out of memory.


A few questions for you:
  • How much physical RAM do you have on that server and
  • How much is available before you start OrientDB server?
  • Did you configure any swap for your server?

Best Regards,

Luca Garulli
Founder & CEO

Erik Pragt

unread,
Apr 8, 2016, 4:14:17 AM4/8/16
to OrientDB
Hi Luca,

Thanks for your response.

- I'm running on Digital Ocean, 2 core 2 gig version. 
- Around 1500 mb.
- What kind of swap do you mean? On an OS level?

Thanks for the help!
Erik

Erik Pragt

unread,
Apr 8, 2016, 4:15:39 AM4/8/16
to OrientDB
Hi Luigi,

I can certainly share my logs. I'll send you an email. Thanks for the help!

Erik

Luca Garulli

unread,
Apr 8, 2016, 6:11:09 AM4/8/16
to OrientDB
Hi Erik,
Do you have 2GB of RAM on that server? if this is the case, you cannot have 8GB of RAM for the Diskcache:

-Dstorage.diskCache.bufferSize=8192

So this must be the reason. This is the formula:

diskcache + heap < available ram

So with 1,5GB of free RAM, try these settings:

MAXHEAP=-Xmx700m and -Dstorage.diskCache.bufferSize=512

Best Regards,

Luca Garulli

Founder & CEO

Erik Pragt

unread,
Apr 8, 2016, 8:49:00 AM4/8/16
to orient-...@googlegroups.com
Hi Luca,

I'm sorry, that was an error on my part: I changed that after the crash, before the crash it was on a very low setting, around 400. I'll change it back anyway!


Erik
You received this message because you are subscribed to a topic in the Google Groups "OrientDB" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/orient-database/Xith1NqSX0k/unsubscribe.
To unsubscribe from this group and all its topics, send an email to orient-databa...@googlegroups.com.

Andrey Lomakin

unread,
Apr 8, 2016, 10:56:09 AM4/8/16
to orient-...@googlegroups.com
Hi Erik,
If system error happens (that is the only reason which we can think why it was shutted down without any exceptions) your OS log should contain information about
that error.
Which OS do you use ?
Best regards,
Andrey Lomakin, R&D lead. 
OrientDB Ltd

twitter:@Andrey_Lomakin linkedin:https://ua.linkedin.com/in/andreylomakin

Erik Pragt

unread,
Apr 8, 2016, 6:35:11 PM4/8/16
to orient-...@googlegroups.com
Thanks! I'm using Ubuntu, I think 14.x

Andrey Lomakin

unread,
Apr 9, 2016, 1:41:13 AM4/9/16
to orient-...@googlegroups.com
Hi Erik,
Could you inspect following files /var/log/syslog and /var/log/kern.log using grep for example  grep -i orient /var/log/kern.log and send us results.

Erik Pragt

unread,
Apr 10, 2016, 12:20:58 AM4/10/16
to orient-...@googlegroups.com
Thanks Andrey.

I checked both files, and there's nothing of internet. This is my complete kern.log file for example:

Apr  8 10:10:04 localhost kernel: [1282413.300235] audit_printk_skb: 24 callbacks suppressed
Apr  8 10:10:04 localhost kernel: [1282413.300241] type=1400 audit(1460103004.823:19): apparmor="STATUS" operation="profile_replace" parent=21949 profile="unconfined" name="/usr/sbin/mysqld" pid=21956 comm="apparmor_parser"
Apr  8 10:11:27 localhost kernel: [1282496.440652] type=1400 audit(1460103087.963:20): apparmor="STATUS" operation="profile_replace" parent=22456 profile="unconfined" name="/usr/sbin/mysqld" pid=22460 comm="apparmor_parser"

and my /var/log/syslog also doesn't contain anything exciting....



scott molinari

unread,
Apr 10, 2016, 1:21:37 AM4/10/16
to OrientDB, erik....@jworks.nl
Is MySql also running on the same server? Just wondering. 

Scott

Erik Pragt

unread,
Apr 10, 2016, 1:54:24 AM4/10/16
to orient-...@googlegroups.com
It used to run on the same server indeed, but it's not used, so I uninstalled it.

On Sun, Apr 10, 2016 at 3:21 PM, 'scott molinari' via OrientDB <orient-...@googlegroups.com> wrote:
Is MySql also running on the same server? Just wondering. 

Scott

--

Andrey Lomakin

unread,
Apr 10, 2016, 5:08:57 AM4/10/16
to orient-...@googlegroups.com
Hi,
OK let's  look at /var/log/messages then.
If your application  is exited without your intention OS has to log it somewhere .
So let's keep  digging. 
Also what about DB server logs .
Could you send them to us ?

вс, 10 Апр 2016, 8:54 Erik Pragt <erik....@jworks.nl>:
You received this message because you are subscribed to the Google Groups "OrientDB" group.
To unsubscribe from this group and stop receiving emails from it, send an email to orient-databa...@googlegroups.com.

For more options, visit https://groups.google.com/d/optout.

Erik Pragt

unread,
Apr 11, 2016, 6:09:03 AM4/11/16
to orient-...@googlegroups.com
On 10 Apr 2016, at 19:08, Andrey Lomakin <lomakin...@gmail.com> wrote:

Hi,
OK let's  look at /var/log/messages then.

I’m not sure if that’s normal, but I don’t have a /var/log/messages…. Any other location I could look?

If your application  is exited without your intention OS has to log it somewhere .
So let's keep  digging. 
Also what about DB server logs .
Could you send them to us ?

I already sent the server logs…but there was really nothing interesting in it.

Andrey Lomakin

unread,
Apr 11, 2016, 6:19:40 AM4/11/16
to orient-...@googlegroups.com
Hi,
OK let's go further. 

  1. Did you open database after server was shutted down ?
  2. If so do you have exact copy of this database before server was started again ?

Andrey Lomakin

unread,
Apr 11, 2016, 6:21:50 AM4/11/16
to orient-...@googlegroups.com
Also do  you know time when process was shutted down ? Is it possible to send us system logs in interval of hour before and after shutdown ?

Erik Pragt

unread,
Apr 11, 2016, 6:30:00 AM4/11/16
to orient-...@googlegroups.com
On 11 Apr 2016, at 20:19, Andrey Lomakin <lomakin...@gmail.com> wrote:

Hi,
OK let's go further. 

  1. Did you open database after server was shutted down ?

What do you mean exactly? Access it through the console? No, I didn’t. I got a message from my monitoring solution, checked if I could access the site myself, I couldn’t, checked the server, saw that OrientDb wasn’t running anymore, checked the server logs of my app and orientdb, started OrientDB and restarted my app, and that was it.

  1. If so do you have exact copy of this database before server was started again ?
No, unfortunately not. Actually, the databases is completely different now, in terms of data and structure. I think right now, it’s hard to continue with this, without any information. If it happens again next time, I’ll make a copy of the DB, the logs, etc, and go from there….okay?

Erik

Erik Pragt

unread,
Apr 11, 2016, 6:30:54 AM4/11/16
to orient-...@googlegroups.com
On 11 Apr 2016, at 20:21, Andrey Lomakin <lomakin...@gmail.com> wrote:

Also do  you know time when process was shutted down ? Is it possible to send us system logs in interval of hour before and after shutdown ?

Yes, I know the time. Sure, I can do that, but which systems logs?

Erik

Andrey Lomakin

unread,
Apr 11, 2016, 7:18:30 AM4/11/16
to orient-...@googlegroups.com
Hi, 
Is it possible to send us /var/log logs but if possibly only for this time ?

Reply all
Reply to author
Forward
0 new messages