Core Dumped/Kernel Shut down

516 views
Skip to first unread message

emilee...@gmail.com

unread,
Mar 2, 2018, 8:18:16 AM3/2/18
to Keras-users
Hello :)

I am trying to run a simple MNIST example using keras. The code I found is directly taken from online at https://github.com/keras-team/keras/blob/master/examples/mnist_cnn.py
When running this on jupyter-notebook it tries to do the first epoch then a pop says the kernel has died and will restart, and it never runs again.
When running this code in the terminal the following output appears during the epoch run time:

Epoch 1/12
2018-03-02 15:07:31.693371: I tensorflow/core/platform/cpu_feature_guard.cc:140] Your CPU supports instructions that this TensorFlow binary was not compiled to use: AVX2 FMA
2018-03-02 15:07:31.770186: I tensorflow/stream_executor/cuda/cuda_gpu_executor.cc:898] successful NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA node zero
2018-03-02 15:07:31.770453: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1212] Found device 0 with properties: 
name: GeForce GTX 960M major: 5 minor: 0 memoryClockRate(GHz): 1.176
pciBusID: 0000:01:00.0
totalMemory: 3.95GiB freeMemory: 3.38GiB
2018-03-02 15:07:31.770471: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1312] Adding visible gpu devices: 0
2018-03-02 15:07:32.271403: I tensorflow/core/common_runtime/gpu/gpu_device.cc:993] Creating TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 3106 MB memory) -> physical GPU (device: 0, name: GeForce GTX 960M, pci bus id: 0000:01:00.0, compute capability: 5.0)
2018-03-02 15:07:33.138197: E tensorflow/stream_executor/cuda/cuda_dnn.cc:378] Loaded runtime CuDNN library: 7101 (compatibility version 7100) but source was compiled with 7004 (compatibility version 7000).  If using a binary install, upgrade your CuDNN library to match.  If building from sources, make sure the library loaded at runtime matches a compatible version specified during compile configuration.
2018-03-02 15:07:33.139061: F tensorflow/core/kernels/conv_ops.cc:717] Check failed: stream->parent()->GetConvolveAlgorithms( conv_parameters.ShouldIncludeWinogradNonfusedAlgo<T>(), &algorithms) 
Aborted (core dumped)

And it jumps out of the python shell.
Any advice to why this would happen would be great, or how to fix it. I am assuming its something to do with the graphics card/tensorflow installation

Thanks :)

Daπid

unread,
Mar 2, 2018, 10:18:37 AM3/2/18
to emilee...@gmail.com, Keras-users
The traceback tells you exactly what is wrong:


2018-03-02 15:07:33.138197: E tensorflow/stream_executor/cuda/cuda_dnn.cc:378] Loaded runtime CuDNN library: 7101 (compatibility version 7100) but source was compiled with 7004 (compatibility version 7000).  If using a binary install, upgrade your CuDNN library to match.  If building from sources, make sure the library loaded at runtime matches a compatible version specified during compile configuration.

You have installed cudnn 7.1, but TF was built against 7.0. Either downgrade cudnn, or rebuild TF in your setup.

--
You received this message because you are subscribed to the Google Groups "Keras-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to keras-users+unsubscribe@googlegroups.com.
To view this discussion on the web, visit https://groups.google.com/d/msgid/keras-users/3bf4ce6e-86d5-4df9-9461-0a3910f546cb%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply all
Reply to author
Forward
0 new messages