Hi Yann,
Yann Sagon wrote on 04/25/2014 05:36 PM:
> I'm using Centos 6.5 and I'm having the same issue. I tried what was
> working for Michael Ruepp with no luck.
>
> @Michael: here is what I did without success:
>
> edit the file /etc/dracut.conf =>
> add_drivers+="rdma_ucm rdma_cm ib_addr ib_ipoib mlx4_core mlx4_ib
> mlx4_en mlx5_core mlx5_ib ib_uverbs ib_umad ib_ucm ib_sa ib_cm ib_mad
> ib_core"
>
> generate the initramfs:
> dracut --force
>
> reboot.
>
> same as before.
>
> Any clue?
Does everything work with RDMA if you restart the fhgfs-client a few
minutes after system boot (e.g. fhgfs-net shows RDMA connections from
the client to the servers), so you only have the problem when the
fhgfs-client is started during system boot?
If this is the case, then:
* Do you see the "ib0: link becomes ready" message in dmesg before or
after the "FhGFS mount ready" message in dmesg during system boot?
* Does your /var/log/fhgfs-client.log list the ib0 interface in the
"Usable NICs" log line during system boot?
* Is the init.d script that brings up your IPoIB and the corresponding
Infiniband interface IP address included as a dependency in the
"Should-Start" line of the /etc/init.d/fhgfs-client script? (Note that
this dependency line was updated in the latest fhgfs 2014.01-r5 release.)
Best regards,
Sven Breuner
Fraunhofer
> <mailto:
mic...@ruepp.at>>:
>
> For your Information:
>
> Add
> rdma_ucm
> rdma_cm
> ib_addr
> ib_ipoib
> mlx4_core
> mlx4_ib
> mlx4_en
> mlx5_core
> mlx5_ib
> ib_uverbs
> ib_umad
> ib_ucm
> ib_sa
> ib_cm
> ib_mad
> ib_core
>
> to the initrd.
>
> Now all fhgfs-net of all nodes are RDMA.
>
> Seems to be the solution. IB takes some time.
>
> Regards Mike
>
>
> _________________
> michael ruepp
>
mic...@ruepp.at <mailto:
mic...@ruepp.at>
> fon
+43 676 911 40 90 <tel:%2B43%20676%20911%2040%2090>
> skype michaelruepp
>
> CONFIDENTIALITY NOTICE
> This message (including any attachments transmitted with it)
> contains confidential information and is intended only for the
> individual named herein. If you are not the herein named addressee
> you should not disseminate, distribute, copy or otherwise make use
> of this message. Please notify the sender immediately by e-mail if
> you have received this message by mistake, and delete it from your
> systems.
>
>
>
>
> On 20.11.2013, at 12:59, Bernd Schubert
> <
bernd.s...@itwm.fraunhofer.de
> <mailto:
bernd.s...@itwm.fraunhofer.de>> wrote:
>
> > Hello Michael,
> >
> > it depends, if fhgfs-client is started when no ib interface is
> up, it will not try to switch later on to IB. If the interface was
> up, but not usable it will every connNonPrimaryExpiration requests
> try to switch to ib.
> >
> > Regards,
> > Bernd
> >
> > On 11/20/2013 12:07 PM, Michael Ruepp wrote:
> >> Hi,
> >>
> >> yes, I know. It takes some time. I will load the modules in the
> initrd.
> >>
> >> But is there any polling mechanism on the fhgfs-side to let
> switch to rdma when possible?
> >>
> >> Should I decrease connNonPrimaryExpiration = 10000?
> >>
> >> Thanks,
> >>
> >> Mike
> >>
> >> _________________
> >> michael ruepp
> >>
mic...@ruepp.at <mailto:
mic...@ruepp.at>
> >> fon
+43 676 911 40 90 <tel:%2B43%20676%20911%2040%2090>
> >> skype michaelruepp
> >>
> >> CONFIDENTIALITY NOTICE
> >> This message (including any attachments transmitted with it)
> contains confidential information and is intended only for the
> individual named herein. If you are not the herein named addressee
> you should not disseminate, distribute, copy or otherwise make use
> of this message. Please notify the sender immediately by e-mail if
> you have received this message by mistake, and delete it from your
> systems.
> >>
> >>
> >>
> >>
> >> On 20.11.2013, at 11:55, Bernd Schubert
> <
bernd.s...@itwm.fraunhofer.de
> <mailto:
bernd.s...@itwm.fraunhofer.de>> wrote:
> >>
> >>> Hello Michael,
> >>>
> >>> fhgfs-client.log should tell you why ib-rdma does not work.
> >>>
> >>> Also, please not that the ib subnet manager may need quite some
> time to properly initialize an ib port - up to several minutes. So
> starting services is not sufficient to know if IB is already working
> properly. Again, fhgfs-client.log should give you some indication.
> >>>
> >>>
> >>> Best regards,
> >>> Bernd
> >>>
> >>>
> >>> On 11/20/2013 09:14 AM, Michael Ruepp wrote:
> >>>> I know this article.
> >>>>
> >>>> However, I am curious why the dependency system is not working
> and I don´t want to fill up my initrd with the mellanox modules,
> because of the image nature of bright (maintain more images than
> absolutely necessary).
> >>>>
> >>>> Thanks anyway,
> >>>>
> >>>> Mike
> >>>> _________________
> >>>> michael ruepp
> >>>>
mic...@ruepp.at <mailto:
mic...@ruepp.at>
> >>>> fon
+43 676 911 40 90 <tel:%2B43%20676%20911%2040%2090>
> >>>> skype michaelruepp
> >>>>
> >>>> CONFIDENTIALITY NOTICE
> >>>> This message (including any attachments transmitted with it)
> contains confidential information and is intended only for the
> individual named herein. If you are not the herein named addressee
> you should not disseminate, distribute, copy or otherwise make use
> of this message. Please notify the sender immediately by e-mail if
> you have received this message by mistake, and delete it from your
> systems.
> >>>>
> >>>>
> >>>>
> >>>>
> >>>> On 20.11.2013, at 02:49, Sven Breuner <
bre...@itwm.fhg.de
> <mailto:
fhgfs-user%2Bunsu...@googlegroups.com>.
> <mailto:
fhgfs-user%2Bunsu...@googlegroups.com>.
> <mailto:
fhgfs-user%2Bunsu...@googlegroups.com>.
> <mailto:
fhgfs-user%2Bunsu...@googlegroups.com>.
> <mailto:
fhgfs-user%2Bunsu...@googlegroups.com>.
> <mailto:
fhgfs-user+...@googlegroups.com>.
> For more options, visit
https://groups.google.com/d/optout.