LNet couldn't load the driver module (kqswlnd) for elan. What's your
lnet module parameters?
Please also run:
ls /lib/modules/2.6.18-53.1.14.el5.lustre/kernel/net/lustre
rpm -ql lustre-modules
Thanks,
Isaac
> Lustre: Removed LNI [4]192.168.0.3@tcp
> LustreError: 2359:0:(events.c:654:ptlrpc_init_portals()) network
> initialisation failed
_______________________________________________
Lustre-discuss mailing list
Lustre-...@lists.lustre.org
http://lists.lustre.org/mailman/listinfo/lustre-discuss
On Mon, Mar 10, 2008 at 10:04:50AM -0500, mitcheloc wrote:
>
> [root@catapult ~]# dmesg
> Lustre: OBD class driver, [2]info@clusterfs.com
It shall be in /etc/modprobe.conf or some file under /etc/modprobe.d.
Exact location depends on your distribution. Look for a line that
starts with "options lnet ".
>
> And as you requested:
>
> [root@catapult ~]# ls
> /lib/modules/2.6.18-53.1.14.el5.lustre/kernel/net/lustre
> ksocklnd.ko libcfs.ko lnet.ko lnet_selftest.ko
>
The kqswlnd.ko is missing.
Isaac
If you don't have Quadrics Elan hardware, you can change it to:
options lnet networks=tcp0
Otherwise,
> Where should kqswlnd.ko be coming from?
you need to compile lustre with proper QsNet support.
Run command by Isaac mentioned:
ls /lib/modules/2.6.18-53.1.14.el5.lustre/kernel/fs/lustre
rpm -ql lustre-modules
If so, please try to "modprobe mgs" manually to see if any messages
displayed.
Jack
> ------------------------------------------------------------------------
#ls /lib/modules/2.6.18-53.1.13.el5_lustre.1.6.4smp/kernel/fs/lustre/
fsfilt_ldiskfs.ko lov.ko lvfs.ko mgc.ko obdecho.ko ost.ko
llite_lloop.ko lquota.ko mdc.ko mgs.ko obdfilter.ko ptlrpc.ko
llog_test.ko lustre.ko mds.ko obdclass.ko osc.ko
I'm not sure what commands you use to compile lustre(configure/lbuild),
I'd recommend you re-compile lustre with patched lustre modules.
Normally, you don't specify any flags, but try to the following commands
or by steps as Lustre_manual document described:
./configure --disable-modules --disable-utils --disable-liblustre --disable-tests --disable-doc
make clean
make dist
sh -x $CUR_LUSTRE/build/lbuild --target=2.6-rhel5 --tag=b1_6 --kerneltree=/path/to/kernel
--target-arch=$ARCH --lustre=$CUR_LUSTRE/lustre-$VERSION.tar.gz --release ...
> Apparently it is not on my system. Looking in the lustre source, it
> looks like mgs should have been compiled with everything else. Is
> there a special flag to enable it?
>
> Thank you,
> Mitchel
Jack,I did issue a ./configure and make before trying that command. Here is my second go at the command:[root@catapult lustre-1.6.4.3]# sh -x /usr/src/lustre-1.6.4.3/build/lbuild --target=2.6-rhel5 --tag=b1_6 --kerneltree=/home/mitchel/rpmbuild/BUILD/kernel-2.6.18/linux-2.6.18.i686 --lustre=/usr/src/lustre-1.6.4.3.tar.gzI'm not sure where $CUR_LUSTRE is supposed to come from?I get the exact same error running that command:.......+ export CC=gcc/usr/src/lustre-1.6.4.3/build/lbuild: line 1095: syntax error near unexpected token `<'
+ CC=gcc
/usr/src/lustre-1.6.4.3/build/lbuild: line 1095: ` read a b < <(echo $(pwd)/usr/src/linux-*)'
Sorry if I'm overlooking something obvious....
On Mon, Mar 10, 2008 at 4:08 PM, Jack Chen <Jian...@sun.com> wrote:
Hi Mitcheloc,
I'm also not sure if the parameters I used is suitable for you, I just
want to let you know what 's the lustre build process.but you must run
./configure and make before using lbuild command. And it's necessary to
add --lustre=$CUR_LUSTRE/lustre-$VERSION.tar.gz.to lbuild command.
I think the steps in quick start page is correct to build patch lustre
kernel, recompile lustre by quick start doc and reference to my
method, let's see if the mgs missed issue can be fixed.
Regards,
Jack
> /usr/src/lustre-1.6.4.3/build/lbuild <http://1.6.4.3/build/lbuild>
mitcheloc wrote:
> Jack,
>
> I mostly followed the quick start page on the wiki with a combination
> of other sources to figure out how to patch the CentOS kernel properly.
>
> Where did you get that lbuild command? I've not seen instructions to
> use it anywhere.
>
> I guessed at where the kerneltree param needed to point. Here is what
> happens:
>
> [root@catapult lustre-1.6.4.3]# sh -x
> --target=2.6-rhel5 --tag=b1_6> + TOPDIR=/usr/src/lustre-1.6.4.3 <http://1.6.4.3>
> --kerneltree=/home/mitchel/rpmbuild/BUILD/kernel-2.6.18/linux-2.6.18.i686/
> /usr/src/lustre-1.6.4.3/build/lbuild <http://1.6.4.3/build/lbuild>:> + KERNELDIR=
> + LINUX=
> + LUSTRE=
> + RELEASE=false
> + DO_SRC=0
> + DOWNLOAD=1
> + TAG=
> + CANONICAL_TARGET=
> + TARGET=
> ++ uname -m
> + TARGET_ARCH=i686
> + TARGET_ARCHS=
> + TARGET_ARCHS_ALL=i686
> + '[' i686 = i686 ']'
> + TARGET_ARCHS_ALL='i686 i586 i386'
> + CONFIGURE_FLAGS=
> + EXTERNAL_PATCHES=
> + EXTRA_VERSION=
> + LUSTRE_EXTRA_VERSION=
> + STAGEDIR=
> + TMPDIR=/var/tmp
> + TIMESTAMP=
> + REUSERPM=
> + REUSEBUILD=
> + NORPM=false
> + LDISKFSRPM=true
> + SKIPLDISKFSRPM='v1_4_* b1_4'
> + SMPTYPES='smp bigsmp default '\'''\'''
> + KERNCONFSMPTYPE=
> + PATCHLESS=false
> + LINUXOBJ=
> + REUSEDKERNELMASK=
> + DISTRO=
> + KERNELTREE=
> + KERNELRPMSBASE=
> + KERNELRPM=
> + KERNELSOURCERPM=
> + RPMSMPTYPE=
> + KERNELRPMCONFIG=
> + KERNELRPMRELEASE=
> + KERNELCOMPILEDIR=
> + KERNEL=
> + SERIES=
> + CONFIG=
> + VERSION=
> + RHBUILD=0
> + SUSEBUILD=0
> + LINUX26=0
> + SUSEBUILD=0
> + BASE_ARCHS=
> + BIGMEM_ARCHS=
> + BOOT_ARCHS=
> + JENSEN_ARCHS=
> + SMP_ARCHS=
> + BIGSMP_ARCHS=
> + PSERIES64_ARCHS=
> + UP_ARCHS=
> ++ date
> + DATE='Mon Mar 10 13:37:28 PDT 2008'
> + USE_DATESTAMP=1
> + RPMBUILD=
> + export CC=gcc
> + CC=gcc
> line 1095: syntax error near unexpected token `<'> /usr/src/lustre-1.6.4.3/build/lbuild <http://1.6.4.3/build/lbuild>:
> line 1095: ` read a b < <(echo $(pwd)/usr/src/linux-*)'
>
>
> Thanks,
> Mitchel
Cheers,
Jack
> > [root@catapult lustre-1.6.4.3]# sh -x
> Do not use sh to execute lbuild command, please retry "./path/to/lbuild
> $parameters"
Or alternatively, if you do want to set flags on execution of lbuild,
use "bash" not "sh":
bash -x .../lbuild.
lbuild contains bashisms that are not supported in the "sh" (i.e.
strictly posix compliant) variant of the bourne shell.
Cheers,
b.