Google Groups Home
Help | Sign in
open-iscsi + LIO kernel crash
There are currently too many topics in this group that display first. To make this topic appear first, remove this option from another topic.
There was an error processing your request. Please try again.
flag
  12 messages - Collapse all
The group you are posting to is a Usenet group. Messages posted to this group will make your email address visible to anyone on the Internet.
Your reply message has not been sent.
Your post was successful
Bart Van Assche  
View profile
 More options Feb 8, 9:40 am
From: "Bart Van Assche" <bart.vanass...@gmail.com>
Date: Fri, 8 Feb 2008 15:40:26 +0100
Local: Fri, Feb 8 2008 9:40 am
Subject: open-iscsi + LIO kernel crash
I have retested the issue I encountered a few days ago with kernel
2.6.24 and LIO 2.9.0.209, but unfortunately, this still triggers a
kernel crash ... This happens systematically after the first attempt
to perform iSCSI discovery with the open-iscsi command line tools.

Setup:

[ TARGET ]

$ cat /proc/iscsi_target/version_info
Linux-iSCSI.org iSCSI Target Core Stack v2.9.0.209 on Linux/x86_64 on 2.6.24
$ rmmod iscsi_target_mod
$ modprobe iscsi_target_mod
$ target-ctl settargetname targetname=iqn.2007-05.com.example
$ target-ctl addtpg tpgt=0
$ target-ctl settpgattrib tpgt=0 authentication=0
$ target-ctl  addnptotpg tpgt=0 ip=$(ip -family inet addr show dev eth0 \
  | sed -n 's:.*inet \([0-9.]*\).*:\1:p') port=3260
$ target-ctl  addnptotpg tpgt=0 ip=$(ip -family inet addr show dev ib0 \
  | sed -n 's:.*inet \([0-9.]*\).*:\1:p') port=3260
$ target-ctl addhbatotarget hba_id=0 hba_type=5 rd_host_id=0
$ target-ctl createvirtdev hba_id=0 rd_device_id=0 rd_pages=$((2**31/4096))
$ target-ctl addluntodev tpgt=0 iscsi_lun=0 hba_id=0 rd_device_id=0
$ target-ctl enabletpg tpgt=0

[ INITIATOR ]

$ rm -rf /etc/iscsi/nodes /etc/iscsi/send_targets
$ iscsiadm -m discovery -t sendtargets -p ${iscsi_target_ip}
10.100.100.10:3260,0 iqn.2007-05.com.example
192.168.102.10:3260,0 iqn.2007-05.com.example

Result (target kernel log):

Received iSCSI login request from 192.168.102.12 on TCP Network Portal
192.168.102.10:3260
Set np->np_login_tpg to ffff81015f37ba00
------------------------------------------------------------------
HeaderDigest:                 None
DataDigest:                   None
MaxRecvDataSegmentLength:     32768
IFMarker:                     No
OFMarker:                     No
------------------------------------------------------------------
------------------------------------------------------------------
InitiatorName:                iqn.1993-08.org.debian:01:b5698b924985
TargetAlias:                  iSBE Target
InitiatorAlias:               INF012
TargetPortalGroupTag:         0
DefaultTime2Wait:             2
DefaultTime2Retain:           0
ErrorRecoveryLevel:           0
SessionType:                  Discovery
------------------------------------------------------------------
iSCSI Login successful on CID: 0 from 192.168.102.12 to 192.168.102.10:3260,0
Incremented iSCSI Connection count to 1 from node:
iqn.1993-08.org.debian:01:b5698b924985
Established iSCSI session from node: iqn.1993-08.org.debian:01:b5698b924985
Incremented number of active iSCSI sessions to 1 on iSCSI Target Portal Group: 0
Cleared np->np_login_tpg
Decremented iSCSI connection count to 0 from node:
iqn.1993-08.org.debian:01:b5698b924985
Released iSCSI session from node: iqn.1993-08.org.debian:01:b5698b924985
Decremented number of active iSCSI Sessions on iSCSI TPG: 0 to 0

general protection fault: 0000 [1] SMP
CPU 0
Modules linked in: iscsi_target_mod ib_iser iscsi_tcp libiscsi
scsi_transport_iscsi rdma_ucm rdma_cm iw_cm ib_addr ib_uverbs ib_ipoib
ib_cm ib_sa ipv6 parport_pc lp parport loop af_packet ib_mthca
serio_raw pcspkr ib_mad ib_core psmouse iTCO_wdt iTCO_vendor_support
shpchp pci_hotplug evdev ext3 jbd mbcache sg sd_mod sr_mod cdrom
ata_generic ata_piix ehci_hcd libata uhci_hcd scsi_mod usbcore e1000
dm_mirror dm_snapshot dm_mod fuse
Pid: 2296, comm: kjournald Not tainted 2.6.24 #1
RIP: 0010:[<ffffffff80289730>]  [<ffffffff80289730>] kmem_cache_alloc+0x40/0xa0
RSP: 0018:ffff81015da4fce0  EFLAGS: 00010086
RAX: 0000000000000000 RBX: 8000000000000874 RCX: ffffffff8026c2a3
RDX: 0000000000000000 RSI: 0000000000011200 RDI: ffffffff804ef940
RBP: 0000000000000282 R08: 0000000000000000 R09: ffff81015d985800
R10: ffff810006721040 R11: ffffffff8030b150 R12: ffff810006828720
R13: 0000000000011200 R14: ffff81015da4fd10 R15: 0000000000000010
FS:  0000000000000000(0000) GS:ffffffff804e3000(0000) knlGS:0000000000000000
CS:  0010 DS: 0018 ES: 0018 CR0: 000000008005003b
CR2: 00002adae7452060 CR3: 0000000077b8f000 CR4: 00000000000006e0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Process kjournald (pid: 2296, threadinfo ffff81015da4e000, task
ffff81015de8a000)
Stack:  ffff81015d985000 0000000000000000 ffff81015fc4b5a0 0000000000011210

 ffff81015fc4b5d0 ffffffff8026c2a3 ffff81015d948c00 0000000000000001
 ffff81015d55b1cc ffffffff802b0f17 0000000000000020 ffffffff881211aa
Call Trace:
 [<ffffffff8026c2a3>] mempool_alloc+0x43/0x120
 [<ffffffff802b0f17>] generic_block_bmap+0x37/0x50
 [<ffffffff881211aa>] :jbd:__journal_file_buffer+0x8a/0x1a0
 [<ffffffff802b58be>] bio_alloc_bioset+0x7e/0x120
 [<ffffffff802b59c0>] bio_alloc+0x10/0x30
 [<ffffffff802b165f>] submit_bh+0x5f/0x130
 [<ffffffff88123ca3>] :jbd:journal_commit_transaction+0xd13/0xe90
 [<ffffffff8023fc64>] lock_timer_base+0x34/0x70
 [<ffffffff88126c8c>] :jbd:kjournald+0xdc/0x230
 [<ffffffff8024b9f0>] autoremove_wake_function+0x0/0x30
 [<ffffffff88126bb0>] :jbd:kjournald+0x0/0x230
 [<ffffffff8024b64b>] kthread+0x4b/0x80
 [<ffffffff8020d0a8>] child_rip+0xa/0x12
 [<ffffffff8024b600>] kthread+0x0/0x80
 [<ffffffff8020d09e>] child_rip+0x0/0x12

Code: 48 8b 04 c3 49 89 04 24 55 9d 66 45 85 ed 78 2e 48 89 d8 48
RIP  [<ffffffff80289730>] kmem_cache_alloc+0x40/0xa0
 RSP <ffff81015da4fce0>
---[ end trace 8cd4fe4bb7615ee9 ]---

Bart Van Assche.


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Nicholas A. Bellinger  
View profile
 More options Feb 8, 8:04 pm
From: "Nicholas A. Bellinger" <n...@linux-iscsi.org>
Date: Fri, 08 Feb 2008 17:04:55 -0800
Local: Fri, Feb 8 2008 8:04 pm
Subject: Re: open-iscsi + LIO kernel crash

Hey Bart,

I just ran:

for ((i=0;i<1000;i++)); do
        iscsiadm -m discovery -t sendtargets -p 172.16.201.129;
done

on a stock ubuntu 7.10 i386 running 2.6.22-14-generic with the default
Open/iSCSI implementation.   I just ran the 1000 loop iteration a dozen
times, and everything looks fine against with the Debian LIO-VM which is
running 2.9.0.188.  As I mentioned previously, nothing has changed in
that related to SendTargets from .180 to CURRENT, or anything else that
I can think of that would cause these types of reproduceable general
protection faults in the discovery path with any initiator..

This leads me to believe that there is still something messed up about
your LIO builds if running over traditional iSCSI with a known working
configuration is still not working. (ie: non IPoIB, for the moment, but
as we can both agree, this should not make any difference).  Can you
triple check that your running kernel and the source that the module is
being built against match kernel .config and compiler versions..?

Also, please send me your kernel configuration for 2.6.24 (are you
copying your .config between versions..?), as your issue seems to be
irrelivent of kernel version.  I will go ahead and do a 2.6.24 build
using your .config and see if I can see an issue inside of a LIO-VM.

Thanks for following up on this,

--nab


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Bart Van Assche  
View profile
 More options Feb 11, 2:55 am
From: "Bart Van Assche" <bart.vanass...@gmail.com>
Date: Mon, 11 Feb 2008 08:55:59 +0100
Local: Mon, Feb 11 2008 2:55 am
Subject: Re: open-iscsi + LIO kernel crash
On Feb 9, 2008 2:04 AM, Nicholas A. Bellinger <n...@linux-iscsi.org> wrote:

I have tested LIO-SE on an x86_64 system, not on an i386 system. I saw
several compiler warnings during compilation of the LIO-SE kernel
module (iscsi_target_mod). Should I send these compiler warnings to
you ?

Regarding potential mismatches between running kernel and the source
that the module is being built against: we can safely exclude this as
a potential cause. I have seen the kernel crashes triggered by
iscsi_target_mod with three different kernel versions (2.6.22.9,
2.6.23.14 and 2.6.24). Each time I had recompiled kernel and
iscsi_target_mod by removing the whole source tree before starting any
compilation steps. Note: before starting compilation, I first applied
the following patches (obtained from
https://scst.svn.sourceforge.net/svnroot/scst/trunk):
* scst_exec_req_fifo-2.6.24.patch
* iscsi-scst/kernel/patches/put_page_callback-2.6.24.patch

> Also, please send me your kernel configuration for 2.6.24 (are you
> copying your .config between versions..?), as your issue seems to be
> irrelevant of kernel version.  I will go ahead and do a 2.6.24 build
> using your .config and see if I can see an issue inside of a LIO-VM.

You can find the kernel config I used below. This config was obtained
by updating an older kernel config via "make oldconfig".

Bart.

$ wc .config
 3495  5941 76131 .config
$ cat .config
#
# Automatically generated make config: don't edit
# Linux kernel version: 2.6.24
# Thu Feb  7 15:15:01 2008
#
CONFIG_64BIT=y
# CONFIG_X86_32 is not set
CONFIG_X86_64=y
CONFIG_X86=y
CONFIG_GENERIC_TIME=y
CONFIG_GENERIC_CMOS_UPDATE=y
CONFIG_CLOCKSOURCE_WATCHDOG=y
CONFIG_GENERIC_CLOCKEVENTS=y
CONFIG_GENERIC_CLOCKEVENTS_BROADCAST=y
CONFIG_LOCKDEP_SUPPORT=y
CONFIG_STACKTRACE_SUPPORT=y
CONFIG_SEMAPHORE_SLEEPERS=y
CONFIG_MMU=y
CONFIG_ZONE_DMA=y
# CONFIG_QUICKLIST is not set
CONFIG_GENERIC_ISA_DMA=y
CONFIG_GENERIC_IOMAP=y
CONFIG_GENERIC_BUG=y
CONFIG_GENERIC_HWEIGHT=y
CONFIG_ARCH_MAY_HAVE_PC_FDC=y
CONFIG_DMI=y
CONFIG_RWSEM_GENERIC_SPINLOCK=y
# CONFIG_RWSEM_XCHGADD_ALGORITHM is not set
# CONFIG_ARCH_HAS_ILOG2_U32 is not set
# CONFIG_ARCH_HAS_ILOG2_U64 is not set
CONFIG_GENERIC_CALIBRATE_DELAY=y
CONFIG_GENERIC_TIME_VSYSCALL=y
CONFIG_ARCH_SUPPORTS_OPROFILE=y
CONFIG_ZONE_DMA32=y
CONFIG_ARCH_POPULATES_NODE_MAP=y
CONFIG_AUDIT_ARCH=y
CONFIG_GENERIC_HARDIRQS=y
CONFIG_GENERIC_IRQ_PROBE=y
CONFIG_GENERIC_PENDING_IRQ=y
CONFIG_X86_HT=y
# CONFIG_KTIME_SCALAR is not set
CONFIG_DEFCONFIG_LIST="/lib/modules/$UNAME_RELEASE/.config"

#
# General setup
#
CONFIG_EXPERIMENTAL=y
CONFIG_LOCK_KERNEL=y
CONFIG_INIT_ENV_ARG_LIMIT=32
CONFIG_LOCALVERSION=""
# CONFIG_LOCALVERSION_AUTO is not set
CONFIG_SWAP=y
CONFIG_SYSVIPC=y
CONFIG_SYSVIPC_SYSCTL=y
CONFIG_POSIX_MQUEUE=y
CONFIG_BSD_PROCESS_ACCT=y
CONFIG_BSD_PROCESS_ACCT_V3=y
# CONFIG_TASKSTATS is not set
# CONFIG_USER_NS is not set
# CONFIG_PID_NS is not set
CONFIG_AUDIT=y
# CONFIG_AUDITSYSCALL is not set
CONFIG_IKCONFIG=y
CONFIG_IKCONFIG_PROC=y
CONFIG_LOG_BUF_SHIFT=17
# CONFIG_CGROUPS is not set
CONFIG_FAIR_GROUP_SCHED=y
CONFIG_FAIR_USER_SCHED=y
# CONFIG_FAIR_CGROUP_SCHED is not set
CONFIG_SYSFS_DEPRECATED=y
CONFIG_RELAY=y
CONFIG_BLK_DEV_INITRD=y
CONFIG_INITRAMFS_SOURCE=""
# CONFIG_CC_OPTIMIZE_FOR_SIZE is not set
CONFIG_SYSCTL=y
CONFIG_EMBEDDED=y
CONFIG_UID16=y
CONFIG_SYSCTL_SYSCALL=y
CONFIG_KALLSYMS=y
# CONFIG_KALLSYMS_EXTRA_PASS is not set
CONFIG_HOTPLUG=y
CONFIG_PRINTK=y
CONFIG_BUG=y
CONFIG_ELF_CORE=y
CONFIG_BASE_FULL=y
CONFIG_FUTEX=y
CONFIG_ANON_INODES=y
CONFIG_EPOLL=y
CONFIG_SIGNALFD=y
CONFIG_EVENTFD=y
CONFIG_SHMEM=y
CONFIG_VM_EVENT_COUNTERS=y
# CONFIG_SLUB_DEBUG is not set
# CONFIG_SLAB is not set
CONFIG_SLUB=y
# CONFIG_SLOB is not set
CONFIG_SLABINFO=y
CONFIG_RT_MUTEXES=y
# CONFIG_TINY_SHMEM is not set
CONFIG_BASE_SMALL=0
CONFIG_MODULES=y
CONFIG_MODULE_UNLOAD=y
# CONFIG_MODULE_FORCE_UNLOAD is not set
CONFIG_MODVERSIONS=y
CONFIG_MODULE_SRCVERSION_ALL=y
CONFIG_KMOD=y
CONFIG_STOP_MACHINE=y
CONFIG_BLOCK=y
CONFIG_BLK_DEV_IO_TRACE=y
# CONFIG_BLK_DEV_BSG is not set
CONFIG_BLOCK_COMPAT=y

#
# IO Schedulers
#
CONFIG_IOSCHED_NOOP=y
CONFIG_IOSCHED_AS=y
CONFIG_IOSCHED_DEADLINE=y
CONFIG_IOSCHED_CFQ=y
# CONFIG_DEFAULT_AS is not set
CONFIG_DEFAULT_DEADLINE=y
# CONFIG_DEFAULT_CFQ is not set
# CONFIG_DEFAULT_NOOP is not set
CONFIG_DEFAULT_IOSCHED="deadline"
CONFIG_PREEMPT_NOTIFIERS=y

#
# Processor type and features
#
# CONFIG_TICK_ONESHOT is not set
# CONFIG_NO_HZ is not set
# CONFIG_HIGH_RES_TIMERS is not set
CONFIG_GENERIC_CLOCKEVENTS_BUILD=y
CONFIG_SMP=y
CONFIG_X86_PC=y
# CONFIG_X86_ELAN is not set
# CONFIG_X86_VOYAGER is not set
# CONFIG_X86_NUMAQ is not set
# CONFIG_X86_SUMMIT is not set
# CONFIG_X86_BIGSMP is not set
# CONFIG_X86_VISWS is not set
# CONFIG_X86_GENERICARCH is not set
# CONFIG_X86_ES7000 is not set
# CONFIG_X86_VSMP is not set
# CONFIG_M386 is not set
# CONFIG_M486 is not set
# CONFIG_M586 is not set
# CONFIG_M586TSC is not set
# CONFIG_M586MMX is not set
# CONFIG_M686 is not set
# CONFIG_MPENTIUMII is not set
# CONFIG_MPENTIUMIII is not set
# CONFIG_MPENTIUMM is not set
# CONFIG_MPENTIUM4 is not set
# CONFIG_MK6 is not set
# CONFIG_MK7 is not set
# CONFIG_MK8 is not set
# CONFIG_MCRUSOE is not set
# CONFIG_MEFFICEON is not set
# CONFIG_MWINCHIPC6 is not set
# CONFIG_MWINCHIP2 is not set
# CONFIG_MWINCHIP3D is not set
# CONFIG_MGEODEGX1 is not set
# CONFIG_MGEODE_LX is not set
# CONFIG_MCYRIXIII is not set
# CONFIG_MVIAC3_2 is not set
# CONFIG_MVIAC7 is not set
# CONFIG_MPSC is not set
# CONFIG_MCORE2 is not set
CONFIG_GENERIC_CPU=y
CONFIG_X86_L1_CACHE_BYTES=128
CONFIG_X86_INTERNODE_CACHE_BYTES=128
CONFIG_X86_CMPXCHG=y
CONFIG_X86_L1_CACHE_SHIFT=7
CONFIG_X86_GOOD_APIC=y
CONFIG_X86_TSC=y
CONFIG_X86_MINIMUM_CPU_FAMILY=64
CONFIG_HPET_TIMER=y
CONFIG_HPET_EMULATE_RTC=y
CONFIG_GART_IOMMU=y
CONFIG_CALGARY_IOMMU=y
CONFIG_CALGARY_IOMMU_ENABLED_BY_DEFAULT=y
CONFIG_SWIOTLB=y
CONFIG_NR_CPUS=4
CONFIG_SCHED_SMT=y
CONFIG_SCHED_MC=y
CONFIG_PREEMPT_NONE=y
# CONFIG_PREEMPT_VOLUNTARY is not set
# CONFIG_PREEMPT is not set
# CONFIG_PREEMPT_BKL is not set
CONFIG_X86_LOCAL_APIC=y
CONFIG_X86_IO_APIC=y
CONFIG_X86_MCE=y
CONFIG_X86_MCE_INTEL=y
CONFIG_X86_MCE_AMD=y
CONFIG_MICROCODE=m
CONFIG_MICROCODE_OLD_INTERFACE=y
CONFIG_X86_MSR=m
CONFIG_X86_CPUID=m
# CONFIG_NUMA is not set
CONFIG_ARCH_FLATMEM_ENABLE=y
CONFIG_ARCH_SPARSEMEM_ENABLE=y
CONFIG_SELECT_MEMORY_MODEL=y
CONFIG_FLATMEM_MANUAL=y
# CONFIG_DISCONTIGMEM_MANUAL is not set
# CONFIG_SPARSEMEM_MANUAL is not set
CONFIG_FLATMEM=y
CONFIG_FLAT_NODE_MEM_MAP=y
# CONFIG_SPARSEMEM_STATIC is not set
CONFIG_SPARSEMEM_VMEMMAP_ENABLE=y
CONFIG_SPLIT_PTLOCK_CPUS=4
CONFIG_RESOURCES_64BIT=y
CONFIG_ZONE_DMA_FLAG=1
CONFIG_BOUNCE=y
CONFIG_VIRT_TO_BUS=y
CONFIG_MTRR=y
CONFIG_SECCOMP=y
# CONFIG_CC_STACKPROTECTOR is not set
CONFIG_HZ_100=y
# CONFIG_HZ_250 is not set
# CONFIG_HZ_300 is not set
# CONFIG_HZ_1000 is not set
CONFIG_HZ=100
CONFIG_KEXEC=y
CONFIG_CRASH_DUMP=y
CONFIG_PHYSICAL_START=0x200000
CONFIG_RELOCATABLE=y
CONFIG_PHYSICAL_ALIGN=0x200000
CONFIG_HOTPLUG_CPU=y
CONFIG_ARCH_ENABLE_MEMORY_HOTPLUG=y

#
# Power management options
#
# CONFIG_PM is not set
CONFIG_SUSPEND_SMP_POSSIBLE=y
CONFIG_HIBERNATION_SMP_POSSIBLE=y

#
# CPU Frequency scaling
#
# CONFIG_CPU_FREQ is not set
# CONFIG_CPU_IDLE is not set

#
# Bus options (PCI etc.)
#
CONFIG_PCI=y
CONFIG_PCI_DIRECT=y
CONFIG_PCI_DOMAINS=y
CONFIG_PCIEPORTBUS=y
CONFIG_HOTPLUG_PCI_PCIE=m
CONFIG_PCIEAER=y
CONFIG_ARCH_SUPPORTS_MSI=y
CONFIG_PCI_MSI=y
CONFIG_PCI_LEGACY=y
CONFIG_HT_IRQ=y
CONFIG_ISA_DMA_API=y
CONFIG_K8_NB=y
CONFIG_PCCARD=m
# CONFIG_PCMCIA_DEBUG is not set
CONFIG_PCMCIA=m
CONFIG_PCMCIA_LOAD_CIS=y
CONFIG_PCMCIA_IOCTL=y
CONFIG_CARDBUS=y

#
# PC-card bridges
#
CONFIG_YENTA=m
CONFIG_YENTA_O2=y
CONFIG_YENTA_RICOH=y
CONFIG_YENTA_TI=y
CONFIG_YENTA_ENE_TUNE=y
CONFIG_YENTA_TOSHIBA=y
CONFIG_PD6729=m
CONFIG_I82092=m
CONFIG_PCCARD_NONSTATIC=m
CONFIG_HOTPLUG_PCI=m
CONFIG_HOTPLUG_PCI_FAKE=m
CONFIG_HOTPLUG_PCI_CPCI=y
CONFIG_HOTPLUG_PCI_CPCI_ZT5550=m
CONFIG_HOTPLUG_PCI_CPCI_GENERIC=m
CONFIG_HOTPLUG_PCI_SHPC=m

#
# Executable file formats / Emulations
#
CONFIG_BINFMT_ELF=y
CONFIG_BINFMT_MISC=m
CONFIG_IA32_EMULATION=y
# CONFIG_IA32_AOUT is not set
CONFIG_COMPAT=y
CONFIG_COMPAT_FOR_U64_ALIGNMENT=y
CONFIG_SYSVIPC_COMPAT=y

#
# Networking
#
CONFIG_NET=y

#
# Networking options
#
CONFIG_PACKET=m
CONFIG_PACKET_MMAP=y
CONFIG_UNIX=y
CONFIG_XFRM=y
CONFIG_XFRM_USER=m
# CONFIG_XFRM_SUB_POLICY is not set
# CONFIG_XFRM_MIGRATE is not set
CONFIG_NET_KEY=m
# CONFIG_NET_KEY_MIGRATE is not set
CONFIG_INET=y
CONFIG_IP_MULTICAST=y
CONFIG_IP_ADVANCED_ROUTER=y
CONFIG_ASK_IP_FIB_HASH=y
# CONFIG_IP_FIB_TRIE is not set
CONFIG_IP_FIB_HASH=y
CONFIG_IP_MULTIPLE_TABLES=y
CONFIG_IP_ROUTE_MULTIPATH=y
CONFIG_IP_ROUTE_VERBOSE=y
# CONFIG_IP_PNP is not set
CONFIG_NET_IPIP=m
CONFIG_NET_IPGRE=m
CONFIG_NET_IPGRE_BROADCAST=y
CONFIG_IP_MROUTE=y
CONFIG_IP_PIMSM_V1=y
CONFIG_IP_PIMSM_V2=y
# CONFIG_ARPD is not set
CONFIG_SYN_COOKIES=y
CONFIG_INET_AH=m ...

read more »


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Nicholas A. Bellinger  
View profile
 More options Feb 11, 4:02 am
From: "Nicholas A. Bellinger" <n...@linux-iscsi.org>
Date: Mon, 11 Feb 2008 01:02:54 -0800
Local: Mon, Feb 11 2008 4:02 am
Subject: Re: open-iscsi + LIO kernel crash

Sure, you can send me these offline if you like..

> Regarding potential mismatches between running kernel and the source
> that the module is being built against: we can safely exclude this as
> a potential cause. I have seen the kernel crashes triggered by
> iscsi_target_mod with three different kernel versions (2.6.22.9,
> 2.6.23.14 and 2.6.24). Each time I had recompiled kernel and
> iscsi_target_mod by removing the whole source tree before starting any
> compilation steps. Note: before starting compilation, I first applied
> the following patches (obtained from
> https://scst.svn.sourceforge.net/svnroot/scst/trunk):
> * scst_exec_req_fifo-2.6.24.patch
> * iscsi-scst/kernel/patches/put_page_callback-2.6.24.patch

Hmm, the first patch does touch code that LIO-SE uses for the PSCSI
plugin (the SCST patch makes scsi_execute_async() inlined), but I can't
see why this would be an issue.  I am will be using a fresh 2.6.24 from
kernel.org for my first test with your config, without these SCST two
patches.

> > Also, please send me your kernel configuration for 2.6.24 (are you
> > copying your .config between versions..?), as your issue seems to be
> > irrelevant of kernel version.  I will go ahead and do a 2.6.24 build
> > using your .config and see if I can see an issue inside of a LIO-VM.

> You can find the kernel config I used below. This config was obtained
> by updating an older kernel config via "make oldconfig".

I can't think of why copying the config would be an issue, but we are
basically down to seperate patches causing the issue, or something with
your config.   I will have a look at your config and give it a try on a
x86_64 VM with the same Open/iSCSI test.

--nat
PS:  I am still running fine on 2.6.24 ppc64 from ps3-linux from last
week with typical usage btw..


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Bart Van Assche  
View profile
 More options Feb 11, 6:10 am
From: "Bart Van Assche" <bart.vanass...@gmail.com>
Date: Mon, 11 Feb 2008 12:10:33 +0100
Local: Mon, Feb 11 2008 6:10 am
Subject: Re: open-iscsi + LIO kernel crash
On Feb 11, 2008 10:02 AM, Nicholas A. Bellinger <n...@linux-iscsi.org> wrote:

By this time I have been able to reproduce the crash with an
unmodified 2.6.24 kernel, so the crash is not related to the SCST
patches.

From the kernel messages:

------------------------------------------------------------------
InitiatorName:                iqn.1993-08.org.debian:01:e52cfb64aea
TargetAlias:                  iSBE Target
InitiatorAlias:               INF010
TargetPortalGroupTag:         0
DefaultTime2Wait:             2
DefaultTime2Retain:           0
ErrorRecoveryLevel:           0
SessionType:                  Discovery
------------------------------------------------------------------
iSCSI Login successful on CID: 0 from 192.168.102.10 to 192.168.102.12:3260,0
Incremented iSCSI Connection count to 1 from node:
iqn.1993-08.org.debian:01:e52cfb64aea
Established iSCSI session from node: iqn.1993-08.org.debian:01:e52cfb64aea
Incremented number of active iSCSI sessions to 1 on iSCSI Target Portal Group: 0
Cleared np->np_login_tpg
Received iSCSI login request from 192.168.102.10 on TCP Network Portal
192.168.102.12:3260
Set np->np_login_tpg to ffff81007c3c5a00
general protection fault: 0000 [1] SMP
CPU 0
Modules linked in: iscsi_target_mod ib_iser iscsi_tcp libiscsi
scsi_transport_iscsi rdma_ucm rdma_cm iw_cm ib_addr ib_uverbs ib_ipoib
ib_cm ib_sa ipv6 parport_pc lp parport loop af_packet ib_mthca psmouse
ib_mad ib_core iTCO_wdt pcspkr serio_raw iTCO_vendor_support shpchp
pci_hotplug evdev ext3 jbd
mod sr_mod cdrom ata_piix ata_generic ehci_hcd libata scsi_mod
uhci_hcd usbcore e1000 fuse
Pid: 10115, comm: iscsi_np Not tainted 2.6.24 #1
RIP: 0010:[<ffffffff8028a81b>]  [<ffffffff8028a81b>] __kmalloc+0x5b/0x100
RSP: 0018:ffff810065131d40  EFLAGS: 00010002
RAX: 0000000000000000 RBX: 6c41746567726154 RCX: ffffffff882959c2
RDX: 0000000000000000 RSI: 00000000000000d0 RDI: ffffffff804f0940
RBP: 0000000000000286 R08: 0000000000000006 R09: ffff81006513e300
R10: 0000000000000000 R11: 0000000000000000 R12: ffff810002c02720
R13: 00000000000000d0 R14: ffff81007dcce280 R15: 0000000000000001
FS:  00002b80783956e0(0000) GS:ffffffff804e4000(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: 00000000006bd540 CR3: 0000000000201000 CR4: 00000000000006e0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Process iscsi_np (pid: 10115, threadinfo ffff810065130000, task
ffff810037bd2ec0)
Stack:  ffff810074efe800 ffff81006513e300 ffff810065373080 ffff81006513e320
 ffff81007bf68800 ffffffff882959c2 ffff81007f41cad0 ffff810074efe800
 ffff81007f41c800 ffff81007de03a00 ffff810065131eb0 ffff810065131e00
Call Trace:
 [<ffffffff882959c2>] :iscsi_target_mod:iscsi_copy_param_list+0x132/0x520
 [<ffffffff882ad275>] :iscsi_target_mod:iscsi_target_login_thread+0xe45/0x2350
 [<ffffffff8020d0a8>] child_rip+0xa/0x12
 [<ffffffff882ac430>] :iscsi_target_mod:iscsi_target_login_thread+0x0/0x2350
 [<ffffffff8020d09e>] child_rip+0x0/0x12

Code: 48 8b 04 c3 49 89 04 24 55 9d 66 45 85 ed 78 58 48 89 d8 48
RIP  [<ffffffff8028a81b>] __kmalloc+0x5b/0x100
 RSP <ffff810065131d40>
---[ end trace 28b49b7dd5f78d80 ]---
Decremented iSCSI connection count to 0 from node:
iqn.1993-08.org.debian:01:e52cfb64aea
Released iSCSI session from node: iqn.1993-08.org.debian:01:e52cfb64aea
Decremented number of active iSCSI Sessions on iSCSI TPG: 0 to 0


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Bart Van Assche  
View profile
 More options Feb 11, 9:04 am
From: "Bart Van Assche" <bart.vanass...@gmail.com>
Date: Mon, 11 Feb 2008 15:04:28 +0100
Local: Mon, Feb 11 2008 9:04 am
Subject: Re: open-iscsi + LIO kernel crash
On Feb 11, 2008 10:02 AM, Nicholas A. Bellinger <n...@linux-iscsi.org> wrote:

> PS:  I am still running fine on 2.6.24 ppc64 from ps3-linux from last
> week with typical usage btw..

Hello Nicholas,

Have you tested the LIO-SE kernel module with kernel debugging enabled
on the 2.6.24 kernel ? This is what I get while configuring the LIO-SE
kernel module on a 2.6.24.2 kernel with kernel debugging enabled (no
iSCSI discovery has yet been attempted):

------------[ cut here ]------------
kernel BUG at include/linux/scatterlist.h:59!
invalid opcode: 0000 [1] SMP
CPU 0
Modules linked in: iscsi_target_mod ib_iser iscsi_tcp libiscsi
scsi_transport_iscsi rdma_ucm rdma_cm iw_cm ib_addr
poib ib_cm ib_sa ipv6 parport_pc lp parport loop af_packet psmouse
serio_raw pcspkr ib_mthca ib_mad ib_core iTCO_wdt iTCO_vendor_support
shpchp pci_hotplug evdev ext3 jbd mbcache sg sr_mod sd_mod cdrom
ata_piix ata_generic libata scsi_mod ehci_hcd uhci_hcd usbcore e1000
fuse
Pid: 18611, comm: target-ctl.bin Not tainted 2.6.24.2 #2
RIP: 0010:[<ffffffff882f3230>]  [<ffffffff882f3230>]
:iscsi_target_mod:rd_create_virtdevice+0x440/0x570
RSP: 0018:ffff81006f4cb8d8  EFLAGS: 00010293
RAX: 0000000087654321 RBX: 0000000000000000 RCX: ffff81007b891b40
RDX: 0000000000000000 RSI: ffff810003dcdb60 RDI: ffff810003dcdb60
RBP: ffff81007b891b40 R08: 0000000000000000 R09: 0000000000000000
R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000004
R13: ffff81007d33c180 R14: 0000000000000004 R15: ffff81007d33cb80
FS:  00002b50d259c6e0(0000) GS:ffffffff80572000(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: 00002b50d230d6c0 CR3: 000000007010e000 CR4: 00000000000006e0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Process target-ctl.bin (pid: 18611, threadinfo ffff81006f4ca000, task
ffff810070014000)
Stack:  ffff81006f4a0118 0000000000000246 ffff81006f4cbc98 ffff81006f4a0000
 ffff81007a13a780 0000000000000004 0000000000000000 ffffffff8832f580
 ffff81006f4a0000 00007fffd8a7fc20 ffff81006f278000 ffff81006f4cbc98
Call Trace:
 [<ffffffff882ae1d9>] :iscsi_target_mod:iscsi_create_virtual_device+0xe9/0x140
 [<ffffffff882bae6b>] :iscsi_target_mod:iscsi_ioctl+0x51b/0x1e70
 [<ffffffff802b33a7>] __d_lookup+0xd7/0x180
 [<ffffffff802304b1>] task_rq_lock+0x51/0xa0
 [<ffffffff802304b1>] task_rq_lock+0x51/0xa0
 [<ffffffff80232415>] try_to_wake_up+0x65/0x480
 [<ffffffff80232415>] try_to_wake_up+0x65/0x480
 [<ffffffff802308fd>] __wake_up+0x2d/0x70
 [<ffffffff8027800e>] find_lock_page+0x1e/0xb0
 [<ffffffff8027800e>] find_lock_page+0x1e/0xb0
 [<ffffffff8022db9a>] __wake_up_common+0x5a/0x90
 [<ffffffff80435def>] do_page_fault+0x1ef/0x840
 [<ffffffff802ad46d>] do_ioctl+0x7d/0xa0
 [<ffffffff802ad504>] vfs_ioctl+0x74/0x2d0
 [<ffffffff802ad7f1>] sys_ioctl+0x91/0xb0
 [<ffffffff8020c2ee>] system_call+0x7e/0x83

Code: 0f 0b eb fe 0f 0b eb fe 48 bf 5f 64 65 76 5f 74 0a 00 49 bb
RIP  [<ffffffff882f3230>] :iscsi_target_mod:rd_create_virtdevice+0x440/0x570
 RSP <ffff81006f4cb8d8>
---[ end trace 12f35ae0aa44b76c ]---

Bart.


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
nab@linux-iscsi.org  
View profile
 More options Feb 12, 6:43 am
From: "n...@linux-iscsi.org" <n...@linux-iscsi.org>
Date: Tue, 12 Feb 2008 03:43:57 -0800 (PST)
Local: Tues, Feb 12 2008 6:43 am
Subject: Re: open-iscsi + LIO kernel crash

Hi Bart,

I added proper usage of sg_init_table() and sg_mark_end() within the
LIO code, which is what is causing the BUG() with createvirtdev.  I am
now recompiling with CONFIG_DEBUG_SG and will let you know what I
find.

I believe this BUG() only exists with the changes in scatterlist.h in
2.6.24 btw, so I still can't explain the general protection faults..

Also, I was able to reproduce the GFPs with discovery and your
config.  I will let you know if my changes resolve the other issue or
if I need to enable more debug code and put kdb into the VM.

Thanks again for spending the extra time to help track this down..

--nab


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
nab@linux-iscsi.org  
View profile
 More options Feb 12, 7:33 am
From: "n...@linux-iscsi.org" <n...@linux-iscsi.org>
Date: Tue, 12 Feb 2008 04:33:55 -0800 (PST)
Local: Tues, Feb 12 2008 7:33 am
Subject: Re: open-iscsi + LIO kernel crash