New OES2018 node cannot join NCS cluster - it crashes as soon as sbdlib.ko is loaded

  • 7023311
  • 29-Aug-2018
  • 03-Sep-2018

Environment

Open Enterprise Server 2018 (OES 2018) Linux

Situation

Existing cluster of OES 2015 SP1 nodes cluster on physical hardware or VMware virtualized.

The shared disks are in a SAN that is connected per Fibre Channel to host bus adapters of the cluster nodes.

SAN connected the to a host bus adapter of an ESXi 6.5 host and created a Raw Disk Mapping for each shared disk of the SAN, also for the SBD partition.

New OES 2018 server as VMWare guest on the ESXi 6.5 host is created and added it to the cluster.

When booting the OES 2018 cluster node, which is a VMWare guest at the ESXi 6.5 host, it crashes as soon as sbdlib.ko is loaded.

dmesg reads the following details about the crash:

"
[   29.865841] Request for unknown module key 'Micro Focus Open Enterprise Secure Boot Signkey: bda8219e250e119aa5c78ef67efa2a3ee78e2e16' err -11
[   29.866617] cis: cis module initialization Successful
[   31.571718] Request for unknown module key 'Micro Focus Open Enterprise Secure Boot Signkey: bda8219e250e119aa5c78ef67efa2a3ee78e2e16' err -11
[   31.581120] Request for unknown module key 'Micro Focus Open Enterprise Secure Boot Signkey: bda8219e250e119aa5c78ef67efa2a3ee78e2e16' err -11
[   31.734905] Request for unknown module key 'Micro Focus Open Enterprise Secure Boot Signkey: bda8219e250e119aa5c78ef67efa2a3ee78e2e16' err -11
[   31.740864] BUG: unable to handle kernel paging request at 00007ff18926f000
[   31.740899] IP: [<ffffffffa07140be>] sbdlib_proc_write+0x6e/0xe0 [sbdlib]
[   31.740926] PGD 37d56a067 PUD 422080067 PMD 37e56c067 PTE 800000037bd7d867
[   31.740956] Oops: 0001 [#1] SMP 
[   31.740971] Modules linked in: sbdlib(OE) clstrlib(OE) ncs_timer(OE) cis(OE) novfs(OE) nebdrv(OE) zapi(OE) nsssa(OE) nsslsa(OE) nssmanage(OE) nsszlss64(OE) nsszlss(OE) nsscomn(OE) ndpmod(OE) nss(OE) nsslibrary(OE) nsslnxlib(OE) libnss(OE) admindrv(OE) nwraid(OE) nf_log_ipv6 xt_pkttype nf_log_ipv4 nf_log_common xt_LOG xt_limit tcp_diag inet_diag adminfs(OE) adminfsdrv(OE) af_packet iscsi_ibft iscsi_boot_sysfs ip6t_REJECT xt_tcpudp nf_conntrack_ipv6 nf_defrag_ipv6 ip6table_raw ipt_REJECT iptable_raw xt_CT iptable_filter ip6table_mangle nf_conntrack_netbios_ns nf_conntrack_broadcast nf_conntrack_ipv4 nf_defrag_ipv4 ip_tables xt_conntrack nf_conntrack ip6table_filter ip6_tables 8021q garp mrp stp x_tables llc vmw_vsock_vmci_transport vsock crct10dif_pclmul crc32_pclmul ghash_clmulni_intel drbg ansi_cprng
[   31.741283]  ppdev vmw_balloon aesni_intel aes_x86_64 lrw gf128mul glue_helper ablk_helper cryptd pcspkr joydev vmxnet3 mptctl sr_mod i2c_piix4 cdrom vmw_vmci shpchp nfit libnvdimm parport_pc parport fjes battery processor ac ata_generic ext4 crc16 jbd2 mbcache ata_piix sd_mod crc32c_intel serio_raw vmwgfx drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops ahci libahci ttm drm libata mptspi scsi_transport_spi mptscsih mptbase button sg dm_multipath dm_mod scsi_dh_rdac scsi_dh_emc scsi_dh_alua scsi_mod autofs4
[   31.741537] Supported: Yes
[   31.741551] CPU: 0 PID: 4163 Comm: tail Tainted: G           OE      4.4.103-92.53-default #1
[   31.741579] Hardware name: VMware, Inc. VMware Virtual Platform/440BX Desktop Reference Platform, BIOS 6.00 04/05/2016
[   31.741617] task: ffff880094dd9640 ti: ffff88037d56c000 task.ti: ffff88037d56c000
[   31.741642] RIP: 0010:[<ffffffffa07140be>]  [<ffffffffa07140be>] sbdlib_proc_write+0x6e/0xe0 [sbdlib]
[   31.741675] RSP: 0018:ffff88037d56fe38  EFLAGS: 00010202
[   31.741694] RAX: 000000000000000d RBX: 000000000000000b RCX: 000000000000000c
[   31.741717] RDX: 000000000000000b RSI: 00007ff18926f000 RDI: ffffffffa0716139
[   31.742461] RBP: 000000000000000d R08: 6465726168536f4e R09: ffff88037e774ee4
[   31.743187] R10: 0000000000000001 R11: 0000000000000246 R12: 00007ff18926f000
[   31.743912] R13: ffff88037d56ff28 R14: 000000000000000d R15: 0000000000000000
[   31.744620] FS:  00007ff189254700(0000) GS:ffff88043fc00000(0000) knlGS:0000000000000000
[   31.745359] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[   31.746074] CR2: 00007ff18926f000 CR3: 000000037d471000 CR4: 00000000003406f0
[   31.746826] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[   31.747557] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[   31.748274] Stack:
[   31.748975]  ffff88037e774e80 ffff88037d56ff28 000000000000000d ffffffff81262359
[   31.749694]  ffff88043f0eeca0 ffff88041e032a00 ffffffff811fc993 ffff88037ed4d200
[   31.750403]  0000000000000010 ffff8800b6fca008 ffffffff81cb8f60 ffff88041e032a00
[   31.751110] Call Trace:
[   31.751855]  [<ffffffff81262359>] proc_reg_write+0x39/0x70
[   31.752550]  [<ffffffff811fc993>] __vfs_write+0x23/0x100
[   31.753222]  [<ffffffff811fd01d>] vfs_write+0x9d/0x190
[   31.753878]  [<ffffffff811fdce2>] SyS_write+0x42/0xa0
[   31.754518]  [<ffffffff815ea9ee>] entry_SYSCALL_64_fastpath+0x12/0x6d
[   31.756528] DWARF2 unwinder stuck at entry_SYSCALL_64_fastpath+0x12/0x6d
[   31.757145] 
[   31.757740] Leftover inexact backtrace:
               
[   31.758885] Code: a0 00 83 eb 01 48 63 d3 0f b6 8a 80 75 71 a0 80 f9 20 74 e7 83 e9 09 80 f9 01 76 df b9 0c 00 00 00 48 c7 c7 39 61 71 a0 4c 89 e6 <f3> a6 74 51 b9 08 00 00 00 48 c7 c7 46 61 71 a0 4c 89 e6 f3 a6 
[   31.760671] RIP  [<ffffffffa07140be>] sbdlib_proc_write+0x6e/0xe0 [sbdlib]
[   31.761368]  RSP <ffff88037d56fe38>
[   31.761938] CR2: 00007ff18926f000

Resolution

A fix for this problem has been released via the standard OES2018 update.
Update 2 - OES 2018 - Mandatory 6 or later provides this fix.

A PTF is available from Micro Focus as well. When required, please open a Service Request with Micro Focus Customer Support and refer to this TID.