I'm facing difficulties to implement Stratum as a kubernetes container. This approach is presented in the
.
- Control VM (8 cores, 16GB RAM, Ubuntu 18.04) - i.e., where the sd-fabric helm charts are instanciated
The sdfabric-helm-charts/sdfabric/values.yaml instruct me to set the taints and label the switch node as decpited below:
Then, I label the switch as suggested in the
stratum kubernetes deployment guide:
1. kubectl label node <node-name>
node-role.kubernetes.io=switch
2. kubectl taint node <node-name>
node-role.kubernetes.io=switch:NoSchedule
The problem is: Once stratum is instantiated, a SIGBUS errors suddenly appears at the kubectl logs:
+ LOCAL_CHASSIS_CONFIG=/config/chassis_config.pb.txt
+ [ -f /config/chassis_config.pb.txt ]
+ /usr/bin/start-stratum.sh -enable_onlp=false -chassis_config_file=/config/chassis_config.pb.txt -max_log_size=0 -write_req_log_file= -read_req_log_file= -v=0 -minloglevel=0 -bf_switchd_background=false -colorlogtostderr=false -logtostderr=true -experimental_enable_p4runtime_translation
Mounting hugepages...
Skipping kernel module installation.
I20230724 18:11:57.530447 7 logging.cc:64] Stratum version: e22940edeadbee23956a903ed5580fa2248830df built at 2022-03-25T00:05:58+00:00 on host 7a47aab43a31 by user root.
I20230724 18:11:57.530922 7 bf_sde_wrapper.cc:1754] bf_sysfs_fname: /sys/class/bf/bf0/device/dev_add
Install dir: /usr (0x2396020)
bf_switchd: system services initialized
bf_switchd: loading conf_file /usr/share/stratum/tofino_skip_p4.conf...
bf_switchd: processing device configuration...
Configuration for dev_id 0
Family : Tofino
pci_sysfs_str : /sys/devices/pci0000:00/0000:00:03.0/0000:05:00.0
pci_domain : 0
pci_bus : 5
pci_fn : 0
pci_dev : 0
pci_int_mode : 1
sbus_master_fw: /usr/
pcie_fw : /usr/
serdes_fw : /usr/
sds_fw_path : /usr/share/tofino_sds_fw/avago/firmware
microp_fw_path:
bf_switchd: processing P4 configuration...
P4 profile for dev_id 0
p4_name: dummy
libpd:
libpdthrift:
context:
config:
Agent[0]: /usr/lib/libpltfm_mgr.so
diag:
accton diag:
non_default_port_ppgs: 0
SAI default initialize: 1
bf_switchd: library /usr/lib/libpltfm_mgr.so loaded
bf_switchd: agent[0] initialized
Health monitor started
Operational mode set to ASIC
Initialized the device types using platforms infra API
ASIC detected at PCI /sys/class/bf/bf0/device
ASIC pci device id is 16
bf_switchd: drivers initialized
Skipping P4 program load for dev_id 0
*** Aborted at 1690222321 (unix time) try "date -d @1690222321" if you are using GNU date ***
PC: @ 0x0 (unknown)
*** SIGBUS (@0x7f49e8800000) received by PID 7 (TID 0x7f4a083b8880) from PID 18446744073315287040; stack trace: ***
@ 0x7f49f68f50e0 (unknown)
@ 0x7f4a0744e865 bf_sys_dma_pool_create
@ 0x7f49f82e77e7 (unknown)
@ 0x7f49f82e9d7e bf_switchd_lib_init
@ 0x43766e (unknown)
@ 0x41bf7b (unknown)
@ 0x41d114 (unknown)
@ 0x7f49f5ac42e1 __libc_start_main
@ 0x41bbea (unknown)
@ 0x0 (unknown)
Bus error (core dumped)Has anyone ever experienced this problem?