Hi Thomas,
So starting 'pacemakerd' sounds like it fails. Can you try just
running 'pacemakerd' on the shell with something like this: pacemakerd
--verbose
And then report the output of that.
--Marc
On Wed, Jun 17, 2020 at 9:31 AM T. Sch. <
tre...@web.de> wrote:
>
> Hi Marc and all others,
>
> I am using ESOS for a while as a stand-alone-storage-Server
> which worked very well for the last years.
>
> Now, I am learning how to setup an ESOS cluster config with
> a two node cluster, but I am not able to even get pacemaker
> to start.
> (Other tests I learned from e.g. with Centos were successfull,
> so I am not entirely inexperienced with
> corosync/pacemaker-cluster setups)
>
> I tried to adapt some pieces out from
> "
http://marcitland.blogspot.com/2013/04/building-using-highly-available-esos.html"
> but no success at all.
>
> I even tried multiple ESOS branches:
> 2.0.16 and master_3ab9c22_dgvszq
>
> So, I am nearly shure the mistake is mine,
> but I have no more idea where to look for it.
> (May be it is only an "ESOS peculiarity")
>
> My config looks like this:
> #fresh install on two test nodes#
>
> #uname -n#
> sv0xx.domain.local (xx per host 11 and 12)
>
> #xtra_hosts#
> 192.168.0.31 sv011 sv011.domain.local
> 192.168.0.32 sv012 sv012.domain.local
> 10.35.6.21 corosync011
> 10.35.6.22 corosync012
>
> #/etc/corosync/corosync.conf#
> totem {
> version: 2
> crypto_cipher: none
> crypto_hash: none
> interface {
> ringnumber: 0
> bindnetaddr: 10.35.6.0
> mcastaddr: 239.255.1.1
> mcastport: 5405
> ttl: 1
> }
> }
> nodelist {
> node {
> ring0_addr: 10.35.6.21
> nodeid: 1
> name: corosync011
> }
> node {
> ring0_addr: 10.35.6.22
> nodeid: 2
> name: corosync012
> }
> }
> logging {
> fileline: off
> to_stderr: no
> to_logfile: yes
> logfile: /var/log/corosync.log
> to_syslog: yes
> syslog_facility: local2
> debug: off
> timestamp: off
> logger_subsys {
> subsys: QUORUM
> debug: off
> }
> }
> quorum {
> provider: corosync_votequorum
> expected_votes: 1
> two_node: 1
> }
>
> #crm corosync status is fine (seems to be) on both#
> crm corosync status
> Printing ring status.
> Local node ID 1
> RING ID 0
> id = 10.35.6.21
> status = ring 0 active with no faults
> Quorum information
> ------------------
> Date: Mon Jun 15 18:21:34 2020
> Quorum provider: corosync_votequorum
> Nodes: 2
> Node ID: 1
> Ring ID: 1/40
> Quorate: Yes
>
> Votequorum information
> ----------------------
> Expected votes: 2
> Highest expected: 2
> Total votes: 2
> Quorum: 1
> Flags: 2Node Quorate WaitForAll
>
> Membership information
> ----------------------
> Nodeid Votes Name
> 1 1 10.35.6.21 (local)
> 2 1 10.35.6.22
>
> BUT 10 seconds after starting rc.pacemaker on both nodes manually, every node stops pacemaker and corosync daemon.
>
> Thing I noticed:
> ping -c 1 $(uname -n) resolves to 127.0.0.1 on both nodes
> daemon.log say: "Error in connection setup (/dev/shm...Broken pipe"
>
> Some Logs and commands are appended to this entry
> (so as not to impair readability too much).
> -corosync.log
> -daemon.log
> -local2.log
> -messages
> -pacemaker.log
>
> Please be so kind and show me a way that could solve my problem.
>
> Let me know, if you need more info's / logfiles to have a helping look ;-)
>
> (As I said: I am just learning ESOS clustering - and willing to give you a well formatted documentation if I'll havbe success with it.)
>
> Greets from Germany - Thomas
>
>
> Hi Marc and all others, I am using ESOS for a while as a stand-alone-storage-Server which worked very well for the last years.
>
> Now, I am learning how to setup an ESOS cluster config with a two node cluster, but I am
> not able to even get pacemaker to start. (Other tests I learned from e.g. with Centos were successfull, so I am not entirely inexperienced with corosync/pacemaker-cluster setups)
>
> I tried to adapt some pieces out from
marcitland.blogspot.com 2013/04/building-using-highly-available-esos.html" but no success at all.
>
> I even tried multiple ESOS branches:
> 2.0.16 and master_3ab9c22_dgvszq
>
> So, I am nearly shure the mistake is mine, but I have no more idea where to look for it.
> (May be it is only an "ESOS peculiarity")
> My config looks like this:
> #fresh install on two test nodes#
>
> #uname -n#
> sv0xx.domain.local (xx per host 11 and 12)
>
> #xtra_hosts#
> 192.168.0.31 sv011 sv011.domain.local
> 192.168.0.32 sv012 sv012.domain.local
> 10.35.6.21 corosync011
> 10.35.6.22 corosync012
>
> #/etc/corosync/corosync.conf#
> totem {
> version: 2
> crypto_cipher: none
> crypto_hash: none
> interface {
> ringnumber: 0
> bindnetaddr: 10.35.6.0
> mcastaddr: 239.255.1.1
> mcastport: 5405
> ttl: 1
> }
> }
> nodelist {
> node {
> ring0_addr: 10.35.6.21
> nodeid: 1
> name: corosync011
> }
> node {
> ring0_addr: 10.35.6.22
> nodeid: 2
> name: corosync012
> }
> }
> logging {
> fileline: off
> to_stderr: no
> to_logfile: yes
> logfile: /var/log/corosync.log
> to_syslog: yes
> syslog_facility: local2
> debug: off
> timestamp: off
> logger_subsys {
> subsys: QUORUM
> debug: off
> }
> }
> quorum {
> provider: corosync_votequorum
> expected_votes: 1
> two_node: 1
> }
>
> #crm corosync status is fine (seems to be) on both#
> crm corosync status
> Printing ring status.
> Local node ID 1
> RING ID 0
> id = 10.35.6.21
> status = ring 0 active with no faults
> Quorum information
> ------------------
> Date: Mon Jun 15 18:21:34 2020
> Quorum provider: corosync_votequorum
> Nodes: 2
> Node ID: 1
> Ring ID: 1/40
> Quorate: Yes
>
> Votequorum information
> ----------------------
> Expected votes: 2
> Highest expected: 2
> Total votes: 2
> Quorum: 1
> Flags: 2Node Quorate WaitForAll
>
> Membership information
> ----------------------
> Nodeid Votes Name
> 1 1 10.35.6.21 (local)
> 2 1 10.35.6.22
>
> BUT 10 seconds after starting rc.pacemaker on both nodes manually,
> every node stops pacemaker and corosync daemon.
>
> Thing I noticed:
> ping -c 1 $(uname -n) resolves to 127.0.0.1 on both nodes
> daemon.log says: "Error in connection setup (/dev/shm...Broken pipe"
> local2.log says: FATAL: Cannot exec /usr/libexec/pacemaker/cib: Permission denied (13)
>
> Some Logs and commands are appended to this entry
> (so as not to impair readability too much).
> -corosync.log
> -daemon.log
> -local2.log
> -messages
> -pacemaker.log
>
> Please be so kind and show me a way that could solve my problem.
>
> Let me know, if you need more info's / logfiles to have a helping look ;-)
>
> (As I said: I am just learning ESOS clustering - and willing to give you a
> well formatted documentation if I'll have success with it.)
>
> Greets from Germany - Thomas
>
> --
> You received this message because you are subscribed to the Google Groups "esos-users" group.
> To unsubscribe from this group and stop receiving emails from it, send an email to
esos-users+...@googlegroups.com.
> To view this discussion on the web visit
https://groups.google.com/d/msgid/esos-users/c540351d-e860-42c6-a505-deedf87bece7n%40googlegroups.com.