I've followed the docs and I'm up to the point where's the system is
running some validation scripts after the installation of clusterware.
The cluvfy script is failing when "Checking the existence of VIP Node
Applications"
Checking existence of VIP node application
Node Name Required Status
Comment
------------ ------------------------ ------------------------
----------
MCECLNX2 yes does not exist
failed
MCECLNX1 yes does not exist
failed
Result: Check failed.
I tried to go to the [clusterware node install]/bin/ path and run
vipca as "root" and this is the output...
Exception in thread "main"
I've read that the "oracle" user shouldn't be the owner of the
clusterware directory, but I'm starting to wonder if my problem isn't
because oracle wasn't the owner... root was (as recommended by
oracle's documentation.)
Any thoughts on this? If I run if I try and run "vipca" as "oracle"
then the results are much more interesting and I receive the
following....
[main] [18:27:46:306] [VipCA.setDefaultTrace:321] trace log file is
not null
[main] [18:27:46:307] [VipCA.getNodeList:1210] VIPCA: GetNodeList
[main] [18:27:46:337] [GetActiveNodes.create:213] Going into
GetActiveNodes constructor...
[main] [18:27:46:346] [HASContext.getInstance:190] Module init : 16
[main] [18:27:46:347] [HASContext.getInstance:214] Local Module
init : 19
[main] [18:27:46:380] [Library.getInstance:95] Created instance of
Library.
[main] [18:27:46:380] [Library.load:195] Loading libsrvmhas10.so...
[main] [18:27:46:380] [Library.load:201] oracleHome null
[main] [18:27:46:381] [Library.load:236] Loading library srvmhas10
from path=
/opt/oracle_crs/product/10.2.0/CRS/jdk/jre/lib/i386/client:/opt/
oracle_crs/product/10.2.0/CRS/jdk/jre/lib/i386:/opt/oracle_crs/product/
10.2.0/CRS/jdk/jre/../lib/i386:/opt/oracle_crs/product/10.2.0/CRS/lib:/
opt/oracle_crs/product/10.2.0/CRS/srvm/lib:/opt/CA/AVEngine/ino/
config:/opt/CA/AVEngine/ino/lib::/opt/CA/BrightStorARCserve/lib:/opt/
CA/BrightStorARCserve/dbase/ingres/ingres/lib:/opt/CA/SharedComponents/
lib:/opt/CA/CAlib
[main] [18:27:46:413] [Library.load:244] Loaded library srvmhas10
from path=
/opt/oracle_crs/product/10.2.0/CRS/jdk/jre/lib/i386/client:/opt/
oracle_crs/product/10.2.0/CRS/jdk/jre/lib/i386:/opt/oracle_crs/product/
10.2.0/CRS/jdk/jre/../lib/i386:/opt/oracle_crs/product/10.2.0/CRS/lib:/
opt/oracle_crs/product/10.2.0/CRS/srvm/lib:/opt/CA/AVEngine/ino/
config:/opt/CA/AVEngine/ino/lib::/opt/CA/BrightStorARCserve/lib:/opt/
CA/BrightStorARCserve/dbase/ingres/ingres/lib:/opt/CA/SharedComponents/
lib:/opt/CA/CAlib
[main] [18:27:46:422] [HASContext.getInstance:249] HAS Context
Allocated: 1 to oracle.ops.mgmt.has.ClusterUtil@18020cc
[main] [18:27:46:423] [ClusterUtil.<init>:55] ClusterUtil Instance
created
[main] [18:27:46:423] [ClusterUtil.<init>:58] ClusterUtil
m_bSKGXN=true
[main] [18:27:46:428] [GetActiveNodes.<init>:185] Inside
GetActiveNodes :mceclnx1
[main] [18:27:46:433] [GetActiveNodes.<init>:197] Started clusterware
daemon thread
[GetActiveNodesThread] [18:27:46:433] [Semaphore.acquire:109]
SyncBufferFull:Acquire called by thread GetActiveNodesThread m_count=0
[main] [18:27:46:433] [GetActiveNodes.create:215] Out of
GetActiveNodes constructor.
[main] [18:27:46:436] [VipCA.processGUIArgs:435] Nodeapps will be
configured for cluster nodes
[main] [18:27:46:436] [VipCA.processGUIArgs:451] Retrieving the list
of interfaces
[main] [18:27:46:436] [VipCA.getInterfaces:579] Retrieving the list
of public interfaces from OCR
[main] [18:27:46:436] [Cluster.listPublicInterfaces:1478] Listing
public interfaces from OCR
[main] [18:27:46:437] [Cluster.listInterfaces:1536] Listing all the
interfaces of type:4 & IP type: 0
[main] [18:27:46:451] [OCRTree.init:201] calling OCRTree.init
[main] [18:27:46:451] [Version.isPre10i:189] isPre10i.java: Returning
FALSE
[main] [18:27:46:462] [OCRTree.<init>:157] calling OCR.init at level
7
[main] [18:27:46:465] [OCR.<init>:228] Called into OCR.java
Constructor: level =7
[main] [18:27:46:465] [OCR.loadLibrary:284]
Inside constructor of OCR
[main] [18:27:46:465] [OCR.loadLibrary:292] Going to load the ocr
library
[main] [18:27:46:466] [Library.load:195] Loading libsrvmocr10.so...
[main] [18:27:46:466] [Library.load:201] oracleHome null
[main] [18:27:46:466] [Library.load:236] Loading library srvmocr10
from path=
/opt/oracle_crs/product/10.2.0/CRS/jdk/jre/lib/i386/client:/opt/
oracle_crs/product/10.2.0/CRS/jdk/jre/lib/i386:/opt/oracle_crs/product/
10.2.0/CRS/jdk/jre/../lib/i386:/opt/oracle_crs/product/10.2.0/CRS/lib:/
opt/oracle_crs/product/10.2.0/CRS/srvm/lib:/opt/CA/AVEngine/ino/
config:/opt/CA/AVEngine/ino/lib::/opt/CA/BrightStorARCserve/lib:/opt/
CA/BrightStorARCserve/dbase/ingres/ingres/lib:/opt/CA/SharedComponents/
lib:/opt/CA/CAlib
[main] [18:27:46:468] [Library.load:244] Loaded library srvmocr10
from path=
/opt/oracle_crs/product/10.2.0/CRS/jdk/jre/lib/i386/client:/opt/
oracle_crs/product/10.2.0/CRS/jdk/jre/lib/i386:/opt/oracle_crs/product/
10.2.0/CRS/jdk/jre/../lib/i386:/opt/oracle_crs/product/10.2.0/CRS/lib:/
opt/oracle_crs/product/10.2.0/CRS/srvm/lib:/opt/CA/AVEngine/ino/
config:/opt/CA/AVEngine/ino/lib::/opt/CA/BrightStorARCserve/lib:/opt/
CA/BrightStorARCserve/dbase/ingres/ingres/lib:/opt/CA/SharedComponents/
lib:/opt/CA/CAlib
[main] [18:27:46:468] [OCR.loadLibrary:294] loaded ocr libraries
[main] [18:27:46:469] [OCR.<init>:241] Calling Native function
[main] [18:27:46:470] [OCR.<init>:245] Returned after initializing
OCR
[main] [18:27:46:471] [HASContext.getInstance:190] Module init : 6
[main] [18:27:46:472] [HASContext.getInstance:214] Local Module
init : 4
[main] [18:27:46:472] [HASContext.getInstance:249] HAS Context
Allocated: 2 to oracle.ops.mgmt.has.Util@1ac1fe4
[main] [18:27:46:472] [Util.<init>:86] Util Instance created.
[main] [18:27:46:474] [OCR.<init>:259] Primary Group of Current User:
oinstall
[main] [18:27:46:476] [HASContext.getInstance:190] Module init : 24
[main] [18:27:46:476] [HASContext.getInstance:214] Local Module
init : 8
[main] [18:27:46:476] [HASContext.getInstance:249] HAS Context
Allocated: 3 to oracle.ops.mgmt.has.ClusterLock@1d520c4
[main] [18:27:46:476] [ClusterLock.<init>:60] ClusterLock Instance
created.
[main] [18:27:46:479] [OCRTreeHA.listInterfaces:1797] Retrieving list
of all the interfaces from OCR
[main] [18:27:46:479] [OCRTreeHA.listInterfaceNames:1752] Acquiring
shared CSS lock SRVM.SYSTEM.css.interfaces
[main] [18:27:46:481] [OCR.keyExists:664]
OCR.keyExists(SYSTEM.css.interfaces)
[main] [18:27:46:482] [OCR.keyExists:664]
OCR.keyExists(SYSTEM.css.interfaces.global)
[main] [18:27:46:482] [OCRTreeHA.listInterfaceNames:1769] Retrieving
list of interfaces from OCR
[main] [18:27:46:483] [OCR.listSubKeys:615]
OCR.listSubKeys(SYSTEM.css.interfaces.global)
[main] [18:27:46:483] [OCR.listSubKeys:634] got 2 subKeys
[main] [18:27:46:483] [OCR.getKeyValue:411]
OCR.getKeyValue(SYSTEM.css.interfaces.global.eth0)
[main] [18:27:46:484] [OCR.listSubKeys:641] subKeys[0]=[eth0],
val=[null]
[main] [18:27:46:484] [OCR.getKeyValue:411]
OCR.getKeyValue(SYSTEM.css.interfaces.global.eth1)
[main] [18:27:46:485] [OCR.listSubKeys:641] subKeys[1]=[eth1],
val=[null]
[main] [18:27:46:487] [OCRTreeHA.listInterfaceNames:1779] Releasing
shared CSS lock SRVM.SYSTEM.css.interfaces
[main] [18:27:46:488] [OCRTreeHA.listInterfaces:1824] interface :
eth0
[main] [18:27:46:488] [OCR.listSubKeys:615]
OCR.listSubKeys(SYSTEM.css.interfaces.global.eth0)
[main] [18:27:46:488] [OCR.listSubKeys:625] Returning with just the
subkey names
[main] [18:27:46:489] [OCRTreeHA.listInterfaces:1833] Found 1 subnets
configured for interface:eth0
[main] [18:27:46:489] [OCRTreeHA.listInterfaces:1838] interface
subnet : 10|d1|d1|d0
[main] [18:27:46:489] [OCR.listSubKeys:615]
OCR.listSubKeys(SYSTEM.css.interfaces.global.eth0.10|d1|d1|d0)
[main] [18:27:46:490] [OCR.listSubKeys:625] Returning with just the
subkey names
[main] [18:27:46:490] [OCRTreeHA.listInterfaces:1852]
OCR(ipTypeKey)=1
[main] [18:27:46:490] [OCR.getKeyValue:411]
OCR.getKeyValue(SYSTEM.css.interfaces.global.eth0.10|d1|d1|d0.1)
[main] [18:27:46:490] [OCRTreeHA.listInterfaces:1860] OCR(ifType)
=public
[main] [18:27:46:491] [OCRTreeHA.listInterfaces:1862] Adding
interface: eth0 to the list
[main] [18:27:46:491] [OCRTreeHA.listInterfaces:1824] interface :
eth1
[main] [18:27:46:491] [OCR.listSubKeys:615]
OCR.listSubKeys(SYSTEM.css.interfaces.global.eth1)
[main] [18:27:46:491] [OCR.listSubKeys:625] Returning with just the
subkey names
[main] [18:27:46:492] [OCRTreeHA.listInterfaces:1833] Found 1 subnets
configured for interface:eth1
[main] [18:27:46:492] [OCRTreeHA.listInterfaces:1838] interface
subnet : 10|d1|d10|d0
[main] [18:27:46:492] [OCR.listSubKeys:615]
OCR.listSubKeys(SYSTEM.css.interfaces.global.eth1.10|d1|d10|d0)
[main] [18:27:46:492] [OCR.listSubKeys:625] Returning with just the
subkey names
[main] [18:27:46:493] [OCRTreeHA.listInterfaces:1852]
OCR(ipTypeKey)=1
[main] [18:27:46:494] [OCR.getKeyValue:411]
OCR.getKeyValue(SYSTEM.css.interfaces.global.eth1.10|d1|d10|d0.1)
[main] [18:27:46:494] [OCRTreeHA.listInterfaces:1860] OCR(ifType)
=cluster_interconnect
[main] [18:27:46:495] [OCRTreeHA.listInterfaces:1862] Adding
interface: eth1 to the list
[main] [18:27:46:495] [Cluster.extractInterfaces:1560] ifType = 4
[main] [18:27:46:495] [Cluster.extractInterfaces:1561] ipType = 0
[main] [18:27:46:495] [Cluster.extractInterfaces:1565] No. of
interfaces found in OCR = 2
[main] [18:27:46:495] [Cluster.extractInterfaces:1586] netIFs.length
= 1
[main] [18:27:46:495] [VipCA.getInterfaces:587] Public interfaces are
obtained from OCR
[main] [18:27:46:496] [HASContext.getInstance:190] Module init : 6
[main] [18:27:46:496] [HASContext.getInstance:214] Local Module
init : 0
[main] [18:27:46:496] [HASContext.getInstance:249] HAS Context
Allocated: 4 to oracle.ops.mgmt.has.Util@1ef9f1d
[main] [18:27:46:496] [Util.<init>:86] Util Instance created.
[main] [18:27:46:512] [VipCA.runApplication:2891]
oracle.ops.vipca.VipCA.runApplication(VipCA.java:2853)
oracle.ops.vipca.VipCA.main(VipCA.java:3147)
Insufficient privileges.
Insufficient privileges.
oracle@mceclnx1:/opt/oracle_crs/product/10.2.0/CRS/bin> ./olsnodes
mceclnx1
mceclnx2
These are the results when I run:
./runcluvfy.sh stage -pre crsinst -n mceclnx1,mceclnx2
Performing pre-checks for cluster services setup
Checking node reachability...
Node reachability check passed from node "mceclnx1".
Checking user equivalence...
User equivalence check passed for user "oracle".
Checking administrative privileges...
User existence check passed for "oracle".
Group existence check passed for "oinstall".
Membership check for user "oracle" in group "oinstall" [as Primary]
passed.
Administrative privileges check passed.
Checking node connectivity...
Node connectivity check passed for subnet "10.1.1.0" with node(s)
mceclnx2,mceclnx1.
Node connectivity check passed for subnet "10.1.10.0" with node(s)
mceclnx2,mceclnx1.
Suitable interfaces for the private interconnect on subnet "10.1.1.0":
mceclnx2 eth0:10.1.1.6
mceclnx1 eth0:10.1.1.5 eth0:10.1.1.5 eth0:10.1.1.5
Suitable interfaces for the private interconnect on subnet
"10.1.10.0":
mceclnx2 eth1:10.1.10.2
mceclnx1 eth1:10.1.10.1
ERROR:
Could not find a suitable set of interfaces for VIPs.
Node connectivity check failed.
Checking system requirements for 'crs'...
Total memory check passed.
Free disk space check passed.
Swap space check passed.
System architecture check passed.
Kernel version check passed.
Package existence check passed for "make-3.79".
Package existence check passed for "binutils-2.14".
Package existence check passed for "gcc-3.2".
Package existence check passed for "glibc-2.3.2-95.27".
Package existence check failed for "compat-db-4.0.14-5".
Check failed on nodes:
mceclnx2,mceclnx1
Package existence check failed for "compat-gcc-7.3-2.96.128".
Check failed on nodes:
mceclnx2,mceclnx1
Package existence check failed for "compat-gcc-c++-7.3-2.96.128".
Check failed on nodes:
mceclnx2,mceclnx1
Package existence check failed for "compat-libstdc++-7.3-2.96.128".
Check failed on nodes:
mceclnx2,mceclnx1
Package existence check failed for "compat-libstdc++-
devel-7.3-2.96.128".
Check failed on nodes:
mceclnx2,mceclnx1
Package existence check failed for "openmotif-2.2.3".
Check failed on nodes:
mceclnx2,mceclnx1
Package existence check failed for "setarch-1.3-1".
Check failed on nodes:
mceclnx2,mceclnx1
Group existence check passed for "dba".
Group existence check passed for "oinstall".
User existence check passed for "nobody".
System requirement failed for 'crs'
Pre-check for cluster services setup was unsuccessful on all the
nodes.
http://forums.oracle.com/forums/thread.jspa?threadID=337286&tstart=60
oracle@mceclnx1:/opt/oracle_crs/product/10.2.0/CRS/bin> ./vipca -
nodelist mceclnx1,mceclnx2 -nodevips 'mceclnx1/mceclnx1-vip/
255.255.255.0/eth0,mceclnx2/mceclnx2-vip/255.255.255.0/eth0'
The given interface(s), "eth0" is not public. Public interfaces should
be used to configure virtual IPs.
...
ETH0 is the interface I want... and I flagged it as "public" during
the installation process (using the universal installer...)
eth0 Link encap:Ethernet HWaddr [ this would be the MAC ]
inet addr:10.1.1.5 Bcast:10.1.1.255 Mask:255.255.255.0
inet6 addr: fe80::219:bbff:fe22:a37a/64 Scope:Link
UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
RX packets:1109605 errors:0 dropped:0 overruns:0 frame:0
TX packets:915021 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:1000
RX bytes:108961439 (103.9 Mb) TX bytes:791088532 (754.4 Mb)
Interrupt:19
eth1 Link encap:Ethernet HWaddr [ MAC goes here :) ]
inet addr:10.1.10.1 Bcast:10.1.10.255 Mask:255.255.255.0
inet6 addr: fe80::219:bbff:fe22:a379/64 Scope:Link
UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
RX packets:414553 errors:0 dropped:0 overruns:0 frame:0
TX packets:457072 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:1000
RX bytes:49661427 (47.3 Mb) TX bytes:50974811 (48.6 Mb)
Interrupt:20
Anyway -- here's the fix:
http://www.oracle.com/technology/pub/articles/hunter_rac10gr2_3.html
It is a *known issue* with the 10.2.0.1 install...
To quote the article:
"This issue is specific to Oracle 10.2.0.1 (noted in bug 4437727) and
needs to be resolved before continuing. The easiest workaround is to
re-run vipca (GUI) manually as root from the last node in which the
error occurred. Please keep in mind that vipca is a GUI and will need
to set your DISPLAY variable accordingly to your X server:"
... so plain english...
Log in as root to the X Server and run $ORA_CRS_HOME/bin/vipca and
hit next ... type in your first node's VIP hostname / alias and then
it'll automatically populate everything -- hit next again ... it
installs... rerun the postint script and presto -- everything works...
move on to install the actual Oracle DB software.
Performing post-checks for cluster services setup
Checking node reachability...
Check: Node reachability from node "mceclnx1"
Destination Node Reachable?
------------------------------------ ------------------------
mceclnx1 yes
mceclnx2 yes
Result: Node reachability check passed from node "mceclnx1".
Checking user equivalence...
Check: User equivalence for user "oracle"
Node Name Comment
------------------------------------ ------------------------
mceclnx2 passed
mceclnx1 passed
Result: User equivalence check passed for user "oracle".
Checking Cluster manager integrity...
Checking CSS daemon...
Node Name Status
------------------------------------ ------------------------
mceclnx2 running
mceclnx1 running
Result: Daemon status check passed for "CSS daemon".
Cluster manager integrity check passed.
Checking cluster integrity...
Node Name
------------------------------------
mceclnx1
mceclnx2
Cluster integrity check passed
Checking OCR integrity...
Checking the absence of a non-clustered configuration...
All nodes free of non-clustered, local-only configurations.
Uniqueness check for OCR device passed.
Checking the version of OCR...
OCR of correct Version "2" exists.
Checking data integrity of OCR...
Data integrity check for OCR passed.
OCR integrity check passed.
Checking CRS integrity...
Checking daemon liveness...
Check: Liveness for "CRS daemon"
Node Name Running
------------------------------------ ------------------------
mceclnx2 yes
mceclnx1 yes
Result: Liveness check passed for "CRS daemon".
Checking daemon liveness...
Check: Liveness for "CSS daemon"
Node Name Running
------------------------------------ ------------------------
mceclnx2 yes
mceclnx1 yes
Result: Liveness check passed for "CSS daemon".
Checking daemon liveness...
Check: Liveness for "EVM daemon"
Node Name Running
------------------------------------ ------------------------
mceclnx2 yes
mceclnx1 yes
Result: Liveness check passed for "EVM daemon".
Liveness of all the daemons
Node Name CRS daemon CSS daemon
EVM daemon
------------ ------------------------ ------------------------
----------
mceclnx2 yes yes
yes
mceclnx1 yes yes
yes
Checking CRS health...
Check: Health of CRS
Node Name CRS OK?
------------------------------------ ------------------------
mceclnx2 yes
mceclnx1 yes
Result: CRS health check passed.
CRS integrity check passed.
Checking node application existence...
Checking existence of VIP node application
Node Name Required Status
Comment
------------ ------------------------ ------------------------
----------
mceclnx2 yes exists
passed
mceclnx1 yes exists
passed
Result: Check passed.
Checking existence of ONS node application
Node Name Required Status
Comment
------------ ------------------------ ------------------------
----------
mceclnx2 no exists
passed
mceclnx1 no exists
passed
Result: Check passed.
Checking existence of GSD node application
Node Name Required Status
Comment
------------ ------------------------ ------------------------
----------
mceclnx2 no exists
passed
mceclnx1 no exists
passed
Result: Check passed.
Post-check for cluster services setup was successful.
Post your /etc/hosts file.
--
Daniel A. Morgan
University of Washington
damo...@x.washington.edu
(replace x with u to respond)
Puget Sound Oracle Users Group
www.psoug.org
127.0.0.1 localhost.localdomain localhost
# special IPv6 addresses
::1 localhost ipv6-localhost ipv6-loopback
fe00::0 ipv6-localnet
ff00::0 ipv6-mcastprefix
ff02::1 ipv6-allnodes
ff02::2 ipv6-allrouters
ff02::3 ipv6-allhosts
10.1.1.5 mceclnx1.int.domain.com mceclnx1
10.1.1.6 mceclnx2.int.domain.com mceclnx2
10.1.10.1 mceclnx1-priv.int.domain.com mceclnx1-priv
10.1.10.2 mceclnx2-priv.int.domain.com mceclnx2-priv
10.1.1.15 mceclnx1-vip.int.domain.com mceclnx1-vip
10.1.1.16 mceclnx2-vip.int.domain.com mceclnx2-vip
10.1.1.20 mcecluster.int.domain.com mcecluster
bye
Cristian Cudizio
http://oracledb.wordpress.com
http://cristiancudizio.wordpress.com
Really i don't know precisely how time it is that 10gR2 is certified
on SLES 10 but i'm sure is at least two months.
I think that sales people is not very well informed.
Have you tried to see at http://www.oracle.com/technology/tech/linux/validated-configurations/index.html
?
there can be interesting informations on packages required and useful.
Bye
It is a known issue with the 10.2.0.1 installation on Linux... you
have to run the VIP gui config manually after the installation...
then rerun the cluvfy script and it'll pass.
It just can't do the VIP assignment during the installation.
Ryan