--
You received this message because you are subscribed to the Google Groups "lmt-discuss" group.
To post to this group, send email to lmt-d...@googlegroups.com.
To unsubscribe from this group, send email to lmt-discuss...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/lmt-discuss?hl=en.
Hi Lisa, that's a known issue that really only affects cerebro-stat.
The data is hashed by hostname so the different ost's overwrite each other.
However the cerebro monitor module gets a callback on arrival of each message
(before the hash) so it doesn't affect lmt.
> Second - I don't see a metric for the mgs. Only lmt_mds. is there a
> metric for the mgs?
Right, there isn't one.
> Third I am not sure where the lmt server needs to be installed. I
> tried to load it on a machine
> that was already running mysql but this machine is not a lustre client
> or server. once cerebro is
> installed it does not see any of the lmt lustre metrics. Does this
> machine have to at least be a
> lustre client? Is it advisable to run it on the mgs or mds?
We typically run it on a management node that is not running any lustre
services. You will need cerebro configured to listen on the network the
others are talking on.
> thanks for any insight you can provide.
>
> I am using cerebro version cerebro-1.10-1 with lmt 1.8.3 and lustre
> server version 1.8.2.
I think you have an lmt 2.x version there, based on your description.
Jim
Generally the MGS has not been a problem for us so we felt no strong need
to develop software to monitor it.
> Regarding the LMT server configuration:
> Is there any documentation on how to configure cerebro to listen on
> the
> network the others are talking on? I did not have to do anything on
> the lustre servers
> to get this to happen. And I see some minimal documentation about an /
> etc/cerebro.conf file but none
> of the machines actually has one of these.
There is a man page cerebro.conf - I think this is the relevant section:
cerebrod_listen ( on | off )
Specify if the cerebrod daemon should receive packets or not.
Generally speaking, this option should be turned off for all but
just a few machines in a cluster. Just a few of the machines
should be responsible for collecting the Cerebro data so that
computational power of the cluster as a whole isn’t lost. The
default configuration turns this option on.
(Although I admit I'm not the expert on how to do this).
Jim
FYI the newest cerebro comes with a template /etc/cerebro.conf. But to
answer your question, cerebro may have chosen a different network by
default to speak/listen on. You can tell cerebro to use a different
interface via:
cerebrod_listen_message_config ip [port network_interface]
the manpage cerebro.conf will have more config details.
Al
> > > > For more options, visit this group athttp://*groups.google.com/group/lmt-discuss?hl=en.
> >
> > --
> > You received this message because you are subscribed to the Google Groups "lmt-discuss" group.
> > To post to this group, send email to lmt-d...@googlegroups.com.
> > To unsubscribe from this group, send email to lmt-discuss...@googlegroups.com.
> > For more options, visit this group at http://*groups.google.com/group/lmt-discuss?hl=en.
> >
>
--
Albert Chu
ch...@llnl.gov
Computer Scientist
High Performance Systems Division
Lawrence Livermore National Laboratory
# test.cf
mds {
name cmsls0
uuid test-MDT0000
}
ost {
name cmsls[1-5]
numdevs 6
uuid foo-OST{HEXINDEX}
skip 1
}
create_lmtconfig -d test -f test.cf >test.sql
Sorry this utility is a pain. There is some (possibly deficient)
discussion of how this all works in the create_lmtconfig man page.
Jim
1. Does monitor module have insert permission? It reads its database
user/password from /usr/share/lmt/cron/lmtrc.
2. The database is named filesystem_fsname. The devices (e.g.
fsname-OST0000) have to match the database name.
3. Is there a mismatch between hostnames used in metrics vs. hostnames
in the _INFO tables? I.e. does one use the fully qualified domain
name and one not?
4. Check syslog - there may be some errors from the monitor module there.
Jim
> ...
>
> read more »
> ...
>
> read more »
This suggests that the plugin isn't linked correctly. Do libmysql show
up if you run
ldd /usr/lib64/cerebro/cerebro_monitor_lmt.so
Al
> --
> You received this message because you are subscribed to the Google Groups "lmt-discuss" group.
> To post to this group, send email to lmt-d...@googlegroups.com.
> To unsubscribe from this group, send email to lmt-discuss...@googlegroups.com.
> For more options, visit this group at http://BLOCKEDgroups.google.com/group/lmt-discuss?hl=en.
All you do is list each host in the cluster, one host per line.
However, if you don't have to have it. Cerebro is probably demanding
you have it because you installed the hostsfile plugin
(cerebro-clusterlist-hostsfile or something like that).
Al