pgsql and net-snmp

"Jerome Blomart" <j.blomart@xxxxxxxxx> · Thu, 17 Jan 2008 18:43:27 +0100

Hello,

**** I really should ask if i am mailing to the correct mailing list in regarding the question i will ask.
**** But ...

I am working for a  small company  offering IT services.
We are in  the acceptance stage  for a  migration of an old UNIX app managing a huge load of data.

This acceptance environment is using a VMWARE ESX platform, a jboss as application server ( java servlet as client application ) 
and a pgsql database. As the developpers knew there will be a lot of data, the server specs are quite high...

( on RAM space for instance... jboss + pgsql on the same server ).

As it is ... this environement gets poor results ... 

This could be a lot of things and as i am charged of the applications monitoring....

How-to monitor pgsql ? 
( considering the server's general statistics and events are already covered : 
cpu usage, memory usage,  shutdown and reboot events, unplanned down time )

- i have got a net-snmp at service

- i have got an event listener too ( traplistener + event-rules)

After looking through the pgsql manual and google ...
- i want base info on the pgsql processes: cpu usage, memory usage and so on...   
- i want base healthiness information on the database 

lets get to pgsql...

- base processes information:
-------------------------------------
As i have understood it pgsql runs as as a single process and forks itself for each clients that connects to the database.

As i have seen the number of starting processes is set into pgsql.
( when there is no connection to the server i see six process postgres in idle mode )
The pgsql client processes names are giving informations on themselves:

using egrep and ps:
   #ps aux | egrep ".*postgres: .* .*:ffff:.*\..*\..*\..*:.*"
   returns a list of pgsqlclient processes

net-snmp ?
   as i look into monitoring pgsql client processes i have written a shell/awk script returning little data on those.

   #./pgsql.cli.mon
   $ [ number of processes ]
   $ [ min cpu usage % ]
   $ [ average cpu usage % ]
   $ [ max cpu usage % ]
   $ [ min mem usage % ]
   $ [ average mem usage % ]
   $ [ max mem usage % ]

   $ [ min  virtual mem  ]
   $ [ average virtual mem ]
   $ [ max virtual mem ]
   $ [ min real mem ]
   $ [ average real mem ]
   $ [ max real mem ]
   and by feeding it into snmpd.conf (exec statement) i get those data  through snmp

The maximum and minimum number of processes allowed are already managed by the net-snmp base agent.

*** That is a real tiny base for some performance testing. 
*** The script should be rewritten in an other language ( perl ): 

***    making sums and averages of doubles in shell lead me ( as too often... ) use pipes and awk....

- database healthiness ?
------------------------------
For that part i have only google'd  and looked into the pgsql manual (  and extracted a "SELECT * from pg_stats" from pgsql ).

I could use an extraction script that maps selected data into snmp ( as a have done for the processes informations ).
But using such a way to pack pgsql's pg_stats into snmp will include regular query to this table.

Is there any performance issues generated by this type of monitoring ?

As  i have not found any pgsql monitoring agent using snmp (yet) ,  i am trying to gather any data concerning pgsql and feed it to snmp.

But i always feel like i have not looked enough for anything.

Is there any snmp agent, for pgsql ? If i gather statistics from pg_stats, is there any values/properties, specific to pgsql that i should look into  ?