[check_postgres] Detection of paused PgBouncer

Cyril Bouthors cyril.bouthors at isvtec.com
Thu Mar 28 19:08:27 UTC 2013


Hi,

First of all, I have to thank you for the excellent quality level that you
provided with check-postgres and the time it saved us when configuring Nagios to
check both PostgreSQL and PgBouncer.

We had a major downtime today on one of our live PostgreSQL cluster because a
maintenance script failed and paused PgBouncer [1] without resuming it [2].

PgBouncer is checked with Nagios by the following check-postgres scripts:

 - pgbouncer_backends
 - pgb_pool_maxwait
 - pgb_pool_cl_waiting
 - pgb_pool_sv_active

Unfortunately, none of them detected that PgBouncer was unable to handle queries
because it was paused.

Is there any particular check-postgres script that we can use to make sure that
this can be detected earlier?

If so, which script?

If not, would it be possible to check this? Maybe pgbouncer_backends should
check that?

Regards,

[1] psql -p 6432 pgbouncer -c PAUSE
[2] psql -p 6432 pgbouncer -c RESUME
-- 
Cyril Bouthors - ISVTEC: Building strong and fast web infrastructure
14 avenue de l'Opéra, 75001 Paris. 1 rue Émile Zola, 69002 Lyon
Tél : 01 84 16 16 17 - Ligne directe : 0x7B9EE3B0E - Fax : 01 77 72 57 24


More information about the Check_postgres mailing list