[Bucardo-general] indirect bucardo problem - stuck triggers?

Paul Theodoropoulos paul at anastrophe.com
Fri Apr 25 21:30:07 UTC 2014


On 4/25/2014 1:50 PM, Paul Theodoropoulos wrote:
> On 4/25/2014 1:10 PM, Greg Sabino Mullane wrote:
>> On Fri, Apr 25, 2014 at 12:46:55PM -0700, Paul Theodoropoulos wrote:
>>> INFO:  "bucardo_truncate_trigger": found 623880 removable, 181642
>>> nonremovable row versions in 1111573 pages
>> Interesting. Smells like a bug to me. I'll look into this. So -
>> did any of your tables have truncates?
>
> Unfortunately, I've no idea.
>
> Even after making the adjustments I mentioned before, the database 
> continued growing - and stopping bucardo and vacuuming brought it back 
> down. The errors look just like the ones I was getting last week. On 
> exit, i get the familiar 'corrupted double-linked list' backtrace. I'm 
> going to upgrade to 4.9.12 and see if it helps.
>
Replying to myself - upgraded to 4.99.12, still getting errors, but 
seemingly not as noisily. Also, not sure what to make of this on startup:

  bucardo_current_version   => '4.99.12'
  bucardo_vac               => '1'
  bucardo_version           => '4.99.10'

But both /usr/local/bin/bucardo and 
/usr/local/share/perl/5.10.1/Bucardo.pm show 4.99.12 on their VERSION line.

Log output:

(27323) [Fri Apr 25 14:03:04 2014] CTL New controller for sync 
"trumgr_main_sync". Relgroup is "trumgr_main_rels", dbs is 
"trumgr_main_dbs". PID=27323
(27330) [Fri Apr 25 14:03:05 2014] KID (trumgr_detailed_tracking_sync) 
New kid, sync "trumgr_detailed_tracking_sync" alive=1 Parent=27320 
PID=27330 kicked=1
(27333) [Fri Apr 25 14:03:06 2014] KID (trumgr_main_sync) New kid, sync 
"trumgr_main_sync" alive=1 Parent=27323 PID=27333 kicked=1
(27336) [Fri Apr 25 14:03:06 2014] KID (trumgr_group_sync) New kid, sync 
"trumgr_group_sync" alive=1 Parent=27317 PID=27336 kicked=1
Exiting eval via redo at /usr/local/share/perl/5.10.1/Bucardo.pm line 4963.
(27336) [Fri Apr 25 14:16:42 2014] KID (trumgr_group_sync) Warning! 
Aborting due to exception for public.unit_history_47:? Error was 
DBD::Pg::db pg_ready failed: No asynchronous query is running at 
/usr/local/share/perl/5.10.1/Bucardo.pm line 8531.
(27336) [Fri Apr 25 14:16:42 2014] KID (trumgr_group_sync) Kid has died, 
error is: DBD::Pg::db pg_ready failed: No asynchronous query is running 
at /usr/local/share/perl/5.10.1/Bucardo.pm line 8531.
Line: 4924
Main DB state: ? Error: none
DB local_trumgr_group state: ? Error: 7
DB remote_trumgr_group state: ? Error: none
(27336) [Fri Apr 25 14:16:42 2014] KID (trumgr_group_sync) Kid has died, 
error is: DBD::Pg::db pg_cancel failed: No asynchronous query is running 
at /usr/local/share/perl/5.10.1/Bucardo.pm line 2300.
Line: 4965
Main DB state: ? Error: none
DB local_trumgr_group state: ? Error: 7
DB remote_trumgr_group state: ? Error: none
DBD::Pg::db pg_cancel failed: No asynchronous query is running at 
/usr/local/share/perl/5.10.1/Bucardo.pm line 2300.
(27317) [Fri Apr 25 14:16:47 2014] CTL Warning: Kid 27336 is not 
responding, will respawn
(29483) [Fri Apr 25 14:16:47 2014] KID (trumgr_group_sync) New kid, sync 
"trumgr_group_sync" alive=1 Parent=27317 PID=29483 kicked=1
Exiting eval via redo at /usr/local/share/perl/5.10.1/Bucardo.pm line 4963.
(29483) [Fri Apr 25 14:16:54 2014] KID (trumgr_group_sync) Warning! 
Aborting due to exception for public.units:? Error was DBD::Pg::db 
pg_ready failed: No asynchronous query is running at 
/usr/local/share/perl/5.10.1/Bucardo.pm line 8531.
(29483) [Fri Apr 25 14:16:54 2014] KID (trumgr_group_sync) Kid has died, 
error is: DBD::Pg::db pg_ready failed: No asynchronous query is running 
at /usr/local/share/perl/5.10.1/Bucardo.pm line 8531.
Line: 4924
Main DB state: ? Error: none
DB local_trumgr_group state: ? Error: none
DB remote_trumgr_group state: ? Error: 7
(29483) [Fri Apr 25 14:16:54 2014] KID (trumgr_group_sync) Kid has died, 
error is: DBD::Pg::db pg_cancel failed: No asynchronous query is running 
at /usr/local/share/perl/5.10.1/Bucardo.pm line 2300.
Line: 4965
Main DB state: ? Error: none
DB local_trumgr_group state: ? Error: none
DB remote_trumgr_group state: ? Error: 7
(29483) [Fri Apr 25 14:16:54 2014] KID (trumgr_group_sync) Ping failed 
for database local_trumgr_group
DBD::Pg::db pg_cancel failed: No asynchronous query is running at 
/usr/local/share/perl/5.10.1/Bucardo.pm line 2300.
(27317) [Fri Apr 25 14:16:58 2014] CTL Warning: Kid 29483 is not 
responding, will respawn
(29538) [Fri Apr 25 14:16:58 2014] KID (trumgr_group_sync) New kid, sync 
"trumgr_group_sync" alive=1 Parent=27317 PID=29538 kicked=1

-- 
Paul Theodoropoulos
www.anastrophe.com



More information about the Bucardo-general mailing list