Hi all, <div><br></div><div>Im trying to cluster the PostgreSQL database in active-passive mode with two nodes. </div><div><br></div><div>I have tested it and If I fence one node, it fails over the other node, but when I run the command to manually relocate the service:</div>
<div><br></div><div>clusvcadm -r BBDD -m NODO1 </div><div><br></div><div>when I have open connections open on the database I get this error messages on /var/log/messages:</div><div><br></div><div><div>Jul 15 09:27:29 NODO2 clurgmgrd[2493]: <notice> Stopping service service:BBDD</div>
<div>Jul 15 09:27:33 NODO2 clurgmgrd: [2493]: <err> Stopping Service postgres-8:BBDD > Failed - Application Is Still Running</div><div>Jul 15 09:27:33 NODO2 clurgmgrd: [2493]: <err> Stopping Service postgres-8:BBDD > Failed</div>
<div>Jul 15 09:27:33 NODO2 clurgmgrd[2493]: <notice> stop on postgres-8 "BBDD" returned 1 (generic error)</div><div>Jul 15 09:27:33 NODO2 clurgmgrd: [2493]: <err> Stopping Service postgres-8:BBDD > Failed</div>
<div>Jul 15 09:27:33 NODO2 clurgmgrd[2493]: <notice> stop on postgres-8 "BBDD" returned 1 (generic error)</div><div>Jul 15 09:27:33 NODO2 avahi-daemon[2304]: Withdrawing address record for 192.168.1.183 on eth0.</div>
<div>Jul 15 09:27:43 NODO2 clurgmgrd[2493]: <crit> #12: RG service:BBDD failed to stop; intervention required</div><div>Jul 15 09:27:43 NODO2 clurgmgrd[2493]: <notice> Service service:BBDD is failed</div><div>
Jul 15 09:27:43 NODO2 clurgmgrd[2493]: <warning> #70: Failed to relocate service:BBDD; restarting locally</div><div>Jul 15 09:27:43 NODO2 clurgmgrd[2493]: <err> #43: Service service:BBDD has failed; can not start.</div>
<div>Jul 15 09:27:44 NODO2 clurgmgrd[2493]: <alert> #2: Service service:BBDD returned failure code. Last Owner: NODO2.</div><div>Jul 15 09:27:44 NODO2 clurgmgrd[2493]: <alert> #4: Administrator intervention required.</div>
<div><br></div><div>If I check the status of the service: </div><div><div>Service Name Owner (Last) State</div><div> ------- ---- ----- ------ -----</div>
<div> service:BBDD (NODO2) failed</div><div><br></div><div>and If I check with ps:</div><div><br></div><div><div># ps aux | grep postgres</div>
<div>root 21552 0.0 0.2 2844 1120 ? S< 09:27 0:00 su - postgres -c /usr/bin/postmaster -c config_file="/etc/cluster/postgres-8/postgres-8:BBDD/postgresql.conf" ??-D /nfsvol/pgsql/data</div>
<div>postgres 21553 0.1 0.5 21504 3076 ? S<s 09:27 0:00 /usr/bin/postmaster -c config_file=/etc/cluster/postgres-8/postgres-8:BBDD/postgresql.conf -D /nfsvol/pgsql/data</div><div>postgres 21591 0.0 0.1 11284 608 ? S< 09:27 0:00 postgres: logger process </div>
<div>postgres 21593 0.0 0.1 21504 896 ? S< 09:27 0:00 postgres: writer process </div><div>postgres 21594 0.0 0.1 12284 608 ? S< 09:27 0:00 postgres: stats buffer process </div>
<div>postgres 21595 0.0 0.1 11428 804 ? S< 09:27 0:00 postgres: stats collector process </div><div>postgres 21720 0.0 0.8 22280 4328 ? S< 09:27 0:00 postgres: postgres postgres 192.168.1.170(2849) idle </div>
<div> </div><div><br></div><div>So, I think the problem is with the stop script that comes with the cluster suite. I think it must close all the open connections or wait until they finish...</div>
<div><br></div><div>Anyone has this problem? and how can it be solved</div><div><br></div><div>Thanks in advance</div><div><br></div><div>ESG</div><div><br></div><div><br></div></div></div></div><div><br></div><div><br></div>