PDA

Afficher la version complète : Cluster HA DRBD



Hexoseth
28/05/2010, 14h39
Bonjour,

Je continue mon avancé sur mon cluster HA. J'ai installé DRBD pour faire du raid tcp/ip entre mes deux serveurs.

Je vous donne les infos :
Version noyau 2.6.31
package drbd8-utils
/home sur partiton /dev/sda6 en ext4


/etc/drbd.conf

# déclaration du cluster
resource drbd {

**protocol B;

**** syncer {
**** rate 100M;
#** group 1;
}

**on nodeA-1 {
****device**** /dev/drbd0;
****disk****** /dev/sda6;
****address****192.168.xxx.xxx:7788;
****meta-disk**internal;
}

****on nodeA-2 {
****device****/dev/drbd0;
****disk******/dev/sda6;
****address** 192.168.xxx.xxx:7788;
****meta-disk internal;
}
}

Lorsque j'ai lancer drbd j'ai eu une erreur

0: Failure: (119) No valid meta-data signature found.
et il me disait d'excuter cette commande

drbdadm create-md drbd

Maintenant voici le résultat

root@nodeA-1:~# drbdadm create-md drbd
md_offset 59295973376
al_offset 59295940608
bm_offset 59294130176

Found ext3 filesystem
****57906228 kB data area apparently used
****57904424 kB left usable by current configuration

Device size would be truncated, which
would corrupt data and result in
'access beyond end of device' errors.
You need to either
** * use external meta data (recommended)
** * shrink that filesystem first
** * zero out the device (destroy the filesystem)
Operation refused.

Command 'drbdmeta 0 v08 /dev/sda6 internal create-md' terminated with exit code 40
drbdadm create-md drbd: exited with code 40

Je pense que drbd n'accepte pas ext4!? Si c'est le cas est-ce que je peux modifier ma partition pour la mettre en ext3 tout en gardant les données dessus?

jluce
28/05/2010, 15h32
slt

tiens moi j'ai ca chez moi qui fonctionne en prod sur une partition d1To par contre en Reiserfs

http://www.linuxpedia.fr/doku.php/opensuse/drdb

j'ai pas trop le temps la je regarderais ca en détail ce week end ou lundi

a+

Hexoseth
28/05/2010, 15h37
Salut,

ext4 est rétro compatible avec le ext3 donc a priori pas de problème de ce coté. Je vais voir avec la doc de linuxpédia j'ai peu être mal fait un truc.

a+

Hexoseth
28/05/2010, 22h59
J'ai fais un essaye en change le fs en ext3 au lieu du ext4 précédement.

Ca ne change rien, toujours la même erreur.

J'arrête la pour ce soir.

A+

Hexoseth
31/05/2010, 09h10
Salut,

J'ai cherché plusieurs tutos surtout ceux qui montre les erreurs qui peuvent arrivé fréquenment.
Le problème viens du faite qu'il y a des données sur la partition du coup j'ai utiliser cette commande pour ecrire des zéros

shred -zvf -n 1 /dev/sda6

J'ai relancé la commande

drbdadm create-md drbd

ET la nikel, je poursuit la configuration, jusqu'a la prochaine panne. Non soyons optimiste.

A+

jluce
31/05/2010, 13h21
<div class='quotetop'>Citation </div>
ET la nikel, je poursuit la configuration, jusqu'a la prochaine panne. Non soyons optimiste.

A+[/b]


ouais surtout que ca tourne pas mal et en plus (je sais pas si tu en est déjà la) heartbeat s'occupe tout seul de passé le DRDB esclave en maitre ce qui fait que t'as pas de soucis pour le rebasculer ;)

a+

tynho
29/06/2010, 16h17
slt Hexoseth
je suis nouveau dans ce forum cependant je voulais savoir si tu as pas un tuto sur la haute disponibilité et surttt sur la configuration de drbd .je suis actuellement sur un noyau 2.6 de debian.
Tres cordialement.

jluce
29/06/2010, 16h39
slt Hexoseth
je suis nouveau dans ce forum cependant je voulais savoir si tu as pas un tuto sur la haute disponibilité et surttt sur la configuration de drbd .je suis actuellement sur un noyau 2.6 de debian.
Tres cordialement.[/b]
slt et bienvenue a toi

comme tuto y'a ca:

http://www.linuxpedia.fr/doku.php/opensuse/drdb

c'est pas sous debian mais la conf de drdb est la meme sur tous les linux ;)

et si t'as des soucis pose tes questions j'en ai un qui tourne a la maison

a+

Hexoseth
29/06/2010, 22h39
slt Hexoseth
je suis nouveau dans ce forum cependant je voulais savoir si tu as pas un tuto sur la haute disponibilité et surttt sur la configuration de drbd .je suis actuellement sur un noyau 2.6 de debian.
Tres cordialement.[/b]


Salut,

J'ai utilisé se tuto trés bien avec des applications serveur qui peuvent avoir l'utilité du raid de serveur. C'est sur ubuntu sa sera trés proche au niveau configuration vu que ubuntu vient de debian.

Mirroir de deux serveurs (http://doc.ubuntu-fr.org/tutoriel/mirroring_sur_deux_serveurs)

J'ai pu résoudre les pannes principalement rencontré car à la fin il indique comment les solutionner.

Bonne continuation

tynho
30/06/2010, 14h12
slt jluce
j'ai bien vu et meme suivie le tuto cependant j'ai une preoccupation à savoir est que les deux serveurs doivent avoir la même architecture physique car moi j'ai actuellement deux serveurs dont l'un a un disque disque sda (avec 2 partition dont 1 sda4 de 45 gb ) et l'autre avec un disque hda (ide ) de 80 gb.

jluce
30/06/2010, 15h12
slt jluce
j'ai bien vu et meme suivie le tuto cependant j'ai une preoccupation à savoir est que les deux serveurs doivent avoir la même architecture physique car moi j'ai actuellement deux serveurs dont l'un a un disque disque sda (avec 2 partition dont 1 sda4 de 45 gb ) et l'autre avec un disque hda (ide ) de 80 gb.[/b]
slt

quand on parle d'architecture c'est de deux partitions ayant la même taille dont on parle

le support physique n'as que peu d'importance que le disque soit en sata/scsi(sda) ou ide(hda)

le seul changement est ca déclaration dans le paramètres pour chaque serveur /dev/sda4 dans le premier et /dev/hda1 pour le deuxième du moment qu'elles sont de la meme taille

a+

tynho
30/06/2010, 16h02
slt
je veux savoir si dans le fichier /etc/drbd.conf ne contient que la ressource que j'aurais choisi en locurence r0 et mes deux noeuds de cluster car jusque là je ne progresse pas .
SOS
tres cordialemnet

jluce
30/06/2010, 18h22
slt
je veux savoir si dans le fichier /etc/drbd.conf ne contient que la ressource que j'aurais choisi en locurence r0 et mes deux noeuds de cluster car jusque là je ne progresse pas .
SOS
tres cordialemnet[/b]
tu peux poster ton drdb.conf pour voir

et nous renvoyer le résultat de la commande suivante:


cat /proc/drbd


a+

tynho
02/07/2010, 00h43
Voici le contenu de mon fichier drbd.conf sur les deux machines:

skip {
As you can see, you can also comment chunks of text
with a 'skip[optional nonsense]{ skipped text }' section.
This comes in handy, if you just want to comment out
some 'resource <some name> {...}' section:
just precede it with 'skip'.

The basic format of option assignment is
<option name><linear whitespace><value>;

It should be obvious from the examples below,
but if you really care to know the details:

<option name> :=
valid options in the respective scope
<value> := <num>|<string>|<choice>|...
depending on the set of allowed values
for the respective option.
<num> := [0-9]+, sometimes with an optional suffix of K,M,G
<string> := (<name>|\"([^\"\\\n]*|\\.)*\")+
<name> := [/_.A-Za-z0-9-]+
}
global {
usage-count yes;
}
common {
syncer { rate 10M; }
}
resource r0 {
protocol C;

handlers {
pri-on-incon-degr "echo o > /proc/sysrq-trigger ; halt -f";
pri-lost-after-sb "echo o > /proc/sysrq-trigger ; halt -f";
local-io-error "echo o > /proc/sysrq-trigger ; halt -f";
outdate-peer "/usr/lib/heartbeat/drbd-peer-outdater -t 5";
}

startup {
degr-wfc-timeout 120; # 2 minutes.
}

disk {
on-io-error detach;
}

net {
after-sb-0pri disconnect;
after-sb-1pri disconnect;
after-sb-2pri disconnect;
rr-conflict disconnect;
}

syncer {
rate 10M;
al-extents 257;
}
# Voici mon premier noeud de cluster
on destaingk {
device /dev/drbd0;
disk /dev/sda4;
address 192.168.1.162:7788;
meta-disk internal;
}
# et en voici le second
on debian {
device /dev/drbd0;
disk /dev/hda4;
address 192.168.1.20:7788;
meta-disk internal;
}
}
resource "r1" {
protocol C;
startup {
wfc-timeout 0; ## Infinite!
degr-wfc-timeout 120; ## 2 minutes.
}
disk {
on-io-error detach;
}
net {
}
syncer {
}

on amd {
device /dev/drbd1;
disk /dev/hde6;
address 192.168.22.11:7789;
meta-disk /dev/somewhere [7];
}

on alf {
device /dev/drbd1;
disk /dev/hdc6;
address 192.168.22.12:7789;
meta-disk /dev/somewhere [7];
}
}

resource r2 {
protocol C;

startup { wfc-timeout 0; degr-wfc-timeout 120; }
disk { on-io-error detach; }
net { timeout 60; connect-int 10; ping-int 10;
max-buffers 2048; max-epoch-size 2048; }
syncer { rate 4M; } # sync when r0 and r1 are finished syncing.
on amd {
address 192.168.22.11:7790;
disk /dev/hde7; device /dev/drbd2; meta-disk "internal";
}
on alf {
device "/dev/drbd2"; disk "/dev/hdc7"; meta-disk "internal";
address 192.168.22.12:7790;
}
}

resource r3 {
protocol C;

startup { wfc-timeout 0; degr-wfc-timeout 120; }
disk { on-io-error detach; }
syncer {
}
on amd {
device /dev/drbd3;
disk /dev/hde8;
address 192.168.22.11:7791;
meta-disk internal;
}
on alf {
device /dev/drbd3;
disk /dev/hdc8;
address 192.168.22.12:7791;
meta-disk /some/where[8];
}
}

mais cela ne marche pas tt a fait et je suis pret pour refaire mon config merci !

tynho
02/07/2010, 09h44
Je suis bloqué qlq'un peut jetter un coup d'oeil à ma config
Merci

jluce
02/07/2010, 10h36
Je suis bloqué qlq'un peut jetter un coup d'oeil à ma config
Merci[/b]
slt

tu devrais viré tous ce qui ne te sert pas dans cette conf... genre:

<div class='quotetop'>Citation </div>
Voici le contenu de mon fichier drbd.conf sur les deux machines:

skip {
As you can see, you can also comment chunks of text
with a 'skip[optional nonsense]{ skipped text }' section.
This comes in handy, if you just want to comment out
some 'resource <some name> {...}' section:
just precede it with 'skip'.

The basic format of option assignment is
<option name><linear whitespace><value>;

It should be obvious from the examples below,
but if you really care to know the details:

<option name> :=
valid options in the respective scope
<value> := <num>|<string>|<choice>|...
depending on the set of allowed values
for the respective option.
<num> := [0-9]+, sometimes with an optional suffix of K,M,G
<string> := (<name>|\"([^\"\\\n]*|\\.)*\")+
<name> := [/_.A-Za-z0-9-]+
}[/b]

et ne garder que ce qui t'interresse

de plus peux tu envoyer le résultat de la commande


cat /proc/drdb

de plus y'a un bout de temps y'avais un article sur DRDB sous DEBIAN dans linuxmag que j'avais scanner, tu peux le télécharger ici:

http://www.megaupload.com/?d=YJZ1PJ64

si ca peux aider

a+


de plus y'a des ressources pas configurer tu peux les viré en gros faut qu'il te reste un truc comme le fichier du premier post

a+

tynho
02/07/2010, 11h08
Voici ce que j'obtient en lancant drbd avec la configuration precedent sans les resources r1 à r3 qui ont ete commenté sur les deux noeuds avec skip {}
/etc/init.d/drbd start
Starting DRBD resources: In resource r0:
resource 'r2' mentioned in 'after' option is not known.
WARN:
You are using the 'drbd-peer-outdater' as outdate-peer program.
If you use that mechanism the dopd heartbeat plugin program needs
to be able to call drbdsetup and drbdmeta with root privileges.

You need to fix this with these commands:
chgrp haclient /sbin/drbdsetup
chmod o-x /sbin/drbdsetup
chmod u+s /sbin/drbdsetup

chgrp haclient /sbin/drbdmeta
chmod o-x /sbin/drbdmeta
chmod u+s /sbin/drbdmeta


et donc je vais lance ces commandes et puis je croix commenter le parametre after r2 dans la section syncer

Apres avoir commenter after "r2" je lance drbd et là j'ai
Starting DRBD resources: WARN:
You are using the 'drbd-peer-outdater' as outdate-peer program.
If you use that mechanism the dopd heartbeat plugin program needs
to be able to call drbdsetup and drbdmeta with root privileges.

You need to fix this with these commands:
chgrp haclient /sbin/drbdsetup
chmod o-x /sbin/drbdsetup
chmod u+s /sbin/drbdsetup

chgrp haclient /sbin/drbdmeta
chmod o-x /sbin/drbdmeta
chmod u+s /sbin/drbdmeta

[ d(r0) /dev/drbd0: Failure: (119) No valid meta-data signature found.

==> Use 'drbdadm create-md res' to initialize meta-data area. <==


[r0] cmd /sbin/drbdsetup /dev/drbd0 disk /dev/sda4 /dev/sda4 internal --set-defaults --create-device --on-io-error=detach failed - continuing!

s(r0) n(r0) ]WARN:
You are using the 'drbd-peer-outdater' as outdate-peer program.
If you use that mechanism the dopd heartbeat plugin program needs
to be able to call drbdsetup and drbdmeta with root privileges.

You need to fix this with these commands:
chgrp haclient /sbin/drbdsetup
chmod o-x /sbin/drbdsetup
chmod u+s /sbin/drbdsetup

chgrp haclient /sbin/drbdmeta
chmod o-x /sbin/drbdmeta
chmod u+s /sbin/drbdmeta

.
WARN:
You are using the 'drbd-peer-outdater' as outdate-peer program.
If you use that mechanism the dopd heartbeat plugin program needs
to be able to call drbdsetup and drbdmeta with root privileges.

You need to fix this with these commands:
chgrp haclient /sbin/drbdsetup
chmod o-x /sbin/drbdsetup
chmod u+s /sbin/drbdsetup

chgrp haclient /sbin/drbdmeta
chmod o-x /sbin/drbdmeta
chmod u+s /sbin/drbdmeta

puis j'ai lancé ces cmd jusqu'a la cmd
# drbdadm create-md r0
md_offset 5700112384
al_offset 5700079616
bm_offset 5699903488

Found ext3 filesystem which uses 5566520 kB
current configuration leaves usable 5566312 kB

Device size would be truncated, which
would corrupt data and result in
'access beyond end of device' errors.
You need to either
* use external meta data (recommended)
* shrink that filesystem first
* zero out the device (destroy the filesystem)
Operation refused.

Command 'drbdmeta /dev/drbd0 v08 /dev/sda4 internal create-md' terminated with exit code 40
drbdadm aborting




Voici son contenu
cat /proc/drbd
version: 8.0.14 (api:86/proto:86)
GIT-hash: bb447522fc9a87d0069b7e14f0234911ebdab0f7 build by phil@fat-tyre, 2008-11-12 16:40:33
0: cs:Unconfigured

tynho
02/07/2010, 12h22
slt jluce
Merci pour ton tuto
J'ai pu finalement resoudre le probleme et voici mes resultats:
~# drbdadm create-md r0
Writing meta data...
initialising activity log
NOT initialized bitmap
New drbd meta data block sucessfully created.
success
~# /etc/init.d/drbd start
Starting DRBD resources: [ d(r0) n(r0) ].
ps -e | grep drbd
11665 ? 00:00:00 drbd0_worker
11667 ? 00:00:00 drbd0_receiver
11673 ? 00:00:00 drbd0_asender
~# cat /proc/drbd
version: 8.0.14 (api:86/proto:86)
GIT-hash: bb447522fc9a87d0069b7e14f0234911ebdab0f7 build by phil@fat-tyre, 2008-11-12 16:40:33
0: cs:Connected st:Secondary/Secondary ds:Inconsistent/Inconsistent C r---
ns:0 nr:0 dw:0 dr:0 al:0 bm:0 lo:0 pe:0 ua:0 ap:0
resync: used:0/61 hits:0 misses:0 starving:0 dirty:0 changed:0
act_log: used:0/257 hits:0 misses:0 starving:0 dirty:0 changed:0


Et je crois c'est un bon debut ou bien et là je passe à la configuration de heartbeat

jluce
02/07/2010, 13h00
re

quel modif as tu fait ???

il te faut un en primary l'autre en secondary

tu peux faire ca sur ton serveur principal :


drbdadm – –overwrite-data-of-peer primary r0
tu devrais te retrouver avec ce genre de truc quand tu fais un cat /proc/drdb:


version: 8.2.6 (api:88/proto:86-88)
GIT-hash: 3e69822d3bb4920a8c1bfdf7d647169eba7d2eb4 build by phil@fat-tyre, 2008-05-30 12:59:17
0: cs:SyncSource st:Primary/Secondary ds:UpToDate/Inconsistent C r---
ns:2240 nr:0 dw:0 dr:2240 al:0 bm:0 lo:0 pe:0 ua:0 ap:0 oos:149132568
[>....................] sync'ed: 0.1% (1345637/1345639)M
finish: 17:15:38 speed: 83,2 (83,2) K/sec

ensuite il te faut creer le syteme de fichier je crois....

a+

tynho
02/07/2010, 15h25
J'ai actu un pb de synchronisation entre les deux noeuds

~# cat /proc/drbd
version: 8.0.14 (api:86/proto:86)
GIT-hash: bb447522fc9a87d0069b7e14f0234911ebdab0f7 build by phil@fat-tyre, 2008-11-12 16:40:33
0: cs:StandAlone st:Primary/Unknown ds:UpToDate/DUnknown r---
ns:5566312 nr:0 dw:0 dr:5566312 al:0 bm:340 lo:0 pe:0 ua:0 ap:0
resync: used:0/61 hits:0 misses:0 starving:0 dirty:0 changed:0
act_log: used:0/257 hits:0 misses:0 starving:0 dirty:0 changed:0

quoi??

jluce
02/07/2010, 15h50
re

ils se voient plus:(

redémarre tes drdb sur les deux ;)

a+

tynho
02/07/2010, 17h17
Pour la configuration de heartbeat es ce que le fichier de configuration de heartbeat doit etre le meme sur les deux serveurs?

Hexoseth
02/07/2010, 17h55
Pour la configuration de heartbeat es ce que le fichier de configuration de heartbeat doit etre le meme sur les deux serveurs?[/b]

Salut

Oui il faut avoir les mêmes fichiers de configuration pour heartbeat

A+

jluce
02/07/2010, 23h15
slt

dans le ha.cf y'a quand meme le ucast a mettre coherent...je crois...

a+

tynho
05/07/2010, 12h42
slt jluce tu as parfiatement raison pour le ucast dans le fichier ha.cf
ce qu'il faut noter c'est qu'il doit prendre respectivement l'adresse de chaque noeud sur lequel il est.

Cependant j'ai un petit soucis au lancement de heartbeat qui m'afiiche le message suivant :
Starting High-Availability services:
2010/07/05_11:26:24 INFO: Resource is stopped
Done.

mais qd je l'arrête sur mon noeud principal, le service apache 2 qu'il gere est pris en cpte par le noeud secondaire alors là je me demande si c'est vraiment un pb car tout parait fonctionner
il me reste à l'essayer depis une machine client.

jluce
05/07/2010, 12h49
slt jluce tu as parfiatement raison pour le ucast dans le fichier ha.cf
ce qu'il faut noter c'est qu'il doit prendre respectivement l'adresse de chaque noeud sur lequel il est.

Cependant j'ai un petit soucis au lancement de heartbeat qui m'afiiche le message suivant :
Starting High-Availability services:
2010/07/05_11:26:24 INFO: Resource is stopped
Done.

mais qd je l'arrête sur mon noeud principal, le service apache 2 qu'il gere est pris en cpte par le noeud secondaire alors là je me demande si c'est vraiment un pb car tout parait fonctionner
il me reste à l'essayer depis une machine client.[/b]
slt

heu non je crois que justement c'est pour la surveillance du noeud d'en face........

verifie ce qu'il te dit dans les log

a+

tynho
08/07/2010, 01h46
Bonsoir
je veux assurer le load balancing du service apache afin de l'attaquer depuis un client via une adresse virtuelle .
Maintenant ma question est comment procedé? puis que sur mes deux noeuds j'ai qu'une seule carte rx chacun
je sais qu'il faut crée par exemple une adresse virtuelle sur l'une des interfaces mais deql noeud maitre ou esclave?
et je pense qu'il faut aussi preciser l'interface sur le fichier /etc/ha.cf/haresources
Mais pour ce qui est des pages web où dois-je les placer?
Merci!!

jluce
08/07/2010, 09h00
Bonsoir
je veux assurer le load balancing du service apache afin de l'attaquer depuis un client via une adresse virtuelle .
Maintenant ma question est comment procedé? puis que sur mes deux noeuds j'ai qu'une seule carte rx chacun
je sais qu'il faut crée par exemple une adresse virtuelle sur l'une des interfaces mais deql noeud maitre ou esclave?
et je pense qu'il faut aussi preciser l'interface sur le fichier /etc/ha.cf/haresources
Mais pour ce qui est des pages web où dois-je les placer?
Merci!![/b]

slt

pour l'interface virtuelle tu n'as pas a la configurer sur une carte tu as juste a la configurer dans heartbeat

dans le fichier haressources a la fin tu dois avoir un truc comme ca:


media1**IPaddr::192.168.202.182 drbddisk::r0 Filesystem::/dev/drbd1::/srv apache2

le premier champs etant le nom du primaire (media1)
le deuxieme champs est pour l'addresse IP virtuel (IPaddr::192.168.202.182 )
le troisieme est pour gerer drdb ainsi que la ressources déclarer (drbddisk::r0)
le quatrieme est le montage du disque et son point de montage (Filesystem::/dev/drbd1::/srv)
le cinquieme les différents servicces a lancer par heartbeat (apache2)


pour le serveur web cela dépends des distribution

sous suse tu pose tes pages dans /srv/www/htdocs

je te conseille de faire un


find / -name htdocs

c'est la qu'il faut les poser mais je te conseille de lira ca:

http://www.linuxpedia.fr/doku.php/apache

c'est sous debian ;)

a+

tynho
08/07/2010, 23h42
slt
Merci pour ton lien mais j'ai deja mis en place un site sous debian cependant je voulais utiliser RAID IP mis en place avec heartbeat pour le mirroring et je demande où je devais mettre mes pages sinon que mon /dev/drbd0 est monté sur le repertoire /data.
Alors où réellement je dois les placer?

Merci!!!

jluce
09/07/2010, 00h24
slt
Merci pour ton lien mais j'ai deja mis en place un site sous debian cependant je voulais utiliser RAID IP mis en place avec heartbeat pour le mirroring et je demande où je devais mettre mes pages sinon que mon /dev/drbd0 est monté sur le repertoire /data.
Alors où réellement je dois les placer?

Merci!!![/b]
slt

tu as deux choix:

soit tu dis a apache que son documentroot c'est /data et tu mets tes pages dans /data

soit tu configure drdb pour qu'il te change le point de montage et qu'il te monte la partition dans le documentroot par default de apache et a ce moment la tu les mets comme d'hab ;)

a+

tynho
09/07/2010, 12h50
slt

tu as deux choix:

soit tu dis a apache que son documentroot c'est /data et tu mets tes pages dans /data

soit tu configure drdb pour qu'il te change le point de montage et qu'il te monte la partition dans le documentroot par default de apache et a ce moment la tu les mets comme d'hab ;)

a+[/b]

Slt
J'ai decidé d'adopter la premiere methode en changeant le DocumentRoot de mes sites virtuel par /data/www/htdocs sur le noeud principal
Maintement la question que je me pose est de savoir si je dois faire la même chose sur mon noeud secondaire?où laisser le drbd faire la replication à ma place
ce que je trouve etre l'idéal . Je te tiens au courant des resultats des différents scénarios.
Merci

jluce
09/07/2010, 12h58
Slt
J'ai decidé d'adopter la premiere methode en changeant le DocumentRoot de mes sites virtuel par /data/www/htdocs sur le noeud principal
Maintement la question que je me pose est de savoir si je dois faire la même chose sur mon noeud secondaire?où laisser le drbd faire la replication à ma place
ce que je trouve etre l'idéal . Je te tiens au courant des resultats des différents scénarios.
Merci[/b]
slt

de toute facon la réplication ne concernera que le répertoire /data donc pas la conf d'apache

pour palier ce probleme j'ai mis en place sur le secondaire un replication via rsync du répertoire de conf d'apache

comme ca des que tu change qqchose sur le primaire au niveau de la conf c'est directement balancer sur le secondaire ;)

a+

tynho
09/07/2010, 13h13
slt

de toute facon la réplication ne concernera que le répertoire /data donc pas la conf d'apache

pour palier ce probleme j'ai mis en place sur le secondaire un replication via rsync du répertoire de conf d'apache

comme ca des que tu change qqchose sur le primaire au niveau de la conf c'est directement balancer sur le secondaire ;)

a+[/b]

stl
Mais qu'en ait il des pages web car c'est ça que je veux repliquer
Cord

jluce
09/07/2010, 14h05
stl
Mais qu'en ait il des pages web car c'est ça que je veux repliquer
Cord[/b]
slt

c'est tout le répertoire /data qui est répliquer via drbd donc tous ce qu'il y a dedans (donc tes pages si elle sont dans /data/www/htdocs ;))

les fichiers de conf sont en général dans /etc/apache par consequent pas dans la partition drbd d'ou la syncro via rsync ;)

a+

tynho
10/07/2010, 11h58
slt
j'ai un ptit soucis ,mon second serveur ne monte pas la partition /dev/drbd0 lors du balancement au point ou sur le serveur secondaire heartbeat ne lance meme pas apache ,je me demande quel est le probleme?
card

jluce
11/07/2010, 11h37
slt
j'ai un ptit soucis ,mon second serveur ne monte pas la partition /dev/drbd0 lors du balancement au point ou sur le serveur secondaire heartbeat ne lance meme pas apache ,je me demande quel est le probleme?
card[/b]slt

envois les fichiers /var/log/messages des deux neouds au moment de la bascule

a+

tynho
12/07/2010, 14h53
Slt voici le resultat de la commande dmesg sur mon secondaire que je n'arrive pas à dechiffré!!!


[ 0.000000] Initializing cgroup subsys cpuset
[ 0.000000] Initializing cgroup subsys cpu
[ 0.000000] Linux version 2.6.26-2-686 (Debian 2.6.26-24) (dannf@debian.org) (gcc version 4.1.3 20080704 (prerelease) (Debian 4.1.2-25)) #1 SMP Mon Jun 21 05:58:44 UTC 2010
[ 0.000000] BIOS-provided physical RAM map:
[ 0.000000] BIOS-e820: 0000000000000000 - 000000000009fc00 (usable)
[ 0.000000] BIOS-e820: 000000000009fc00 - 00000000000a0000 (reserved)
[ 0.000000] BIOS-e820: 00000000000e7000 - 0000000000100000 (reserved)
[ 0.000000] BIOS-e820: 0000000000100000 - 000000001dfc0000 (usable)
[ 0.000000] BIOS-e820: 000000001dfc0000 - 000000001dfce000 (ACPI data)
[ 0.000000] BIOS-e820: 000000001dfce000 - 000000001dff0000 (ACPI NVS)
[ 0.000000] BIOS-e820: 000000001dff0000 - 000000001e000000 (reserved)
[ 0.000000] BIOS-e820: 00000000fec00000 - 00000000fec01000 (reserved)
[ 0.000000] BIOS-e820: 00000000fee00000 - 00000000fee01000 (reserved)
[ 0.000000] 0MB HIGHMEM available.
[ 0.000000] 479MB LOWMEM available.
[ 0.000000] found SMP MP-table at [c00ff780] 000ff780
[ 0.000000] Entering add_active_range(0, 0, 122816) 0 entries of 256 used
[ 0.000000] Zone PFN ranges:
[ 0.000000] DMA 0 -> 4096
[ 0.000000] Normal 4096 -> 122816
[ 0.000000] HighMem 122816 -> 122816
[ 0.000000] Movable zone start PFN for each node
[ 0.000000] early_node_map[1] active PFN ranges
[ 0.000000] 0: 0 -> 122816
[ 0.000000] On node 0 totalpages: 122816
[ 0.000000] DMA zone: 32 pages used for memmap
[ 0.000000] DMA zone: 0 pages reserved
[ 0.000000] DMA zone: 4064 pages, LIFO batch:0
[ 0.000000] Normal zone: 928 pages used for memmap
[ 0.000000] Normal zone: 117792 pages, LIFO batch:31
[ 0.000000] HighMem zone: 0 pages used for memmap
[ 0.000000] Movable zone: 0 pages used for memmap
[ 0.000000] DMI 2.3 present.
[ 0.000000] ACPI: RSDP 000F82F0, 0014 (r0 ACPIAM)
[ 0.000000] ACPI: RSDT 1DFC0000, 0030 (r1 A M I OEMRSDT 11000502 MSFT 97)
[ 0.000000] ACPI: FACP 1DFC0200, 0084 (r2 A M I OEMFACP 11000502 MSFT 97)
[ 0.000000] ACPI: DSDT 1DFC0400, 3A9F (r1 12345 12345123 123 INTL 2002026)
[ 0.000000] ACPI: FACS 1DFCE000, 0040
[ 0.000000] ACPI: APIC 1DFC0390, 006C (r1 A M I OEMAPIC 11000502 MSFT 97)
[ 0.000000] ACPI: OEMB 1DFCE040, 0046 (r1 A M I AMI_OEM 11000502 MSFT 97)
[ 0.000000] ACPI: PM-Timer IO Port: 0x808
[ 0.000000] ACPI: Local APIC address 0xfee00000
[ 0.000000] ACPI: LAPIC (acpi_id[0x01] lapic_id[0x00] enabled)
[ 0.000000] ACPI: LAPIC (acpi_id[0x02] lapic_id[0x81] disabled)
[ 0.000000] ACPI: LAPIC (acpi_id[0x03] lapic_id[0x82] disabled)
[ 0.000000] ACPI: LAPIC (acpi_id[0x04] lapic_id[0x83] disabled)
[ 0.000000] ACPI: IOAPIC (id[0x01] address[0xfec00000] gsi_base[0])
[ 0.000000] IOAPIC[0]: apic_id 1, version 3, address 0xfec00000, GSI 0-23
[ 0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 0 global_irq 2 dfl dfl)
[ 0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 9 global_irq 9 low level)
[ 0.000000] ACPI: IRQ0 used by override.
[ 0.000000] ACPI: IRQ2 used by override.
[ 0.000000] ACPI: IRQ9 used by override.
[ 0.000000] Enabling APIC mode: Flat. Using 1 I/O APICs
[ 0.000000] Using ACPI (MADT) for SMP configuration information
[ 0.000000] Allocating PCI resources starting at 20000000 (gap: 1e000000:e0c00000)
[ 0.000000] PM: Registered nosave memory: 000000000009f000 - 00000000000a0000
[ 0.000000] PM: Registered nosave memory: 00000000000a0000 - 00000000000e7000
[ 0.000000] PM: Registered nosave memory: 00000000000e7000 - 0000000000100000
[ 0.000000] SMP: Allowing 4 CPUs, 3 hotplug CPUs
[ 0.000000] PERCPU: Allocating 37992 bytes of per cpu data
[ 0.000000] NR_CPUS: 8, nr_cpu_ids: 4
[ 0.000000] Built 1 zonelists in Zone *****, mobility grouping on. Total pages: 121856
[ 0.000000] Kernel command line: root=/dev/hda1 ro quiet
[ 0.000000] mapped APIC to ffffb000 (fee00000)
[ 0.000000] mapped IOAPIC to ffffa000 (fec00000)
[ 0.000000] Enabling fast FPU save and restore... done.
[ 0.000000] Enabling unmasked SIMD FPU exception support... done.
[ 0.000000] Initializing CPU#0
[ 0.000000] PID hash table entries: 2048 (*****: 11, 8192 bytes)
[ 0.000000] Detected 2660.215 MHz processor.
[ 0.004000] Console: colour VGA+ 80x25
[ 0.004000] console [tty0] enabled
[ 0.004000] Dentry cache hash table entries: 65536 (*****: 6, 262144 bytes)
[ 0.004000] Inode-cache hash table entries: 32768 (*****: 5, 131072 bytes)
[ 0.004000] Memory: 476996k/491264k available (1771k kernel code, 13732k reserved, 750k data, 244k init, 0k highmem)
[ 0.004000] virtual kernel memory layout:
[ 0.004000] fixmap : 0xfff4c000 - 0xfffff000 ( 716 kB)
[ 0.004000] pkmap : 0xff800000 - 0xffc00000 (4096 kB)
[ 0.004000] vmalloc : 0xde800000 - 0xff7fe000 ( 527 MB)
[ 0.004000] lowmem : 0xc0000000 - 0xddfc0000 ( 479 MB)
[ 0.004000] .init : 0xc037f000 - 0xc03bc000 ( 244 kB)
[ 0.004000] .data : 0xc02badcd - 0xc0376620 ( 750 kB)
[ 0.004000] .text : 0xc0100000 - 0xc02badcd (1771 kB)
[ 0.004000] Checking if this processor honours the WP bit even in supervisor mode...Ok.
[ 0.004000] CPA: page pool initialized 1 of 1 pages preallocated
[ 0.084014] Calibrating delay using timer specific routine.. 5328.92 BogoMIPS (lpj=10657843)
[ 0.084064] Security Framework initialized
[ 0.084070] SELinux: Disabled at boot.
[ 0.084075] Capability LSM initialized
[ 0.084095] Mount-cache hash table entries: 512
[ 0.084293] Initializing cgroup subsys ns
[ 0.084299] Initializing cgroup subsys cpuacct
[ 0.084303] Initializing cgroup subsys devices
[ 0.084335] CPU: Trace cache: 12K uops, L1 D cache: 16K
[ 0.084339] CPU: L2 cache: 1024K
[ 0.084342] CPU: Hyper-Threading is disabled
[ 0.084347] Intel machine check architecture supported.
[ 0.084355] Intel machine check reporting enabled on CPU#0.
[ 0.084359] CPU0: Intel P4/Xeon Extended MCE MSRs (24) available
[ 0.084364] CPU0: Thermal monitoring enabled
[ 0.084367] using mwait in idle threads.
[ 0.084383] Checking 'hlt' instruction... OK.
[ 0.100479] SMP alternatives: switching to UP code
[ 0.111936] ACPI: Core revision 20080321
[ 0.124748] ENABLING IO-APIC IRQs
[ 0.125069] ..TIMER: vector=0x31 apic1=0 pin1=2 apic2=-1 pin2=-1
[ 0.164832] CPU0: Intel® Pentium® 4 CPU 2.66GHz stepping 01
[ 0.168010] Brought up 1 CPUs
[ 0.168010] Total of 1 processors activated (5328.92 BogoMIPS).
[ 0.168010] CPU0 attaching sched-domain:
[ 0.168010] domain 0: span 0
[ 0.168010] groups: 0
[ 0.168010] net_namespace: 660 bytes
[ 0.168010] Booting paravirtualized kernel on bare hardware
[ 0.168010] NET: Registered protocol family 16
[ 0.168010] ACPI: bus type pci registered
[ 0.168010] PCI: PCI BIOS revision 2.10 entry at 0xf0031, last bus=1
[ 0.168010] PCI: Using configuration type 1 for base access
[ 0.168010] Setting up standard PCI resources
[ 0.176731] ACPI: EC: Look up EC in DSDT
[ 0.189321] ACPI: Interpreter enabled
[ 0.189326] ACPI: (supports S0 S1 S3 S4 S5)
[ 0.189349] ACPI: Using IOAPIC for interrupt routing
[ 0.203565] ACPI: PCI Root Bridge [PCI0] (0000:00)
[ 0.204962] ACPI: PCI Interrupt Routing Table [\_SB_.PCI0._PRT]
[ 0.205337] ACPI: PCI Interrupt Routing Table [\_SB_.PCI0.P0P1._PRT]
[ 0.218789] ACPI: PCI Interrupt Link [LNKA] (IRQs 3 4 5 6 7 *10 11 12 14 15)
[ 0.218954] ACPI: PCI Interrupt Link [LNKB] (IRQs 3 4 *5 6 7 10 11 12 14 15)
[ 0.219111] ACPI: PCI Interrupt Link [LNKC] (IRQs 3 4 5 6 7 10 *11 12 14 15)
[ 0.219265] ACPI: PCI Interrupt Link [LNKD] (IRQs 3 4 5 6 7 10 11 12 14 15) *0, disabled.
[ 0.219420] ACPI: PCI Interrupt Link [LNKE] (IRQs 3 4 5 6 7 10 11 12 14 15) *0, disabled.
[ 0.219575] ACPI: PCI Interrupt Link [LNKF] (IRQs 3 4 5 6 7 10 11 12 14 15) *0, disabled.
[ 0.219730] ACPI: PCI Interrupt Link [LNKG] (IRQs 3 4 5 6 7 10 11 12 14 15) *0, disabled.
[ 0.219885] ACPI: PCI Interrupt Link [LNKH] (IRQs 3 4 5 6 7 10 11 12 14 15) *0, disabled.
[ 0.220143] ACPI Warning (tbutils-0217): Incorrect checksum in table [OEMB] - 18, should be 0B [20080321]
[ 0.220158] Linux Plug and Play Support v0.97 © Adam Belay
[ 0.220199] pnp: PnP ACPI init
[ 0.220209] ACPI: bus type pnp registered
[ 0.230940] pnp: PnP ACPI: found 13 devices
[ 0.230944] ACPI: ACPI bus type pnp unregistered
[ 0.230949] PnPBIOS: Disabled by ACPI PNP
[ 0.231292] PCI: Using ACPI for IRQ routing
[ 0.231538] ACPI: RTC can wake from S4
[ 0.231576] system 00:08: ioport range 0xa00-0xa0f has been reserved
[ 0.231581] system 00:08: ioport range 0x290-0x29f has been reserved
[ 0.231584] system 00:08: ioport range 0xa20-0xa2f has been reserved
[ 0.231587] system 00:08: ioport range 0xa30-0xa3f has been reserved
[ 0.231590] system 00:08: ioport range 0xa40-0xa4f has been reserved
[ 0.231599] system 00:09: ioport range 0x290-0x297 has been reserved
[ 0.231602] system 00:09: ioport range 0xc00-0xc05 has been reserved
[ 0.231608] system 00:09: ioport range 0x3e0-0x3e7 has been reserved
[ 0.231612] system 00:09: ioport range 0x4d0-0x4d1 has been reserved
[ 0.231615] system 00:09: ioport range 0x800-0x87f has been reserved
[ 0.231618] system 00:09: ioport range 0x400-0x41f has been reserved
[ 0.231628] system 00:0a: iomem range 0xfec00000-0xfec00fff could not be reserved
[ 0.231632] system 00:0a: iomem range 0xfee00000-0xfee00fff could not be reserved
[ 0.231641] system 00:0c: iomem range 0x0-0x9ffff could not be reserved
[ 0.231645] system 00:0c: iomem range 0xc0000-0xcffff could not be reserved
[ 0.231648] system 00:0c: iomem range 0xe0000-0xfffff could not be reserved
[ 0.231652] system 00:0c: iomem range 0x100000-0x1dffffff could not be reserved
[ 0.231656] system 00:0c: iomem range 0xfff80000-0xffffffff has been reserved
[ 0.262202] PCI: Bridge: 0000:00:01.0
[ 0.262205] IO window: disabled.
[ 0.262211] MEM window: 0xfca00000-0xfeafffff
[ 0.262216] PREFETCH window: 0x00000000eff00000-0x00000000f7efffff
[ 0.262243] PCI: Setting latency timer of device 0000:00:01.0 to 64
[ 0.262270] NET: Registered protocol family 2
[ 0.262405] IP route cache hash table entries: 4096 (*****: 2, 16384 bytes)
[ 0.262704] TCP established hash table entries: 16384 (*****: 5, 131072 bytes)
[ 0.262827] TCP bind hash table entries: 16384 (*****: 5, 131072 bytes)
[ 0.262943] TCP: Hash tables configured (established 16384 bind 16384)
[ 0.262947] TCP reno registered
[ 0.263059] NET: Registered protocol family 1
[ 0.263212] checking if image is initramfs... it is
[ 0.764022] Switched to high resolution mode on CPU 0
[ 0.832296] Freeing initrd memory: 6068k freed
[ 0.833258] audit: initializing netlink socket (disabled)
[ 0.833288] type=2000 audit(1278927470.832:1): initialized
[ 0.833449] Total HugeTLB memory allocated, 0
[ 0.833546] VFS: Disk quotas dquot_6.5.1
[ 0.833583] Dquot-cache hash table entries: 1024 (***** 0, 4096 bytes)
[ 0.833636] msgmni has been set to 943
[ 0.833775] Block layer SCSI generic (bsg) driver version 0.4 loaded (major 253)
[ 0.833780] io scheduler noop registered
[ 0.833783] io scheduler anticipatory registered
[ 0.833785] io scheduler deadline registered
[ 0.833797] io scheduler cfq registered (default)
[ 0.833818] PCI: VIA PCI bridge detected.Disabling DAC.
[ 0.833905] pci 0000:00:11.0: Bypassing VIA 8237 APIC De-Assert Message
[ 0.833916] pci 0000:01:00.0: Boot video device
[ 0.834360] isapnp: Scanning for PnP cards...
[ 1.186625] isapnp: No Plug & Play device found
[ 1.190097] Serial: 8250/16550 driver $Revision: 1.90 $ 4 ports, IRQ sharing enabled
[ 1.190238] serial8250: ttyS0 at I/O 0x3f8 (irq = 4) is a 16550A
[ 1.190801] 00:0b: ttyS0 at I/O 0x3f8 (irq = 4) is a 16550A
[ 1.193250] brd: module loaded
[ 1.193374] PNP: PS/2 Controller [PNP0303:PS2K] at 0x60,0x64 irq 1
[ 1.193377] PNP: PS/2 appears to have AUX port disabled, if this is incorrect please boot with i8042.nopnp
[ 1.193535] serio: i8042 KBD port at 0x60,0x64 irq 1
[ 1.193690] mice: PS/2 mouse device common for all mice
[ 1.193841] rtc_cmos 00:02: rtc core: registered rtc_cmos as rtc0
[ 1.193877] rtc0: alarms up to one year, y3k
[ 1.193923] cpuidle: using governor ladder
[ 1.193926] cpuidle: using governor menu
[ 1.193933] No iBFT detected.
[ 1.194488] TCP cubic registered
[ 1.194493] NET: Registered protocol family 17
[ 1.194502] Using IPI No-Shortcut mode
[ 1.194649] registered taskstats version 1
[ 1.194851] rtc_cmos 00:02: setting system clock to 2010-07-12 09:37:52 UTC (1278927472)
[ 1.195106] Freeing unused kernel memory: 244k freed
[ 1.212966] input: AT Translated Set 2 keyboard as /class/input/input0
[ 1.393065] ACPI Warning (tbutils-0217): Incorrect checksum in table [ à] - 00, should be 24 [20080321]
[ 1.393108] ACPI: à FFFF0000, 0000 (r0 0 0)
[ 1.393137] ACPI Error (psparse-0530): Method parse/execution failed [\_PR_.CPU1._PDC] (Node dd4342d4), AE_BAD_HEADER
[ 1.393324] ACPI: ACPI0007:00 is registered as cooling_device0
[ 1.393329] ACPI: Processor [CPU1] (supports 16 throttling states)
[ 2.123587] Uniform Multi-Platform E-IDE driver
[ 2.123596] ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx
[ 2.140640] No dock devices found.
[ 2.176097] SCSI subsystem initialized
[ 2.178126] VP_IDE: IDE controller (0x1106:0x0571 rev 0x06) at PCI slot 0000:00:0f.1
[ 2.178165] ACPI: PCI Interrupt 0000:00:0f.1[A] -> GSI 20 (level, low) -> IRQ 20
[ 2.178181] VP_IDE: not 100% native mode: will probe irqs later
[ 2.178199] VP_IDE: VIA vt8237 (rev 00) IDE UDMA133 controller on pci0000:00:0f.1
[ 2.178212] ide0: BM-DMA at 0xfc00-0xfc07
[ 2.178221] ide1: BM-DMA at 0xfc08-0xfc0f
[ 2.178225] Probing IDE interface ide0...
[ 2.196470] usbcore: registered new interface driver usbfs
[ 2.196513] usbcore: registered new interface driver hub
[ 2.196554] usbcore: registered new device driver usb
[ 2.201318] USB Universal Host Controller Interface driver v3.0
[ 2.244632] libata version 3.00 loaded.
[ 2.265983] via-rhine.c:v1.10-LK1.4.3 2007-03-06 Written by Donald Becker
[ 2.362965] Floppy drive(s): fd0 is 1.44M
[ 2.379933] FDC 0 is a post-1991 82077
[ 2.592115] hda: WDC WD800BB-00JHC0, ATA DISK drive
[ 3.264070] hda: host max PIO5 wanted PIO255(auto-tune) selected PIO4
[ 3.264142] hda: UDMA/100 mode selected
[ 3.264214] Probing IDE interface ide1...
[ 4.128115] hdc: LITE-ON CD-RW SOHR-5239S, ATAPI CD/DVD-ROM drive
[ 4.800040] hdc: host max PIO5 wanted PIO255(auto-tune) selected PIO4
[ 4.800165] hdc: UDMA/33 mode selected
[ 4.800336] ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
[ 4.811715] ide1 at 0x170-0x177,0x376 on irq 15
[ 4.816003] ACPI: PCI Interrupt 0000:00:10.0[A] -> GSI 21 (level, low) -> IRQ 21
[ 4.816003] uhci_hcd 0000:00:10.0: UHCI Host Controller
[ 4.816003] uhci_hcd 0000:00:10.0: new USB bus registered, assigned bus number 1
[ 4.816003] uhci_hcd 0000:00:10.0: irq 21, io base 0x0000e000
[ 4.816003] usb usb1: configuration #1 chosen from 1 choice
[ 4.816003] hub 1-0:1.0: USB hub found
[ 4.816003] hub 1-0:1.0: 2 ports detected
[ 4.876008] usb usb1: New USB device found, idVendor=1d6b, idProduct=0001
[ 4.876008] usb usb1: New USB device strings: Mfr=3, Product=2, SerialNumber=1
[ 4.876008] usb usb1: Product: UHCI Host Controller
[ 4.876008] usb usb1: Manufacturer: Linux 2.6.26-2-686 uhci_hcd
[ 4.876008] usb usb1: SerialNumber: 0000:00:10.0
[ 4.876008] ACPI: PCI Interrupt 0000:00:10.1[A] -> GSI 21 (level, low) -> IRQ 21
[ 4.876008] uhci_hcd 0000:00:10.1: UHCI Host Controller
[ 4.876008] uhci_hcd 0000:00:10.1: new USB bus registered, assigned bus number 2
[ 4.876008] uhci_hcd 0000:00:10.1: irq 21, io base 0x0000dc00
[ 4.876008] usb usb2: configuration #1 chosen from 1 choice
[ 4.876008] hub 2-0:1.0: USB hub found
[ 4.876008] hub 2-0:1.0: 2 ports detected
[ 5.028152] usb usb2: New USB device found, idVendor=1d6b, idProduct=0001
[ 5.028158] usb usb2: New USB device strings: Mfr=3, Product=2, SerialNumber=1
[ 5.028162] usb usb2: Product: UHCI Host Controller
[ 5.028164] usb usb2: Manufacturer: Linux 2.6.26-2-686 uhci_hcd
[ 5.028167] usb usb2: SerialNumber: 0000:00:10.1
[ 5.028241] ACPI: PCI Interrupt 0000:00:10.2[B] -> GSI 21 (level, low) -> IRQ 21
[ 5.028257] uhci_hcd 0000:00:10.2: UHCI Host Controller
[ 5.028297] uhci_hcd 0000:00:10.2: new USB bus registered, assigned bus number 3
[ 5.028326] uhci_hcd 0000:00:10.2: irq 21, io base 0x0000d480
[ 5.028434] usb usb3: configuration #1 chosen from 1 choice
[ 5.028473] hub 3-0:1.0: USB hub found
[ 5.028484] hub 3-0:1.0: 2 ports detected
[ 5.132111] usb usb3: New USB device found, idVendor=1d6b, idProduct=0001
[ 5.132117] usb usb3: New USB device strings: Mfr=3, Product=2, SerialNumber=1
[ 5.132120] usb usb3: Product: UHCI Host Controller
[ 5.132123] usb usb3: Manufacturer: Linux 2.6.26-2-686 uhci_hcd
[ 5.132125] usb usb3: SerialNumber: 0000:00:10.2
[ 5.132179] ACPI: PCI Interrupt 0000:00:10.3[B] -> GSI 21 (level, low) -> IRQ 21
[ 5.132191] uhci_hcd 0000:00:10.3: UHCI Host Controller
[ 5.132222] uhci_hcd 0000:00:10.3: new USB bus registered, assigned bus number 4
[ 5.132246] uhci_hcd 0000:00:10.3: irq 21, io base 0x0000d400
[ 5.132339] usb usb4: configuration #1 chosen from 1 choice
[ 5.132376] hub 4-0:1.0: USB hub found
[ 5.132385] hub 4-0:1.0: 2 ports detected
[ 5.236108] usb usb4: New USB device found, idVendor=1d6b, idProduct=0001
[ 5.236113] usb usb4: New USB device strings: Mfr=3, Product=2, SerialNumber=1
[ 5.236116] usb usb4: Product: UHCI Host Controller
[ 5.236119] usb usb4: Manufacturer: Linux 2.6.26-2-686 uhci_hcd
[ 5.236121] usb usb4: SerialNumber: 0000:00:10.3
[ 5.236187] ACPI: PCI Interrupt 0000:00:10.4[C] -> GSI 21 (level, low) -> IRQ 21
[ 5.236207] ehci_hcd 0000:00:10.4: EHCI Host Controller
[ 5.236236] ehci_hcd 0000:00:10.4: new USB bus registered, assigned bus number 5
[ 5.236282] ehci_hcd 0000:00:10.4: irq 21, io mem 0xfebff800
[ 5.248013] ehci_hcd 0000:00:10.4: USB 2.0 started, EHCI 1.00, driver 10 Dec 2004
[ 5.248102] usb usb5: configuration #1 chosen from 1 choice
[ 5.248140] hub 5-0:1.0: USB hub found
[ 5.248150] hub 5-0:1.0: 8 ports detected
[ 5.352108] usb usb5: New USB device found, idVendor=1d6b, idProduct=0002
[ 5.352112] usb usb5: New USB device strings: Mfr=3, Product=2, SerialNumber=1
[ 5.352116] usb usb5: Product: EHCI Host Controller
[ 5.352118] usb usb5: Manufacturer: Linux 2.6.26-2-686 ehci_hcd
[ 5.352121] usb usb5: SerialNumber: 0000:00:10.4
[ 5.356387] ACPI: PCI Interrupt 0000:00:12.0[A] -> GSI 23 (level, low) -> IRQ 23
[ 5.360692] eth0: VIA Rhine II at 0x1c800, 00:14:2a:8e:78:33, IRQ 23.
[ 5.361408] eth0: MII PHY found at address 1, status 0x786d advertising 05e1 Link 0021.
[ 5.361810] sata_via 0000:00:0f.0: version 2.3
[ 5.361832] ACPI: PCI Interrupt 0000:00:0f.0[B] -> GSI 20 (level, low) -> IRQ 20
[ 5.361888] sata_via 0000:00:0f.0: routed to hard irq line 11
[ 5.361964] scsi0 : sata_via
[ 5.362073] scsi1 : sata_via
[ 5.362113] ata1: SATA max UDMA/133 cmd 0xe880 ctl 0xe800 bmdma 0xe080 irq 20
[ 5.362116] ata2: SATA max UDMA/133 cmd 0xe480 ctl 0xe400 bmdma 0xe088 irq 20
[ 5.392488] hda: max request size: 128KiB
[ 5.392779] hda: 156301488 sectors (80026 MB) w/2048KiB Cache, CHS=65535/16/63
[ 5.393918] hda: cache flushes supported
[ 5.393983] hda: hda1 hda2 < hda5 > hda3
[ 5.534069] hdc: ATAPI 52X CD-ROM CD-R/RW drive, 1536kB Cache
[ 5.534080] Uniform CD-ROM driver Revision: 3.20
[ 5.564069] ata1: SATA link down 1.5 Gbps (SStatus 0 SControl 300)
[ 5.776014] ata2: SATA link down 1.5 Gbps (SStatus 0 SControl 300)
[ 6.372011] usb 4-1: new low speed USB device using uhci_hcd and address 2
[ 6.520002] usb 4-1: configuration #1 chosen from 1 choice
[ 6.520002] usb 4-1: New USB device found, idVendor=1bcf, idProduct=0007
[ 6.520002] usb 4-1: New USB device strings: Mfr=0, Product=2, SerialNumber=0
[ 6.520002] usb 4-1: Product: USB Optical Mouse
[ 6.569425] usbcore: registered new interface driver hiddev
[ 6.586153] input: USB Optical Mouse as /class/input/input1
[ 6.586153] input,hiddev96,hidraw0: USB HID v1.10 Mouse [USB Optical Mouse] on usb-0000:00:10.3-1
[ 6.586153] usbcore: registered new interface driver usbhid
[ 6.586153] usbhid: v2.6:USB HID core driver
[ 8.576853] PM: Starting manual resume from disk
[ 8.596619] EXT3-fs: INFO: recovery required on readonly filesystem.
[ 8.596625] EXT3-fs: write access will be enabled during recovery.
[ 9.905855] kjournald starting. Commit interval 5 seconds
[ 9.905877] EXT3-fs: recovery complete.
[ 9.907734] EXT3-fs: mounted filesystem with ordered data mode.
[ 11.358000] udevd version 125 started
[ 12.711298] Linux agpgart interface v0.103
[ 12.716798] agpgart: Detected VIA VT3314 chipset
[ 12.721768] agpgart: AGP aperture is 64M @ 0xf8000000
[ 12.866479] pci_hotplug: PCI Hot Plug PCI Core version: 0.5
[ 12.875815] shpchp: Standard Hot Plug PCI Controller Driver version: 0.4
[ 13.031021] input: Power Button (FF) as /class/input/input2
[ 13.056102] ACPI: Power Button (FF) [PWRF]
[ 13.056233] input: Sleep Button (CM) as /class/input/input3
[ 13.088107] ACPI: Sleep Button (CM) [SLPB]
[ 13.088199] input: Power Button (CM) as /class/input/input4
[ 13.120110] ACPI: Power Button (CM) [PWRB]
[ 14.091592] input: PC Speaker as /class/input/input5
[ 14.409636] parport_pc 00:07: reported by Plug and Play ACPI
[ 14.409714] parport0: PC-style at 0x378 (0x778), irq 7 [PCSPP,TRISTATE]
[ 14.501932] ACPI: PCI Interrupt 0000:00:11.5[C] -> GSI 22 (level, low) -> IRQ 22
[ 14.502091] PCI: Setting latency timer of device 0000:00:11.5 to 64
[ 16.246318] Unable to find swap-space signature
[ 113.743785] EXT3 FS on hda1, internal journal
[ 113.901995] loop: module loaded
[ 114.253935] Unable to find swap-space signature
[ 114.897214] eth0: link up, 10Mbps, half-duplex, lpa 0x0021
[ 115.328379] NET: Registered protocol family 10
[ 115.329024] lo: Disabled Privacy Extensions
[ 117.388734] lp0: using parport0 (interrupt-driven).
[ 117.412936] ppdev: user-space parallel port driver
[ 126.063499] eth0: no IPv6 routers present
[ 134.692010] warning: `ntpd' uses 32-bit capabilities (legacy support in use)
[ 140.033505] [drm] Initialized drm 1.1.0 20060810
[ 140.040396] ACPI: PCI Interrupt 0000:01:00.0[A] -> GSI 16 (level, low) -> IRQ 16
[ 140.040396] [drm] Initialized via 2.11.1 20070202 on minor 0
[ 140.084480] agpgart: Found an AGP 3.5 compliant device at 0000:00:00.0.
[ 140.084514] agpgart: Putting AGP V3 device at 0000:00:00.0 into 8x mode
[ 140.084601] agpgart: Putting AGP V3 device at 0000:01:00.0 into 8x mode
[ 230.029334] cdrom: hdc: mrw address space DMA selected
[ 230.542413] cdrom: hdc: mrw address space DMA selected
[ 232.028172] UDF-fs: No VRS found
[ 232.112118] ISO 9660 Extensions: Microsoft Joliet Level 3
[ 232.182010] ISO 9660 Extensions: RRIP_1991A
[ 646.946478] drbd: initialised. Version: 8.0.14 (api:86/proto:86)
[ 646.946484] drbd: GIT-hash: bb447522fc9a87d0069b7e14f0234911ebdab0f7 build by phil@fat-tyre, 2008-11-12 16:40:33
[ 646.946488] drbd: registered as block device major 147
[ 646.946490] drbd: minor_table @ 0xd4997200
[ 646.971910] drbd0: disk( Diskless -> Attaching )
[ 646.971917] drbd0: Starting worker thread (from cqueue [5693])
[ 646.993548] drbd0: Found 6 transactions (6 active extents) in activity log.
[ 646.993556] drbd0: max_segment_size ( = BIO size ) = 32768
[ 646.993563] drbd0: drbd_bm_resize called with capacity == 11132624
[ 646.993733] drbd0: resync bitmap: bits=1391578 words=43488
[ 646.993739] drbd0: size = 5436 MB (5566312 KB)
[ 647.001232] drbd0: recounting of set bits took additional 0 jiffies
[ 647.001238] drbd0: 12 KB (3 bits) marked out-of-sync by on disk bit-map.
[ 647.001247] drbd0: disk( Attaching -> UpToDate )
[ 647.113915] padlock: VIA PadLock Hash Engine not detected.
[ 647.143381] drbd0: conn( StandAlone -> Unconnected )
[ 647.149269] drbd0: Starting receiver thread (from drbd0_worker [5700])
[ 647.149315] drbd0: receiver (re)started
[ 647.149324] drbd0: conn( Unconnected -> WFConnection )
[ 706.816029] drbd0: Handshake successful: DRBD Network Protocol version 86
[ 706.817783] drbd0: Peer authenticated using 20 bytes of 'sha1' HMAC
[ 706.817793] drbd0: conn( WFConnection -> WFReportParams )
[ 706.817816] drbd0: Starting asender thread (from drbd0_receiver [5721])
[ 706.819203] drbd0: drbd_sync_handshake:
[ 706.819208] drbd0: self AEA3E675AC967616:20DCC1731627015E:13CBBB8A57019E90 :9FE83E076086E633
[ 706.819213] drbd0: peer 0802A231F0146ADA:20DCC1731627015F:13CBBB8A57019E90 :9FE83E076086E633
[ 706.819216] drbd0: uuid_compare()=100 by rule 9
[ 706.819219] drbd0: Split-Brain detected, dropping connection!
[ 706.819224] drbd0: self AEA3E675AC967616:20DCC1731627015E:13CBBB8A57019E90 :9FE83E076086E633
[ 706.819228] drbd0: peer 0802A231F0146ADA:20DCC1731627015F:13CBBB8A57019E90 :9FE83E076086E633
[ 706.819232] drbd0: helper command: /sbin/drbdadm split-brain minor-0
[ 706.821751] drbd0: helper command: /sbin/drbdadm split-brain minor-0 exit code 0 (0x0)
[ 706.821761] drbd0: conn( WFReportParams -> Disconnecting )
[ 706.821770] drbd0: error receiving ReportState, l: 4!
[ 706.821789] drbd0: asender terminated
[ 706.821794] drbd0: Terminating asender thread
[ 706.821865] drbd0: Connection closed
[ 706.821877] drbd0: conn( Disconnecting -> StandAlone )
[ 706.821889] drbd0: receiver terminated
[ 706.821891] drbd0: Terminating receiver thread
[ 710.525656] drbd0: role( Secondary -> Primary )
[ 710.927075] kjournald starting. Commit interval 5 seconds
[ 710.927075] EXT3-fs warning: maximal mount count reached, running e2fsck is recommended
[ 710.939230] EXT3 FS on drbd0, internal journal
[ 710.939238] EXT3-fs: mounted filesystem with ordered data mode.
[ 711.546663] drbd0: role( Primary -> Secondary )
[ 737.526703] drbd0: conn( StandAlone -> Unconnected )
[ 737.526730] drbd0: Starting receiver thread (from drbd0_worker [5700])
[ 737.526772] drbd0: receiver (re)started
[ 737.526778] drbd0: conn( Unconnected -> WFConnection )
[ 784.144030] drbd0: Handshake successful: DRBD Network Protocol version 86
[ 784.145911] drbd0: Peer authenticated using 20 bytes of 'sha1' HMAC
[ 784.145922] drbd0: conn( WFConnection -> WFReportParams )
[ 784.145946] drbd0: Starting asender thread (from drbd0_receiver [6565])
[ 784.147983] drbd0: drbd_sync_handshake:
[ 784.147989] drbd0: self AEA3E675AC967616:20DCC1731627015E:13CBBB8A57019E90 :9FE83E076086E633
[ 784.148013] drbd0: peer 0802A231F0146ADB:20DCC1731627015F:13CBBB8A57019E90 :9FE83E076086E633
[ 784.148017] drbd0: uuid_compare()=100 by rule 9
[ 784.148020] drbd0: Split-Brain detected, dropping connection!
[ 784.148026] drbd0: self AEA3E675AC967616:20DCC1731627015E:13CBBB8A57019E90 :9FE83E076086E633
[ 784.148030] drbd0: peer 0802A231F0146ADB:20DCC1731627015F:13CBBB8A57019E90 :9FE83E076086E633
[ 784.148034] drbd0: helper command: /sbin/drbdadm split-brain minor-0
[ 784.149681] drbd0: meta connection shut down by peer.
[ 784.149691] drbd0: conn( WFReportParams -> NetworkFailure )
[ 784.149701] drbd0: asender terminated
[ 784.149704] drbd0: Terminating asender thread
[ 784.150906] drbd0: helper command: /sbin/drbdadm split-brain minor-0 exit code 0 (0x0)
[ 784.150913] drbd0: conn( NetworkFailure -> Disconnecting )
[ 784.150921] drbd0: error receiving ReportState, l: 4!
[ 784.150971] drbd0: Connection closed
[ 784.150984] drbd0: conn( Disconnecting -> StandAlone )
[ 784.150994] drbd0: receiver terminated
[ 784.150997] drbd0: Terminating receiver thread
[ 831.335905] drbd0: conn( StandAlone -> Unconnected )
[ 831.336260] drbd0: Starting receiver thread (from drbd0_worker [5700])
[ 831.336307] drbd0: receiver (re)started
[ 831.336313] drbd0: conn( Unconnected -> WFConnection )
[ 831.436656] drbd0: Handshake successful: DRBD Network Protocol version 86
[ 831.436656] drbd0: Peer authenticated using 20 bytes of 'sha1' HMAC
[ 831.436656] drbd0: conn( WFConnection -> WFReportParams )
[ 831.436656] drbd0: Starting asender thread (from drbd0_receiver [6841])
[ 831.436656] drbd0: drbd_sync_handshake:
[ 831.436656] drbd0: self AEA3E675AC967616:20DCC1731627015E:13CBBB8A57019E90 :9FE83E076086E633
[ 831.436656] drbd0: peer 0802A231F0146ADB:20DCC1731627015F:13CBBB8A57019E90 :9FE83E076086E633
[ 831.436656] drbd0: uuid_compare()=100 by rule 9
[ 831.436656] drbd0: Split-Brain detected, dropping connection!
[ 831.436656] drbd0: self AEA3E675AC967616:20DCC1731627015E:13CBBB8A57019E90 :9FE83E076086E633
[ 831.436656] drbd0: peer 0802A231F0146ADB:20DCC1731627015F:13CBBB8A57019E90 :9FE83E076086E633
[ 831.436656] drbd0: helper command: /sbin/drbdadm split-brain minor-0
[ 831.441800] drbd0: helper command: /sbin/drbdadm split-brain minor-0 exit code 0 (0x0)
[ 831.441810] drbd0: conn( WFReportParams -> Disconnecting )
[ 831.441820] drbd0: error receiving ReportState, l: 4!
[ 831.441839] drbd0: asender terminated
[ 831.441844] drbd0: Terminating asender thread
[ 831.441909] drbd0: Connection closed
[ 831.441922] drbd0: conn( Disconnecting -> StandAlone )
[ 831.441933] drbd0: receiver terminated
[ 831.441936] drbd0: Terminating receiver thread
[ 843.788151] drbd0: conn( StandAlone -> Unconnected )
[ 843.788245] drbd0: Starting receiver thread (from drbd0_worker [5700])
[ 843.788290] drbd0: receiver (re)started
[ 843.788296] drbd0: conn( Unconnected -> WFConnection )
[ 845.436025] drbd0: Handshake successful: DRBD Network Protocol version 86
[ 845.437219] drbd0: Peer authenticated using 20 bytes of 'sha1' HMAC
[ 845.437230] drbd0: conn( WFConnection -> WFReportParams )
[ 845.437253] drbd0: Starting asender thread (from drbd0_receiver [6848])
[ 845.438817] drbd0: drbd_sync_handshake:
[ 845.438823] drbd0: self AEA3E675AC967616:20DCC1731627015E:13CBBB8A57019E90 :9FE83E076086E633
[ 845.438828] drbd0: peer 0802A231F0146ADB:20DCC1731627015F:13CBBB8A57019E90 :9FE83E076086E633
[ 845.438831] drbd0: uuid_compare()=100 by rule 9
[ 845.438834] drbd0: Split-Brain detected, dropping connection!
[ 845.438839] drbd0: self AEA3E675AC967616:20DCC1731627015E:13CBBB8A57019E90 :9FE83E076086E633
[ 845.438843] drbd0: peer 0802A231F0146ADB:20DCC1731627015F:13CBBB8A57019E90 :9FE83E076086E633
[ 845.438847] drbd0: helper command: /sbin/drbdadm split-brain minor-0
[ 845.441237] drbd0: meta connection shut down by peer.
[ 845.441249] drbd0: conn( WFReportParams -> NetworkFailure )
[ 845.441260] drbd0: asender terminated
[ 845.441263] drbd0: Terminating asender thread
[ 845.441504] drbd0: helper command: /sbin/drbdadm split-brain minor-0 exit code 0 (0x0)
[ 845.441509] drbd0: conn( NetworkFailure -> Disconnecting )
[ 845.441516] drbd0: error receiving ReportState, l: 4!
[ 845.441565] drbd0: Connection closed
[ 845.441578] drbd0: conn( Disconnecting -> StandAlone )
[ 845.441588] drbd0: receiver terminated
[ 845.441591] drbd0: Terminating receiver thread
[ 982.593345] drbd0: conn( StandAlone -> Unconnected )
[ 982.593371] drbd0: Starting receiver thread (from drbd0_worker [5700])
[ 982.593422] drbd0: receiver (re)started
[ 982.593428] drbd0: conn( Unconnected -> WFConnection )
[ 1050.187746] drbd0: conn( WFConnection -> Disconnecting )
[ 1050.187794] drbd0: Discarding network configuration.
[ 1050.187815] drbd0: Connection closed
[ 1050.187833] drbd0: conn( Disconnecting -> StandAlone )
[ 1050.187847] drbd0: receiver terminated
[ 1050.187851] drbd0: Terminating receiver thread
[ 1050.187921] drbd0: disk( UpToDate -> Diskless )
[ 1050.187971] drbd0: drbd_bm_resize called with capacity == 0
[ 1050.188023] drbd0: worker terminated
[ 1050.188027] drbd0: Terminating worker thread
[ 1050.254199] drbd: module cleanup done.
[ 1050.273486] drbd: initialised. Version: 8.0.14 (api:86/proto:86)
[ 1050.273493] drbd: GIT-hash: bb447522fc9a87d0069b7e14f0234911ebdab0f7 build by phil@fat-tyre, 2008-11-12 16:40:33
[ 1050.273496] drbd: registered as block device major 147
[ 1050.273500] drbd: minor_table @ 0xd4997200
[ 1050.277179] drbd0: disk( Diskless -> Attaching )
[ 1050.277187] drbd0: Starting worker thread (from cqueue [5693])
[ 1050.304981] drbd0: Found 6 transactions (6 active extents) in activity log.
[ 1050.304981] drbd0: max_segment_size ( = BIO size ) = 32768
[ 1050.304981] drbd0: drbd_bm_resize called with capacity == 11132624
[ 1050.304981] drbd0: resync bitmap: bits=1391578 words=43488
[ 1050.304981] drbd0: size = 5436 MB (5566312 KB)
[ 1050.313598] drbd0: recounting of set bits took additional 0 jiffies
[ 1050.313603] drbd0: 12 KB (3 bits) marked out-of-sync by on disk bit-map.
[ 1050.313612] drbd0: disk( Attaching -> UpToDate )
[ 1050.324649] drbd0: conn( StandAlone -> Unconnected )
[ 1050.324673] drbd0: Starting receiver thread (from drbd0_worker [6874])
[ 1050.324716] drbd0: receiver (re)started
[ 1050.324721] drbd0: conn( Unconnected -> WFConnection )
[ 1100.372028] drbd0: Handshake successful: DRBD Network Protocol version 86
[ 1100.373429] drbd0: Peer authenticated using 20 bytes of 'sha1' HMAC
[ 1100.373440] drbd0: conn( WFConnection -> WFReportParams )
[ 1100.373462] drbd0: Starting asender thread (from drbd0_receiver [6883])
[ 1100.375403] drbd0: drbd_sync_handshake:
[ 1100.375408] drbd0: self AEA3E675AC967616:20DCC1731627015E:13CBBB8A57019E90 :9FE83E076086E633
[ 1100.375413] drbd0: peer 0802A231F0146ADB:20DCC1731627015F:13CBBB8A57019E90 :9FE83E076086E633
[ 1100.375416] drbd0: uuid_compare()=100 by rule 9
[ 1100.375419] drbd0: Split-Brain detected, dropping connection!
[ 1100.375425] drbd0: self AEA3E675AC967616:20DCC1731627015E:13CBBB8A57019E90 :9FE83E076086E633
[ 1100.375429] drbd0: peer 0802A231F0146ADB:20DCC1731627015F:13CBBB8A57019E90 :9FE83E076086E633
[ 1100.375433] drbd0: helper command: /sbin/drbdadm split-brain minor-0
[ 1100.377791] drbd0: helper command: /sbin/drbdadm split-brain minor-0 exit code 0 (0x0)
[ 1100.377801] drbd0: conn( WFReportParams -> Disconnecting )
[ 1100.377810] drbd0: error receiving ReportState, l: 4!
[ 1100.377828] drbd0: asender terminated
[ 1100.377834] drbd0: Terminating asender thread
[ 1100.377906] drbd0: Connection closed
[ 1100.377918] drbd0: conn( Disconnecting -> StandAlone )
[ 1100.377928] drbd0: receiver terminated
[ 1100.377931] drbd0: Terminating receiver thread
[ 1193.848607] drbd0: conn( StandAlone -> Unconnected )
[ 1193.848633] drbd0: Starting receiver thread (from drbd0_worker [6874])
[ 1193.848681] drbd0: receiver (re)started
[ 1193.848686] drbd0: conn( Unconnected -> WFConnection )
[ 1244.348031] drbd0: Handshake successful: DRBD Network Protocol version 86
[ 1244.349954] drbd0: Peer authenticated using 20 bytes of 'sha1' HMAC
[ 1244.349964] drbd0: conn( WFConnection -> WFReportParams )
[ 1244.349987] drbd0: Starting asender thread (from drbd0_receiver [6891])
[ 1244.351734] drbd0: drbd_sync_handshake:
[ 1244.351739] drbd0: self AEA3E675AC967616:20DCC1731627015E:13CBBB8A57019E90 :9FE83E076086E633
[ 1244.351744] drbd0: peer 0802A231F0146ADB:20DCC1731627015F:13CBBB8A57019E90 :9FE83E076086E633
[ 1244.351747] drbd0: uuid_compare()=100 by rule 9
[ 1244.351750] drbd0: Split-Brain detected, dropping connection!
[ 1244.351756] drbd0: self AEA3E675AC967616:20DCC1731627015E:13CBBB8A57019E90 :9FE83E076086E633
[ 1244.351760] drbd0: peer 0802A231F0146ADB:20DCC1731627015F:13CBBB8A57019E90 :9FE83E076086E633
[ 1244.351764] drbd0: helper command: /sbin/drbdadm split-brain minor-0
[ 1244.353602] drbd0: meta connection shut down by peer.
[ 1244.353613] drbd0: conn( WFReportParams -> NetworkFailure )
[ 1244.353622] drbd0: asender terminated
[ 1244.353626] drbd0: Terminating asender thread
[ 1244.354380] drbd0: helper command: /sbin/drbdadm split-brain minor-0 exit code 0 (0x0)
[ 1244.354386] drbd0: conn( NetworkFailure -> Disconnecting )
[ 1244.354394] drbd0: error receiving ReportState, l: 4!
[ 1244.354448] drbd0: Connection closed
[ 1244.354461] drbd0: conn( Disconnecting -> StandAlone )
[ 1244.354471] drbd0: receiver terminated
[ 1244.354474] drbd0: Terminating receiver thread
[ 1337.535919] drbd0: conn( StandAlone -> Unconnected )
[ 1337.536129] drbd0: Starting receiver thread (from drbd0_worker [6874])
[ 1337.536166] drbd0: receiver (re)started
[ 1337.536173] drbd0: conn( Unconnected -> WFConnection )
[ 1523.776024] drbd0: Handshake successful: DRBD Network Protocol version 86
[ 1523.778045] drbd0: Peer authenticated using 20 bytes of 'sha1' HMAC
[ 1523.778055] drbd0: conn( WFConnection -> WFReportParams )
[ 1523.778078] drbd0: Starting asender thread (from drbd0_receiver [6903])
[ 1523.779006] drbd0: drbd_sync_handshake:
[ 1523.779012] drbd0: self AEA3E675AC967616:20DCC1731627015E:13CBBB8A57019E90 :9FE83E076086E633
[ 1523.779016] drbd0: peer 0802A231F0146ADB:20DCC1731627015F:13CBBB8A57019E90 :9FE83E076086E633
[ 1523.779020] drbd0: uuid_compare()=100 by rule 9
[ 1523.779023] drbd0: Split-Brain detected, dropping connection!
[ 1523.779028] drbd0: self AEA3E675AC967616:20DCC1731627015E:13CBBB8A57019E90 :9FE83E076086E633
[ 1523.779034] drbd0: peer 0802A231F0146ADB:20DCC1731627015F:13CBBB8A57019E90 :9FE83E076086E633
[ 1523.779039] drbd0: helper command: /sbin/drbdadm split-brain minor-0
[ 1523.781657] drbd0: helper command: /sbin/drbdadm split-brain minor-0 exit code 0 (0x0)
[ 1523.781667] drbd0: conn( WFReportParams -> Disconnecting )
[ 1523.781677] drbd0: error receiving ReportState, l: 4!
[ 1523.781697] drbd0: asender terminated
[ 1523.781702] drbd0: Terminating asender thread
[ 1523.781775] drbd0: Connection closed
[ 1523.781789] drbd0: conn( Disconnecting -> StandAlone )
[ 1523.781800] drbd0: receiver terminated
[ 1523.781804] drbd0: Terminating receiver thread
[ 1741.595863] drbd0: role( Secondary -> Primary )
[ 1741.962343] kjournald starting. Commit interval 5 seconds
[ 1741.962343] EXT3-fs warning: maximal mount count reached, running e2fsck is recommended
[ 1741.974248] EXT3 FS on drbd0, internal journal
[ 1741.974248] EXT3-fs: mounted filesystem with ordered data mode.
[ 1742.507542] drbd0: role( Primary -> Secondary )
[ 1844.237543] drbd0: conn( StandAlone -> Unconnected )
[ 1844.237634] drbd0: Starting receiver thread (from drbd0_worker [6874])
[ 1844.237679] drbd0: receiver (re)started
[ 1844.237684] drbd0: conn( Unconnected -> WFConnection )
[ 1844.337471] drbd0: Handshake successful: DRBD Network Protocol version 86
[ 1844.337471] drbd0: Peer authenticated using 20 bytes of 'sha1' HMAC
[ 1844.337471] drbd0: conn( WFConnection -> WFReportParams )
[ 1844.337471] drbd0: Starting asender thread (from drbd0_receiver [8009])
[ 1844.340528] drbd0: drbd_sync_handshake:
[ 1844.340535] drbd0: self AEA3E675AC967616:20DCC1731627015E:13CBBB8A57019E90 :9FE83E076086E633
[ 1844.340539] drbd0: peer 0802A231F0146ADB:20DCC1731627015F:13CBBB8A57019E90 :9FE83E076086E633
[ 1844.340542] drbd0: uuid_compare()=100 by rule 9
[ 1844.340545] drbd0: Split-Brain detected, dropping connection!
[ 1844.340551] drbd0: self AEA3E675AC967616:20DCC1731627015E:13CBBB8A57019E90 :9FE83E076086E633
[ 1844.340555] drbd0: peer 0802A231F0146ADB:20DCC1731627015F:13CBBB8A57019E90 :9FE83E076086E633
[ 1844.340559] drbd0: helper command: /sbin/drbdadm split-brain minor-0
[ 1844.342792] drbd0: helper command: /sbin/drbdadm split-brain minor-0 exit code 0 (0x0)
[ 1844.342803] drbd0: conn( WFReportParams -> Disconnecting )
[ 1844.342812] drbd0: error receiving ReportState, l: 4!
[ 1844.342825] drbd0: asender terminated
[ 1844.342830] drbd0: Terminating asender thread
[ 1844.342891] drbd0: Connection closed
[ 1844.342905] drbd0: conn( Disconnecting -> StandAlone )
[ 1844.342916] drbd0: receiver terminated
[ 1844.342919] drbd0: Terminating receiver thread
[ 1862.285573] drbd0: conn( StandAlone -> Unconnected )
[ 1862.285667] drbd0: Starting receiver thread (from drbd0_worker [6874])
[ 1862.285711] drbd0: receiver (re)started
[ 1862.285716] drbd0: conn( Unconnected -> WFConnection )
[ 1963.232029] drbd0: Handshake successful: DRBD Network Protocol version 86
[ 1963.233412] drbd0: Peer authenticated using 20 bytes of 'sha1' HMAC
[ 1963.233423] drbd0: conn( WFConnection -> WFReportParams )
[ 1963.233446] drbd0: Starting asender thread (from drbd0_receiver [8017])
[ 1963.235460] drbd0: drbd_sync_handshake:
[ 1963.235466] drbd0: self AEA3E675AC967616:20DCC1731627015E:13CBBB8A57019E90 :9FE83E076086E633
[ 1963.235470] drbd0: peer 0802A231F0146ADB:20DCC1731627015F:13CBBB8A57019E90 :9FE83E076086E633
[ 1963.235473] drbd0: uuid_compare()=100 by rule 9
[ 1963.235476] drbd0: Split-Brain detected, dropping connection!
[ 1963.235482] drbd0: self AEA3E675AC967616:20DCC1731627015E:13CBBB8A57019E90 :9FE83E076086E633
[ 1963.235486] drbd0: peer 0802A231F0146ADB:20DCC1731627015F:13CBBB8A57019E90 :9FE83E076086E633
[ 1963.235490] drbd0: helper command: /sbin/drbdadm split-brain minor-0
[ 1963.237803] drbd0: meta connection shut down by peer.
[ 1963.237815] drbd0: conn( WFReportParams -> NetworkFailure )
[ 1963.237825] drbd0: asender terminated
[ 1963.237828] drbd0: Terminating asender thread
[ 1963.238272] drbd0: helper command: /sbin/drbdadm split-brain minor-0 exit code 0 (0x0)
[ 1963.238278] drbd0: conn( NetworkFailure -> Disconnecting )
[ 1963.238285] drbd0: error receiving ReportState, l: 4!
[ 1963.238336] drbd0: Connection closed
[ 1963.238349] drbd0: conn( Disconnecting -> StandAlone )
[ 1963.238359] drbd0: receiver terminated
[ 1963.238362] drbd0: Terminating receiver thread
[ 1992.550487] drbd0: conn( StandAlone -> Unconnected )
[ 1992.550578] drbd0: Starting receiver thread (from drbd0_worker [6874])
[ 1992.550622] drbd0: receiver (re)started
[ 1992.550627] drbd0: conn( Unconnected -> WFConnection )
[ 2000.452026] drbd0: Handshake successful: DRBD Network Protocol version 86
[ 2000.453346] drbd0: Peer authenticated using 20 bytes of 'sha1' HMAC
[ 2000.453356] drbd0: conn( WFConnection -> WFReportParams )
[ 2000.453380] drbd0: Starting asender thread (from drbd0_receiver [8038])
[ 2000.455176] drbd0: drbd_sync_handshake:
[ 2000.455182] drbd0: self AEA3E675AC967616:20DCC1731627015E:13CBBB8A57019E90 :9FE83E076086E633
[ 2000.455186] drbd0: peer 0802A231F0146ADB:20DCC1731627015F:13CBBB8A57019E90 :9FE83E076086E633
[ 2000.455189] drbd0: uuid_compare()=100 by rule 9
[ 2000.455192] drbd0: Split-Brain detected, dropping connection!
[ 2000.455198] drbd0: self AEA3E675AC967616:20DCC1731627015E:13CBBB8A57019E90 :9FE83E076086E633
[ 2000.455202] drbd0: peer 0802A231F0146ADB:20DCC1731627015F:13CBBB8A57019E90 :9FE83E076086E633
[ 2000.455206] drbd0: helper command: /sbin/drbdadm split-brain minor-0
[ 2000.457677] drbd0: helper command: /sbin/drbdadm split-brain minor-0 exit code 0 (0x0)
[ 2000.457687] drbd0: conn( WFReportParams -> Disconnecting )
[ 2000.457697] drbd0: error receiving ReportState, l: 4!
[ 2000.457716] drbd0: asender terminated
[ 2000.457721] drbd0: Terminating asender thread
[ 2000.457791] drbd0: Connection closed
[ 2000.457804] drbd0: conn( Disconnecting -> StandAlone )
[ 2000.457815] drbd0: receiver terminated
[ 2000.457818] drbd0: Terminating receiver thread
[ 2085.513723] drbd0: conn( StandAlone -> Unconnected )
[ 2085.513750] drbd0: Starting receiver thread (from drbd0_worker [6874])
[ 2085.513797] drbd0: receiver (re)started
[ 2085.513803] drbd0: conn( Unconnected -> WFConnection )
[ 2085.614707] drbd0: Handshake successful: DRBD Network Protocol version 86
[ 2085.616129] drbd0: Peer authenticated using 20 bytes of 'sha1' HMAC
[ 2085.616140] drbd0: conn( WFConnection -> WFReportParams )
[ 2085.616163] drbd0: Starting asender thread (from drbd0_receiver [8045])
[ 2085.617310] drbd0: drbd_sync_handshake:
[ 2085.617316] drbd0: self AEA3E675AC967616:20DCC1731627015E:13CBBB8A57019E90 :9FE83E076086E633
[ 2085.617320] drbd0: peer 0802A231F0146ADB:20DCC1731627015F:13CBBB8A57019E90 :9FE83E076086E633
[ 2085.617323] drbd0: uuid_compare()=100 by rule 9
[ 2085.617327] drbd0: Split-Brain detected, dropping connection!
[ 2085.617332] drbd0: self AEA3E675AC967616:20DCC1731627015E:13CBBB8A57019E90 :9FE83E076086E633
[ 2085.617336] drbd0: peer 0802A231F0146ADB:20DCC1731627015F:13CBBB8A57019E90 :9FE83E076086E633
[ 2085.617340] drbd0: helper command: /sbin/drbdadm split-brain minor-0
[ 2085.618852] drbd0: helper command: /sbin/drbdadm split-brain minor-0 exit code 0 (0x0)
[ 2085.618860] drbd0: conn( WFReportParams -> Disconnecting )
[ 2085.618869] drbd0: error receiving ReportState, l: 4!
[ 2085.618886] drbd0: asender terminated
[ 2085.618891] drbd0: Terminating asender thread
[ 2085.618950] drbd0: Connection closed
[ 2085.618963] drbd0: conn( Disconnecting -> StandAlone )
[ 2085.618973] drbd0: receiver terminated
[ 2085.618976] drbd0: Terminating receiver thread
[ 2141.700316] drbd0: disk( UpToDate -> Diskless )
[ 2141.700376] drbd0: drbd_bm_resize called with capacity == 0
[ 2141.700408] drbd0: worker terminated
[ 2141.700412] drbd0: Terminating worker thread
[ 2141.750328] drbd: module cleanup done.
[ 2141.769616] drbd: initialised. Version: 8.0.14 (api:86/proto:86)
[ 2141.769622] drbd: GIT-hash: bb447522fc9a87d0069b7e14f0234911ebdab0f7 build by phil@fat-tyre, 2008-11-12 16:40:33
[ 2141.769626] drbd: registered as block device major 147
[ 2141.769629] drbd: minor_table @ 0xd4997200
[ 2141.775289] drbd0: disk( Diskless -> Attaching )
[ 2141.775297] drbd0: Starting worker thread (from cqueue [5693])
[ 2141.794454] drbd0: Found 6 transactions (6 active extents) in activity log.
[ 2141.794454] drbd0: max_segment_size ( = BIO size ) = 32768
[ 2141.794454] drbd0: drbd_bm_resize called with capacity == 11132624
[ 2141.794454] drbd0: resync bitmap: bits=1391578 words=43488
[ 2141.794454] drbd0: size = 5436 MB (5566312 KB)
[ 2141.802701] drbd0: recounting of set bits took additional 0 jiffies
[ 2141.802701] drbd0: 12 KB (3 bits) marked out-of-sync by on disk bit-map.
[ 2141.802701] drbd0: disk( Attaching -> UpToDate )
[ 2141.814814] drbd0: conn( StandAlone -> Unconnected )
[ 2141.814840] drbd0: Starting receiver thread (from drbd0_worker [8067])
[ 2141.814885] drbd0: receiver (re)started
[ 2141.814890] drbd0: conn( Unconnected -> WFConnection )
[ 2141.915267] drbd0: Handshake successful: DRBD Network Protocol version 86
[ 2141.916986] drbd0: Peer authenticated using 20 bytes of 'sha1' HMAC
[ 2141.916997] drbd0: conn( WFConnection -> WFReportParams )
[ 2141.917021] drbd0: Starting asender thread (from drbd0_receiver [8075])
[ 2141.917919] drbd0: drbd_sync_handshake:
[ 2141.917925] drbd0: self AEA3E675AC967616:20DCC1731627015E:13CBBB8A57019E90 :9FE83E076086E633
[ 2141.917929] drbd0: peer 0802A231F0146ADB:20DCC1731627015F:13CBBB8A57019E90 :9FE83E076086E633
[ 2141.917932] drbd0: uuid_compare()=100 by rule 9
[ 2141.917935] drbd0: Split-Brain detected, dropping connection!
[ 2141.917941] drbd0: self AEA3E675AC967616:20DCC1731627015E:13CBBB8A57019E90 :9FE83E076086E633
[ 2141.917945] drbd0: peer 0802A231F0146ADB:20DCC1731627015F:13CBBB8A57019E90 :9FE83E076086E633
[ 2141.917949] drbd0: helper command: /sbin/drbdadm split-brain minor-0
[ 2141.919464] drbd0: helper command: /sbin/drbdadm split-brain minor-0 exit code 0 (0x0)
[ 2141.919473] drbd0: conn( WFReportParams -> Disconnecting )
[ 2141.919482] drbd0: error receiving ReportState, l: 4!
[ 2141.919499] drbd0: asender terminated
[ 2141.919505] drbd0: Terminating asender thread
[ 2141.919569] drbd0: Connection closed
[ 2141.919581] drbd0: conn( Disconnecting -> StandAlone )
[ 2141.919591] drbd0: receiver terminated
[ 2141.919594] drbd0: Terminating receiver thread
[ 2230.158051] drbd0: conn( StandAlone -> Unconnected )
[ 2230.158142] drbd0: Starting receiver thread (from drbd0_worker [8067])
[ 2230.158184] drbd0: receiver (re)started
[ 2230.158190] drbd0: conn( Unconnected -> WFConnection )
[ 2230.244003] drbd0: Handshake successful: DRBD Network Protocol version 86
[ 2230.244003] drbd0: Peer authenticated using 20 bytes of 'sha1' HMAC
[ 2230.244003] drbd0: conn( WFConnection -> WFReportParams )
[ 2230.244003] drbd0: Starting asender thread (from drbd0_receiver [8085])
[ 2230.244003] drbd0: drbd_sync_handshake:
[ 2230.244003] drbd0: self AEA3E675AC967616:20DCC1731627015E:13CBBB8A57019E90 :9FE83E076086E633
[ 2230.244003] drbd0: peer 0802A231F0146ADB:20DCC1731627015F:13CBBB8A57019E90 :9FE83E076086E633
[ 2230.244003] drbd0: uuid_compare()=100 by rule 9
[ 2230.244003] drbd0: Split-Brain detected, dropping connection!
[ 2230.244003] drbd0: self AEA3E675AC967616:20DCC1731627015E:13CBBB8A57019E90 :9FE83E076086E633
[ 2230.244003] drbd0: peer 0802A231F0146ADB:20DCC1731627015F:13CBBB8A57019E90 :9FE83E076086E633
[ 2230.244003] drbd0: helper command: /sbin/drbdadm split-brain minor-0
[ 2230.261817] drbd0: helper command: /sbin/drbdadm split-brain minor-0 exit code 0 (0x0)
[ 2230.261828] drbd0: conn( WFReportParams -> Disconnecting )
[ 2230.261837] drbd0: error receiving ReportState, l: 4!
[ 2230.261857] drbd0: asender terminated
[ 2230.261862] drbd0: Terminating asender thread
[ 2230.261929] drbd0: Connection closed
[ 2230.261944] drbd0: conn( Disconnecting -> StandAlone )
[ 2230.261954] drbd0: receiver terminated
[ 2230.261957] drbd0: Terminating receiver thread
[ 2604.548545] drbd0: conn( StandAlone -> Unconnected )
[ 2604.548572] drbd0: Starting receiver thread (from drbd0_worker [8067])
[ 2604.548618] drbd0: receiver (re)started
[ 2604.548623] drbd0: conn( Unconnected -> WFConnection )
[ 2604.646862] drbd0: Handshake successful: DRBD Network Protocol version 86
[ 2604.646862] drbd0: Peer authenticated using 20 bytes of 'sha1' HMAC
[ 2604.646862] drbd0: conn( WFConnection -> WFReportParams )
[ 2604.646862] drbd0: Starting asender thread (from drbd0_receiver [8103])
[ 2604.652502] drbd0: drbd_sync_handshake:
[ 2604.652508] drbd0: self AEA3E675AC967616:20DCC1731627015E:13CBBB8A57019E90 :9FE83E076086E633
[ 2604.652513] drbd0: peer 0802A231F0146ADB:20DCC1731627015F:13CBBB8A57019E90 :9FE83E076086E633
[ 2604.652516] drbd0: uuid_compare()=100 by rule 9
[ 2604.652519] drbd0: Split-Brain detected, dropping connection!
[ 2604.652524] drbd0: self AEA3E675AC967616:20DCC1731627015E:13CBBB8A57019E90 :9FE83E076086E633
[ 2604.652528] drbd0: peer 0802A231F0146ADB:20DCC1731627015F:13CBBB8A57019E90 :9FE83E076086E633
[ 2604.652532] drbd0: helper command: /sbin/drbdadm split-brain minor-0
[ 2604.654057] drbd0: helper command: /sbin/drbdadm split-brain minor-0 exit code 0 (0x0)
[ 2604.654068] drbd0: conn( WFReportParams -> Disconnecting )
[ 2604.654078] drbd0: error receiving ReportState, l: 4!
[ 2604.654096] drbd0: asender terminated
[ 2604.654101] drbd0: Terminating asender thread
[ 2604.654164] drbd0: Connection closed
[ 2604.654176] drbd0: conn( Disconnecting -> StandAlone )
[ 2604.654186] drbd0: receiver terminated
[ 2604.654189] drbd0: Terminating receiver thread
[ 2663.300433] drbd0: conn( StandAlone -> Unconnected )
[ 2663.300526] drbd0: Starting receiver thread (from drbd0_worker [8067])
[ 2663.300571] drbd0: receiver (re)started
[ 2663.300577] drbd0: conn( Unconnected -> WFConnection )
[ 2663.616026] drbd0: Handshake successful: DRBD Network Protocol version 86
[ 2663.617221] drbd0: Peer authenticated using 20 bytes of 'sha1' HMAC
[ 2663.617231] drbd0: conn( WFConnection -> WFReportParams )
[ 2663.617253] drbd0: Starting asender thread (from drbd0_receiver [8111])
[ 2663.618732] drbd0: drbd_sync_handshake:
[ 2663.618738] drbd0: self AEA3E675AC967616:20DCC1731627015E:13CBBB8A57019E90 :9FE83E076086E633
[ 2663.618742] drbd0: peer 0802A231F0146ADB:20DCC1731627015F:13CBBB8A57019E90 :9FE83E076086E633
[ 2663.618746] drbd0: uuid_compare()=100 by rule 9
[ 2663.618749] drbd0: Split-Brain detected, manually solved. Sync from peer node
[ 2663.619099] drbd0: peer( Unknown -> Primary ) conn( WFReportParams -> WFBitMapT ) pdsk( DUnknown -> UpToDate )
[ 2663.998850] drbd0: conn( WFBitMapT -> WFSyncUUID )
[ 2664.019795] drbd0: conn( WFSyncUUID -> SyncTarget ) disk( UpToDate -> Inconsistent )
[ 2664.019795] drbd0: Began resync as SyncTarget (will sync 24 KB [6 bits set]).
[ 2664.170316] drbd0: Resync done (total 1 sec; paused 0 sec; 24 K/sec)
[ 2664.170316] drbd0: conn( SyncTarget -> Connected ) disk( Inconsistent -> UpToDate )

jluce
12/07/2010, 15h27
slt

c'est plus simple et plus explicite dans /var/log/messages et pedant que t'y est renvois le résultat de la commande cat /proc/drbd parceque de ce que je lis dans le dmesg ils n'ont plus l'air de ce voir...

a+

tynho
12/07/2010, 17h04
Slt
voici le contenu /proc/drbd

version: 8.0.14 (api:86/proto:86)
GIT-hash: bb447522fc9a87d0069b7e14f0234911ebdab0f7 build by phil@fat-tyre, 2008-11-12 16:40:33
0: cs:Connected st:Primary/Secondary ds:UpToDate/UpToDate C r---
ns:0 nr:24 dw:24 dr:0 al:0 bm:3 lo:0 pe:0 ua:0 ap:0
resync: used:0/61 hits:5 misses:3 starving:0 dirty:0 changed:3
act_log: used:0/257 hits:0 misses:0 starving:0 dirty:0 changed:0


Slt
j'avais oublier le heartbeat du noeud secondaire ne lance pas apache et ne monte non plus mon bloc drbd. Cependant il faut aussi signaler que j'ai mis une BDD derrier qui n'est pas gèré par heartbeat.

jluce
13/07/2010, 08h30
slt

bon a priori ton raid réseaux est bon et se porte bien
<div class='quotetop'>Citation </div>
0: cs:Connected st:Primary/Secondary ds:UpToDate/UpToDate C r---[/b]

donc le problème se situe plus au niveau de heartbeat

fais un


grep heartbeat /var/log/syslog

je crois que c'est le fichier qui remplace messages sous debian

fais le sur les deux neoud lorsque tu lui demandes de basculer et envoies le résultat

a+

tynho
13/07/2010, 12h35
Slt voici le resultat sur le principal avant le basculement

destaingk:~# grep heartbeat /var/log/syslog
Jul 13 08:52:01 destaingk heartbeat: [3172]: info: Heartbeat shutdown in progress. (3172)
Jul 13 08:52:01 destaingk heartbeat: [10584]: info: Giving up all HA resources.
Jul 13 08:52:02 destaingk heartbeat: [10584]: info: All HA resources relinquished.
Jul 13 08:52:04 destaingk heartbeat: [3172]: info: killing HBFIFO process 3181 with signal 15
Jul 13 08:52:04 destaingk heartbeat: [3172]: info: killing HBWRITE process 3182 with signal 15
Jul 13 08:52:04 destaingk heartbeat: [3172]: info: killing HBREAD process 3183 with signal 15
Jul 13 08:52:04 destaingk heartbeat: [3172]: info: killing HBWRITE process 3184 with signal 15
Jul 13 08:52:04 destaingk heartbeat: [3172]: info: killing HBREAD process 3185 with signal 15
Jul 13 08:52:04 destaingk heartbeat: [3172]: info: Core process 3181 exited. 5 remaining
Jul 13 08:52:04 destaingk heartbeat: [3172]: info: Core process 3182 exited. 4 remaining
Jul 13 08:52:04 destaingk heartbeat: [3172]: info: Core process 3183 exited. 3 remaining
Jul 13 08:52:04 destaingk heartbeat: [3172]: info: Core process 3184 exited. 2 remaining
Jul 13 08:52:04 destaingk heartbeat: [3172]: info: Core process 3185 exited. 1 remaining
Jul 13 08:52:04 destaingk heartbeat: [3172]: info: destaingk Heartbeat shutdown complete.
Jul 13 09:46:44 destaingk heartbeat: [3249]: WARN: Core dumps could be lost if multiple dumps occur.
Jul 13 09:46:44 destaingk heartbeat: [3249]: WARN: Consider setting non-default value in /proc/sys/kernel/core_pattern (or equivalent) for maximum supportability
Jul 13 09:46:44 destaingk heartbeat: [3249]: WARN: Consider setting /proc/sys/kernel/core_uses_pid (or equivalent) to 1 for maximum supportability
Jul 13 09:46:44 destaingk heartbeat: [3249]: info: Version 2 support: false
Jul 13 09:46:44 destaingk heartbeat: [3249]: WARN: Logging daemon is disabled --enabling logging daemon is recommended
Jul 13 09:46:44 destaingk heartbeat: [3249]: info: **************************
Jul 13 09:46:44 destaingk heartbeat: [3249]: info: Configuration validated. Starting heartbeat 2.1.3
Jul 13 09:46:44 destaingk heartbeat: [3250]: info: heartbeat: version 2.1.3
Jul 13 09:46:45 destaingk heartbeat: [3250]: info: Heartbeat generation: 1278090103
Jul 13 09:46:45 destaingk heartbeat: [3250]: info: glib: ucast: write socket priority set to IPTOS_LOWDELAY on eth0
Jul 13 09:46:45 destaingk heartbeat: [3250]: info: glib: ucast: bound send socket to device: eth0
Jul 13 09:46:45 destaingk heartbeat: [3250]: info: glib: ucast: bound receive socket to device: eth0
Jul 13 09:46:45 destaingk heartbeat: [3250]: info: glib: ucast: started on port 694 interface eth0 to 192.168.1.140
Jul 13 09:46:45 destaingk heartbeat: [3250]: info: glib: UDP Broadcast heartbeat started on port 694 (694) interface eth0
Jul 13 09:46:45 destaingk heartbeat: [3250]: info: glib: UDP Broadcast heartbeat closed on port 694 interface eth0 - Status: 1
Jul 13 09:46:45 destaingk heartbeat: [3250]: info: G_main_add_TriggerHandler: Added signal manual handler
Jul 13 09:46:45 destaingk heartbeat: [3250]: info: G_main_add_TriggerHandler: Added signal manual handler
Jul 13 09:46:45 destaingk heartbeat: [3250]: info: G_main_add_SignalHandler: Added signal handler for signal 17
Jul 13 09:46:46 destaingk heartbeat: [3250]: info: Local status now set to: 'up'
Jul 13 09:46:47 destaingk heartbeat: [3250]: info: Link destaingk:eth0 up.
Jul 13 09:47:06 destaingk heartbeat: [3250]: WARN: node debian: is dead
Jul 13 09:47:06 destaingk heartbeat: [3250]: info: Comm_now_up(): updating status to active
Jul 13 09:47:06 destaingk heartbeat: [3250]: info: Local status now set to: 'active'
Jul 13 09:47:06 destaingk heartbeat: [3250]: WARN: No STONITH device configured.
Jul 13 09:47:06 destaingk heartbeat: [3250]: WARN: Shared disks are not protected.
Jul 13 09:47:06 destaingk heartbeat: [3250]: info: Resources being acquired from debian.
Jul 13 09:47:06 destaingk heartbeat: [3832]: debug: notify_world: setting SIGCHLD Handler to SIG_DFL
Jul 13 09:47:06 destaingk mach_down[3862]: info: /usr/share/heartbeat/mach_down: nice_failback: foreign resources acquired
Jul 13 09:47:06 destaingk heartbeat: [3250]: info: Initial resource acquisition complete (T_RESOURCES(us))
Jul 13 09:47:06 destaingk heartbeat: [3250]: info: mach_down takeover complete.
Jul 13 09:47:06 destaingk heartbeat: [3250]: debug: StartNextRemoteRscReq(): child count 1
Jul 13 09:47:07 destaingk heartbeat: [3833]: info: Local Resource acquisition completed.
Jul 13 09:47:07 destaingk heartbeat: [3250]: debug: StartNextRemoteRscReq(): child count 1
Jul 13 09:47:07 destaingk heartbeat: [3969]: debug: notify_world: setting SIGCHLD Handler to SIG_DFL
Jul 13 09:47:17 destaingk heartbeat: [3250]: info: Local Resource acquisition completed. (none)
Jul 13 09:47:17 destaingk heartbeat: [3250]: info: local resource transition completed.
Jul 13 09:47:33 destaingk heartbeat: [4730]: ERROR: Cannot open keyfile [/etc/ha.d/authkeys]. Stop.
Jul 13 09:47:33 destaingk heartbeat: Cannot append to /var/log/ha-debug: Permission denied
Jul 13 09:47:33 destaingk heartbeat: Cannot append to /var/log/ha-log: Permission denied
Jul 13 09:47:33 destaingk heartbeat: [4730]: ERROR: Authentication configuration error.
Jul 13 09:47:33 destaingk heartbeat: Cannot append to /var/log/ha-debug: Permission denied
Jul 13 09:47:33 destaingk heartbeat: Cannot append to /var/log/ha-log: Permission denied
Jul 13 09:47:33 destaingk heartbeat: [4730]: ERROR: Configuration error, heartbeat not started.
Jul 13 09:47:33 destaingk heartbeat: Cannot append to /var/log/ha-debug: Permission denied
Jul 13 09:47:33 destaingk heartbeat: Cannot append to /var/log/ha-log: Permission denied
Jul 13 09:47:45 destaingk heartbeat: [3250]: info: destaingk wants to go standby [foreign]
Jul 13 09:47:56 destaingk heartbeat: [3250]: WARN: No reply to standby request. Standby request cancelled.
Jul 13 09:49:49 destaingk heartbeat: [3250]: info: Heartbeat shutdown in progress. (3250)
Jul 13 09:49:49 destaingk heartbeat: [5149]: info: Giving up all HA resources.
Jul 13 09:49:50 destaingk heartbeat: [5149]: info: All HA resources relinquished.
Jul 13 09:49:52 destaingk heartbeat: [3250]: info: killing HBFIFO process 3259 with signal 15
Jul 13 09:49:52 destaingk heartbeat: [3250]: info: killing HBWRITE process 3260 with signal 15
Jul 13 09:49:52 destaingk heartbeat: [3250]: info: killing HBREAD process 3261 with signal 15
Jul 13 09:49:52 destaingk heartbeat: [3250]: info: killing HBWRITE process 3262 with signal 15
Jul 13 09:49:52 destaingk heartbeat: [3250]: info: killing HBREAD process 3263 with signal 15
Jul 13 09:49:52 destaingk heartbeat: [3250]: info: Core process 3259 exited. 5 remaining
Jul 13 09:49:52 destaingk heartbeat: [3250]: info: Core process 3260 exited. 4 remaining
Jul 13 09:49:52 destaingk heartbeat: [3250]: info: Core process 3261 exited. 3 remaining
Jul 13 09:49:52 destaingk heartbeat: [3250]: info: Core process 3262 exited. 2 remaining
Jul 13 09:49:52 destaingk heartbeat: [3250]: info: Core process 3263 exited. 1 remaining
Jul 13 09:49:52 destaingk heartbeat: [3250]: info: destaingk Heartbeat shutdown complete.
Jul 13 09:50:12 destaingk heartbeat: [5563]: WARN: Core dumps could be lost if multiple dumps occur.
Jul 13 09:50:12 destaingk heartbeat: [5563]: WARN: Consider setting non-default value in /proc/sys/kernel/core_pattern (or equivalent) for maximum supportability
Jul 13 09:50:12 destaingk heartbeat: [5563]: WARN: Consider setting /proc/sys/kernel/core_uses_pid (or equivalent) to 1 for maximum supportability
Jul 13 09:50:12 destaingk heartbeat: [5563]: info: Version 2 support: false
Jul 13 09:50:12 destaingk heartbeat: [5563]: WARN: Logging daemon is disabled --enabling logging daemon is recommended
Jul 13 09:50:12 destaingk heartbeat: [5563]: info: **************************
Jul 13 09:50:12 destaingk heartbeat: [5563]: info: Configuration validated. Starting heartbeat 2.1.3
Jul 13 09:50:12 destaingk heartbeat: [5564]: info: heartbeat: version 2.1.3
Jul 13 09:50:12 destaingk heartbeat: [5564]: info: Heartbeat generation: 1278090104
Jul 13 09:50:12 destaingk heartbeat: [5564]: info: glib: ucast: write socket priority set to IPTOS_LOWDELAY on eth0
Jul 13 09:50:12 destaingk heartbeat: [5564]: info: glib: ucast: bound send socket to device: eth0
Jul 13 09:50:12 destaingk heartbeat: [5564]: info: glib: ucast: bound receive socket to device: eth0
Jul 13 09:50:12 destaingk heartbeat: [5564]: info: glib: ucast: started on port 694 interface eth0 to 192.168.1.140
Jul 13 09:50:12 destaingk heartbeat: [5564]: info: glib: UDP Broadcast heartbeat started on port 694 (694) interface eth0
Jul 13 09:50:12 destaingk heartbeat: [5564]: info: glib: UDP Broadcast heartbeat closed on port 694 interface eth0 - Status: 1
Jul 13 09:50:12 destaingk heartbeat: [5564]: info: G_main_add_TriggerHandler: Added signal manual handler
Jul 13 09:50:12 destaingk heartbeat: [5564]: info: G_main_add_TriggerHandler: Added signal manual handler
Jul 13 09:50:12 destaingk heartbeat: [5564]: info: G_main_add_SignalHandler: Added signal handler for signal 17
Jul 13 09:50:12 destaingk heartbeat: [5564]: info: Local status now set to: 'up'
Jul 13 09:50:13 destaingk heartbeat: [5564]: info: Link destaingk:eth0 up.
Jul 13 09:50:32 destaingk heartbeat: [5564]: WARN: node debian: is dead
Jul 13 09:50:32 destaingk heartbeat: [5564]: info: Comm_now_up(): updating status to active
Jul 13 09:50:32 destaingk heartbeat: [5564]: info: Local status now set to: 'active'
Jul 13 09:50:32 destaingk heartbeat: [5564]: WARN: No STONITH device configured.
Jul 13 09:50:32 destaingk heartbeat: [5564]: WARN: Shared disks are not protected.
Jul 13 09:50:32 destaingk heartbeat: [5564]: info: Resources being acquired from debian.
Jul 13 09:50:32 destaingk heartbeat: [5582]: debug: notify_world: setting SIGCHLD Handler to SIG_DFL
Jul 13 09:50:32 destaingk mach_down[5613]: info: /usr/share/heartbeat/mach_down: nice_failback: foreign resources acquired
Jul 13 09:50:32 destaingk heartbeat: [5564]: info: Initial resource acquisition complete (T_RESOURCES(us))
Jul 13 09:50:32 destaingk heartbeat: [5564]: info: mach_down takeover complete.
Jul 13 09:50:32 destaingk heartbeat: [5564]: debug: StartNextRemoteRscReq(): child count 1
Jul 13 09:50:33 destaingk heartbeat: [5583]: info: Local Resource acquisition completed.
Jul 13 09:50:33 destaingk heartbeat: [5564]: debug: StartNextRemoteRscReq(): child count 1
Jul 13 09:50:33 destaingk heartbeat: [5719]: debug: notify_world: setting SIGCHLD Handler to SIG_DFL
Jul 13 09:50:43 destaingk heartbeat: [5564]: info: Local Resource acquisition completed. (none)
Jul 13 09:50:43 destaingk heartbeat: [5564]: info: local resource transition completed.
Jul 13 09:52:46 destaingk heartbeat: [5564]: info: Link debian:eth0 up.
Jul 13 09:52:46 destaingk heartbeat: [5564]: info: Status update for node debian: status init
Jul 13 09:52:46 destaingk heartbeat: [6368]: debug: notify_world: setting SIGCHLD Handler to SIG_DFL
Jul 13 09:52:46 destaingk heartbeat: [5564]: info: Status update for node debian: status up
Jul 13 09:52:46 destaingk heartbeat: [5564]: debug: StartNextRemoteRscReq(): child count 1
Jul 13 09:52:46 destaingk heartbeat: [6385]: debug: notify_world: setting SIGCHLD Handler to SIG_DFL
Jul 13 09:52:47 destaingk heartbeat: [5564]: debug: get_delnodelist: delnodelist=
Jul 13 09:52:47 destaingk heartbeat: [5564]: info: Status update for node debian: status active
Jul 13 09:52:47 destaingk heartbeat: [6401]: debug: notify_world: setting SIGCHLD Handler to SIG_DFL
Jul 13 09:52:48 destaingk heartbeat: [5564]: info: remote resource transition completed.
Jul 13 09:53:33 destaingk heartbeat: [5564]: info: Received shutdown notice from 'debian'.
Jul 13 09:53:33 destaingk heartbeat: [5564]: info: Resources being acquired from debian.
Jul 13 09:53:33 destaingk heartbeat: [6458]: debug: notify_world: setting SIGCHLD Handler to SIG_DFL
Jul 13 09:53:33 destaingk mach_down[6488]: info: /usr/share/heartbeat/mach_down: nice_failback: foreign resources acquired
Jul 13 09:53:33 destaingk heartbeat: [5564]: info: mach_down takeover complete.
Jul 13 09:53:33 destaingk heartbeat: [5564]: debug: StartNextRemoteRscReq(): child count 1
Jul 13 09:53:33 destaingk heartbeat: [6459]: info: Local Resource acquisition completed.
Jul 13 09:53:44 destaingk heartbeat: [5564]: WARN: node debian: is dead
Jul 13 09:53:44 destaingk heartbeat: [5564]: info: Dead node debian gave up resources.
Jul 13 09:53:44 destaingk heartbeat: [5564]: info: Link debian:eth0 dead.
Jul 13 09:53:57 destaingk heartbeat: [5564]: info: Heartbeat restart on node debian
Jul 13 09:53:57 destaingk heartbeat: [5564]: info: Link debian:eth0 up.
Jul 13 09:53:57 destaingk heartbeat: [5564]: info: Status update for node debian: status init
Jul 13 09:53:57 destaingk heartbeat: [5564]: info: Status update for node debian: status up
Jul 13 09:53:57 destaingk heartbeat: [6610]: debug: notify_world: setting SIGCHLD Handler to SIG_DFL
Jul 13 09:53:57 destaingk heartbeat: [5564]: debug: StartNextRemoteRscReq(): child count 1
Jul 13 09:53:57 destaingk heartbeat: [5564]: debug: get_delnodelist: delnodelist=
Jul 13 09:53:57 destaingk heartbeat: [5564]: info: Status update for node debian: status active
Jul 13 09:53:57 destaingk heartbeat: [5564]: debug: StartNextRemoteRscReq(): child count 1
Jul 13 09:53:57 destaingk heartbeat: [6626]: debug: notify_world: setting SIGCHLD Handler to SIG_DFL
Jul 13 09:53:57 destaingk heartbeat: [6642]: debug: notify_world: setting SIGCHLD Handler to SIG_DFL
Jul 13 09:53:58 destaingk heartbeat: [5564]: info: remote resource transition completed.
Jul 13 10:57:09 destaingk heartbeat: [5564]: info: Heartbeat shutdown in progress. (5564)
Jul 13 10:57:09 destaingk heartbeat: [8941]: info: Giving up all HA resources.
Jul 13 10:57:10 destaingk heartbeat: [8941]: info: All HA resources relinquished.
Jul 13 10:57:12 destaingk heartbeat: [5564]: info: killing HBFIFO process 5567 with signal 15
Jul 13 10:57:12 destaingk heartbeat: [5564]: info: killing HBWRITE process 5568 with signal 15
Jul 13 10:57:12 destaingk heartbeat: [5564]: info: killing HBREAD process 5569 with signal 15
Jul 13 10:57:12 destaingk heartbeat: [5564]: info: killing HBWRITE process 5570 with signal 15
Jul 13 10:57:12 destaingk heartbeat: [5564]: info: killing HBREAD process 5571 with signal 15
Jul 13 10:57:12 destaingk heartbeat: [5564]: info: Core process 5567 exited. 5 remaining
Jul 13 10:57:12 destaingk heartbeat: [5564]: info: Core process 5568 exited. 4 remaining
Jul 13 10:57:12 destaingk heartbeat: [5564]: info: Core process 5569 exited. 3 remaining
Jul 13 10:57:12 destaingk heartbeat: [5564]: info: Core process 5570 exited. 2 remaining
Jul 13 10:57:12 destaingk heartbeat: [5564]: info: Core process 5571 exited. 1 remaining
Jul 13 10:57:12 destaingk heartbeat: [5564]: info: destaingk Heartbeat shutdown complete.
Jul 13 10:57:43 destaingk heartbeat: [9398]: WARN: Core dumps could be lost if multiple dumps occur.
Jul 13 10:57:43 destaingk heartbeat: [9398]: WARN: Consider setting non-default value in /proc/sys/kernel/core_pattern (or equivalent) for maximum supportability
Jul 13 10:57:43 destaingk heartbeat: [9398]: WARN: Consider setting /proc/sys/kernel/core_uses_pid (or equivalent) to 1 for maximum supportability
Jul 13 10:57:43 destaingk heartbeat: [9398]: info: Version 2 support: false
Jul 13 10:57:43 destaingk heartbeat: [9398]: WARN: Logging daemon is disabled --enabling logging daemon is recommended
Jul 13 10:57:43 destaingk heartbeat: [9398]: info: **************************
Jul 13 10:57:43 destaingk heartbeat: [9398]: info: Configuration validated. Starting heartbeat 2.1.3
Jul 13 10:57:43 destaingk heartbeat: [9399]: info: heartbeat: version 2.1.3
Jul 13 10:57:43 destaingk heartbeat: [9399]: info: Heartbeat generation: 1278090105
Jul 13 10:57:43 destaingk heartbeat: [9399]: info: glib: ucast: write socket priority set to IPTOS_LOWDELAY on eth0
Jul 13 10:57:43 destaingk heartbeat: [9399]: info: glib: ucast: bound send socket to device: eth0
Jul 13 10:57:43 destaingk heartbeat: [9399]: info: glib: ucast: bound receive socket to device: eth0
Jul 13 10:57:43 destaingk heartbeat: [9399]: info: glib: ucast: started on port 694 interface eth0 to 192.168.1.140
Jul 13 10:57:43 destaingk heartbeat: [9399]: info: glib: UDP Broadcast heartbeat started on port 694 (694) interface eth0
Jul 13 10:57:43 destaingk heartbeat: [9399]: info: glib: UDP Broadcast heartbeat closed on port 694 interface eth0 - Status: 1
Jul 13 10:57:43 destaingk heartbeat: [9399]: info: G_main_add_TriggerHandler: Added signal manual handler
Jul 13 10:57:43 destaingk heartbeat: [9399]: info: G_main_add_TriggerHandler: Added signal manual handler
Jul 13 10:57:43 destaingk heartbeat: [9399]: info: G_main_add_SignalHandler: Added signal handler for signal 17
Jul 13 10:57:43 destaingk heartbeat: [9399]: info: Local status now set to: 'up'
Jul 13 10:57:44 destaingk heartbeat: [9399]: info: Link debian:eth0 up.
Jul 13 10:57:44 destaingk heartbeat: [9399]: info: Status update for node debian: status active
Jul 13 10:57:44 destaingk heartbeat: [9408]: debug: notify_world: setting SIGCHLD Handler to SIG_DFL
Jul 13 10:57:44 destaingk heartbeat: [9399]: info: Link destaingk:eth0 up.
Jul 13 10:57:45 destaingk heartbeat: [9399]: info: Comm_now_up(): updating status to active
Jul 13 10:57:45 destaingk heartbeat: [9399]: info: Local status now set to: 'active'
Jul 13 10:57:45 destaingk heartbeat: [9399]: info: remote resource transition completed.
Jul 13 10:57:45 destaingk heartbeat: [9399]: info: remote resource transition completed.
Jul 13 10:57:45 destaingk heartbeat: [9399]: info: Local Resource acquisition completed. (none)
Jul 13 10:57:45 destaingk heartbeat: [9399]: info: Initial resource acquisition complete (T_RESOURCES(them))
Jul 13 10:57:45 destaingk heartbeat: [9399]: info: debian wants to go standby [foreign]
Jul 13 10:57:45 destaingk heartbeat: [9399]: info: remote resource transition completed.
Jul 13 10:57:46 destaingk heartbeat: [9399]: info: remote resource transition completed.
Jul 13 10:57:46 destaingk heartbeat: [9399]: info: standby: acquire [foreign] resources from debian
Jul 13 10:57:46 destaingk heartbeat: [9430]: info: acquire local HA resources (standby).
Jul 13 10:57:47 destaingk heartbeat: [9430]: info: local HA resource acquisition completed (standby).
Jul 13 10:57:47 destaingk heartbeat: [9399]: info: Standby resource acquisition done [foreign].


Et le contenu du noeud secondaire avant basculement


debian:~# grep heartbeat /var/log/syslog
Jul 13 10:57:11 debian heartbeat: [3434]: info: Received shutdown notice from 'destaingk'.
Jul 13 10:57:11 debian heartbeat: [3434]: info: Resources being acquired from destaingk.
Jul 13 10:57:11 debian heartbeat: [3434]: debug: StartNextRemoteRscReq(): child count 1
Jul 13 10:57:11 debian heartbeat: [3698]: info: acquire all HA resources (standby).
Jul 13 10:57:11 debian heartbeat: [3699]: info: No local resources [/usr/share/heartbeat/ResourceManager listkeys debian] to acquire.
Jul 13 10:57:11 debian heartbeat: [3434]: debug: StartNextRemoteRscReq(): child count 1
Jul 13 10:57:13 debian heartbeat: [3698]: info: all HA resource acquisition completed (standby).
Jul 13 10:57:13 debian heartbeat: [3434]: info: Standby resource acquisition done [all].
Jul 13 10:57:13 debian heartbeat: [4490]: debug: notify_world: setting SIGCHLD Handler to SIG_DFL
Jul 13 10:57:15 debian mach_down[4506]: info: /usr/share/heartbeat/mach_down: nice_failback: foreign resources acquired
Jul 13 10:57:15 debian heartbeat: [3434]: info: mach_down takeover complete.
Jul 13 10:57:23 debian heartbeat: [3434]: WARN: node destaingk: is dead
Jul 13 10:57:23 debian heartbeat: [3434]: info: Dead node destaingk gave up resources.
Jul 13 10:57:23 debian heartbeat: [3434]: info: Link destaingk:eth0 dead.
Jul 13 10:57:43 debian heartbeat: [3434]: info: debian wants to go standby [foreign]
Jul 13 10:57:45 debian heartbeat: [3434]: info: Heartbeat restart on node destaingk
Jul 13 10:57:45 debian heartbeat: [3434]: info: Link destaingk:eth0 up.
Jul 13 10:57:45 debian heartbeat: [3434]: info: Status update for node destaingk: status init
Jul 13 10:57:45 debian heartbeat: [3434]: info: Status update for node destaingk: status up
Jul 13 10:57:45 debian heartbeat: [3434]: debug: StartNextRemoteRscReq(): child count 1
Jul 13 10:57:45 debian heartbeat: [3434]: debug: get_delnodelist: delnodelist=
Jul 13 10:57:45 debian heartbeat: [5337]: debug: notify_world: setting SIGCHLD Handler to SIG_DFL
Jul 13 10:57:45 debian heartbeat: [5367]: debug: notify_world: setting SIGCHLD Handler to SIG_DFL
Jul 13 10:57:45 debian heartbeat: [3434]: info: Status update for node destaingk: status active
Jul 13 10:57:45 debian heartbeat: [3434]: debug: StartNextRemoteRscReq(): child count 1
Jul 13 10:57:45 debian heartbeat: [3434]: WARN: Standby in progress- new request from debian ignored [9 seconds left]
Jul 13 10:57:45 debian heartbeat: [5392]: debug: notify_world: setting SIGCHLD Handler to SIG_DFL
Jul 13 10:57:45 debian heartbeat: [3434]: info: remote resource transition completed.
Jul 13 10:57:45 debian heartbeat: [3434]: info: standby: destaingk can take our foreign resources
Jul 13 10:57:45 debian heartbeat: [3434]: debug: StartNextRemoteRscReq(): child count 1
Jul 13 10:57:45 debian heartbeat: [5408]: info: give up foreign HA resources (standby).
Jul 13 10:57:45 debian heartbeat: [3434]: info: remote resource transition completed.
Jul 13 10:57:46 debian heartbeat: [5408]: info: foreign HA resource release completed (standby).
Jul 13 10:57:46 debian heartbeat: [3434]: info: Local standby process completed [foreign].
Jul 13 10:57:47 debian heartbeat: [3434]: WARN: 1 lost packet(s) for [destaingk] [12:14]
Jul 13 10:57:47 debian heartbeat: [3434]: info: No pkts missing from destaingk!
Jul 13 10:57:47 debian heartbeat: [3434]: info: Other node completed standby takeover of foreign resources.


Maintenant apres basculement sur le noeud secondaire

Pour le noeud principal on a:


destaingk:~# grep heartbeat /var/log/syslog
Jul 13 08:52:01 destaingk heartbeat: [3172]: info: Heartbeat shutdown in progress. (3172)
Jul 13 08:52:01 destaingk heartbeat: [10584]: info: Giving up all HA resources.
Jul 13 08:52:02 destaingk heartbeat: [10584]: info: All HA resources relinquished.
Jul 13 08:52:04 destaingk heartbeat: [3172]: info: killing HBFIFO process 3181 with signal 15
Jul 13 08:52:04 destaingk heartbeat: [3172]: info: killing HBWRITE process 3182 with signal 15
Jul 13 08:52:04 destaingk heartbeat: [3172]: info: killing HBREAD process 3183 with signal 15
Jul 13 08:52:04 destaingk heartbeat: [3172]: info: killing HBWRITE process 3184 with signal 15
Jul 13 08:52:04 destaingk heartbeat: [3172]: info: killing HBREAD process 3185 with signal 15
Jul 13 08:52:04 destaingk heartbeat: [3172]: info: Core process 3181 exited. 5 remaining
Jul 13 08:52:04 destaingk heartbeat: [3172]: info: Core process 3182 exited. 4 remaining
Jul 13 08:52:04 destaingk heartbeat: [3172]: info: Core process 3183 exited. 3 remaining
Jul 13 08:52:04 destaingk heartbeat: [3172]: info: Core process 3184 exited. 2 remaining
Jul 13 08:52:04 destaingk heartbeat: [3172]: info: Core process 3185 exited. 1 remaining
Jul 13 08:52:04 destaingk heartbeat: [3172]: info: destaingk Heartbeat shutdown complete.
Jul 13 09:46:44 destaingk heartbeat: [3249]: WARN: Core dumps could be lost if multiple dumps occur.
Jul 13 09:46:44 destaingk heartbeat: [3249]: WARN: Consider setting non-default value in /proc/sys/kernel/core_pattern (or equivalent) for maximum supportability
Jul 13 09:46:44 destaingk heartbeat: [3249]: WARN: Consider setting /proc/sys/kernel/core_uses_pid (or equivalent) to 1 for maximum supportability
Jul 13 09:46:44 destaingk heartbeat: [3249]: info: Version 2 support: false
Jul 13 09:46:44 destaingk heartbeat: [3249]: WARN: Logging daemon is disabled --enabling logging daemon is recommended
Jul 13 09:46:44 destaingk heartbeat: [3249]: info: **************************
Jul 13 09:46:44 destaingk heartbeat: [3249]: info: Configuration validated. Starting heartbeat 2.1.3
Jul 13 09:46:44 destaingk heartbeat: [3250]: info: heartbeat: version 2.1.3
Jul 13 09:46:45 destaingk heartbeat: [3250]: info: Heartbeat generation: 1278090103
Jul 13 09:46:45 destaingk heartbeat: [3250]: info: glib: ucast: write socket priority set to IPTOS_LOWDELAY on eth0
Jul 13 09:46:45 destaingk heartbeat: [3250]: info: glib: ucast: bound send socket to device: eth0
Jul 13 09:46:45 destaingk heartbeat: [3250]: info: glib: ucast: bound receive socket to device: eth0
Jul 13 09:46:45 destaingk heartbeat: [3250]: info: glib: ucast: started on port 694 interface eth0 to 192.168.1.140
Jul 13 09:46:45 destaingk heartbeat: [3250]: info: glib: UDP Broadcast heartbeat started on port 694 (694) interface eth0
Jul 13 09:46:45 destaingk heartbeat: [3250]: info: glib: UDP Broadcast heartbeat closed on port 694 interface eth0 - Status: 1
Jul 13 09:46:45 destaingk heartbeat: [3250]: info: G_main_add_TriggerHandler: Added signal manual handler
Jul 13 09:46:45 destaingk heartbeat: [3250]: info: G_main_add_TriggerHandler: Added signal manual handler
Jul 13 09:46:45 destaingk heartbeat: [3250]: info: G_main_add_SignalHandler: Added signal handler for signal 17
Jul 13 09:46:46 destaingk heartbeat: [3250]: info: Local status now set to: 'up'
Jul 13 09:46:47 destaingk heartbeat: [3250]: info: Link destaingk:eth0 up.
Jul 13 09:47:06 destaingk heartbeat: [3250]: WARN: node debian: is dead
Jul 13 09:47:06 destaingk heartbeat: [3250]: info: Comm_now_up(): updating status to active
Jul 13 09:47:06 destaingk heartbeat: [3250]: info: Local status now set to: 'active'
Jul 13 09:47:06 destaingk heartbeat: [3250]: WARN: No STONITH device configured.
Jul 13 09:47:06 destaingk heartbeat: [3250]: WARN: Shared disks are not protected.
Jul 13 09:47:06 destaingk heartbeat: [3250]: info: Resources being acquired from debian.
Jul 13 09:47:06 destaingk heartbeat: [3832]: debug: notify_world: setting SIGCHLD Handler to SIG_DFL
Jul 13 09:47:06 destaingk mach_down[3862]: info: /usr/share/heartbeat/mach_down: nice_failback: foreign resources acquired
Jul 13 09:47:06 destaingk heartbeat: [3250]: info: Initial resource acquisition complete (T_RESOURCES(us))
Jul 13 09:47:06 destaingk heartbeat: [3250]: info: mach_down takeover complete.
Jul 13 09:47:06 destaingk heartbeat: [3250]: debug: StartNextRemoteRscReq(): child count 1
Jul 13 09:47:07 destaingk heartbeat: [3833]: info: Local Resource acquisition completed.
Jul 13 09:47:07 destaingk heartbeat: [3250]: debug: StartNextRemoteRscReq(): child count 1
Jul 13 09:47:07 destaingk heartbeat: [3969]: debug: notify_world: setting SIGCHLD Handler to SIG_DFL
Jul 13 09:47:17 destaingk heartbeat: [3250]: info: Local Resource acquisition completed. (none)
Jul 13 09:47:17 destaingk heartbeat: [3250]: info: local resource transition completed.
Jul 13 09:47:33 destaingk heartbeat: [4730]: ERROR: Cannot open keyfile [/etc/ha.d/authkeys]. Stop.
Jul 13 09:47:33 destaingk heartbeat: Cannot append to /var/log/ha-debug: Permission denied
Jul 13 09:47:33 destaingk heartbeat: Cannot append to /var/log/ha-log: Permission denied
Jul 13 09:47:33 destaingk heartbeat: [4730]: ERROR: Authentication configuration error.
Jul 13 09:47:33 destaingk heartbeat: Cannot append to /var/log/ha-debug: Permission denied
Jul 13 09:47:33 destaingk heartbeat: Cannot append to /var/log/ha-log: Permission denied
Jul 13 09:47:33 destaingk heartbeat: [4730]: ERROR: Configuration error, heartbeat not started.
Jul 13 09:47:33 destaingk heartbeat: Cannot append to /var/log/ha-debug: Permission denied
Jul 13 09:47:33 destaingk heartbeat: Cannot append to /var/log/ha-log: Permission denied
Jul 13 09:47:45 destaingk heartbeat: [3250]: info: destaingk wants to go standby [foreign]
Jul 13 09:47:56 destaingk heartbeat: [3250]: WARN: No reply to standby request. Standby request cancelled.
Jul 13 09:49:49 destaingk heartbeat: [3250]: info: Heartbeat shutdown in progress. (3250)
Jul 13 09:49:49 destaingk heartbeat: [5149]: info: Giving up all HA resources.
Jul 13 09:49:50 destaingk heartbeat: [5149]: info: All HA resources relinquished.
Jul 13 09:49:52 destaingk heartbeat: [3250]: info: killing HBFIFO process 3259 with signal 15
Jul 13 09:49:52 destaingk heartbeat: [3250]: info: killing HBWRITE process 3260 with signal 15
Jul 13 09:49:52 destaingk heartbeat: [3250]: info: killing HBREAD process 3261 with signal 15
Jul 13 09:49:52 destaingk heartbeat: [3250]: info: killing HBWRITE process 3262 with signal 15
Jul 13 09:49:52 destaingk heartbeat: [3250]: info: killing HBREAD process 3263 with signal 15
Jul 13 09:49:52 destaingk heartbeat: [3250]: info: Core process 3259 exited. 5 remaining
Jul 13 09:49:52 destaingk heartbeat: [3250]: info: Core process 3260 exited. 4 remaining
Jul 13 09:49:52 destaingk heartbeat: [3250]: info: Core process 3261 exited. 3 remaining
Jul 13 09:49:52 destaingk heartbeat: [3250]: info: Core process 3262 exited. 2 remaining
Jul 13 09:49:52 destaingk heartbeat: [3250]: info: Core process 3263 exited. 1 remaining
Jul 13 09:49:52 destaingk heartbeat: [3250]: info: destaingk Heartbeat shutdown complete.
Jul 13 09:50:12 destaingk heartbeat: [5563]: WARN: Core dumps could be lost if multiple dumps occur.
Jul 13 09:50:12 destaingk heartbeat: [5563]: WARN: Consider setting non-default value in /proc/sys/kernel/core_pattern (or equivalent) for maximum supportability
Jul 13 09:50:12 destaingk heartbeat: [5563]: WARN: Consider setting /proc/sys/kernel/core_uses_pid (or equivalent) to 1 for maximum supportability
Jul 13 09:50:12 destaingk heartbeat: [5563]: info: Version 2 support: false
Jul 13 09:50:12 destaingk heartbeat: [5563]: WARN: Logging daemon is disabled --enabling logging daemon is recommended
Jul 13 09:50:12 destaingk heartbeat: [5563]: info: **************************
Jul 13 09:50:12 destaingk heartbeat: [5563]: info: Configuration validated. Starting heartbeat 2.1.3
Jul 13 09:50:12 destaingk heartbeat: [5564]: info: heartbeat: version 2.1.3
Jul 13 09:50:12 destaingk heartbeat: [5564]: info: Heartbeat generation: 1278090104
Jul 13 09:50:12 destaingk heartbeat: [5564]: info: glib: ucast: write socket priority set to IPTOS_LOWDELAY on eth0
Jul 13 09:50:12 destaingk heartbeat: [5564]: info: glib: ucast: bound send socket to device: eth0
Jul 13 09:50:12 destaingk heartbeat: [5564]: info: glib: ucast: bound receive socket to device: eth0
Jul 13 09:50:12 destaingk heartbeat: [5564]: info: glib: ucast: started on port 694 interface eth0 to 192.168.1.140
Jul 13 09:50:12 destaingk heartbeat: [5564]: info: glib: UDP Broadcast heartbeat started on port 694 (694) interface eth0
Jul 13 09:50:12 destaingk heartbeat: [5564]: info: glib: UDP Broadcast heartbeat closed on port 694 interface eth0 - Status: 1
Jul 13 09:50:12 destaingk heartbeat: [5564]: info: G_main_add_TriggerHandler: Added signal manual handler
Jul 13 09:50:12 destaingk heartbeat: [5564]: info: G_main_add_TriggerHandler: Added signal manual handler
Jul 13 09:50:12 destaingk heartbeat: [5564]: info: G_main_add_SignalHandler: Added signal handler for signal 17
Jul 13 09:50:12 destaingk heartbeat: [5564]: info: Local status now set to: 'up'
Jul 13 09:50:13 destaingk heartbeat: [5564]: info: Link destaingk:eth0 up.
Jul 13 09:50:32 destaingk heartbeat: [5564]: WARN: node debian: is dead
Jul 13 09:50:32 destaingk heartbeat: [5564]: info: Comm_now_up(): updating status to active
Jul 13 09:50:32 destaingk heartbeat: [5564]: info: Local status now set to: 'active'
Jul 13 09:50:32 destaingk heartbeat: [5564]: WARN: No STONITH device configured.
Jul 13 09:50:32 destaingk heartbeat: [5564]: WARN: Shared disks are not protected.
Jul 13 09:50:32 destaingk heartbeat: [5564]: info: Resources being acquired from debian.
Jul 13 09:50:32 destaingk heartbeat: [5582]: debug: notify_world: setting SIGCHLD Handler to SIG_DFL
Jul 13 09:50:32 destaingk mach_down[5613]: info: /usr/share/heartbeat/mach_down: nice_failback: foreign resources acquired
Jul 13 09:50:32 destaingk heartbeat: [5564]: info: Initial resource acquisition complete (T_RESOURCES(us))
Jul 13 09:50:32 destaingk heartbeat: [5564]: info: mach_down takeover complete.
Jul 13 09:50:32 destaingk heartbeat: [5564]: debug: StartNextRemoteRscReq(): child count 1
Jul 13 09:50:33 destaingk heartbeat: [5583]: info: Local Resource acquisition completed.
Jul 13 09:50:33 destaingk heartbeat: [5564]: debug: StartNextRemoteRscReq(): child count 1
Jul 13 09:50:33 destaingk heartbeat: [5719]: debug: notify_world: setting SIGCHLD Handler to SIG_DFL
Jul 13 09:50:43 destaingk heartbeat: [5564]: info: Local Resource acquisition completed. (none)
Jul 13 09:50:43 destaingk heartbeat: [5564]: info: local resource transition completed.
Jul 13 09:52:46 destaingk heartbeat: [5564]: info: Link debian:eth0 up.
Jul 13 09:52:46 destaingk heartbeat: [5564]: info: Status update for node debian: status init
Jul 13 09:52:46 destaingk heartbeat: [6368]: debug: notify_world: setting SIGCHLD Handler to SIG_DFL
Jul 13 09:52:46 destaingk heartbeat: [5564]: info: Status update for node debian: status up
Jul 13 09:52:46 destaingk heartbeat: [5564]: debug: StartNextRemoteRscReq(): child count 1
Jul 13 09:52:46 destaingk heartbeat: [6385]: debug: notify_world: setting SIGCHLD Handler to SIG_DFL
Jul 13 09:52:47 destaingk heartbeat: [5564]: debug: get_delnodelist: delnodelist=
Jul 13 09:52:47 destaingk heartbeat: [5564]: info: Status update for node debian: status active
Jul 13 09:52:47 destaingk heartbeat: [6401]: debug: notify_world: setting SIGCHLD Handler to SIG_DFL
Jul 13 09:52:48 destaingk heartbeat: [5564]: info: remote resource transition completed.
Jul 13 09:53:33 destaingk heartbeat: [5564]: info: Received shutdown notice from 'debian'.
Jul 13 09:53:33 destaingk heartbeat: [5564]: info: Resources being acquired from debian.
Jul 13 09:53:33 destaingk heartbeat: [6458]: debug: notify_world: setting SIGCHLD Handler to SIG_DFL
Jul 13 09:53:33 destaingk mach_down[6488]: info: /usr/share/heartbeat/mach_down: nice_failback: foreign resources acquired
Jul 13 09:53:33 destaingk heartbeat: [5564]: info: mach_down takeover complete.
Jul 13 09:53:33 destaingk heartbeat: [5564]: debug: StartNextRemoteRscReq(): child count 1
Jul 13 09:53:33 destaingk heartbeat: [6459]: info: Local Resource acquisition completed.
Jul 13 09:53:44 destaingk heartbeat: [5564]: WARN: node debian: is dead
Jul 13 09:53:44 destaingk heartbeat: [5564]: info: Dead node debian gave up resources.
Jul 13 09:53:44 destaingk heartbeat: [5564]: info: Link debian:eth0 dead.
Jul 13 09:53:57 destaingk heartbeat: [5564]: info: Heartbeat restart on node debian
Jul 13 09:53:57 destaingk heartbeat: [5564]: info: Link debian:eth0 up.
Jul 13 09:53:57 destaingk heartbeat: [5564]: info: Status update for node debian: status init
Jul 13 09:53:57 destaingk heartbeat: [5564]: info: Status update for node debian: status up
Jul 13 09:53:57 destaingk heartbeat: [6610]: debug: notify_world: setting SIGCHLD Handler to SIG_DFL
Jul 13 09:53:57 destaingk heartbeat: [5564]: debug: StartNextRemoteRscReq(): child count 1
Jul 13 09:53:57 destaingk heartbeat: [5564]: debug: get_delnodelist: delnodelist=
Jul 13 09:53:57 destaingk heartbeat: [5564]: info: Status update for node debian: status active
Jul 13 09:53:57 destaingk heartbeat: [5564]: debug: StartNextRemoteRscReq(): child count 1
Jul 13 09:53:57 destaingk heartbeat: [6626]: debug: notify_world: setting SIGCHLD Handler to SIG_DFL
Jul 13 09:53:57 destaingk heartbeat: [6642]: debug: notify_world: setting SIGCHLD Handler to SIG_DFL
Jul 13 09:53:58 destaingk heartbeat: [5564]: info: remote resource transition completed.
Jul 13 10:57:09 destaingk heartbeat: [5564]: info: Heartbeat shutdown in progress. (5564)
Jul 13 10:57:09 destaingk heartbeat: [8941]: info: Giving up all HA resources.
Jul 13 10:57:10 destaingk heartbeat: [8941]: info: All HA resources relinquished.
Jul 13 10:57:12 destaingk heartbeat: [5564]: info: killing HBFIFO process 5567 with signal 15
Jul 13 10:57:12 destaingk heartbeat: [5564]: info: killing HBWRITE process 5568 with signal 15
Jul 13 10:57:12 destaingk heartbeat: [5564]: info: killing HBREAD process 5569 with signal 15
Jul 13 10:57:12 destaingk heartbeat: [5564]: info: killing HBWRITE process 5570 with signal 15
Jul 13 10:57:12 destaingk heartbeat: [5564]: info: killing HBREAD process 5571 with signal 15
Jul 13 10:57:12 destaingk heartbeat: [5564]: info: Core process 5567 exited. 5 remaining
Jul 13 10:57:12 destaingk heartbeat: [5564]: info: Core process 5568 exited. 4 remaining
Jul 13 10:57:12 destaingk heartbeat: [5564]: info: Core process 5569 exited. 3 remaining
Jul 13 10:57:12 destaingk heartbeat: [5564]: info: Core process 5570 exited. 2 remaining
Jul 13 10:57:12 destaingk heartbeat: [5564]: info: Core process 5571 exited. 1 remaining
Jul 13 10:57:12 destaingk heartbeat: [5564]: info: destaingk Heartbeat shutdown complete.
Jul 13 10:57:43 destaingk heartbeat: [9398]: WARN: Core dumps could be lost if multiple dumps occur.
Jul 13 10:57:43 destaingk heartbeat: [9398]: WARN: Consider setting non-default value in /proc/sys/kernel/core_pattern (or equivalent) for maximum supportability
Jul 13 10:57:43 destaingk heartbeat: [9398]: WARN: Consider setting /proc/sys/kernel/core_uses_pid (or equivalent) to 1 for maximum supportability
Jul 13 10:57:43 destaingk heartbeat: [9398]: info: Version 2 support: false
Jul 13 10:57:43 destaingk heartbeat: [9398]: WARN: Logging daemon is disabled --enabling logging daemon is recommended
Jul 13 10:57:43 destaingk heartbeat: [9398]: info: **************************
Jul 13 10:57:43 destaingk heartbeat: [9398]: info: Configuration validated. Starting heartbeat 2.1.3
Jul 13 10:57:43 destaingk heartbeat: [9399]: info: heartbeat: version 2.1.3
Jul 13 10:57:43 destaingk heartbeat: [9399]: info: Heartbeat generation: 1278090105
Jul 13 10:57:43 destaingk heartbeat: [9399]: info: glib: ucast: write socket priority set to IPTOS_LOWDELAY on eth0
Jul 13 10:57:43 destaingk heartbeat: [9399]: info: glib: ucast: bound send socket to device: eth0
Jul 13 10:57:43 destaingk heartbeat: [9399]: info: glib: ucast: bound receive socket to device: eth0
Jul 13 10:57:43 destaingk heartbeat: [9399]: info: glib: ucast: started on port 694 interface eth0 to 192.168.1.140
Jul 13 10:57:43 destaingk heartbeat: [9399]: info: glib: UDP Broadcast heartbeat started on port 694 (694) interface eth0
Jul 13 10:57:43 destaingk heartbeat: [9399]: info: glib: UDP Broadcast heartbeat closed on port 694 interface eth0 - Status: 1
Jul 13 10:57:43 destaingk heartbeat: [9399]: info: G_main_add_TriggerHandler: Added signal manual handler
Jul 13 10:57:43 destaingk heartbeat: [9399]: info: G_main_add_TriggerHandler: Added signal manual handler
Jul 13 10:57:43 destaingk heartbeat: [9399]: info: G_main_add_SignalHandler: Added signal handler for signal 17
Jul 13 10:57:43 destaingk heartbeat: [9399]: info: Local status now set to: 'up'
Jul 13 10:57:44 destaingk heartbeat: [9399]: info: Link debian:eth0 up.
Jul 13 10:57:44 destaingk heartbeat: [9399]: info: Status update for node debian: status active
Jul 13 10:57:44 destaingk heartbeat: [9408]: debug: notify_world: setting SIGCHLD Handler to SIG_DFL
Jul 13 10:57:44 destaingk heartbeat: [9399]: info: Link destaingk:eth0 up.
Jul 13 10:57:45 destaingk heartbeat: [9399]: info: Comm_now_up(): updating status to active
Jul 13 10:57:45 destaingk heartbeat: [9399]: info: Local status now set to: 'active'
Jul 13 10:57:45 destaingk heartbeat: [9399]: info: remote resource transition completed.
Jul 13 10:57:45 destaingk heartbeat: [9399]: info: remote resource transition completed.
Jul 13 10:57:45 destaingk heartbeat: [9399]: info: Local Resource acquisition completed. (none)
Jul 13 10:57:45 destaingk heartbeat: [9399]: info: Initial resource acquisition complete (T_RESOURCES(them))
Jul 13 10:57:45 destaingk heartbeat: [9399]: info: debian wants to go standby [foreign]
Jul 13 10:57:45 destaingk heartbeat: [9399]: info: remote resource transition completed.
Jul 13 10:57:46 destaingk heartbeat: [9399]: info: remote resource transition completed.
Jul 13 10:57:46 destaingk heartbeat: [9399]: info: standby: acquire [foreign] resources from debian
Jul 13 10:57:46 destaingk heartbeat: [9430]: info: acquire local HA resources (standby).
Jul 13 10:57:47 destaingk heartbeat: [9430]: info: local HA resource acquisition completed (standby).
Jul 13 10:57:47 destaingk heartbeat: [9399]: info: Standby resource acquisition done [foreign].
Jul 13 11:17:37 destaingk heartbeat: [9399]: info: Heartbeat shutdown in progress. (9399)
Jul 13 11:17:37 destaingk heartbeat: [10761]: info: Giving up all HA resources.
Jul 13 11:17:44 destaingk heartbeat: [10761]: info: All HA resources relinquished.
Jul 13 11:17:44 destaingk heartbeat: [9399]: WARN: 1 lost packet(s) for [debian] [2532:2534]
Jul 13 11:17:44 destaingk heartbeat: [9399]: info: No pkts missing from debian!
Jul 13 11:17:46 destaingk heartbeat: [9399]: info: killing HBWRITE process 9405 with signal 15
Jul 13 11:17:46 destaingk heartbeat: [9399]: info: killing HBREAD process 9406 with signal 15
Jul 13 11:17:46 destaingk heartbeat: [9399]: info: killing HBFIFO process 9402 with signal 15
Jul 13 11:17:46 destaingk heartbeat: [9399]: info: killing HBWRITE process 9403 with signal 15
Jul 13 11:17:46 destaingk heartbeat: [9399]: info: killing HBREAD process 9404 with signal 15
Jul 13 11:17:46 destaingk heartbeat: [9399]: info: Core process 9402 exited. 5 remaining
Jul 13 11:17:46 destaingk heartbeat: [9399]: info: Core process 9403 exited. 4 remaining
Jul 13 11:17:46 destaingk heartbeat: [9399]: info: Core process 9404 exited. 3 remaining
Jul 13 11:17:46 destaingk heartbeat: [9399]: info: Core process 9405 exited. 2 remaining
Jul 13 11:17:46 destaingk heartbeat: [9399]: info: Core process 9406 exited. 1 remaining
Jul 13 11:17:46 destaingk heartbeat: [9399]: info: destaingk Heartbeat shutdown complete.


Et sur le secondaire on a

debian:~# grep heartbeat /var/log/syslog
Jul 13 10:57:11 debian heartbeat: [3434]: info: Received shutdown notice from 'destaingk'.
Jul 13 10:57:11 debian heartbeat: [3434]: info: Resources being acquired from destaingk.
Jul 13 10:57:11 debian heartbeat: [3434]: debug: StartNextRemoteRscReq(): child count 1
Jul 13 10:57:11 debian heartbeat: [3698]: info: acquire all HA resources (standby).
Jul 13 10:57:11 debian heartbeat: [3699]: info: No local resources [/usr/share/heartbeat/ResourceManager listkeys debian] to acquire.
Jul 13 10:57:11 debian heartbeat: [3434]: debug: StartNextRemoteRscReq(): child count 1
Jul 13 10:57:13 debian heartbeat: [3698]: info: all HA resource acquisition completed (standby).
Jul 13 10:57:13 debian heartbeat: [3434]: info: Standby resource acquisition done [all].
Jul 13 10:57:13 debian heartbeat: [4490]: debug: notify_world: setting SIGCHLD Handler to SIG_DFL
Jul 13 10:57:15 debian mach_down[4506]: info: /usr/share/heartbeat/mach_down: nice_failback: foreign resources acquired
Jul 13 10:57:15 debian heartbeat: [3434]: info: mach_down takeover complete.
Jul 13 10:57:23 debian heartbeat: [3434]: WARN: node destaingk: is dead
Jul 13 10:57:23 debian heartbeat: [3434]: info: Dead node destaingk gave up resources.
Jul 13 10:57:23 debian heartbeat: [3434]: info: Link destaingk:eth0 dead.
Jul 13 10:57:43 debian heartbeat: [3434]: info: debian wants to go standby [foreign]
Jul 13 10:57:45 debian heartbeat: [3434]: info: Heartbeat restart on node destaingk
Jul 13 10:57:45 debian heartbeat: [3434]: info: Link destaingk:eth0 up.
Jul 13 10:57:45 debian heartbeat: [3434]: info: Status update for node destaingk: status init
Jul 13 10:57:45 debian heartbeat: [3434]: info: Status update for node destaingk: status up
Jul 13 10:57:45 debian heartbeat: [3434]: debug: StartNextRemoteRscReq(): child count 1
Jul 13 10:57:45 debian heartbeat: [3434]: debug: get_delnodelist: delnodelist=
Jul 13 10:57:45 debian heartbeat: [5337]: debug: notify_world: setting SIGCHLD Handler to SIG_DFL
Jul 13 10:57:45 debian heartbeat: [5367]: debug: notify_world: setting SIGCHLD Handler to SIG_DFL
Jul 13 10:57:45 debian heartbeat: [3434]: info: Status update for node destaingk: status active
Jul 13 10:57:45 debian heartbeat: [3434]: debug: StartNextRemoteRscReq(): child count 1
Jul 13 10:57:45 debian heartbeat: [3434]: WARN: Standby in progress- new request from debian ignored [9 seconds left]
Jul 13 10:57:45 debian heartbeat: [5392]: debug: notify_world: setting SIGCHLD Handler to SIG_DFL
Jul 13 10:57:45 debian heartbeat: [3434]: info: remote resource transition completed.
Jul 13 10:57:45 debian heartbeat: [3434]: info: standby: destaingk can take our foreign resources
Jul 13 10:57:45 debian heartbeat: [3434]: debug: StartNextRemoteRscReq(): child count 1
Jul 13 10:57:45 debian heartbeat: [5408]: info: give up foreign HA resources (standby).
Jul 13 10:57:45 debian heartbeat: [3434]: info: remote resource transition completed.
Jul 13 10:57:46 debian heartbeat: [5408]: info: foreign HA resource release completed (standby).
Jul 13 10:57:46 debian heartbeat: [3434]: info: Local standby process completed [foreign].
Jul 13 10:57:47 debian heartbeat: [3434]: WARN: 1 lost packet(s) for [destaingk] [12:14]
Jul 13 10:57:47 debian heartbeat: [3434]: info: No pkts missing from destaingk!
Jul 13 10:57:47 debian heartbeat: [3434]: info: Other node completed standby takeover of foreign resources.
Jul 13 11:17:44 debian heartbeat: [3434]: info: Received shutdown notice from 'destaingk'.
Jul 13 11:17:44 debian heartbeat: [3434]: info: Resources being acquired from destaingk.
Jul 13 11:17:44 debian heartbeat: [3434]: debug: StartNextRemoteRscReq(): child count 1
Jul 13 11:17:44 debian heartbeat: [5725]: info: acquire local HA resources (standby).
Jul 13 11:17:44 debian heartbeat: [5725]: info: local HA resource acquisition completed (standby).
Jul 13 11:17:44 debian heartbeat: [3434]: info: Standby resource acquisition done [foreign].
Jul 13 11:17:44 debian heartbeat: [3434]: debug: StartNextRemoteRscReq(): child count 1
Jul 13 11:17:44 debian heartbeat: [5726]: info: No local resources [/usr/share/heartbeat/ResourceManager listkeys debian] to acquire.
Jul 13 11:17:44 debian heartbeat: [5751]: debug: notify_world: setting SIGCHLD Handler to SIG_DFL
Jul 13 11:17:46 debian mach_down[5767]: info: /usr/share/heartbeat/mach_down: nice_failback: foreign resources acquired
Jul 13 11:17:46 debian heartbeat: [3434]: info: mach_down takeover complete.
Jul 13 11:17:56 debian heartbeat: [3434]: WARN: node destaingk: is dead
Jul 13 11:17:56 debian heartbeat: [3434]: info: Dead node destaingk gave up resources.
Jul 13 11:17:56 debian heartbeat: [3434]: info: Link destaingk:eth0 dead.
Jul 13 11:18:16 debian heartbeat: [3434]: info: debian wants to go standby [foreign]
Jul 13 11:18:26 debian heartbeat: [3434]: WARN: No reply to standby request. Standby request cancelled.


J'ai l'impression qu'il ya un pb sur le serveur secondaire puis qu'en faisant
cat /proc/drbd j'ai ceci sur les deux serveurs:

:~# cat /proc/drbd
version: 8.0.14 (api:86/proto:86)
GIT-hash: bb447522fc9a87d0069b7e14f0234911ebdab0f7 build by phil@fat-tyre, 2008-11-12 16:40:33
0: cs:Connected st:Secondary/Secondary ds:UpToDate/UpToDate C r---
ns:40 nr:24 dw:64 dr:145 al:1 bm:6 lo:0 pe:0 ua:0 ap:0
resync: used:0/61 hits:1 misses:1 starving:0 dirty:0 changed:1
act_log: used:0/257 hits:9 misses:1 starving:0 dirty:0 changed:1


i,e que les deux sont vue comme des secondaires

Trés Cord !!!

jluce
13/07/2010, 13h22
slt

<div class='quotetop'>Citation </div>
Jul 13 09:47:33 destaingk heartbeat: [4730]: ERROR: Cannot open keyfile [/etc/ha.d/authkeys]. Stop.[/b]

déjà ca ca va pas envois les droits que tu as su /etc/ha.d/*


ls -alrt /etc/ha.d/

de plus on dirais que ton secondaire en as un peu rien a foutre que le primaire tombe :blink:

envois le résultat des commandes suivantes stp:


cat /etc/ha.d/authkeys|grep -v ^#
cat /etc/ha.d/haresources|grep -v ^#
cat /etc/ha.d/ha.cf|grep -v ^#

pour debian et destaingk

a+

tynho
13/07/2010, 14h49
Slt voici le contenu des fichiers heartbeat de mes serveurs

~# ls -alrt /etc/ha.d/
total 60
drwxr-xr-x 2 root root 4096 avr 30 2009 cts
drwxr-xr-x 2 root root 4096 avr 30 2009 conf
-rw-r--r-- 1 root root 7184 avr 30 2009 shellfuncs
-rw-r--r-- 1 root root 692 avr 30 2009 README.config
-rwxr-xr-x 1 root root 745 avr 30 2009 harc
drwxr-xr-x 2 root root 4096 jun 28 12:55 rc.d
drwxr-xr-x 2 root root 4096 jui 2 15:43 resource.d
-rw------- 1 root root 22 jui 2 16:52 authkeys
-rw-r--r-- 1 root root 774 jui 5 10:05 ha.cf
-rw-r--r-- 1 root root 110 jui 12 13:45 haresources
drwxr-xr-x 6 root root 4096 jui 12 18:25 .
drwxr-xr-x 126 root root 12288 jui 13 11:25 ..


cat /etc/ha.d/authkeys | grep -v ^#
auth 1
1 md5 "motdepasser"

cat /etc/ha.d/haresources | grep -v ^#
destaingk IPaddr::192.168.1.199/24/eth0 drbddisk::r0 Filesystem::/dev/drbd0::/data::ext3 apache2 MailTo::root

cat /etc/ha.d/ha.cf | grep -v ^#
ucast eth0 192.168.1.140
debugfile /var/log/ha-debug
logfile /var/log/ha-log
logfacility local0
keepalive 2
deadtime 10
initdead 20
udpport 694
bcast eth0
auto_failback off
node destaingk
node debian

Pour le second serveur
cat /etc/ha.d/ha.cf | grep -v ^#
ucast eth0 192.168.1.20
debugfile /var/log/ha-debug
logfile /var/log/ha-log
logfacility local0
keepalive 2
deadtime 10
initdead 20
udpport 694
bcast eth0
auto_failback off
node destaingk
node debian



Merci!!!

jluce
13/07/2010, 15h26
re

si tu as un ucast (pour uniquement un host), tu n'as pas besoins du bcast eth0

de plus moi j'ai cette option en plus

respawn hacluster /usr/lib/heartbeat/ipfail dans les ha.cf
dans les ha.cf

a coté de ca essaye de voir pour viré ton mailto::root et le type de systeme de fichiers (ext3, t'inquiètes il devrais le reconnaitre tout seul...) dans haressources

et les droits sur le authkeys sont les memes sur les deux ???

peux tu faire un


ps auxf|grep heartbeat

pour qu'on vois avec quel utilisateur il tourne...

a+

tynho
13/07/2010, 16h42
Slt
Voici les droits
th0
debian:/etc/ha.d# ps auxf | grep heartbeat
root 13394 0.0 0.1 3144 768 pts/0 S+ 15:40 0:00 \_ grep heartbeat
root 13355 0.0 2.6 12848 12848 ? SLs 15:37 0:00 heartbeat: master control process
nobody 13357 0.0 1.2 6252 6252 ? SL 15:37 0:00 \_ heartbeat: FIFO reader
nobody 13358 0.0 1.2 6248 6248 ? SL 15:37 0:00 \_ heartbeat: write: ucast eth0
nobody 13359 0.0 1.2 6248 6248 ? SL 15:37 0:00 \_ heartbeat: read: ucast eth0
nobody 13360 0.0 1.2 6248 6248 ? SL 15:37 0:00 \_ heartbeat: write: bcast eth0
nobody 13361 0.0 1.2 6248 6248 ? SL 15:37 0:00 \_ heartbeat: read: bcast eth0


Cepdt tjrs rien donc j'ai decidé de garder les mm conf

jluce
15/07/2010, 09h02
slt

et tu as fait quoi exactement ??

a+