CodeVerge.Net Beta


   Item Entry   Register  Login  
Microsoft News
Asp.Net Forums
IBM Software
Borland Forums
Adobe Forums
Novell Forums




Can Reply:  Yes Members Can Edit: No Online: Yes
Zone: > NEWSGROUP > Novell Forums > novell.support.cluster-services Tags:
Item Type: Date Entered: 10/21/2008 8:06:04 PM Date Modified: Subscribers: 0 Subscribe Alert
Rate It:
NR
XPoints: N/A Replies: 7 Views: 7 Favorited: 0 Favorite
8 Items, 1 Pages 1 |< << Go >> >|
"Alexander Lore
NewsGroup User
Poison pill given to cluster node10/21/2008 8:06:04 PM
Reply

0

HI all,

i have a strange behavior of my two-node cluster (OES2-Linux, iSCSI-SAN). I
set up a clustered NSS Pool with Volumes. Everything works fine, but if i
trigger a migration from node one to node two during a file copy process on
a Windows XP workstation, the current active node gets a poison pill and is
"killed". The other node is still active and runnig. Is this working as
designed or do i have a problem? If so where should i look for the reason??

Thanks in advance

Alex
"Klaus Arpe" <a
NewsGroup User
Re: Poison pill given to cluster node10/22/2008 8:14:32 AM
Reply

0

Not really as designed. Poison pills happen if one node doesnot answer
timely to th heartbeat. As you describe it happens during high load (copy).
I would look for communication problems, harddisk driver problems (is the
iSCSI device on a seperate box or one of the nodes?).
False Poison Pills may also happen due to 2 node Clusters. For 3 and more
Nodes it is easier for the nodes to find out which node is alive.

Klaus

"Alexander Lorenz" <Alexander.Lorenz@4plus.de> wrote in message
news:48FE5246.3C49.0001.0@4plus.de...
> HI all,
>
> i have a strange behavior of my two-node cluster (OES2-Linux, iSCSI-SAN).
> I
> set up a clustered NSS Pool with Volumes. Everything works fine, but if i
> trigger a migration from node one to node two during a file copy process
> on
> a Windows XP workstation, the current active node gets a poison pill and
> is
> "killed". The other node is still active and runnig. Is this working as
> designed or do i have a problem? If so where should i look for the
> reason??
>
> Thanks in advance
>
> Alex


"Alexander Lore
NewsGroup User
Antw: Re: Poison pill given to cluster node10/23/2008 9:27:26 AM
Reply

0

Could it be helpfull to adjust the timeout settings? (Takt, Toleranz, Master
und Slave Watchdog)?

The iSCSI device resides on a HP MSA 1510i connected via a HP Switch.

Thanks so far.

Alex

>>> <arpe@etech.haw-hamburg.de> schrieb am 22.10.2008 um 10:14:
> Not really as designed. Poison pills happen if one node doesnot answer
> timely to th heartbeat. As you describe it happens during high load
> (copy).
> I would look for communication problems, harddisk driver problems (is
> the
> iSCSI device on a seperate box or one of the nodes?).
> False Poison Pills may also happen due to 2 node Clusters. For 3 and
> more
> Nodes it is easier for the nodes to find out which node is alive.
>
> Klaus
>
> "Alexander Lorenz" <Alexander.Lorenz@4plus.de> wrote in message
> news:48FE5246.3C49.0001.0@4plus.de...
>> HI all,
>>
>> i have a strange behavior of my two-node cluster (OES2-Linux, iSCSI-SAN).

>> I
>> set up a clustered NSS Pool with Volumes. Everything works fine, but if
> i
>> trigger a migration from node one to node two during a file copy process

>
>> on
>> a Windows XP workstation, the current active node gets a poison pill and

>
>> is
>> "killed". The other node is still active and runnig. Is this working as
>> designed or do i have a problem? If so where should i look for the
>> reason??
>>
>> Thanks in advance
>>
>> Alex
"Klaus Arpe" <a
NewsGroup User
Re: Re: Poison pill given to cluster node10/23/2008 12:41:59 PM
Reply

0

Might be, but before tuning that you should carefully read the TID 10053882
"NetWare Cluster Services: The Gory details of Heartbeats, Split Brains and
Poison Pills"

Klaus

"Alexander Lorenz" <Alexander.Lorenz@4plus.de> wrote in message
news:49005F9B.3C49.0001.0@4plus.de...
> Could it be helpfull to adjust the timeout settings? (Takt, Toleranz,
> Master
> und Slave Watchdog)?
>
> The iSCSI device resides on a HP MSA 1510i connected via a HP Switch.
>
> Thanks so far.
>
> Alex
>
>>>> <arpe@etech.haw-hamburg.de> schrieb am 22.10.2008 um 10:14:
>> Not really as designed. Poison pills happen if one node doesnot answer
>> timely to th heartbeat. As you describe it happens during high load
>> (copy).
>> I would look for communication problems, harddisk driver problems (is
>> the
>> iSCSI device on a seperate box or one of the nodes?).
>> False Poison Pills may also happen due to 2 node Clusters. For 3 and
>> more
>> Nodes it is easier for the nodes to find out which node is alive.
>>
>> Klaus
>>
>> "Alexander Lorenz" <Alexander.Lorenz@4plus.de> wrote in message
>> news:48FE5246.3C49.0001.0@4plus.de...
>>> HI all,
>>>
>>> i have a strange behavior of my two-node cluster (OES2-Linux,
>>> iSCSI-SAN).
>
>>> I
>>> set up a clustered NSS Pool with Volumes. Everything works fine, but if
>> i
>>> trigger a migration from node one to node two during a file copy process
>
>>
>>> on
>>> a Windows XP workstation, the current active node gets a poison pill and
>
>>
>>> is
>>> "killed". The other node is still active and runnig. Is this working as
>>> designed or do i have a problem? If so where should i look for the
>>> reason??
>>>
>>> Thanks in advance
>>>
>>> Alex


ataubman <ataub
NewsGroup User
Re: Poison pill given to cluster node10/24/2008 12:06:02 AM
Reply

0


See TID 3839149. Set the Tolerance and Slave Watchdog from 8 to 16
seconds each.


--
Andrew C Taubman
Novell Support Forums Volunteer SysOp
http://forums.novell.com/
(Sorry, support is not provided via e-mail)

Opinions expressed above are not
necessarily those of Novell Inc.
------------------------------------------------------------------------
ataubman's Profile: http://forums.novell.com/member.php?userid=34
View this thread: http://forums.novell.com/showthread.php?t=348139

"Alexander Lore
NewsGroup User
Antw: Re: Poison pill given to cluster node10/27/2008 8:52:37 PM
Reply

0

OK i will test this. Thanks for the support.

Results will be reported here.

Alex

>>> <ataubman@no-mx.forums.novell.com> schrieb am 24.10.2008 um 02:06:
> See TID 3839149. Set the Tolerance and Slave Watchdog from 8 to 16seconds

> each.-- Andrew C Taubman
> Novell Support Forums Volunteer SysOp
> http://forums.novell.com/
> (Sorry, support is not provided via e-mail)
>
> Opinions expressed above are not
> necessarily those of Novell
Inc.------------------------------------------------------------------------
ataubman's Profile:
> http://forums.novell.com/member.php?userid=34View this thread:
> http://forums.novell.com/showthread.php?t=348139
"Alexander Lore
NewsGroup User
Antw: Re: Poison pill given to cluster node11/5/2008 11:31:37 PM
Reply

0

After playing around with the watchdog timers are the results the same. What
now?

Greetings Alex

>>> <ataubman@no-mx.forums.novell.com> schrieb am 24.10.2008 um 02:06:
> See TID 3839149. Set the Tolerance and Slave Watchdog from 8 to 16seconds

> each.-- Andrew C Taubman
> Novell Support Forums Volunteer SysOp
> http://forums.novell.com/
> (Sorry, support is not provided via e-mail)
>
> Opinions expressed above are not
> necessarily those of Novell
Inc.------------------------------------------------------------------------
ataubman's Profile:
> http://forums.novell.com/member.php?userid=34View this thread:
> http://forums.novell.com/showthread.php?t=348139
ataubman <ataub
NewsGroup User
Re: Poison pill given to cluster node11/5/2008 11:56:02 PM
Reply

0


Well that's not good. It means during these copy sessions, the heartbeat
is failing to get even one out of the 16 packets it is sendign through
the wire. You might need a lan trace to see what's going on and whether
it's the master or slave node not playing nice.

Is the same lan used for the heartbeat and iSCSI and the workstation
client access?


--
Andrew C Taubman
Novell Support Forums Volunteer SysOp
http://forums.novell.com/
(Sorry, support is not provided via e-mail)

Opinions expressed above are not
necessarily those of Novell Inc.
------------------------------------------------------------------------
ataubman's Profile: http://forums.novell.com/member.php?userid=34
View this thread: http://forums.novell.com/showthread.php?t=348139

8 Items, 1 Pages 1 |< << Go >> >|


Free Download:







another auto-complete problem

gridview syntax

saving a word doc inside application server w/o showing the page

can someone demistify or mappers?

how i can remove te unbderline of hyperlink

format sql server true/false to yes/no

some questions

web:get live image from scanner...

need your help on source code

how to create newsletter module

httpweb request/response

using the function of the event

why the asp code gets ignored

how to validate textboxes if users uses netscape.

asp.net integration with voice message system

passing a string value to userb wizard control?

do you use sqlhelper ?

how to developed mvc application

which control to use?

help! microsoft jscript runtime error: 'length' is null or not an object

force download file with swedish character

force page to refresh after button click /postback

asp.net / vb.net text file manipulation question

i need help to get started with this simple asp.net page

exchange global address book?

return value on new page

can anyone explain difference between page_load and pre_render

login.aspx does not start up

error 1935 on 60 trial vs.net 2003

data adapter clause: time, will not fill

licenses

error trying to pull data from oracle number fields.

geekspeak tutorial

looking for sample code that will read / write file to a users desktop.

needs of learning asp 3.0 before asp.net?

3.x

web matrix server does not open ie to display page?

object reference not set to an instance of an object error on an unbound dropdownlist

how to save a picture in a sql image field and how to show a picture from a sql image field

asp.net e-commerce application help

how do you prefer to name and link your web pages?

datalist with multipe datasources?

visual studio formatting

assign current date to a textbox at design time

mcp exam, where to start?

radio button, boolean problem

function to change backcolor

asp.net book opinions

why has asp.net appeared as an account holder on my computer?

.net certification

   
  Privacy | Contact Us
All Times Are GMT