Hi All,
Last month end there were a lot of deadlocks occured in our production server hence hold queue got filled with messages.Next day I found 74 messages in the hold queue and I have replayed all the messages from the from the hold queue, this has successfully sent only 40 messages from the queue. Remaining 34 messages are still exists in the hold queue. However yesterday I have tried to replay only the first message, it has created exceptions and deadloacks and also restarted execution server instances. Here I have pasted the exception and deadlock details for more information.
Error message from fmcerr.log:
------------------------------
MQSeries Workflow 3.3 Error Report
Report creation = 10/18/06 15:46:08
Related message = FMC31050E An error has occurred which has terminated processing.
Error location = File=/projects/fmc/drvp/lbld/v332/src/fmcdtprc.cxx, Line=2887, Function=FmcTOMForBlocks::MaterializeForUpdateImpl2(const MaterializeInputParm &, const FmcBIID &, TomForAbstractBlocksDict::iterator &)
Error data = FmcTOMNotFoundException, KeyValues= BIID=OID(00000001153f00030000000000000000),OID(00000001415647490000000000000000)
Exceptions from fmcsys.log:
---------------------------
10/18/06 15:46:07 FMC12170I The processing of hold queue messages started.
10/18/06 15:46:07 FMC12150I 1 messages have been moved to the execution server input queue.
10/18/06 15:46:08 FMC31050E An error has occurred which has terminated processing.
10/18/06 15:46:08 FmcTOMNotFoundException, KeyValues= BIID=OID(00000001153f00030000000000000000),OID(00000001415647490000000000000000)10/18/06 15:46:08 FMC31050E An error has occurred which has terminated processing.
10/18/06 15:46:08 FmcTOMNotFoundException, KeyValues= BIID=OID(00000001153f00030000000000000000),OID(00000001415647490000000000000000)10/18/06 15:46:09 FMC12240E Execution server instance(s) stopped with an error.
10/18/06 15:46:09 FMC12240E Execution server instance(s) stopped with an error.
10/18/06 15:46:09 FMC31050E An error has occurred which has terminated processing.
10/18/06 15:46:09 FmcTOMNotFoundException, KeyValues= BIID=OID(00000001153f00030000000000000000),OID(00000001415647490000000000000000)10/18/06 15:46:09 FMC12240E Execution server instance(s) stopped with an error.
10/18/06 15:46:10 FMC10500I Execution server instance started.
10/18/06 15:46:10 FMC31050E An error has occurred which has terminated processing.
10/18/06 15:46:10 FmcTOMNotFoundException, KeyValues= BIID=OID(00000001153f00030000000000000000),OID(00000001415647490000000000000000)10/18/06 15:46:10 FMC12240E Execution server instance(s) stopped with an error.
10/18/06 15:46:10 FMC10500I Execution server instance started.
10/18/06 15:46:10 FMC31050E An error has occurred which has terminated processing.
10/18/06 15:46:10 FmcTOMNotFoundException, KeyValues= BIID=OID(00000001153f00030000000000000000),OID(00000001415647490000000000000000)10/18/06 15:46:10 FMC12240E Execution server instance(s) stopped with an error.
10/18/06 15:46:11 FMC10500I Execution server instance started.
10/18/06 15:46:11 FMC10500I Execution server instance started.
10/18/06 15:46:11 FMC10500I Execution server instance started.
deadlocks:
----------
10/18/06 16:40:40 FMC31100W The message ResumeProcInst could not be processed because of a database deadlock or timeout. The message will be retried.
10/18/06 16:40:42 FMC31100W The message ResumeProcInst could not be processed because of a database deadlock or timeout. The message will be retried.
10/18/06 16:40:44 FMC31100W The message ResumeProcInst could not be processed because of a database deadlock or timeout. The message will be retried.
10/18/06 16:40:46 FMC31100W The message ResumeProcInst could not be processed because of a database deadlock or timeout. The message will be retried.
10/18/06 16:40:58 FMC31100W The message ResumeProcInst could not be processed because of a database deadlock or timeout. The message will be retried.
10/18/06 16:41:05 FMC31100W The message ResumeProcInst could not be processed because of a database deadlock or timeout. The message will be retried.
10/18/06 16:41:07 FMC31100W The message ResumeProcInst could not be processed because of a database deadlock or timeout. The message will be retried.
10/18/06 16:41:09 FMC31100W The message ResumeProcInst could not be processed because of a database deadlock or timeout. The message will be retried.
Can anyone tell me why the execution server instances restarted?
How to clear the messages from hold queue ?
What would happen if I delete message from hold queue using fmcautil?
Production server environment details:
-------------------------------------
AIX-4.3
MQ Workflow-3.3.2.0
MQ Series Server-5.2.0.0
DB2-06.01.0000
Thanks & Regards,
BMCA.