2007-1-23 09:53
xingyu820424
WAS 总挂起 没反应
在AIX 5.3ML04 下 安装WAS 6.1
--------------------------------------------------------------------------------
Name IBM WebSphere Application Server - ND
Version 6.1.0.3
ID ND
Build Level cf30646.29
Build Date 11/15/06
去年9月底装的,10月底上线的,开始一直都没有出现问题,一直到今年1月中旬左右,WAS 就死掉了,应用网页无法打开,Admin也进不了网页.
我查了一下
/安装路径/AppServer/logs下的日志 没看出什么
系统errpt也没有任何错误,硬件都正常
机器H80配置是2C2G
请大家帮忙,怎么分析查找原因.谢谢
2007-1-23 09:56
xingyu820424
还有用备份的PC机(当时跟B80一起装的)windows系统下,现在用这台机器当服务器,没有出现这个问题.就是爱中毒. 是不是跟WAS 的配置有关系啊?
当时安装的时候,优化参数好象是默认的.
2007-1-23 10:00
xgj0123
中病毒和was的配置有啥关系?最近的病毒还不是一般的嚣张....
2007-1-23 10:36
xingyu820424
SystemErr.log信息:
[1/22/07 8:52:02:579 GMT+08:00] 0000003d SystemErr R at com.ibm.ws.ht
tp.channel.inbound.impl.HttpInboundLink.ready(HttpInboundLink.java:274)
[1/22/07 8:52:02:606 GMT+08:00] 0000003d SystemErr R at com.ibm.ws.tc
p.channel.impl.NewConnectionInitialReadCallback.sendToDiscriminators(NewConnecti
onInitialReadCallback.java:214)
[1/22/07 8:52:02:606 GMT+08:00] 0000003d SystemErr R at com.ibm.ws.tc
p.channel.impl.NewConnectionInitialReadCallback.complete(NewConnectionInitialRea
dCallback.java:113)
[1/22/07 8:52:02:607 GMT+08:00] 0000003d SystemErr R at com.ibm.ws.tc
p.channel.impl.AioReadCompletionListener.futureCompleted(AioReadCompletionListen
er.java:152)
[1/22/07 8:52:02:607 GMT+08:00] 0000003d SystemErr R at com.ibm.io.as
ync.AbstractAsyncFuture.invokeCallback(AbstractAsyncFuture.java:213)
[1/22/07 8:52:02:607 GMT+08:00] 0000003d SystemErr R at com.ibm.io.as
ync.AsyncChannelFuture$1.run(AsyncChannelFuture.java:163)
[1/22/07 8:52:02:607 GMT+08:00] 0000003d SystemErr R at com.ibm.ws.ut
il.ThreadPool$Worker.run(ThreadPool.java:1469)
[1/22/07 8:52:02:609 GMT+08:00] 0000003d SystemErr R java.net.ConnectExcepti
on: A remote host refused an attempted connect operation.
[1/22/07 8:52:02:618 GMT+08:00] 0000003d SystemErr R at java.net.Plai
nSocketImpl.socketConnect(Native Method)
[1/22/07 8:52:02:619 GMT+08:00] 0000003d SystemErr R at java.net.Plai
nSocketImpl.doConnect(PlainSocketImpl.java:372)
...................................................
2007-1-23 10:52
xingyu820424
[quote]原帖由 [i]xgj0123[/i] 于 2007-1-23 10:00 发表
中病毒和was的配置有啥关系?最近的病毒还不是一般的嚣张.... [/quote]
我说的中毒意思是 不能总挂在Windows上,不安全,需要移回AIX.
配置的关系意思是说: 在AIX 运行几个月后才出问题,而在windows才运行了几天,应该看不出问题.
我就担心跟那些池的配置有关
2007-1-23 11:17
void
检查was的log!
2007-1-23 11:22
hgh25emus
都没说你重起was了没有,跟硬件关系不大吧?
看was启动时的报错信息,看was的log
2007-1-23 11:25
hgh25emus
打IBM电话,不过这种的800电话也没什么用,反正我这里800是没解决过任何问题,折腾几下就送试验室去了。一般会把你的log要了去然后说看不出问题的:L
2007-1-23 11:42
yddll
啥都正常的话,还怎么看
是不是重起WAS就好了
2007-1-23 11:48
老农
WAS的问题,看系统日志能管多少用?
2007-1-23 12:48
qiaolan
回复 #1 xingyu820424 的帖子
你要看看死掉是怎么个死法,是CPU、内存都耗尽了,所以没有相应,还是说WAS相关的那些进程死掉了(这个通常应该会在日志当中有体现的)。如果是前面的,要查查看装载的应用。JAVA的东西就是这样的,很耗资源的。
你的WAS是ND版,不知道你是不是用的集群环境,如果是单看app server下面的日志可能不全,有好几个路径下面的日志要看的,你可以参考一下下面的信息:WebSphere Application Server logs
You need to look at the WebSphere Application Server log files when diagnosing
system management problems.
Stand-alone server
In a stand-alone server, the administrative console application and the
administrative MBeans run in the server process. So, you need to look at the logs
for that application server. The log files are:
<WAS_install_root>/profiles/<profile>/logs/<server>/SystemOut.log
<WAS_install_root>/profiles/<profile>/logs/<server>/SystemErr.log
Deployment manager
In a Network Deployment installation, system management involves more than
one application server process. So, you need to look at the logs for each
component. The deployment manager logs are:
<WAS_install_root>/profiles/<profile>/logs/dmgr/SystemOut.log
<WAS_install_root>/profiles/<profile>/logs/dmgr/SystemErr.log
Node agent
Node agent log files are:
<WAS_install_root>/profiles/<profile>/logs/nodeagent/SystemOut.log
<WAS_install_root>/profiles/<profile>/logs/nodeagent/SystemErr.log
WebSphere Application Server V6: System Management Problem Determination 9
Application server
With some problem types, you might also need to look at the logs for the
application server that you are trying to manage. These log files are:
<WAS_install_root>/profiles/<profile>/logs/<server>/SystemOut.log
<WAS_install_root>/profiles/<profile>/logs/<server>/SystemErr.log
2007-1-23 12:56
xingyu820424
重新启动WAS之后,过半小时到一小时就会挂死在那里.
启动WAS都正常
2007-1-23 13:05
xingyu820424
启动日志:
************ Start Display Current Environment ************
Host Operating System is AIX, version 5.3
Java version = J2RE 1.5.0 IBM J9 2.3 AIX ppc-32 j9vmap3223-20060504 (JIT enabled
)
J9VM - 20060501_06428_bHdSMR
JIT - 20060428_1800_r8
GC - 20060501_AA, Java Compiler = j9jit23, Java VM name = IBM J9 VM
was.install.root = /opt/Web/AppServer
user.install.root = /opt/Web/AppServer/profiles/AppSrv01
Java Home = /opt/Web/AppServer/java/jre
ws.ext.dirs = /opt/Web/AppServer/java/lib:/opt/Web/AppServer/classes:/opt/Web/Ap
pServer/lib:/opt/Web/AppServer/installedChannels:/opt/Web/AppServer/lib/ext:/opt
/Web/AppServer/web/help:/opt/Web/AppServer/deploytool/itp/plugins/com.ibm.etools
.ejbdeploy/runtime
Classpath = /opt/Web/AppServer/profiles/AppSrv01/properties:/opt/Web/AppServer/p
roperties:/opt/Web/AppServer/lib/startup.jar:/opt/Web/AppServer/lib/bootstrap.ja
r:/opt/Web/AppServer/lib/j2ee.jar:/opt/Web/AppServer/lib/lmproxy.jar:/opt/Web/Ap
pServer/lib/urlprotocols.jar:/opt/Web/AppServer/java/lib/tools.jar
Java Library path = /opt/Web/AppServer/java/jre/bin:/opt/Web/AppServer/java/jre/
bin/j9vm:/opt/Web/AppServer/java/jre/bin:/opt/Web/AppServer/bin::/usr/lib
Current trace specification = *=info
************* End Display Current Environment *************
[1/21/07 22:29:21:055 GMT+08:00] 0000000a ManagerAdmin I TRAS0017I: The start
up trace state is *=info.
[1/21/07 22:29:21:320 GMT+08:00] 0000000a AdminTool A ADMU0128I: Starting
tool with the AppSrv01 profile
[1/21/07 22:29:21:324 GMT+08:00] 0000000a AdminTool A ADMU3100I: Reading c
onfiguration for server: server1
[1/21/07 22:29:33:435 GMT+08:00] 0000000a AdminTool A ADMU3200I: Server la
unched. Waiting for initialization status.
[1/21/07 22:30:49:156 GMT+08:00] 0000000a AdminTool A ADMU3000I: Server se
rver1 open for e-business; process id is 29474
2007-1-23 13:07
xingyu820424
大家帮帮忙,谢谢了
第一次处理WAS的问题.给点思路.
2007-1-23 13:09
xingyu820424
stopServer.log 的内容:
************ Start Display Current Environment ************
Host Operating System is AIX, version 5.3
Java version = J2RE 1.5.0 IBM J9 2.3 AIX ppc-32 j9vmap3223-20060504 (JIT enabled
)
J9VM - 20060501_06428_bHdSMR
JIT - 20060428_1800_r8
GC - 20060501_AA, Java Compiler = j9jit23, Java VM name = IBM J9 VM
was.install.root = /opt/Web/AppServer
user.install.root = /opt/Web/AppServer/profiles/AppSrv01
Java Home = /opt/Web/AppServer/java/jre
ws.ext.dirs = /opt/Web/AppServer/java/lib:/opt/Web/AppServer/classes:/opt/Web/Ap
pServer/lib:/opt/Web/AppServer/installedChannels:/opt/Web/AppServer/lib/ext:/opt
/Web/AppServer/web/help:/opt/Web/AppServer/deploytool/itp/plugins/com.ibm.etools
.ejbdeploy/runtime
Classpath = /opt/Web/AppServer/profiles/AppSrv01/properties:/opt/Web/AppServer/p
roperties:/opt/Web/AppServer/lib/startup.jar:/opt/Web/AppServer/lib/bootstrap.ja
r:/opt/Web/AppServer/lib/j2ee.jar:/opt/Web/AppServer/lib/lmproxy.jar:/opt/Web/Ap
pServer/lib/urlprotocols.jar:/opt/Web/AppServer/java/lib/tools.jar
Java Library path = /opt/Web/AppServer/java/jre/bin:/opt/Web/AppServer/java/jre/
bin/j9vm:/opt/Web/AppServer/java/jre/bin:/opt/Web/AppServer/bin::/usr/lib
Current trace specification = *=info
************* End Display Current Environment *************
[1/19/07 22:37:19:525 GMT+08:00] 0000000a ManagerAdmin I TRAS0017I: The start
up trace state is *=info.
[1/19/07 22:37:19:900 GMT+08:00] 0000000a AdminTool A ADMU0128I: Starting
tool with the AppSrv01 profile
[1/19/07 22:37:19:937 GMT+08:00] 0000000a AdminTool A ADMU3100I: Reading c
onfiguration for server: server1
[1/19/07 22:37:30:835 GMT+08:00] 0000000a SSLConfig W CWPKI0041W: One or m
ore key stores are using the default password.
[1/19/07 22:37:30:865 GMT+08:00] 0000000a SSLConfigMana I CWPKI0027I: Disablin
g default hostname verification for HTTPS URL connections.
[1/19/07 22:37:32:727 GMT+08:00] 0000000a AdminTool A ADMU3201I: Server st
op request issued. Waiting for stop status.
2007-1-23 20:07
dtbdtbdtb
硬件很差
软件不会管理
问题不会去缩小范围
最后
你需要很认真看书
你的帖子显示你确实很不懂was
你要是比较懂
我就说了
可是现在
要说的太多
就无法说了
[[i] 本帖最后由 dtbdtbdtb 于 2007-1-23 20:08 编辑 [/i]]
2007-1-24 12:06
xingyu820424
呵呵,说的很对,我是不懂WAS的.不好意思.
问的问题让你生气了
2007-1-24 19:28
老农
生气那应该没有,但是和不懂的人真的是太难解释了。
应该有懂的人负责这个的啊。
2007-1-26 00:23
dtbdtbdtb
hope u will not be angry
not for me
you know it is really very hard to say
so much need to say ,to collect ,to analyse ,to isolate
systemout.log
gc.log
http log
be best , under hang the moment
collect all logs under such moment
collect correctly is first step
页:
[1]
Powered by Discuz! Archiver 5.5.0
© 2001-2006 Comsenz Inc.