Message Boards Message Boards

0
|
5056 Views
|
0 Replies
|
0 Total Likes
View groups...
Share
Share this post:

How to configure TimedOut limit

Posted 10 years ago
I wrote launch command for parallel computing on RPi cluster.But,I have some problems yet.
Environment:Master:Win7Slave:Raspberry Pi *4
LaunchSlave["root@192.168.1.100", "ssh -n `1` \"math -mathlink -linkprotocol TCP -linkmode Connect -linkname `2` >&/dev/null &\"", ConnectionType -> LinkCreate]LaunchSlave["root@192.168.1.101", "ssh -n `1` \"math -mathlink -linkprotocol TCP -linkmode Connect -linkname `2` >&/dev/null &\"", ConnectionType -> LinkCreate]LaunchSlave["root@192.168.1.102", "ssh -n `1` \"math -mathlink -linkprotocol TCP -linkmode Connect -linkname `2` >&/dev/null &\"", ConnectionType -> LinkCreate]LaunchSlave["root@192.168.1.103", "ssh -n `1` \"math -mathlink -linkprotocol TCP -linkmode Connect -linkname `2` >&/dev/null &\"", ConnectionType -> LinkCreate]
For example,run "LaunchKernels[RemoteMachine["PiCluster"]]"
Mathematica reply
In[2]:= LaunchKernels[RemoteMachine["PiCluster"]]
\:8A55\:4FA1\:4E2D\:FF1AIn[2]:= LaunchRemote::rsh: Command rsh hh21246-01 -n -l Yukihiro "math -mathlink -linkmode Connect -linkprotocol TCPIP -linkname 51449@192.168.1.30,51450@192.168.1.30 -subkernel -noinit >& /dev/null &" may have failed (exit code 1).
\:8A55\:4FA1\:4E2D\:FF1AIn[2]:= LaunchRemote::rsh: Command rsh hh21246-01 -n -l Yukihiro "math -mathlink -linkmode Connect -linkprotocol TCPIP -linkname 51451@192.168.1.30,51452@192.168.1.30 -subkernel -noinit >& /dev/null &" may have failed (exit code 1).
\:8A55\:4FA1\:4E2D\:FF1AIn[2]:= SubKernels`Protected`kernelFlush::time: Operation LinkWrite timed out after 15.` seconds.
\:8A55\:4FA1\:4E2D\:FF1AIn[2]:= KernelObject::rdead: Subkernel connected through remote[hh21246-01] appears dead. >>
\:8A55\:4FA1\:4E2D\:FF1AIn[2]:= SubKernels`Protected`kernelFlush::time: Operation LinkWrite timed out after 15.` seconds.
\:8A55\:4FA1\:4E2D\:FF1AIn[2]:= KernelObject::rdead: Subkernel connected through remote[hh21246-01] appears dead. >>
\:8A55\:4FA1\:4E2D\:FF1AIn[2]:= LaunchRemote::rsh: Command LaunchSlave["root@192.168.1.100", "ssh -n raspberrypi100 \"math -mathlink -linkprotocol TCP -linkmode Connect -linkname 51455@192.168.1.30,51456@192.168.1.30 >&/dev/null &\"", ConnectionType -> LinkCreate] may have failed (exit code 1).
\:8A55\:4FA1\:4E2D\:FF1AIn[2]:= General::stop: ???????LaunchRemote::rsh????????????????? >>
\:8A55\:4FA1\:4E2D\:FF1AIn[2]:= SubKernels`Protected`kernelFlush::time: Operation LinkWrite timed out after 15.` seconds.
\:8A55\:4FA1\:4E2D\:FF1AIn[2]:= General::stop: ???????SubKernels`Protected`kernelFlush::time????????????????? >>
\:8A55\:4FA1\:4E2D\:FF1AIn[2]:= KernelObject::rdead: Subkernel connected through remote[raspberrypi100] appears dead. >>
\:8A55\:4FA1\:4E2D\:FF1AIn[2]:= General::stop: ???????KernelObject::rdead????????????????? >>
Out[2]= $Failed
I think most important problem is about time out.I'm trying to solve the TimedOut problem.

May be this problem relateĀ 

http://mathematica.stackexchange.com/questions/612/connecting-to-and-disconnecting-from-a-continuously-running-kernel-on-demand/708#708

In particular Operation LinkWrite timed out after 15.` seconds.

Who Know change timed out from15 to 60 seconds.
Please suggest and help me.

Thank you.
POSTED BY: Yukihiro KOBUNA
Reply to this discussion
Community posts can be styled and formatted using the Markdown syntax.
Reply Preview
Attachments
Remove
or Discard

Group Abstract Group Abstract