Re: problem with ceph init scripts

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Sorry, we kind of lost this in the shuffle!

Did you manage to resolve this issue? From what I see in your steps below, it looks like you're doing things in the wrong order -- mkcephfs should always be the first thing you do. If you really have nodes running before you do mkcephfs I suspect you just got your cluster into an inconsistent state and the (fairly fragile) methods it uses to shut down all the daemons got broken.
-Greg
On Friday, February 18, 2011 at 6:11 AM, Upendra Moturi wrote: 
> Hi Colin
> 
> I am using the ubuntu 11.04 (32 bit) and got the ceph package from apt-get
> i am using the default init script.(Found at /etc/init.d/ceph)
> 
> Regarding issue 1)
> 
> On the osd node I tried to start that osd but it did not work
> Steps followed
> 
> 1) Started 3 nodes(ceph.conf is same as i sent earlier)
> 
> 2) mkcephfs -c /etc/ceph/ceph.conf -a --mkbtrfs -k /etc/ceph/keyring.bin
> 
> 3)/etc/init.d/ceph start osd0 (Did this on first osd)
> 
> 4) ps -ef | grep ceph or ps -ef | grep cosd ---- does not show any process
> 
> 5)/etc/init.d/ceph -a start ---- shows all process on all nodes
> 
> Regading Issue 2
> It says -x option is not available
> 
> Tried /etc/init.d/ceph -ax stop and /etc/init.d/ceph -x stop and even
> tried /etc/init.d/ceph -x -a stop
> but nothing worked.
> 
> Then tried with /etc/init.d/ceph -a -v stop this also did not stop
> ceph on all nodes but got the output as
> 
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 0 -t mon "auto start" ""
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 0 -t mon "user" ""
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 0 -t mon "ssh path" "/etc/ceph"
> === mon.0 ===
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 0 -t mon "pid file"
> "/var/run/ceph/mon.0.pid"
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 0 -t mon "log dir" "/var/log/ceph"
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 0 -t mon "log sym dir" ""
> --- ssh ceph0 "cd /etc/ceph ; ulimit -c unlimited ; mkdir -p /var/log/ceph"
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 0 -t mon "lock file"
> "/var/lock/subsys/ceph"
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 0 -t mon "pre stop command" ""
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 0 -t mon "post stop command" ""
> Stopping Ceph mon.0 on ceph0...--- ssh ceph0 "cd /etc/ceph ; ulimit
> -c unlimited ; while [ 1 ]; do
>  [ -e /var/run/ceph/mon.0.pid ] || break
>  pid=`cat /var/run/ceph/mon.0.pid`
>  while [ -e /proc/$pid ] && grep -q cmon /proc/$pid/cmdline ; do
>  cmd="kill $pid"
>  echo -n $cmd...
>  $cmd
>  sleep 1
>  continue
>  done
>  break
>  done"
> kill 1668...done
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 1 -t mon "auto start" ""
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 1 -t mon "user" ""
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 1 -t mon "ssh path" "/etc/ceph"
> === mon.1 ===
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 1 -t mon "pid file"
> "/var/run/ceph/mon.1.pid"
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 1 -t mon "log dir" "/var/log/ceph"
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 1 -t mon "log sym dir" ""
> --- ssh ceph1 "cd /etc/ceph ; ulimit -c unlimited ; mkdir -p /var/log/ceph"
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 1 -t mon "lock file"
> "/var/lock/subsys/ceph"
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 1 -t mon "pre stop command" ""
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 1 -t mon "post stop command" ""
> Stopping Ceph mon.1 on ceph1...--- ssh ceph1 "cd /etc/ceph ; ulimit
> -c unlimited ; while [ 1 ]; do
>  [ -e /var/run/ceph/mon.1.pid ] || break
>  pid=`cat /var/run/ceph/mon.1.pid`
>  while [ -e /proc/$pid ] && grep -q cmon /proc/$pid/cmdline ; do
>  cmd="kill $pid"
>  echo -n $cmd...
>  $cmd
>  sleep 1
>  continue
>  done
>  break
>  done"
> done
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 2 -t mon "auto start" ""
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 2 -t mon "user" ""
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 2 -t mon "ssh path" "/etc/ceph"
> === mon.2 ===
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 2 -t mon "pid file"
> "/var/run/ceph/mon.2.pid"
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 2 -t mon "log dir" "/var/log/ceph"
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 2 -t mon "log sym dir" ""
> --- ssh ceph2 "cd /etc/ceph ; ulimit -c unlimited ; mkdir -p /var/log/ceph"
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 2 -t mon "lock file"
> "/var/lock/subsys/ceph"
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 2 -t mon "pre stop command" ""
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 2 -t mon "post stop command" ""
> Stopping Ceph mon.2 on ceph2...--- ssh ceph2 "cd /etc/ceph ; ulimit
> -c unlimited ; while [ 1 ]; do
>  [ -e /var/run/ceph/mon.2.pid ] || break
>  pid=`cat /var/run/ceph/mon.2.pid`
>  while [ -e /proc/$pid ] && grep -q cmon /proc/$pid/cmdline ; do
>  cmd="kill $pid"
>  echo -n $cmd...
>  $cmd
>  sleep 1
>  continue
>  done
>  break
>  done"
> done
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 0 -t mds "auto start" ""
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 0 -t mds "user" ""
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 0 -t mds "ssh path" "/etc/ceph"
> === mds.0 ===
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 0 -t mds "pid file"
> "/var/run/ceph/mds.0.pid"
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 0 -t mds "log dir" "/var/log/ceph"
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 0 -t mds "log sym dir" ""
> --- ssh ceph0 "cd /etc/ceph ; ulimit -c unlimited ; mkdir -p /var/log/ceph"
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 0 -t mds "lock file"
> "/var/lock/subsys/ceph"
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 0 -t mds "pre stop command" ""
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 0 -t mds "post stop command" ""
> Stopping Ceph mds0 on ceph0...--- ssh ceph0 "cd /etc/ceph ; ulimit -c
> unlimited ; while [ 1 ]; do
>  [ -e /var/run/ceph/mds.0.pid ] || break
>  pid=`cat /var/run/ceph/mds.0.pid`
>  while [ -e /proc/$pid ] && grep -q cmds /proc/$pid/cmdline ; do
>  cmd="kill $pid"
>  echo -n $cmd...
>  $cmd
>  sleep 1
>  continue
>  done
>  break
>  done"
> kill 1844...done
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 1 -t mds "auto start" ""
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 1 -t mds "user" ""
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 1 -t mds "ssh path" "/etc/ceph"
> === mds.1 ===
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 1 -t mds "pid file"
> "/var/run/ceph/mds.1.pid"
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 1 -t mds "log dir" "/var/log/ceph"
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 1 -t mds "log sym dir" ""
> --- ssh ceph1 "cd /etc/ceph ; ulimit -c unlimited ; mkdir -p /var/log/ceph"
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 1 -t mds "lock file"
> "/var/lock/subsys/ceph"
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 1 -t mds "pre stop command" ""
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 1 -t mds "post stop command" ""
> Stopping Ceph mds1 on ceph1...--- ssh ceph1 "cd /etc/ceph ; ulimit -c
> unlimited ; while [ 1 ]; do
>  [ -e /var/run/ceph/mds.1.pid ] || break
>  pid=`cat /var/run/ceph/mds.1.pid`
>  while [ -e /proc/$pid ] && grep -q cmds /proc/$pid/cmdline ; do
>  cmd="kill $pid"
>  echo -n $cmd...
>  $cmd
>  sleep 1
>  continue
>  done
>  break
>  done"
> done
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 0 -t osd "auto start" ""
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 0 -t osd "user" ""
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 0 -t osd "ssh path" "/etc/ceph"
> === osd.0 ===
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 0 -t osd "pid file"
> "/var/run/ceph/osd.0.pid"
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 0 -t osd "log dir" "/var/log/ceph"
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 0 -t osd "log sym dir" ""
> --- ssh ceph0 "cd /etc/ceph ; ulimit -c unlimited ; mkdir -p /var/log/ceph"
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 0 -t osd "osd data" ""
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 0 -t osd "btrfs path" "/data/osd0"
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 0 -t osd "btrfs devs" ""
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 0 -t osd "lock file"
> "/var/lock/subsys/ceph"
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 0 -t osd "pre stop command" ""
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 0 -t osd "post stop command" ""
> Stopping Ceph osd0 on ceph0...--- ssh ceph0 "cd /etc/ceph ; ulimit -c
> unlimited ; while [ 1 ]; do
>  [ -e /var/run/ceph/osd.0.pid ] || break
>  pid=`cat /var/run/ceph/osd.0.pid`
>  while [ -e /proc/$pid ] && grep -q cosd /proc/$pid/cmdline ; do
>  cmd="kill $pid"
>  echo -n $cmd...
>  $cmd
>  sleep 1
>  continue
>  done
>  break
>  done"
> kill 2033...done
> Unmounting Btrfs on ceph0:/data/osd0
> --- ssh root@ceph0 "cd /etc/ceph ; ulimit -c unlimited ; umount
> /data/osd0 || true"
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 1 -t osd "auto start" ""
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 1 -t osd "user" ""
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 1 -t osd "ssh path" "/etc/ceph"
> === osd.1 ===
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 1 -t osd "pid file"
> "/var/run/ceph/osd.1.pid"
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 1 -t osd "log dir" "/var/log/ceph"
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 1 -t osd "log sym dir" ""
> --- ssh ceph1 "cd /etc/ceph ; ulimit -c unlimited ; mkdir -p /var/log/ceph"
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 1 -t osd "osd data" ""
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 1 -t osd "btrfs path" "/data/osd1"
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 1 -t osd "btrfs devs" ""
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 1 -t osd "lock file"
> "/var/lock/subsys/ceph"
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 1 -t osd "pre stop command" ""
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 1 -t osd "post stop command" ""
> Stopping Ceph osd1 on ceph1...--- ssh ceph1 "cd /etc/ceph ; ulimit -c
> unlimited ; while [ 1 ]; do
>  [ -e /var/run/ceph/osd.1.pid ] || break
>  pid=`cat /var/run/ceph/osd.1.pid`
>  while [ -e /proc/$pid ] && grep -q cosd /proc/$pid/cmdline ; do
>  cmd="kill $pid"
>  echo -n $cmd...
>  $cmd
>  sleep 1
>  continue
>  done
>  break
>  done"
> done
> Unmounting Btrfs on ceph1:/data/osd1
> --- ssh root@ceph1 "cd /etc/ceph ; ulimit -c unlimited ; umount
> /data/osd1 || true"
> umount: /data/osd1: device is busy.
>  (In some cases useful info about processes that use
>  the device is found by lsof(8) or fuser(1))
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 2 -t osd "auto start" ""
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 2 -t osd "user" ""
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 2 -t osd "ssh path" "/etc/ceph"
> === osd.2 ===
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 2 -t osd "pid file"
> "/var/run/ceph/osd.2.pid"
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 2 -t osd "log dir" "/var/log/ceph"
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 2 -t osd "log sym dir" ""
> --- ssh ceph2 "cd /etc/ceph ; ulimit -c unlimited ; mkdir -p /var/log/ceph"
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 2 -t osd "osd data" ""
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 2 -t osd "btrfs path" "/data/osd2"
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 2 -t osd "btrfs devs" ""
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 2 -t osd "lock file"
> "/var/lock/subsys/ceph"
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 2 -t osd "pre stop command" ""
> /usr/bin/cconf -c /etc/ceph/ceph.conf -i 2 -t osd "post stop command" ""
> Stopping Ceph osd2 on ceph2...--- ssh ceph2 "cd /etc/ceph ; ulimit -c
> unlimited ; while [ 1 ]; do
>  [ -e /var/run/ceph/osd.2.pid ] || break
>  pid=`cat /var/run/ceph/osd.2.pid`
>  while [ -e /proc/$pid ] && grep -q cosd /proc/$pid/cmdline ; do
>  cmd="kill $pid"
>  echo -n $cmd...
>  $cmd
>  sleep 1
>  continue
>  done
>  break
>  done"
> done
> Unmounting Btrfs on ceph2:/data/osd2
> --- ssh root@ceph2 "cd /etc/ceph ; ulimit -c unlimited ; umount
> /data/osd2 || true"
> umount: /data/osd2: device is busy.
>  (In some cases useful info about processes that use
>  the device is found by lsof(8) or fuser(1))
> 
> 
> 
> 
> On Fri, Feb 18, 2011 at 12:35 PM, Colin McCabe <cmccabe@xxxxxxxxxxxxxx> wrote:
> > Hi Upendra,
> > 
> > Are you running init-ceph from the source directory? If you do that,
> > it will use the ceph.conf in the source directory itself, which is
> > probably not what you want. So it might be good to double-check that.
> > 
> > If all else fails, running init-ceph with -x will show you exactly
> > what the script is doing. If all goes well, its exit status should be
> > 0. Are you getting exit status 0?
> > 
> > Colin
> > 
> > 
> > On Thu, Feb 17, 2011 at 1:55 AM, Upendra Moturi <upendra.m@xxxxxxxxxxxx> wrote:
> > > Hi Colin
> > > 
> > > Here is my ceph.conf :
> > > 
> > > [global]
> > >  pid file = /var/run/ceph/$name.pid
> > >  debug ms = 1
> > > [mon]
> > > mon data = /data/mon$id
> > > [mon.0]
> > > host = ceph0
> > > mon addr = 192.168.155.5:6789
> > > [mon.1]
> > > host = ceph1
> > > mon addr = 192.168.155.6:6789
> > > [mon.2]
> > > host = ceph2
> > > mon addr = 192.168.155.7:6789
> > > [mds]
> > > 
> > > [mds0]
> > > host = ceph0
> > > [mds1]
> > > host = ceph1
> > > 
> > > [osd]
> > > sudo = true
> > > osd data = /data/osd$id
> > > osd journal = /data/osd$id/journal
> > > osd journal size = 512
> > > osd use stale snap = true
> > > [osd0]
> > > host = ceph0
> > > btrfs devs = /dev/sdb
> > > [osd1]
> > > host = ceph1
> > > btrfs devs = /dev/sdb
> > > [osd2]
> > > host = ceph2
> > > btrfs devs = /dev/sdb
> > > 
> > > 
> > > 
> > > 
> > > On Thu, Feb 17, 2011 at 1:06 PM, Colin McCabe <cmccabe@xxxxxxxxxxxxxx> wrote:
> > > > I'm using head of line from the master branch. But that particular
> > > > code hasn't changed since January, which is 0.24.2 is from.
> > > > 
> > > > In my ceph.conf, I just had an osd that was on a remote machine, and
> > > > everything else local.
> > > > 
> > > > If you could post your ceph.conf here or in IRC, perhaps we might spot
> > > > an issue that's causing the problems that you see.
> > > > 
> > > > Colin
> > > > 
> > > > 
> > > > On Wed, Feb 16, 2011 at 11:01 PM, Upendra Moturi <upendra.m@xxxxxxxxxxxx> wrote:
> > > > > Hi Colin
> > > > > I am using
> > > > > ceph version 0.24.2 commit:f7572de5cb87eb7157217be4975ae66d90831bb7
> > > > > ubuntu 11.04 32 bit with upgraded kernal of 2.6.38-2-generic
> > > > > 
> > > > > Installed ceph form apt source.
> > > > > 
> > > > > With above configurations i still able yo reproduce.
> > > > > Can you please share me ur configurations?
> > > > > 
> > > > > On Thu, Feb 17, 2011 at 3:31 AM, Colin McCabe <cmccabe@xxxxxxxxxxxxxx> wrote:
> > > > > > On Wed, Feb 16, 2011 at 1:41 PM, Colin McCabe <cmccabe@xxxxxxxxxxxxxx> wrote:
> > > > > > > On Wed, Feb 16, 2011 at 6:44 AM, Upendra Moturi <upendra.m@xxxxxxxxxxxx> wrote:
> > > > > > > > But if we want to start a particular osd or mon or mds ,its not
> > > > > > > > working and there is no error
> > > > > > > > eg:/etc/init.d/ceph start osd1 does not start osd1 and don't get any error
> > > > > > > 
> > > > > > > That is expected, unless you are running init-ceph on the same node as
> > > > > > > osd1 is on.
> > > > > > > 
> > > > > > > It might be nice to have some kind of interface like "run command X on
> > > > > > > osd1", but init-ceph is not that.
> > > > > > > 
> > > > > > > > /etc/init.d/ceph -a stop also does not stop ceph on all nodes.It stops
> > > > > > > > on current node only
> > > > > > > > where as
> > > > > > > > /etc/init.d/ceph -a killall works fine.
> > > > > > > 
> > > > > > > That sounds like a bug. I'll see if I can fix it.
> > > > > > 
> > > > > > I'm afraid I can't reproduce this.
> > > > > > 
> > > > > > I ran /etc/init.d/ceph -a stop
> > > > > > 
> > > > > > and it stopped ceph daemons running on remote nodes too. Looking at
> > > > > > the code, it looks correct.
> > > > > > 
> > > > > > Colin
> > > > > 
> > > > > 
> > > > > 
> > > > > --
> > > > > Thanks and Regards,
> > > > > Upendra.M
> > > 
> > > 
> > > 
> > > --
> > > Thanks and Regards,
> > > Upendra.M
> 
> 
> 
> -- 
> Thanks and Regards,
> Upendra.M
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to majordomo@xxxxxxxxxxxxxxx
> More majordomo info at http://vger.kernel.org/majordomo-info.html
> 

--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html


[Index of Archives]     [CEPH Users]     [Ceph Large]     [Information on CEPH]     [Linux BTRFS]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]
  Powered by Linux