Hi Sage, 2010/9/23 Sage Weil <sage@xxxxxxxxxxxx>: > On Wed, 22 Sep 2010, Sage Weil wrote: >> On Wed, 22 Sep 2010, cang lin wrote: >> > > When I follow the instruction of >> > > http://ceph.newdream.net/wiki/Monitor_cluster_expansion to expand a >> > > monitor to ceph02, the following error occurred: >> > > > >> > > > root@ceph02:~# /etc/init.d/ceph start mon1 >> > > > [/etc/ceph/fetch_config/tmp/fetched.ceph.conf.14210] ceph.conf 100% 2565 >> > > 2.5KB/s 00:00 >> > > > === mon.1 === >> > > > Starting Ceph mon1 on ceph02... >> > > > ** WARNING: Ceph is still under heavy development, and is only suitable >> > > for ** >> > > > ** testing and review. Do not trust it with important data. ** >> > > > terminate called after throwing an instance of 'std::logic_error' >> > > > what(): basic_string::_S_construct NULL not valid >> > > > Aborted (core dumped) >> > > > failed: ' /usr/bin/cmon -i 1 -c /tmp/fetched.ceph.conf.14210 ' >> > > >> > > I haven't seen that crash, but it looks like a std::string constructor is >> > > being passed a NULL pointer. Do you have a core dump (to get a >> > > backtrace)? Which version are you running (`cmon -v`)? >> > > >> > >> > The cmon version is v0.21.1 when the crash happen and been updated to >> > v0.21.2. >> > >> > The following backtrace is from v0.21.2: >> >> Thanks, we'll see if we can reproduce and fix this one! > > Ok, this one is fixed by commit 79b6f2f9e9dd70704644338c968f9ad070e5a8f8 > in the testing branch. It actually should be printing an error that the > 'magic' file is missing from the mon data directory. Maybe you skipped > step #3 on the monitor cluster expansion page? > > Thanks- > sage > Yes, I understood wrong to the step 3 in the first test , not the host name included in the path: root@ceph01:/ # rsync -av /mnt/ceph/data/mon0/ /mnt/ceph/data/mon1 root@ceph01:/ # echo 1 > /mnt/ceph/data/mon1/whoami After finding the mistake, I added the host name in the path: root@ceph01:/ # rsync -av /mnt/ceph/data/mon0/ ceph02:/mnt/ceph/data/mon1 I forget to do the following command on the ceph02: root@ceph02:/ # echo 1 > /mnt/ceph/data/mon1/whoami The third test was modified to: root@ceph01:/ # rsync -av /mnt/ceph/data/mon0/ ceph02:/mnt/ceph/data/mon1 root@ceph02:/ # echo 1 > /mnt/ceph/data/mon1/whoami But still error. I do not know the practice, or the system to remember the wrong path? Thinks! lin -- To unsubscribe from this list: send the line "unsubscribe ceph-devel" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html