=?eucgb2312_cn?q?=BB=D8=B8=B4=3A_Cephfs_-_MDS_all_up=3Astandby=2C_not_becoming_up=3Aactive?=

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



> Did you run the command I suggested before or after you executed `rmfailed` below?

I run “rmfailed” before reading your mail. Then I got MON crashed. I fixed the crash by setting max_mds=2. Then I tried the command you suggested.

By reading the code[1], I think I really need to undo the “rmfailed” to get my MDS out of standby state.

> I will compile an addfailed command in a branch but you'll need to download the packages and run it.

Recompile can be hard, I’m not familiar with the procedure. Now I’m going to modify the logic of[2] a little bit with gdb to insert the failed rank.

> Please be careful running hidden/debugging commands.

I will definitely be more careful in the future. Thanks again for your help.

[1]: https://github.com/ceph/ceph/blob/v16.2.6/src/mon/MDSMonitor.cc#L2238
[2]: https://github.com/ceph/ceph/blob/v16.2.6/src/mds/FSMap.cc#L1031

> What was the crash?

ceph version 16.2.6 (ee28fb57e47e9f88813e24bbf4c14496ca299d31) pacific (stable)
1: /lib64/libpthread.so.0(+0x12b20) [0x7f7e9e32cb20]
2: gsignal()
3: abort()
4: /lib64/libstdc++.so.6(+0x9009b) [0x7f7e9d94509b]
5: /lib64/libstdc++.so.6(+0x9653c) [0x7f7e9d94b53c]
6: /lib64/libstdc++.so.6(+0x96597) [0x7f7e9d94b597]
7: /lib64/libstdc++.so.6(+0x967f8) [0x7f7e9d94b7f8]
8: /lib64/libstdc++.so.6(+0x9204b) [0x7f7e9d94704b]
9: (MDSMonitor::maybe_resize_cluster(FSMap&, int)+0xb7f) [0x558c2f6d72ff]
10: (MDSMonitor::tick()+0x161) [0x558c2f6d9cd1]
11: (MDSMonitor::on_active()+0x2c) [0x558c2f6c34bc]
12: (PaxosService::_active()+0x1f5) [0x558c2f5fd865]
13: (Context::complete(int)+0xd) [0x558c2f4eaead]
14: (void finish_contexts<std::__cxx11::list<Context*, std::allocator<Context*> > >(ceph::common::CephContext*, …
15: (Paxos::finish_round()+0x169) [0x558c2f5f4139]
16: (Paxos::commit_finish()+0x8c0) [0x558c2f5f6c60]
17: (C_Committed::finish(int)+0x45) [0x558c2f5fa885]
18: (Context::complete(int)+0xd) [0x558c2f4eaead]
19: (MonitorDBStore::C_DoTransaction::finish(int)+0x98) [0x558c2f5fa5a8]
20: (Context::complete(int)+0xd) [0x558c2f4eaead]
21: (Finisher::finisher_thread_entry()+0x1a5) [0x7f7ea068d6d5]
22: /lib64/libpthread.so.0(+0x814a) [0x7f7e9e32214a]
23: clone()
_______________________________________________
ceph-users mailing list -- ceph-users@xxxxxxx
To unsubscribe send an email to ceph-users-leave@xxxxxxx




[Index of Archives]     [Information on CEPH]     [Linux Filesystem Development]     [Ceph Development]     [Ceph Large]     [Ceph Dev]     [Linux USB Development]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [xfs]


  Powered by Linux