On Thu, Aug 17, 2017 at 2:27 AM, Neha Ojha <nojha@xxxxxxxxxx> wrote: > Hi John, > > I am working on the following tracker issue: > http://tracker.ceph.com/issues/20995. > I was wondering if you could help me figure out a couple of things - > > 1. What should be the expected output of only using ceph -w? Adding > --watch-channel=* logs things properly, but as for ceph -w alone, I am > unable to understand why I see very few log messages(in some cases > none). "ceph -w" just follows the cluster log at "info" severity, so if there is nothing going to the cluster log, the command doesn't output anything. It used to also include the audit log, but that is now filtered out by default, because it's usually not interesting (the administrator doesn't need to be told what they just typed), and because it's very hard to read (the log lines are basically JSON dumps). All that content is still going to the log file, it's just not visible by default in ceph -w. This is causing some confusion, because historically we had code that wrote the PG status to the cluster log continuously (every 5 seconds or so), so people are accustomed to always seeing something. Also the old documentation talked about this command "watching the cluster's events" which created the impression that this was something other than a log tailing command. > 2. For which cases would it show log messages? Like for example, for > "ceph osd out" it does, but not for "ceph osd in", "ceph osd pool > create" etc. In general you'll see messages about bad or unexpected things (every failing health check now comes with log messages). For actions that are done by the administrator, the audit log is still present (if hidden by default). If you look at the log at "debug" level of severity (--watch-debug), you can still see the summary prints of the cluster maps (excluding the pg map) when they change. The motivation behind all this is to have a cluster log that tells a somewhat human-readable story about failures and recovery. There is certainly scope for adding in some more logging at carefully chosen points, such as: - if a user marks an OSD out which was down, we could log that to indicate that the user pre-empted the timeout-based "out" path. - when a pool is created we could print a message to say that it is being created, and then when all its PGs are out of "creating" we could print another message to that effect. The key thing is to make sure the messages are intelligible to non-expert users and that they are not overwhelming: there are a set of guidelines here http://docs.ceph.com/docs/master/dev/logging/ John > I would really appreciate your help on this issue. > > Thanks, > Neha -- To unsubscribe from this list: send the line "unsubscribe ceph-devel" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html