> -----Original Message----- > From: Christoph Hellwig <hch@xxxxxx> > Sent: Monday, October 14, 2024 9:47 AM > To: Javier Gonzalez <javier.gonz@xxxxxxxxxxx> > Cc: Christoph Hellwig <hch@xxxxxx>; Jens Axboe <axboe@xxxxxxxxx>; Keith Busch > <kbusch@xxxxxxxxxx>; Martin K. Petersen <martin.petersen@xxxxxxxxxx>; Kanchan > Joshi <joshi.k@xxxxxxxxxxx>; hare@xxxxxxx; sagi@xxxxxxxxxxx; > brauner@xxxxxxxxxx; viro@xxxxxxxxxxxxxxxxxx; jack@xxxxxxx; jaegeuk@xxxxxxxxxx; > bcrl@xxxxxxxxx; dhowells@xxxxxxxxxx; bvanassche@xxxxxxx; > asml.silence@xxxxxxxxx; linux-nvme@xxxxxxxxxxxxxxxxxxx; linux- > fsdevel@xxxxxxxxxxxxxxx; io-uring@xxxxxxxxxxxxxxx; linux-block@xxxxxxxxxxxxxxx; > linux-aio@xxxxxxxxx; gost.dev@xxxxxxxxxxx; vishak.g@xxxxxxxxxxx > Subject: Re: [PATCH v7 0/3] FDP and per-io hints > > On Mon, Oct 14, 2024 at 07:02:11AM +0000, Javier Gonzalez wrote: > > > And exactly that is the problem. For file systems we can't support > > > that sanely. So IFF you absolutely want the per-I/O hints we need > > > an opt in by the file operations. I've said that at least twice > > > in this discussion before, but as everyone likes to have political > > > discussions instead of technical ones no one replied to that. > > > > Is it a way forward to add this in a new spin of the series - keeping the > > temperature mapping on the NVMe side? > > What do you gain from that? NVMe does not understand data temperatures, > so why make up that claim? I expressed this a couple of times in this thread. It is no problem to map temperatures to a protocol that does not understand the semantics. It is not perfect, and in time I agree that we would benefit from exposing the raw hints without semantics from Block layer upwards. But the temperatures are already there. We are not adding anything new. And thanks to this, we can enable existing users with _minimal_ changes to user-space. As pointed on the XFS zoned discussions, this is very similar to what you guys are doing (exactly to re-use what it is already there), and it works. On top of this, applications that already understand temperatures (e.g., RocksDB) will be able to leverage FDP SSDs without changes. > Especially as it directly undermindes any file system work to actually make use of it. I do not think it does. If a FS wants to use the temperatures, then they would be able to leverage FDP besides SCSI. And if we come up with a better interface later on, we can make the changes then. I really do not see the issue. If we were adding a temperature abstraction now, I would agree with You that we would need to cover the use-case you mention for FSs from the beginning, but this Is already here. Seems like a fair compromise to support current users. > > > If not, what would be acceptable for a first version, before getting into adding > > a new interface to expose agnostic hints? > > Just iterate on Kanchan's series for a block layer (and possible user level, > but that's less urgent) interface for stream separation? We can work on this later, but it is not that easy. Look at the mail I forwarded Form Christian. We now can store hints in the same structure as the temperatures. I believe that we would be able to support 128 hints. If we are serious about supporting hints as a general interface, 128 hints is not nearly enough. Moreover: - How do we convince VFS folks to give us more space for hints at this point? - How do we make agnostic hints and temperature co-exist? Do we use a bit to select and create a protocol? This seems over-complicated - When we are exposing general hints to the block layer, how do we support N:N FS:Block_Devices? and DMs? This is not obvious, and it goes well beyond what we need today, but we probably need to think about it. And these are just questions that we know. You see that this is a big lift (which we can work together on!), but will delay current users by months. And honestly, I do not feel comfortable doing this alone; we need application and FS folks like you to help guide this so that it is really usable. I say it again: Your idea about properly supporting hints in FS is good, and I think we should work on it. But I do not believe that the current patches (with added things like opt in file ops) get in the way of this. So if the disagreement is if these patches are harmful, let's talk about this explicitly and try to agree; we do not need to go over the need for a new hint interface. We already agree on this.