On Sat, Feb 16, 2019 at 04:31:33PM +1100, Dave Chinner wrote: > On Fri, Feb 15, 2019 at 10:57:12AM +0100, Johannes Thumshirn wrote: > > (This is a joint proposal with Hannes Reinecke) > > > > Servers with NV-DIMM are slowly emerging in data centers but one key feature > > for reliability of these systems hasn't been addressed up to now, data > > redundancy. > > > > While it would be best to solve this issue in the memory controller of the CPU > > itself, I don't see this coming in the next few years. This puts us as the OS > > in the burden to create the redundant copies of data for the users. > > > > If we leave of the DAX support Linux' software RAID implementations (MD, > > device-mapper and BTRFS RAID) do already work on top of pmem devices, but they > > are incompatible with DAX. > > > > In this session Hannes and I would like to discuss eventual ways how we as an > > operating system can mitigate these issues for our users. > > We've supported this since mid 2018 and commit ba23cba9b3bd ("fs: > allow per-device dax status checking for filesystems"). That is, > we can have DAX on the XFS RT device indepently of the data device. > > That is, you set up pmem in three segments - two small identical > segments start get mirrored with RAID1 as the data device, and > the remainder as a block device that is dax capable set up as the > XFS realtime device. Set the RTINHERIT bit on the root directory at > mkfs time ("-d rtinherit=1") and then all the data goes to the DAX > capable realtime device, and all the metadata goes to the software > raided pmem block devices that aren't DAX capable. > > Problem already solved, yes? Sorry, this was meant to be a reply to Dan's email commenting about some people needing mirrored metadata, not the parent that was talking about whole device RAID... i.e. mirrored metadata w/ FS-DAX for data should already be a solved problem... Cheers, Dave. > Cheers, > > Dave. > -- > Dave Chinner > david@xxxxxxxxxxxxx > -- Dave Chinner david@xxxxxxxxxxxxx