On Sun, Feb 04, 2018 at 12:10:58PM +0100, Dmitry Vyukov wrote: > > > > To get memory corruption it's actually sufficient just to submit "1-byte" reads; > > there's no need for the SG_NEXT_CMD_LEN ioctl or anything: > > > > #include <fcntl.h> > > #include <unistd.h> > > > > int main() > > { > > int fd = open("/dev/sg0", O_RDWR); > > char buf[43] = { [36] = 0x08 /* READ_6 */ }; > > > > for (;;) > > write(fd, buf, sizeof(buf)); > > } > > > > (where /dev/sg0 is the default QEMU disk type, "82371SB PIIX3 IDE") > > > > The SCSI command descriptor block is the 6 bytes at indices 36-41, so index 42 > > is the only data byte. > > > > Also this is a different bug from the crash in ata_bmdma_fill_sg() which is > > fixed by "libata: fix length validation of ATAPI-relayed SCSI commands". > > > > I'm guessing the driver is DMA'ing to somewhere it shouldn't be... > > It would be good to add KASAN checks to the DMA code that issues > transfers. This is another case where a silent memory corruption > causes dozens of assorted crashes all over the kernel. If we add > checks, KASAN would pinpoint the exact stack that issues the bad > command. This may be the simplest way to debug this bug as well. I've > filed https://bugzilla.kernel.org/show_bug.cgi?id=198661 for this. It seems the problem is related to the fact that in the PRD (Physical Region Descriptor) list for the DMA transfer for "BMDMA" ATA disks, the disk (emulated in QEMU here: https://github.com/qemu/qemu/blob/master/hw/ide/pci.c#L89) ignores the low bit in the lengths, causing a length of 1 (byte) to be interpreted as 0. But, at the same time there is also a special case where a length of 0 is interpreted as 65536 bytes. So the disk will DMA up to 65536 bytes into a 1-byte buffer, causing massive memory corruption. I'm not sure what the best fix is, but probably it needs to be required that the lengths in the sglist have the alignment needed for the disk. KASAN would not have helped here unfortunately. Even if there were KASAN checks when mapping the sglist for DMA or when filling in the PRD list, the kernel would not have known that the disk would actually interpret "1" as "65536". - Eric