On Tue, Aug 14, 2018 at 08:29:04PM +0200, Johannes Sixt wrote: > Am 14.08.2018 um 00:37 schrieb Jeff King: > > And then you can do something like: > > > > for size in 4097 8193 16385 32769 65537 131073 262145 524289 1048577; do > > >out ;# clean up from last run > > echo "Trying $size..." > > timeout 5 ./write $size out > > if ! ./check $size <out; then > > echo "$size failed" > > break > > fi > > done > > > > On my Linux system, each of those seems to write several gigabytes > > without overlapping. I did manage to hit some failing cases, but they > > were never sheared writes, but rather cases where there was an > > incomplete write at the end-of-file. > > I used your programs with necessary adjustments (as fork() is not > available), and did similar tests with concurrent processes. With packet > sizes 1025, 4093, 7531 (just to include some odd number), and 8193 I did not > observe any overlapping or short writes. > > I'm now very confident that we are on the safe side for our purposes. Great, thanks for testing! Re-reading what I wrote about end-of-file above and thinking about the conversation with Ævar elsewhere in the thread, I suspect it _is_ easy to get overlapping writes if the processes are receiving signals (since clearly the TERM signal caused a partial write). My experiment doesn't simulate that at all. I suppose the parent process could send SIGUSR1 to the child in each loop, and the child would catch it but keep going. Hmm, that was easy enough to do (programs below for reference), but surprisingly it didn't fail for me (except for the normal end-of-file truncation). It's like the OS is willing to truncate the write of a dying program but not one for a signal that is getting handled. Which is great for us, since it's exactly what we want, but makes me even more suspicious that a non-Linux kernel might behave completely differently. I still think we're fine in practice, as I'd expect any kernel to be atomic under the page size. So this was mostly just for my own edification. -Peff -- >8 -- /* check.c, with separate short-read reporting */ #include <stdlib.h> #include <stdio.h> #include <unistd.h> int main(int argc, const char **argv) { int size = atoi(argv[1]); int block = 0; char *buf; buf = malloc(size); while (1) { int i; /* assume atomic reads */ int r = read(0, buf, size); if (!r) break; if (r < size) { fprintf(stderr, "short read\n"); return 1; } for (i = 1; i < size; i++) { if (buf[i] != buf[0]) { fprintf(stderr, "overlap in block %d\n", block); return 1; } } block++; } } -- >8 -- -- >8 -- /* write.c with signals; you can also confirm via strace that each write is atomic */ #include <stdlib.h> #include <string.h> #include <stdio.h> #include <sys/types.h> #include <fcntl.h> #include <unistd.h> #include <signal.h> void handle_signal(int sig) { /* do nothing */ } static void doit(int size, const char *fn, char c, pid_t pid) { int fd; char *buf; fd = open(fn, O_WRONLY|O_APPEND|O_CREAT, 0666); if (fd < 0) { perror("open"); return; } buf = malloc(size); memset(buf, c, size); while (1) { if (pid) kill(pid, SIGUSR1); write(fd, buf, size); } } int main(int argc, const char **argv) { int size = atoi(argv[1]); pid_t pid; signal(SIGUSR1, handle_signal); pid = fork(); if (pid) doit(size, argv[2], '1', pid); else doit(size, argv[2], '2', pid); return 0; } -- >8 --