patch getaddrinfo.3 with "protocol independent" information

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Michael,

I'm one of the instructors for the Linux Foundation. Jerry Cooperstein posted a note to the instructor's mailing list that he's going to update the network programming chapters of LFD312. A couple of us suggested that he retire the exercises that use the obsolete library calls gethostbyname(3) and friends, replacing with with code using getaddrinfo(3).

I noted that a lot of example code out there, including (sigh) early editions of the Richard Stevens book "UNIX Network Programming, Vol. 1" fail to accomplish what Stevens says is a goal of network programming: "protocol independence."

A "protocol independent" server exposes the server's socket at ALL of the server's IP addresses: IPv4 and IPv6, local and remote.

The current getaddrinfo(3) man page gets PART of the way to instructing how to do this.

The missing part is that after setting the "hints" struct to ai_family = AF_UNSPEC and ai_flags = AI_PASSIVE, the code needs to CHECK the "ai_family" of each candidate address in the list returned by getaddrinfo(3) to preferentially choose AF_INET6 over AF_INET.

If the server (preferentially) binds to an IPv6 address, qualified as "AF_UNSPEC" and "AI_PASSIVE" in the hints, then the resulting socket will be visible at ANY and ALL of the server's IP addresses. If the server binds to an IPv4 address candidate (because the address candidates in the list aren't checked before trying to bind), then the socket is unavailable to clients trying any of the server's IPv6 addresses. This is arguably an inferior result.

I attach a patch file against the top of the man-pages tree (version 4.05) with modifications to two parts of man3/getaddrinfo.3:

-> the section describing AF_UNSPEC is expanded with a "programming note" describing how to write a "protocol independent" server.

-> the "server" example code has about 6 lines of code added, PLUS an expansion of the block comment before the loop that walks the list returned by getaddrinfo(3).

Thanks for considering this patch. Please let me know what I've done horribly wrong, and I'll try to fix it. I think this is somewhat important, as writing "protocol independent" servers is fairly important, and MOST of the code examples out on the Web and elsewhere don't CHECK the address candidates when walking the list returned by getaddrinfo(3). That's all that needs to be done to make network servers better.

 == Bill Kerr
        bilker@xxxxxxx
        503 781-7946

diff -Naur man-pages-orig/man3/getaddrinfo.3 man-pages/man3/getaddrinfo.3
--- man-pages-orig/man3/getaddrinfo.3	2016-02-25 16:12:10.918563230 -0800
+++ man-pages/man3/getaddrinfo.3	2016-02-25 08:52:27.049057533 -0800
@@ -154,7 +154,20 @@
 (either IPv4 or IPv6, for example) that can be used with
 .I node
 and
-.IR service .
+.IR service .  
+.I Programming Note:  
+in order to create a server socket that can be reached at
+.I both
+the server's IPv4 and IPv6 addresses, you should
+.I preferentially
+bind with AF_INET6 addresses returned by
+.BR getaddrinfo () ,
+as binding with an AF_INET address will create a socket incapable
+of accepting connections on the server's IPv6 addresses.  In other words,
+as you walk the list of candidate addresses, you should prefer
+binding with those whose
+.I ai_family
+is AF_INET6, rather than AF_INET.
 .TP
 .I ai_socktype
 This field specifies the preferred socket type, for example
@@ -675,6 +688,7 @@
     socklen_t peer_addr_len;
     ssize_t nread;
     char buf[BUF_SIZE];
+    int loops;
 
     if (argc != 2) {
         fprintf(stderr, "Usage: %s port\\n", argv[0]);
@@ -697,22 +711,39 @@
     }
 
     /* getaddrinfo() returns a list of address structures.
-       Try each address until we successfully bind(2).
-       If socket(2) (or bind(2)) fails, we (close the socket
-       and) try the next address. */
-
-    for (rp = result; rp != NULL; rp = rp\->ai_next) {
-        sfd = socket(rp\->ai_family, rp\->ai_socktype,
-                rp\->ai_protocol);
-        if (sfd == \-1)
-            continue;
+       Pass through the list UP TO twice.  On the first pass,
+       SKIP addresses that are NOT AF_INET6.  If socket(2) and
+       bind(2) both succeed on a candidate, we are done.  Else,
+       close the socket and try the next address.  If we didn't
+       find an IPv6 address on the first pass, try a second pass,
+       this time try ALL candidates in the list.  If we get an
+       IPv6 address, the "hints" AF_UNSPEC and AI_PASSIVE will
+       ensure that the socket can be reached by all client programs
+       trying ANY of our IP addresses:  v4 or v6, local or remote.
+       This is an useful and important property of network
+       programming called "protocol independence."  See Richard
+       Stevens, "UNIX  Network Programming, Vol. 1". */
+       
+    for (loops = 0; loops < 2; loops++) {
+        for (rp = result; rp != NULL; rp = rp\->ai_next) {
+
+            /* SKIP all but IPv6 candidates on 1st pass */
+            if (loops == 0 && rp\->ai_family != AF_INET6)
+                    continue;
+
+            sfd = socket(rp\->ai_family, rp\->ai_socktype,
+                    rp\->ai_protocol);
+            if (sfd == \-1)
+                continue;
 
-        if (bind(sfd, rp\->ai_addr, rp\->ai_addrlen) == 0)
-            break;                  /* Success */
+            if (bind(sfd, rp\->ai_addr, rp\->ai_addrlen) == 0)
+                goto out;                  /* Success */
 
-        close(sfd);
+            close(sfd);
+        }
     }
 
+out:
     if (rp == NULL) {               /* No address succeeded */
         fprintf(stderr, "Could not bind\\n");
         exit(EXIT_FAILURE);

[Index of Archives]     [Kernel Documentation]     [Netdev]     [Linux Ethernet Bridging]     [Linux Wireless]     [Kernel Newbies]     [Security]     [Linux for Hams]     [Netfilter]     [Bugtraq]     [Yosemite News]     [MIPS Linux]     [ARM Linux]     [Linux RAID]     [Linux Admin]     [Samba]

  Powered by Linux