On Thu, Jun 10, 2021 at 1:30 PM Olga Kornievskaia <olga.kornievskaia@xxxxxxxxx> wrote: > > On Thu, Jun 10, 2021 at 12:36 PM Trond Myklebust > <trondmy@xxxxxxxxxxxxxxx> wrote: > > > > On Thu, 2021-06-10 at 12:14 -0400, Olga Kornievskaia wrote: > > > On Thu, Jun 10, 2021 at 10:56 AM Trond Myklebust > > > <trondmy@xxxxxxxxxxxxxxx> wrote: > > > > > > > > On Thu, 2021-06-10 at 10:31 -0400, Olga Kornievskaia wrote: > > > > > On Thu, Jun 10, 2021 at 10:13 AM Trond Myklebust > > > > > <trondmy@xxxxxxxxxxxxxxx> wrote: > > > > > > > > > > > > On Thu, 2021-06-10 at 13:56 +0000, Chuck Lever III wrote: > > > > > > > > > > > > > > > > > > > > > > On Jun 10, 2021, at 9:34 AM, Trond Myklebust < > > > > > > > > trondmy@xxxxxxxxxxxxxxx> wrote: > > > > > > > > > > > > > > > > On Thu, 2021-06-10 at 13:30 +0000, Chuck Lever III wrote: > > > > > > > > > > > > > > > > > > > > > > > > > > > > On Jun 9, 2021, at 5:53 PM, Olga Kornievskaia < > > > > > > > > > > olga.kornievskaia@xxxxxxxxx> wrote: > > > > > > > > > > > > > > > > > > > > From: Olga Kornievskaia <kolga@xxxxxxxxxx> > > > > > > > > > > > > > > > > > > > > This option will control up to how many xprts can the > > > > > > > > > > client > > > > > > > > > > establish to the server. This patch parses the value > > > > > > > > > > and > > > > > > > > > > sets > > > > > > > > > > up structures that keep track of max_connect. > > > > > > > > > > > > > > > > > > > > Signed-off-by: Olga Kornievskaia <kolga@xxxxxxxxxx> > > > > > > > > > > --- > > > > > > > > > > fs/nfs/client.c | 1 + > > > > > > > > > > fs/nfs/fs_context.c | 8 ++++++++ > > > > > > > > > > fs/nfs/internal.h | 2 ++ > > > > > > > > > > fs/nfs/nfs4client.c | 12 ++++++++++-- > > > > > > > > > > fs/nfs/super.c | 2 ++ > > > > > > > > > > include/linux/nfs_fs_sb.h | 1 + > > > > > > > > > > 6 files changed, 24 insertions(+), 2 deletions(-) > > > > > > > > > > > > > > > > > > > > diff --git a/fs/nfs/client.c b/fs/nfs/client.c > > > > > > > > > > index 330f65727c45..486dec59972b 100644 > > > > > > > > > > --- a/fs/nfs/client.c > > > > > > > > > > +++ b/fs/nfs/client.c > > > > > > > > > > @@ -179,6 +179,7 @@ struct nfs_client > > > > > > > > > > *nfs_alloc_client(const > > > > > > > > > > struct nfs_client_initdata *cl_init) > > > > > > > > > > > > > > > > > > > > clp->cl_proto = cl_init->proto; > > > > > > > > > > clp->cl_nconnect = cl_init->nconnect; > > > > > > > > > > + clp->cl_max_connect = cl_init->max_connect ? > > > > > > > > > > cl_init- > > > > > > > > > > > max_connect : 1; > > > > > > > > > > > > > > > > > > So, 1 is the default setting, meaning the "add another > > > > > > > > > transport" > > > > > > > > > facility is disabled by default. Would it be less > > > > > > > > > surprising > > > > > > > > > for > > > > > > > > > an admin to allow some extra connections by default? > > > > > > > > > > > > > > > > > > > > > > > > > > > > clp->cl_net = get_net(cl_init->net); > > > > > > > > > > > > > > > > > > > > clp->cl_principal = "*"; > > > > > > > > > > diff --git a/fs/nfs/fs_context.c b/fs/nfs/fs_context.c > > > > > > > > > > index d95c9a39bc70..cfbff7098f8e 100644 > > > > > > > > > > --- a/fs/nfs/fs_context.c > > > > > > > > > > +++ b/fs/nfs/fs_context.c > > > > > > > > > > @@ -29,6 +29,7 @@ > > > > > > > > > > #endif > > > > > > > > > > > > > > > > > > > > #define NFS_MAX_CONNECTIONS 16 > > > > > > > > > > +#define NFS_MAX_TRANSPORTS 128 > > > > > > > > > > > > > > > > > > This maximum seems excessive... again, there are > > > > > > > > > diminishing > > > > > > > > > returns to adding more connections to the same server. > > > > > > > > > what's > > > > > > > > > wrong with re-using NFS_MAX_CONNECTIONS for the maximum? > > > > > > > > > > > > > > > > > > As always, I'm a little queasy about adding yet another > > > > > > > > > mount > > > > > > > > > option. Are there real use cases where a whole-client > > > > > > > > > setting > > > > > > > > > (like a sysfs attribute) would be inadequate? Is there a > > > > > > > > > way > > > > > > > > > the client could figure out a reasonable maximum without > > > > > > > > > a > > > > > > > > > human intervention, say, by counting the number of NICs > > > > > > > > > on > > > > > > > > > the system? > > > > > > > > > > > > > > > > Oh, hell no! We're not tying anything to the number of > > > > > > > > NICs... > > > > > > > > > > > > > > That's a bit of an over-reaction. :-) A little more > > > > > > > explanation > > > > > > > would be welcome. I mean, don't you expect someone to ask > > > > > > > "How > > > > > > > do I pick a good value?" and someone might reasonably answer > > > > > > > "Well, start with the number of NICs on your client times 3" > > > > > > > or > > > > > > > something like that. > > > > > > > > > > > > > > IMO we're about to add another admin setting without > > > > > > > understanding > > > > > > > how it will be used, how to select a good maximum value, or > > > > > > > even > > > > > > > whether this maximum needs to be adjustable. In a previous e- > > > > > > > mail > > > > > > > Olga has already demonstrated that it will be difficult to > > > > > > > explain > > > > > > > how to use this setting with nconnect=. > > > > > > > > > > > > > > Thus I would favor a (moderate) soldered-in maximum to start > > > > > > > with, > > > > > > > and then as real world use cases arise, consider adding a > > > > > > > tuning > > > > > > > mechanism based on actual requirements. > > > > > > > > > > > > It's not an overreaction. It's insane to think that counting > > > > > > NICs > > > > > > gives > > > > > > you any notion whatsoever about the network topology and > > > > > > connectivity > > > > > > between the client and server. It doesn't even tell you how > > > > > > many of > > > > > > those NICs might potentially be available to your application. > > > > > > > > > > > > We're not doing any automation based on that kind of layering > > > > > > violation. > > > > > > > > > > I'm not suggesting to programmatically determine the number of > > > > > NIC to > > > > > determine the value of max_connect. > > > > > > > > > > > > > > No, but that's what Chuck appeared to be suggesting in order to > > > > avoid > > > > the need for the mount option. > > > > > > > > To me, the main reason for the mount option is to allow the user to > > > > limit the number of new IP addresses being added so that if the DNS > > > > server is configured to hand out lots of different addresses for > > > > the > > > > same servername, the user can basically say 'no, I just want to use > > > > the > > > > one IP address that I'm already connected to' (i.e. max_connect=1). > > > > I > > > > can imagine that some clustered setups might need that ability in > > > > order > > > > to work efficiently. > > > > > > > > I'm fine with the idea of nconnect setting the number of > > > > connections > > > > per IP address, but that would need some plumbing in > > > > rpc_clnt_test_and_add_xprt() to allow us to add up to 'nconnect' > > > > copies > > > > of a given transport. > > > > Presumably rpc_xprt_switch_has_addr() would need to return a count > > > > of > > > > the number of copies of the transport that are already present so > > > > that > > > > we can decide whether or not we should add a new one. > > > > > > I think the last paragraph is what I'm asking for. But I would like > > > to > > > again confirm if you still mean "max_connect" to be the total number > > > of connections since you say we could/will allow for nconnect number > > > of connections per IP address. Would max_connect need to be a > > > multiple > > > of nconnect (max_connect = X *nconnect)? > > > > No. Your suggestion to make the two independent is growing on me, > > however in that case we do want to ensure that if nconnect=X, then we > > always add X transports when we add a new IP address. > > Ok. I'm glad to hear independ idea still has life. Are you still > thinking "max_connect" is the right name for it? I guess if we explain > the feature in the man pages the name doesn't matter so much. I would > have still liked it to be something like "max_session_xprts". > > > > Actually when I said supporting (or rather allowing for) nconnect * > > > max_connect transport, is that correct? Given how the code works now > > > this is going to be nconnect + max_connect (only if 1st mount had > > > nconnect option). We can't "add" nconnect connections to the new > > > mounts (but with my patch we can add a single trunk connection). By > > > that I mean: say the first was "mount IP1:/vol1 /mnt1" (1 connection > > > to IP2). Now the client is doing "mount IP2:/vol2 /mnt2". IP1 and IP2 > > > are trunkable addresses of the same server so we add a trunk. We > > > currently don't allow for doing "mount -o nconnec=2 IP2:vol2 /mnt2" > > > and then also add "nconnect" connections to IP2 along with a trunk. > > > In > > > the 2nd example, we'd have 1 connections to IP1, then 2 connections > > > to > > > IP2. Can we allow for that (with needed code change)? If not, then > > > we > > > really need to commit to only support nconnect (16) connections + > > > some > > > number of trunkable connections. > > > > > > I think we want to have nconnect be server-global. i.e. nconnect > > entries of each IP address. After doing more thinking, I'm not sure I like imposing nconnect connections on a mount that didn't ask for it when a mount is done to a trunkable address. It feels like we are going from where we were conserving resources to creating extra when it wasn't asked for. Note, I'm not arguing (yet) against "having nconnect be server-global". I don't have an alternative suggestion. > Thank you both, Trond and Chuck. > > I'll work on v3. > > > > > > -- > > Trond Myklebust > > Linux NFS client maintainer, Hammerspace > > trond.myklebust@xxxxxxxxxxxxxxx > > > >