@@ -64,6 +64,92 @@ fio for 20 secs with directio, qd of 8, 1 libaio thread:
6464 128K read: IOPS=24.4k, BW=3050MiB/s (3198MB/s)(59.6GiB/20001msec)
6565 128K write: IOPS=11.4k, BW=1430MiB/s (1500MB/s)(27.9GiB/20001msec)
6666
67+ FAQ
68+ ===
69+
70+ 1. What are the use cases for LOCALIO?
71+
72+ a. Workloads where the NFS client and server are on the same host
73+ realize improved IO performance. In particular, it is common when
74+ running containerised workloads for jobs to find themselves
75+ running on the same host as the knfsd server being used for
76+ storage.
77+
78+ 2. What are the requirements for LOCALIO?
79+
80+ a. Bypass use of the network RPC protocol as much as possible. This
81+ includes bypassing XDR and RPC for open, read, write and commit
82+ operations.
83+ b. Allow client and server to autonomously discover if they are
84+ running local to each other without making any assumptions about
85+ the local network topology.
86+ c. Support the use of containers by being compatible with relevant
87+ namespaces (e.g. network, user, mount).
88+ d. Support all versions of NFS. NFSv3 is of particular importance
89+ because it has wide enterprise usage and pNFS flexfiles makes use
90+ of it for the data path.
91+
92+ 3. Why doesn’t LOCALIO just compare IP addresses or hostnames when
93+ deciding if the NFS client and server are co-located on the same
94+ host?
95+
96+ Since one of the main use cases is containerised workloads, we cannot
97+ assume that IP addresses will be shared between the client and
98+ server. This sets up a requirement for a handshake protocol that
99+ needs to go over the same connection as the NFS traffic in order to
100+ identify that the client and the server really are running on the
101+ same host. The handshake uses a secret that is sent over the wire,
102+ and can be verified by both parties by comparing with a value stored
103+ in shared kernel memory if they are truly co-located.
104+
105+ 4. Does LOCALIO improve pNFS flexfiles?
106+
107+ Yes, LOCALIO complements pNFS flexfiles by allowing it to take
108+ advantage of NFS client and server locality. Policy that initiates
109+ client IO as closely to the server where the data is stored naturally
110+ benefits from the data path optimization LOCALIO provides.
111+
112+ 5. Why not develop a new pNFS layout to enable LOCALIO?
113+
114+ A new pNFS layout could be developed, but doing so would put the
115+ onus on the server to somehow discover that the client is co-located
116+ when deciding to hand out the layout.
117+ There is value in a simpler approach (as provided by LOCALIO) that
118+ allows the NFS client to negotiate and leverage locality without
119+ requiring more elaborate modeling and discovery of such locality in a
120+ more centralized manner.
121+
122+ 6. Why is having the client perform a server-side file OPEN, without
123+ using RPC, beneficial? Is the benefit pNFS specific?
124+
125+ Avoiding the use of XDR and RPC for file opens is beneficial to
126+ performance regardless of whether pNFS is used. Especially when
127+ dealing with small files its best to avoid going over the wire
128+ whenever possible, otherwise it could reduce or even negate the
129+ benefits of avoiding the wire for doing the small file I/O itself.
130+ Given LOCALIO's requirements the current approach of having the
131+ client perform a server-side file open, without using RPC, is ideal.
132+ If in the future requirements change then we can adapt accordingly.
133+
134+ 7. Why is LOCALIO only supported with UNIX Authentication (AUTH_UNIX)?
135+
136+ Strong authentication is usually tied to the connection itself. It
137+ works by establishing a context that is cached by the server, and
138+ that acts as the key for discovering the authorisation token, which
139+ can then be passed to rpc.mountd to complete the authentication
140+ process. On the other hand, in the case of AUTH_UNIX, the credential
141+ that was passed over the wire is used directly as the key in the
142+ upcall to rpc.mountd. This simplifies the authentication process, and
143+ so makes AUTH_UNIX easier to support.
144+
145+ 8. How do export options that translate RPC user IDs behave for LOCALIO
146+ operations (eg. root_squash, all_squash)?
147+
148+ Export options that translate user IDs are managed by nfsd_setuser()
149+ which is called by nfsd_setuser_and_check_port() which is called by
150+ __fh_verify(). So they get handled exactly the same way for LOCALIO
151+ as they do for non-LOCALIO.
152+
67153RPC
68154===
69155
0 commit comments