Re: [Libguestfs] [PATCH nbdkit 0/6] curl: Use a curl handle pool

Monday, 6 February 2023

On 2/4/23 13:34, Richard W.M. Jones wrote:
...
 This experimental series changes the way that the curl plugin deals
 with libcurl handles.  It also changes the thread model of the plugin
 from SERIALIZE_REQUESTS to PARALLEL.

 Currently one NBD connection opens one libcurl handle.  This also
 implies one TCP connection to the web server.  If you want to open
 multiple libcurl handles (and multiple TCP connections), the client
 must open multiple NBD connections, eg. using multi-conn.

 After this series, there is a pool of libcurl handles shared across
 all NBD connections.  The pool defaults to 4 handles, but this can be
 changed using the connections=N parameter.

 Previously the plugin relied on nbdkit SERIALIZE_REQUESTS to ensure
 that a curl handle could not be used from multiple threads at the same
 time (https://curl.se/libcurl/c/threadsafe.html).  After this change
 it is possible to use the PARALLEL thread model.  This change is quite
 valuable because it means we can use filters like readahead and scan.

 Anyway, this all seems to work, but it actually reduces performance :-(

 In particular this simple test slows down quite substantially:

   time ./nbdkit -r -U - curl file:/var/tmp/fedora-36.img --run 'nbdcopy --no-extents
-p "$uri" null:'

 (where /var/tmp/fedora-36.img is a 10G file).

 I've been looking at flamegraphs all morning and I can't really see
 what the problem is (except that lots more time is spent with libcurl
 calling sigaction?!?)

 I'm wondering if it might be a locality issue, since curl handles are
 now being scattered randomly across threads.  (It might mean in the
 file: case that Linux kernel readahead is ineffective).  I can't
 easily see a way to change the implementation to encourage handles to
 be reused by the same thread. 
I believe the result is expected with a local "file:", and that it's
precisely due to the reason you name.

A good test case could be a long-distance http(s) download, or a
download over a somewhat noisy (but not necessarily congested) WiFi
link. IOW, scenarios where a single TCP connection doesn't perform
supremely:

- In the former case (long distance), because the bandwidth may indeed
be limited, shared with many other TCP streams that don't terminate on
your host, and TCP "plays nice" with oters. By having multiple
connections, you might carve out a larger proportion of the bandwith.
(Unless traffic shaping rules "up-stream" thwarted that.) And here the
assumption is that the total bandwidth is negligible in comparison to
what the disk on the remote end can sustain; IOW the same locality issue
will not be hit on the remote server.

- In the latter case (WiFi), because TCP mistakes packet loss for
congestion, and slows down unjustifiedly, even if the lossage is
extremely short-lived/transient. By having multiple streams, some
streams could "bridge over" the "congestion" perceived by another
stream.

The second case could be possible to emulate locally, with the "tc" or
the "iptables" utility:

https://stackoverflow.com/questions/614795/simulate-delayed-and-dropped-p...

Laszlo

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

Re: [Libguestfs] [PATCH nbdkit 0/6] curl: Use a curl handle pool