On Wed, Jun 30, 2021 at 05:11:45PM +0200, Martin Kletzander wrote:
I am preparing more patches for CI to run check-valgrind and fix
ongoing
errors but there are two issues I can not identify the reason why they
are failing.
- On debian-10 the info/info-can.sh started failing and the error
message is just one of those I saw earlier in other places:
libnbd: debug: nbd1: nbd_opt_abort: leave: error="nbd_opt_abort:
invalid state: READY: the handle must be negotiating: Invalid
argument"
I think it is probably not the error since it happens during normal
output. The error must be something else. Is this when using
valgrind or normally? Do you have the complete log?
- On Fedora rawhide I hit a random issue where a port in a URI was
translated to its name and looking at the code I can not find how this
could have happened. Until this is fixed the test suite is unreliable
and notification fatigue will cause everyone to start ignoring any
future failures.
/builds/nertpinx/libnbd/tests/.libs/aio-connect: actual URI
nbd://127.0.0.1:altova-lm/ != expected URI nbd://127.0.0.1:35355/
That is likely a real bug. In libnbd 1.8 we added a new API which
creates the URI for connecting back to the server:
https://libguestfs.org/nbd_get_uri.3.html
When this particular test is run, the following code path is followed:
https://gitlab.com/nbdkit/libnbd/-/blob/c6109b5d37c08a430b3a55691dff72321...
which uses getnameinfo. I thought that by using NI_NUMERICHOST it
wouldn't translate the port number using /etc/services. Apparently
that's not the case.
A simple reproducer would be the one attached, except that it
segfaults :-( which means now we've got 2 bugs ...
$ nbdkit null --run './uri.py'
nbdkit: external command was killed by signal 6
The stack trace is rather long so I attached that too.
Patches welcome for both issues.
- Both openSUSE builds are failing to run check-valgrind and it
looks
like it might be unrelated to libnbd, although it would be nice for
someone else to confirm that. For now I have disabled check-valgrind
on those platforms in my branch.
What's the failure?
- Similarly to openSUSE Ubuntu 20.04 fails in valgrind tests, but
somewhere down the GnuTLS rabbit hole, which I presume is unrelated
too, so I disabled check-valgrind on that one as well.
I will send the patches once they are cleaned up, but I wanted to let
everyone know what the current status is because eliminating all random
issues is essential to properly consuming CI results.
Thanks,
Rich.
--
Richard Jones, Virtualization Group, Red Hat
http://people.redhat.com/~rjones
Read my programming and virtualization blog:
http://rwmj.wordpress.com
Fedora Windows cross-compiler. Compile Windows programs, test, and
build Windows installers. Over 100 libraries supported.
http://fedoraproject.org/wiki/MinGW