[PATCH nbdkit 0/2] common: Add checked-overflow macros

Re: [Libguestfs] libguestfs...

[libnbd PATCH]...

Richard W.M. Jones

Tuesday, 9 November 2021 Tue, 9 Nov '21

11:49 a.m.

In common/vector/vector.c use GCC/Clang built-in overflow operators. The first patch is a neutral change which adds comments. The second patch is the actual change. Add a new header "checked-overflow.h" which has the purpose of isolating the use of the built-ins to one file (in case we need to add a new compiler later). Then use this in generic_vector_reserve. I tested this with GCC 11 and Clang 13. I verified by disassembly that "jo" (jump overflow) / "jno" is used where it was not used previously, by both compilers. Rich.

Show replies by date

Richard W.M. Jones

Tuesday, 9 November Tue, 9 Nov

11:49 a.m.

New subject: [PATCH nbdkit 1/2] common/utils/vector: Add comments to generic_vector_reserve

This commit makes no changes, it simply adds comments and breaks out the multiplcation into a local variable. --- common/utils/vector.c | 19 +++++++++++++++---- 1 file changed, 15 insertions(+), 4 deletions(-) diff --git a/common/utils/vector.c b/common/utils/vector.c index d7120399..dff051e9 100644 --- a/common/utils/vector.c +++ b/common/utils/vector.c @@ -42,20 +42,31 @@ int generic_vector_reserve (struct generic_vector *v, size_t n, size_t itemsize) { void *newptr; - size_t reqcap, newcap; + size_t reqcap, reqbytes, newcap, newbytes; + /* New capacity requested. We must allocate this minimum (or fail). */ reqcap = v->cap + n; - if (reqcap * itemsize < v->cap * itemsize) { + reqbytes = reqcap * itemsize; + if (reqbytes < v->cap * itemsize) { errno = ENOMEM; return -1; /* overflow */ } + /* However for the sake of optimization, scale buffer by 3/2 so that + * repeated reservations don't call realloc often. + */ newcap = v->cap + (v->cap + 1) / 2; + newbytes = newcap * itemsize; - if (newcap * itemsize < reqcap * itemsize) + if (newbytes < reqbytes) { + /* If that either overflows or is less than the minimum requested, + * fall back to the requested capacity. + */ newcap = reqcap; + newbytes = reqbytes; + } - newptr = realloc (v->ptr, newcap * itemsize); + newptr = realloc (v->ptr, newbytes); if (newptr == NULL) return -1; v->ptr = newptr; -- 2.32.0

Eric Blake

1:38 p.m.

New subject: [PATCH nbdkit 1/2] common/utils/vector: Add comments to generic_vector_reserve

On Tue, Nov 09, 2021 at 05:49:17PM +0000, Richard W.M. Jones wrote:

...

ACK. -- Eric Blake, Principal Software Engineer Red Hat, Inc. +1-919-301-3266 Virtualization: qemu.org | libvirt.org

Laszlo Ersek

Wednesday, 10 November Wed, 10 Nov

7:26 a.m.

New subject: [PATCH nbdkit 1/2] common/utils/vector: Add comments to generic_vector_reserve

On 11/09/21 18:49, Richard W.M. Jones wrote:

...

Acked-by: Laszlo Ersek <lersek(a)redhat.com>

Richard W.M. Jones

Tuesday, 9 November Tue, 9 Nov

11:49 a.m.

New subject: [PATCH nbdkit 2/2] common: Add checked-overflow macros and use for safe vector extension

--- common/include/Makefile.am | 1 + common/utils/Makefile.am | 3 +- common/include/checked-overflow.h | 61 +++++++++++++++++++++++++++++++ common/utils/vector.c | 26 +++++++++---- 4 files changed, 82 insertions(+), 9 deletions(-) diff --git a/common/include/Makefile.am b/common/include/Makefile.am index a7d0d026..52d97216 100644 --- a/common/include/Makefile.am +++ b/common/include/Makefile.am @@ -37,6 +37,7 @@ EXTRA_DIST = \ ascii-ctype.h \ ascii-string.h \ byte-swapping.h \ + checked-overflow.h \ exit-with-parent.h \ isaligned.h \ ispowerof2.h \ diff --git a/common/utils/Makefile.am b/common/utils/Makefile.am index 55415535..012a5c25 100644 --- a/common/utils/Makefile.am +++ b/common/utils/Makefile.am @@ -52,6 +52,7 @@ libutils_la_SOURCES = \ $(NULL) libutils_la_CPPFLAGS = \ -I$(top_srcdir)/include \ + -I$(top_srcdir)/common/include \ $(NULL) libutils_la_CFLAGS = \ $(WARNINGS_CFLAGS) \ @@ -101,7 +102,7 @@ test_quotes_CPPFLAGS = -I$(srcdir) test_quotes_CFLAGS = $(WARNINGS_CFLAGS) test_vector_SOURCES = test-vector.c vector.c vector.h bench.h -test_vector_CPPFLAGS = -I$(srcdir) +test_vector_CPPFLAGS = -I$(srcdir) -I$(top_srcdir)/common/include test_vector_CFLAGS = $(WARNINGS_CFLAGS) bench: test-vector diff --git a/common/include/checked-overflow.h b/common/include/checked-overflow.h new file mode 100644 index 00000000..b571e2c6 --- /dev/null +++ b/common/include/checked-overflow.h @@ -0,0 +1,61 @@ +/* nbdkit + * Copyright (C) 2013-2021 Red Hat Inc. + * + * Redistribution and use in source and binary forms, with or without + * modification, are permitted provided that the following conditions are + * met: + * + * * Redistributions of source code must retain the above copyright + * notice, this list of conditions and the following disclaimer. + * + * * Redistributions in binary form must reproduce the above copyright + * notice, this list of conditions and the following disclaimer in the + * documentation and/or other materials provided with the distribution. + * + * * Neither the name of Red Hat nor the names of its contributors may be + * used to endorse or promote products derived from this software without + * specific prior written permission. + * + * THIS SOFTWARE IS PROVIDED BY RED HAT AND CONTRIBUTORS ''AS IS'' AND + * ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, + * THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A + * PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL RED HAT OR + * CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, + * SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT + * LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF + * USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND + * ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, + * OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT + * OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF + * SUCH DAMAGE. + */ + +/* This header file defines functions for checking overflow in common + * integer arithmetic operations. + * + * It uses GCC/Clang built-ins: a possible future enhancement is to + * provide fallbacks in plain C or for other compilers. The only + * purpose of having a header file for this is to have a single place + * where we would extend this in future. + */ + +#ifndef NBDKIT_CHECKED_OVERFLOW_H +#define NBDKIT_CHECKED_OVERFLOW_H + +#if !defined(__GNUC__) && !defined(__clang__) +#error "this file may need to be ported to your compiler" +#endif + +/* Add two uint64_t values. Returns true if overflow happened. */ +#define ADD_UINT64_T_OVERFLOW(a, b, r) __builtin_add_overflow((a), (b), (r)) + +/* Multiply two uint64_t values. Returns true if overflow happened. */ +#define MUL_UINT64_T_OVERFLOW(a, b, r) __builtin_mul_overflow((a), (b), (r)) + +/* Add two size_t values. Returns true if overflow happened. */ +#define ADD_SIZE_T_OVERFLOW(a, b, r) __builtin_add_overflow((a), (b), (r)) + +/* Multiple two size_t values. Returns true if overflow happened. */ +#define MUL_SIZE_T_OVERFLOW(a, b, r) __builtin_mul_overflow((a), (b), (r)) + +#endif /* NBDKIT_CHECKED_OVERFLOW_H */ diff --git a/common/utils/vector.c b/common/utils/vector.c index dff051e9..e4ea7f3f 100644 --- a/common/utils/vector.c +++ b/common/utils/vector.c @@ -36,6 +36,7 @@ #include <stdlib.h> #include <errno.h> +#include "checked-overflow.h" #include "vector.h" int @@ -44,21 +45,30 @@ generic_vector_reserve (struct generic_vector *v, size_t n, size_t itemsize) void *newptr; size_t reqcap, reqbytes, newcap, newbytes; - /* New capacity requested. We must allocate this minimum (or fail). */ - reqcap = v->cap + n; - reqbytes = reqcap * itemsize; - if (reqbytes < v->cap * itemsize) { + /* New capacity requested. We must allocate this minimum (or fail). + * reqcap = v->cap + n + * reqbytes = reqcap * itemsize + */ + if (ADD_SIZE_T_OVERFLOW (v->cap, n, &reqcap) || + MUL_SIZE_T_OVERFLOW (reqcap, itemsize, &reqbytes)) { errno = ENOMEM; - return -1; /* overflow */ + return -1; } /* However for the sake of optimization, scale buffer by 3/2 so that * repeated reservations don't call realloc often. + * newcap = v->cap + (v->cap + 1) / 2 + * newbytes = newcap * itemsize */ - newcap = v->cap + (v->cap + 1) / 2; - newbytes = newcap * itemsize; - + if (ADD_SIZE_T_OVERFLOW (v->cap, 1, &newcap)) + goto fallback; + newcap /= 2; + if (ADD_SIZE_T_OVERFLOW (v->cap, newcap, &newcap)) + goto fallback; + if (MUL_SIZE_T_OVERFLOW (newcap, itemsize, &newbytes)) + goto fallback; if (newbytes < reqbytes) { + fallback: /* If that either overflows or is less than the minimum requested, * fall back to the requested capacity. */ -- 2.32.0

Nir Soffer

12:53 p.m.

New subject: [PATCH nbdkit 2/2] common: Add checked-overflow macros and use for safe vector extension

On Tue, Nov 9, 2021 at 7:49 PM Richard W.M. Jones <rjones(a)redhat.com> wrote:

...

This should explain the next lines? I'm not sure about it, it makes the code more complicated and the commented code can get out of sync with the actual code.

...

+ if (ADD_SIZE_T_OVERFLOW (v->cap, n, &reqcap) || + MUL_SIZE_T_OVERFLOW (reqcap, itemsize, &reqbytes)) {

Is order guaranteed? I think it will be more clear as separate if blocks, even if we need to have 2 blocks for returning ENOMEM.

...

errno = ENOMEM; - return -1; /* overflow */ + return -1; } /* However for the sake of optimization, scale buffer by 3/2 so that * repeated reservations don't call realloc often. + * newcap = v->cap + (v->cap + 1) / 2 + * newbytes = newcap * itemsize */ - newcap = v->cap + (v->cap + 1) / 2; - newbytes = newcap * itemsize; - + if (ADD_SIZE_T_OVERFLOW (v->cap, 1, &newcap)) + goto fallback; + newcap /= 2; + if (ADD_SIZE_T_OVERFLOW (v->cap, newcap, &newcap)) + goto fallback;

This probably works but adding v->cap and newcap and storing back in newcap is pretty confusing. I would use a temporary: if (ADD_SIZE_T_OVERFLOW (v->cap, 1, &extracap)) goto fallback; extracap /= 2; if (ADD_SIZE_T_OVERFLOW (v->cap, extracap, &newcap)) goto fallback;

...

+ if (MUL_SIZE_T_OVERFLOW (newcap, itemsize, &newbytes)) + goto fallback; if (newbytes < reqbytes) { + fallback:

Jumping inside an if block is evil. I would try to extract the code to compute the new capacity into a helper function: if (next_capacity(v-cap, n, itemsize, &newcap)) return -1; This function can return early instead of jumping around or fail if we cannot reserve n items. In the worst case this function will only hide the overflow macros. Nir

Richard W.M. Jones

1:27 p.m.

New subject: [PATCH nbdkit 2/2] common: Add checked-overflow macros and use for safe vector extension

On Tue, Nov 09, 2021 at 08:53:28PM +0200, Nir Soffer wrote:

...

On Tue, Nov 9, 2021 at 7:49 PM Richard W.M. Jones <rjones(a)redhat.com> wrote: > > --- > common/include/Makefile.am | 1 + > common/utils/Makefile.am | 3 +- > common/include/checked-overflow.h | 61 +++++++++++++++++++++++++++++++ > common/utils/vector.c | 26 +++++++++---- > 4 files changed, 82 insertions(+), 9 deletions(-) > > diff --git a/common/include/Makefile.am b/common/include/Makefile.am > index a7d0d026..52d97216 100644 > --- a/common/include/Makefile.am > +++ b/common/include/Makefile.am > @@ -37,6 +37,7 @@ EXTRA_DIST = \ > ascii-ctype.h \ > ascii-string.h \ > byte-swapping.h \ > + checked-overflow.h \ > exit-with-parent.h \ > isaligned.h \ > ispowerof2.h \ > diff --git a/common/utils/Makefile.am b/common/utils/Makefile.am > index 55415535..012a5c25 100644 > --- a/common/utils/Makefile.am > +++ b/common/utils/Makefile.am > @@ -52,6 +52,7 @@ libutils_la_SOURCES = \ > $(NULL) > libutils_la_CPPFLAGS = \ > -I$(top_srcdir)/include \ > + -I$(top_srcdir)/common/include \ > $(NULL) > libutils_la_CFLAGS = \ > $(WARNINGS_CFLAGS) \ > @@ -101,7 +102,7 @@ test_quotes_CPPFLAGS = -I$(srcdir) > test_quotes_CFLAGS = $(WARNINGS_CFLAGS) > > test_vector_SOURCES = test-vector.c vector.c vector.h bench.h > -test_vector_CPPFLAGS = -I$(srcdir) > +test_vector_CPPFLAGS = -I$(srcdir) -I$(top_srcdir)/common/include > test_vector_CFLAGS = $(WARNINGS_CFLAGS) > > bench: test-vector > diff --git a/common/include/checked-overflow.h b/common/include/checked-overflow.h > new file mode 100644 > index 00000000..b571e2c6 > --- /dev/null > +++ b/common/include/checked-overflow.h > @@ -0,0 +1,61 @@ > +/* nbdkit > + * Copyright (C) 2013-2021 Red Hat Inc. > + * > + * Redistribution and use in source and binary forms, with or without > + * modification, are permitted provided that the following conditions are > + * met: > + * > + * * Redistributions of source code must retain the above copyright > + * notice, this list of conditions and the following disclaimer. > + * > + * * Redistributions in binary form must reproduce the above copyright > + * notice, this list of conditions and the following disclaimer in the > + * documentation and/or other materials provided with the distribution. > + * > + * * Neither the name of Red Hat nor the names of its contributors may be > + * used to endorse or promote products derived from this software without > + * specific prior written permission. > + * > + * THIS SOFTWARE IS PROVIDED BY RED HAT AND CONTRIBUTORS ''AS IS'' AND > + * ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, > + * THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A > + * PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL RED HAT OR > + * CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, > + * SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT > + * LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF > + * USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND > + * ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, > + * OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT > + * OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF > + * SUCH DAMAGE. > + */ > + > +/* This header file defines functions for checking overflow in common > + * integer arithmetic operations. > + * > + * It uses GCC/Clang built-ins: a possible future enhancement is to > + * provide fallbacks in plain C or for other compilers. The only > + * purpose of having a header file for this is to have a single place > + * where we would extend this in future. > + */ > + > +#ifndef NBDKIT_CHECKED_OVERFLOW_H > +#define NBDKIT_CHECKED_OVERFLOW_H > + > +#if !defined(__GNUC__) && !defined(__clang__) > +#error "this file may need to be ported to your compiler" > +#endif > + > +/* Add two uint64_t values. Returns true if overflow happened. */ > +#define ADD_UINT64_T_OVERFLOW(a, b, r) __builtin_add_overflow((a), (b), (r)) > + > +/* Multiply two uint64_t values. Returns true if overflow happened. */ > +#define MUL_UINT64_T_OVERFLOW(a, b, r) __builtin_mul_overflow((a), (b), (r)) > + > +/* Add two size_t values. Returns true if overflow happened. */ > +#define ADD_SIZE_T_OVERFLOW(a, b, r) __builtin_add_overflow((a), (b), (r)) > + > +/* Multiple two size_t values. Returns true if overflow happened. */ > +#define MUL_SIZE_T_OVERFLOW(a, b, r) __builtin_mul_overflow((a), (b), (r)) > + > +#endif /* NBDKIT_CHECKED_OVERFLOW_H */ > diff --git a/common/utils/vector.c b/common/utils/vector.c > index dff051e9..e4ea7f3f 100644 > --- a/common/utils/vector.c > +++ b/common/utils/vector.c > @@ -36,6 +36,7 @@ > #include <stdlib.h> > #include <errno.h> > > +#include "checked-overflow.h" > #include "vector.h" > > int > @@ -44,21 +45,30 @@ generic_vector_reserve (struct generic_vector *v, size_t n, size_t itemsize) > void *newptr; > size_t reqcap, reqbytes, newcap, newbytes; > > - /* New capacity requested. We must allocate this minimum (or fail). */ > - reqcap = v->cap + n; > - reqbytes = reqcap * itemsize; > - if (reqbytes < v->cap * itemsize) { > + /* New capacity requested. We must allocate this minimum (or fail). > + * reqcap = v->cap + n > + * reqbytes = reqcap * itemsize > + */ This should explain the next lines? I'm not sure about it, it makes the code more complicated and the commented code can get out of sync with the actual code.

I really do think it makes it clearer. I agree that it makes it possible for the code to get out of step though.

...

> + if (ADD_SIZE_T_OVERFLOW (v->cap, n, &reqcap) || > + MUL_SIZE_T_OVERFLOW (reqcap, itemsize, &reqbytes)) { Is order guaranteed? I think it will be more clear as separate if blocks, even if we need to have 2 blocks for returning ENOMEM.

Order is definitely guaranteed since || is a sequence point https://en.wikipedia.org/wiki/Sequence_point#Sequence_points_in_C_and_C++ (point 1). (It could also short-circuit, but we don't care). I could add an overflow: label, split the two statements, and jump here I suppose?

...

Agreed.

...

if (ADD_SIZE_T_OVERFLOW (v->cap, 1, &extracap)) goto fallback; extracap /= 2; if (ADD_SIZE_T_OVERFLOW (v->cap, extracap, &newcap)) goto fallback; > + if (MUL_SIZE_T_OVERFLOW (newcap, itemsize, &newbytes)) > + goto fallback; > if (newbytes < reqbytes) { > + fallback: Jumping inside an if block is evil.

...

I would try to extract the code to compute the new capacity into a helper function: if (next_capacity(v-cap, n, itemsize, &newcap)) return -1; This function can return early instead of jumping around or fail if we cannot reserve n items. In the worst case this function will only hide the overflow macros.

OK Rich. -- Richard Jones, Virtualization Group, Red Hat http://people.redhat.com/~rjones Read my programming and virtualization blog: http://rwmj.wordpress.com virt-p2v converts physical machines to virtual machines. Boot with a live CD or over the network (PXE) and turn machines into KVM guests. http://libguestfs.org/virt-v2v

Nir Soffer

1:48 p.m.

New subject: [PATCH nbdkit 2/2] common: Add checked-overflow macros and use for safe vector extension

On Tue, Nov 9, 2021 at 9:27 PM Richard W.M. Jones <rjones(a)redhat.com> wrote:

...

On Tue, Nov 09, 2021 at 08:53:28PM +0200, Nir Soffer wrote: > On Tue, Nov 9, 2021 at 7:49 PM Richard W.M. Jones <rjones(a)redhat.com> wrote: > > > > --- > > common/include/Makefile.am | 1 + > > common/utils/Makefile.am | 3 +- > > common/include/checked-overflow.h | 61 +++++++++++++++++++++++++++++++ > > common/utils/vector.c | 26 +++++++++---- > > 4 files changed, 82 insertions(+), 9 deletions(-) > > > > diff --git a/common/include/Makefile.am b/common/include/Makefile.am > > index a7d0d026..52d97216 100644 > > --- a/common/include/Makefile.am > > +++ b/common/include/Makefile.am > > @@ -37,6 +37,7 @@ EXTRA_DIST = \ > > ascii-ctype.h \ > > ascii-string.h \ > > byte-swapping.h \ > > + checked-overflow.h \ > > exit-with-parent.h \ > > isaligned.h \ > > ispowerof2.h \ > > diff --git a/common/utils/Makefile.am b/common/utils/Makefile.am > > index 55415535..012a5c25 100644 > > --- a/common/utils/Makefile.am > > +++ b/common/utils/Makefile.am > > @@ -52,6 +52,7 @@ libutils_la_SOURCES = \ > > $(NULL) > > libutils_la_CPPFLAGS = \ > > -I$(top_srcdir)/include \ > > + -I$(top_srcdir)/common/include \ > > $(NULL) > > libutils_la_CFLAGS = \ > > $(WARNINGS_CFLAGS) \ > > @@ -101,7 +102,7 @@ test_quotes_CPPFLAGS = -I$(srcdir) > > test_quotes_CFLAGS = $(WARNINGS_CFLAGS) > > > > test_vector_SOURCES = test-vector.c vector.c vector.h bench.h > > -test_vector_CPPFLAGS = -I$(srcdir) > > +test_vector_CPPFLAGS = -I$(srcdir) -I$(top_srcdir)/common/include > > test_vector_CFLAGS = $(WARNINGS_CFLAGS) > > > > bench: test-vector > > diff --git a/common/include/checked-overflow.h b/common/include/checked-overflow.h > > new file mode 100644 > > index 00000000..b571e2c6 > > --- /dev/null > > +++ b/common/include/checked-overflow.h > > @@ -0,0 +1,61 @@ > > +/* nbdkit > > + * Copyright (C) 2013-2021 Red Hat Inc. > > + * > > + * Redistribution and use in source and binary forms, with or without > > + * modification, are permitted provided that the following conditions are > > + * met: > > + * > > + * * Redistributions of source code must retain the above copyright > > + * notice, this list of conditions and the following disclaimer. > > + * > > + * * Redistributions in binary form must reproduce the above copyright > > + * notice, this list of conditions and the following disclaimer in the > > + * documentation and/or other materials provided with the distribution. > > + * > > + * * Neither the name of Red Hat nor the names of its contributors may be > > + * used to endorse or promote products derived from this software without > > + * specific prior written permission. > > + * > > + * THIS SOFTWARE IS PROVIDED BY RED HAT AND CONTRIBUTORS ''AS IS'' AND > > + * ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, > > + * THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A > > + * PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL RED HAT OR > > + * CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, > > + * SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT > > + * LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF > > + * USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND > > + * ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, > > + * OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT > > + * OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF > > + * SUCH DAMAGE. > > + */ > > + > > +/* This header file defines functions for checking overflow in common > > + * integer arithmetic operations. > > + * > > + * It uses GCC/Clang built-ins: a possible future enhancement is to > > + * provide fallbacks in plain C or for other compilers. The only > > + * purpose of having a header file for this is to have a single place > > + * where we would extend this in future. > > + */ > > + > > +#ifndef NBDKIT_CHECKED_OVERFLOW_H > > +#define NBDKIT_CHECKED_OVERFLOW_H > > + > > +#if !defined(__GNUC__) && !defined(__clang__) > > +#error "this file may need to be ported to your compiler" > > +#endif > > + > > +/* Add two uint64_t values. Returns true if overflow happened. */ > > +#define ADD_UINT64_T_OVERFLOW(a, b, r) __builtin_add_overflow((a), (b), (r)) > > + > > +/* Multiply two uint64_t values. Returns true if overflow happened. */ > > +#define MUL_UINT64_T_OVERFLOW(a, b, r) __builtin_mul_overflow((a), (b), (r)) > > + > > +/* Add two size_t values. Returns true if overflow happened. */ > > +#define ADD_SIZE_T_OVERFLOW(a, b, r) __builtin_add_overflow((a), (b), (r)) > > + > > +/* Multiple two size_t values. Returns true if overflow happened. */ > > +#define MUL_SIZE_T_OVERFLOW(a, b, r) __builtin_mul_overflow((a), (b), (r)) > > + > > +#endif /* NBDKIT_CHECKED_OVERFLOW_H */ > > diff --git a/common/utils/vector.c b/common/utils/vector.c > > index dff051e9..e4ea7f3f 100644 > > --- a/common/utils/vector.c > > +++ b/common/utils/vector.c > > @@ -36,6 +36,7 @@ > > #include <stdlib.h> > > #include <errno.h> > > > > +#include "checked-overflow.h" > > #include "vector.h" > > > > int > > @@ -44,21 +45,30 @@ generic_vector_reserve (struct generic_vector *v, size_t n, size_t itemsize) > > void *newptr; > > size_t reqcap, reqbytes, newcap, newbytes; > > > > - /* New capacity requested. We must allocate this minimum (or fail). */ > > - reqcap = v->cap + n; > > - reqbytes = reqcap * itemsize; > > - if (reqbytes < v->cap * itemsize) { > > + /* New capacity requested. We must allocate this minimum (or fail). > > + * reqcap = v->cap + n > > + * reqbytes = reqcap * itemsize > > + */ > > This should explain the next lines? I'm not sure about it, it makes > the code more complicated and the commented code can get out > of sync with the actual code. I really do think it makes it clearer. I agree that it makes it possible for the code to get out of step though. > > + if (ADD_SIZE_T_OVERFLOW (v->cap, n, &reqcap) || > > + MUL_SIZE_T_OVERFLOW (reqcap, itemsize, &reqbytes)) { > > Is order guaranteed? > > I think it will be more clear as separate if blocks, even if we need > to have 2 blocks for returning ENOMEM. Order is definitely guaranteed since || is a sequence point https://en.wikipedia.org/wiki/Sequence_point#Sequence_points_in_C_and_C++ (point 1). (It could also short-circuit, but we don't care). I could add an overflow: label, split the two statements, and jump here I suppose?

overflow label can be nice.

...

> > errno = ENOMEM; > > - return -1; /* overflow */ > > + return -1; > > } > > > > /* However for the sake of optimization, scale buffer by 3/2 so that > > * repeated reservations don't call realloc often. > > + * newcap = v->cap + (v->cap + 1) / 2 > > + * newbytes = newcap * itemsize > > */ > > - newcap = v->cap + (v->cap + 1) / 2; > > - newbytes = newcap * itemsize; > > - > > + if (ADD_SIZE_T_OVERFLOW (v->cap, 1, &newcap)) > > + goto fallback; > > + newcap /= 2; > > + if (ADD_SIZE_T_OVERFLOW (v->cap, newcap, &newcap)) > > + goto fallback; > > This probably works but adding v->cap and newcap and storing > back in newcap is pretty confusing. I would use a temporary: Agreed. > if (ADD_SIZE_T_OVERFLOW (v->cap, 1, &extracap)) > goto fallback; > > extracap /= 2; > if (ADD_SIZE_T_OVERFLOW (v->cap, extracap, &newcap)) > goto fallback; > > > + if (MUL_SIZE_T_OVERFLOW (newcap, itemsize, &newbytes)) > > + goto fallback; > > if (newbytes < reqbytes) { > > + fallback: > > Jumping inside an if block is evil. ?!

Maybe evil is not the right word :-)

...

> I would try to extract the code to compute the new capacity into a helper > function: > > if (next_capacity(v-cap, n, itemsize, &newcap)) > return -1; > > This function can return early instead of jumping around or fail > if we cannot reserve n items. In the worst case this function will > only hide the overflow macros. OK Rich. -- Richard Jones, Virtualization Group, Red Hat http://people.redhat.com/~rjones Read my programming and virtualization blog: http://rwmj.wordpress.com virt-p2v converts physical machines to virtual machines. Boot with a live CD or over the network (PXE) and turn machines into KVM guests. http://libguestfs.org/virt-v2v

Richard W.M. Jones

2:54 p.m.

New subject: [PATCH nbdkit 2/2] common: Add checked-overflow macros and use for safe vector extension

On Tue, Nov 09, 2021 at 07:27:12PM +0000, Richard W.M. Jones wrote:

...

On Tue, Nov 09, 2021 at 08:53:28PM +0200, Nir Soffer wrote: > I would try to extract the code to compute the new capacity into a helper > function: > > if (next_capacity(v-cap, n, itemsize, &newcap)) > return -1; > > This function can return early instead of jumping around or fail > if we cannot reserve n items. In the worst case this function will > only hide the overflow macros. OK

While I think this is a good idea, when I tried to make it work the function wasn't very elegant. The problem is trying to make the output "atomic", ie. only updating *newcap once. What was worse, reading the disassembly Clang managed to produce something that was less efficient, even though it inlined the function. Rich. -- Richard Jones, Virtualization Group, Red Hat http://people.redhat.com/~rjones Read my programming and virtualization blog: http://rwmj.wordpress.com virt-builder quickly builds VMs from scratch http://libguestfs.org/virt-builder.1.html

Laszlo Ersek

Wednesday, 10 November Wed, 10 Nov

7:47 a.m.

New subject: [PATCH nbdkit 2/2] common: Add checked-overflow macros and use for safe vector extension

On 11/09/21 21:54, Richard W.M. Jones wrote:

...

On Tue, Nov 09, 2021 at 07:27:12PM +0000, Richard W.M. Jones wrote: > On Tue, Nov 09, 2021 at 08:53:28PM +0200, Nir Soffer wrote: >> I would try to extract the code to compute the new capacity into a helper >> function: >> >> if (next_capacity(v-cap, n, itemsize, &newcap)) >> return -1; >> >> This function can return early instead of jumping around or fail >> if we cannot reserve n items. In the worst case this function will >> only hide the overflow macros. > > OK While I think this is a good idea, when I tried to make it work the function wasn't very elegant. The problem is trying to make the output "atomic", ie. only updating *newcap once.

We could always use a temporary (local) variable for that, and only assign it to the output parameter at the very end. (Either way, if the helper function fails, it's OK to have the output param(s) with indeterminate value. Assuming we document that.) The compiler can still keep the local variable in a register.

...

What was worse, reading the disassembly Clang managed to produce something that was less efficient, even though it inlined the function.

I think clarity / safety around integer overflows beats "performance of generated assembly", even in a hot path (which I understand this function *not* to be in). The C source code we end up including here should remain undisturbed for a long time; the generated assembly will change as compiler versions come and go. The patch does use compiler builtins, so we've done the expected (= trusted the compiler with generating the best possible assembly). (Trying to steer assembly output via C code manipulation (let alone OCaml code manipulation) is something I don't understand in general. For how many architectures are we willing to eyeball and tweak the generated assembly? How stable is the generated assembly over different versions of the same compiler? If the assembly code is so important, we should *code* the logic in assembly; perhaps by involving arch-specific assemblers in the makefiles, or by using inline assembly with #ifdefs. (Some projects do the former regularly, for example OpenSSL and edk2.)) Thanks, Laszlo

Eric Blake

Tuesday, 9 November Tue, 9 Nov

1:56 p.m.

New subject: [PATCH nbdkit 2/2] common: Add checked-overflow macros and use for safe vector extension

On Tue, Nov 09, 2021 at 08:53:28PM +0200, Nir Soffer wrote:

...

On Tue, Nov 9, 2021 at 7:49 PM Richard W.M. Jones <rjones(a)redhat.com> wrote: > +++ b/common/include/checked-overflow.h

...

> + > +/* This header file defines functions for checking overflow in common > + * integer arithmetic operations. > + * > + * It uses GCC/Clang built-ins: a possible future enhancement is to > + * provide fallbacks in plain C or for other compilers. The only > + * purpose of having a header file for this is to have a single place > + * where we would extend this in future. > + */

gnulib has such checks ported to other compilers (in intprops.h), but with an incompatible license that we can't copy.

...

> + > +#ifndef NBDKIT_CHECKED_OVERFLOW_H > +#define NBDKIT_CHECKED_OVERFLOW_H > + > +#if !defined(__GNUC__) && !defined(__clang__) > +#error "this file may need to be ported to your compiler" > +#endif > + > +/* Add two uint64_t values. Returns true if overflow happened. */ > +#define ADD_UINT64_T_OVERFLOW(a, b, r) __builtin_add_overflow((a), (b), (r))

I was able to figure out that this performs '*r = a + b' by guessing from the macro parameter names; I'm not sure if adding that to the comment would help other readers less familiar with the compiler builtin.

...

> + > +/* Multiply two uint64_t values. Returns true if overflow happened. */ > +#define MUL_UINT64_T_OVERFLOW(a, b, r) __builtin_mul_overflow((a), (b), (r)) > + > +/* Add two size_t values. Returns true if overflow happened. */ > +#define ADD_SIZE_T_OVERFLOW(a, b, r) __builtin_add_overflow((a), (b), (r)) > + > +/* Multiple two size_t values. Returns true if overflow happened. */ > +#define MUL_SIZE_T_OVERFLOW(a, b, r) __builtin_mul_overflow((a), (b), (r))

gcc also has forced-type builtins, but using __builtin_uaddll_overflow() would be wrong (it's harder to prove whether uint64_t is 'unsigned long' or 'unsigned long long'), so using the generic macro is what we want anyways. The fact that you have two different macros (for UINT64_T and SIZE_T) that expand to the same thing is a side-effect of gcc's implementation; if we later have to code up a C language fallback for other compilers, that fallback may indeed need different implementations between the two macros. So I'm fine with what looks like duplication here.

...

> +++ b/common/utils/vector.c > @@ -36,6 +36,7 @@ > #include <stdlib.h> > #include <errno.h> > > +#include "checked-overflow.h" > #include "vector.h" > > int > @@ -44,21 +45,30 @@ generic_vector_reserve (struct generic_vector *v, size_t n, size_t itemsize) > void *newptr; > size_t reqcap, reqbytes, newcap, newbytes; > > - /* New capacity requested. We must allocate this minimum (or fail). */ > - reqcap = v->cap + n; > - reqbytes = reqcap * itemsize; > - if (reqbytes < v->cap * itemsize) { > + /* New capacity requested. We must allocate this minimum (or fail). > + * reqcap = v->cap + n > + * reqbytes = reqcap * itemsize > + */ This should explain the next lines? I'm not sure about it, it makes the code more complicated and the commented code can get out of sync with the actual code. > + if (ADD_SIZE_T_OVERFLOW (v->cap, n, &reqcap) || > + MUL_SIZE_T_OVERFLOW (reqcap, itemsize, &reqbytes)) {

Yes, the code in the comment is pseudocode for the macro use; I agree that the duplication is a slight risk of losing sync, but it's a weak argument (the comment is quite close to the code, and this file is not likely to be frequently rewritten). I'm fine keeping it.

...

Is order guaranteed?

Yes, because of ||.

...

I think it will be more clear as separate if blocks, even if we need to have 2 blocks for returning ENOMEM.

2 blocks is useful if you ever expect to be in a gdb session trying to figure out which of the two conditions failed. But for this one, I'm fine with one.

...

> errno = ENOMEM; > - return -1; /* overflow */ > + return -1; > } > > /* However for the sake of optimization, scale buffer by 3/2 so that > * repeated reservations don't call realloc often. > + * newcap = v->cap + (v->cap + 1) / 2 > + * newbytes = newcap * itemsize > */ > - newcap = v->cap + (v->cap + 1) / 2; > - newbytes = newcap * itemsize; > - > + if (ADD_SIZE_T_OVERFLOW (v->cap, 1, &newcap)) > + goto fallback; > + newcap /= 2; > + if (ADD_SIZE_T_OVERFLOW (v->cap, newcap, &newcap)) > + goto fallback; This probably works but adding v->cap and newcap and storing back in newcap is pretty confusing. I would use a temporary: if (ADD_SIZE_T_OVERFLOW (v->cap, 1, &extracap)) goto fallback; extracap /= 2; if (ADD_SIZE_T_OVERFLOW (v->cap, extracap, &newcap)) goto fallback; > + if (MUL_SIZE_T_OVERFLOW (newcap, itemsize, &newbytes)) > + goto fallback; > if (newbytes < reqbytes) { > + fallback: Jumping inside an if block is evil.

Not the first time our code base has done it. It's not always the cleanest, but is more compact that a number of other alternatives without too much hassle.

...

That is indeed an option which may improve legibility for the next reader, even if it costs a few more lines of code now. -- Eric Blake, Principal Software Engineer Red Hat, Inc. +1-919-301-3266 Virtualization: qemu.org | libvirt.org

Laszlo Ersek

Wednesday, 10 November Wed, 10 Nov

7:36 a.m.

New subject: [PATCH nbdkit 2/2] common: Add checked-overflow macros and use for safe vector extension

Two comments only: On 11/09/21 18:49, Richard W.M. Jones wrote:

...

(1) Typo: this too should be "Multiply".

...

+#define MUL_SIZE_T_OVERFLOW(a, b, r) __builtin_mul_overflow((a), (b), (r)) + +#endif /* NBDKIT_CHECKED_OVERFLOW_H */ diff --git a/common/utils/vector.c b/common/utils/vector.c index dff051e9..e4ea7f3f 100644 --- a/common/utils/vector.c +++ b/common/utils/vector.c @@ -36,6 +36,7 @@ #include <stdlib.h> #include <errno.h> +#include "checked-overflow.h" #include "vector.h" int @@ -44,21 +45,30 @@ generic_vector_reserve (struct generic_vector *v, size_t n, size_t itemsize) void *newptr; size_t reqcap, reqbytes, newcap, newbytes; - /* New capacity requested. We must allocate this minimum (or fail). */ - reqcap = v->cap + n; - reqbytes = reqcap * itemsize; - if (reqbytes < v->cap * itemsize) { + /* New capacity requested. We must allocate this minimum (or fail). + * reqcap = v->cap + n + * reqbytes = reqcap * itemsize + */ + if (ADD_SIZE_T_OVERFLOW (v->cap, n, &reqcap) || + MUL_SIZE_T_OVERFLOW (reqcap, itemsize, &reqbytes)) { errno = ENOMEM; - return -1; /* overflow */ + return -1; } /* However for the sake of optimization, scale buffer by 3/2 so that * repeated reservations don't call realloc often. + * newcap = v->cap + (v->cap + 1) / 2 + * newbytes = newcap * itemsize */ - newcap = v->cap + (v->cap + 1) / 2; - newbytes = newcap * itemsize; - + if (ADD_SIZE_T_OVERFLOW (v->cap, 1, &newcap)) + goto fallback; + newcap /= 2; + if (ADD_SIZE_T_OVERFLOW (v->cap, newcap, &newcap)) + goto fallback; + if (MUL_SIZE_T_OVERFLOW (newcap, itemsize, &newbytes)) + goto fallback; if (newbytes < reqbytes) { + fallback: /* If that either overflows or is less than the minimum requested, * fall back to the requested capacity. */

(2) I have to agree with Nir here; the "goto" into the "if" body is unbearable. :) How about: if (ADD_SIZE_T_OVERFLOW (v->cap, 1, &newcap) || ADD_SIZE_T_OVERFLOW (v->cap, newcap / 2, &newcap) || MUL_SIZE_T_OVERFLOW (newcap, itemsize, &newbytes) || newbytes < reqbytes) { /* If that either overflows or is less than the minimum requested, * fall back to the requested capacity. */ ... } In this case, I've moved the halving of "newcap" into ADD_SIZE_T_OVERFLOW. If we need more complex expressions, such that are difficult to chain within a logical expression -- modulo the ugly "comma" operator! --, or even if we just want to encapsulate this better, we can still use a helper function. It's perfectly fine (IMO) to use a number of early "returns" in a helper function. ... Huh, next_capacity() is exactly what Nir suggested too. Then: "I agree". :) Thanks Laszlo

Richard W.M. Jones

7:45 a.m.

New subject: [PATCH nbdkit 2/2] common: Add checked-overflow macros and use for safe vector extension

On Wed, Nov 10, 2021 at 02:36:18PM +0100, Laszlo Ersek wrote:

...

How about: if (ADD_SIZE_T_OVERFLOW (v->cap, 1, &newcap) || ADD_SIZE_T_OVERFLOW (v->cap, newcap / 2, &newcap) || MUL_SIZE_T_OVERFLOW (newcap, itemsize, &newbytes) || newbytes < reqbytes) { /* If that either overflows or is less than the minimum requested, * fall back to the requested capacity. */ ... }

Oh that's a lot better. Let me do that instead ... Rich. -- Richard Jones, Virtualization Group, Red Hat http://people.redhat.com/~rjones Read my programming and virtualization blog: http://rwmj.wordpress.com virt-top is 'top' for virtual machines. Tiny program with many powerful monitoring features, net stats, disk stats, logging, etc. http://people.redhat.com/~rjones/virt-top

Eric Blake

Tuesday, 9 November Tue, 9 Nov

1:45 p.m.

On Tue, Nov 09, 2021 at 05:49:16PM +0000, Richard W.M. Jones wrote:

...

That's the modern edge of the spectrum; will we be interfering with compilation on older distros with older compilers? I guess CI testing can help find that out quickly.

...

I verified by disassembly that "jo" (jump overflow) / "jno" is used where it was not used previously, by both compilers.

Nice, and an argument in favor of doing this. -- Eric Blake, Principal Software Engineer Red Hat, Inc. +1-919-301-3266 Virtualization: qemu.org | libvirt.org

1276

days inactive

1277

days old

guestfs@lists.libguestfs.org

Manage subscription

13 comments

4 participants

tags (0)

participants (4)

Eric Blake
Laszlo Ersek
Nir Soffer
Richard W.M. Jones

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

[PATCH nbdkit 0/2] common: Add checked-overflow macros