[RFC] Refactoring OpenWrt's build infra

Christian Marangi ansuelsmth at gmail.com
Sun Oct 16 05:16:45 PDT 2022


On Sun, Oct 16, 2022 at 02:07:05PM +0200, Baptiste Jonglez wrote:
> Hi,
> 
> On 05-10-22, Thibaut wrote:
> > Hi,
> > 
> > Following an earlier conversation on IRC with Petr, I’m willing to work on refactoring our buildbot setup as follows:
> > 
> > - single master for each stage (images and packages)
> > - latent workers attached to either master, thus able to build opportunistically from either master or release branches as needed / as work becomes available
> 
> This is a good idea, but I see one main downside: we would probably have
> to use the same buildbot worker image for all releases.
> 
> From what I remember, when the worker image was updated from Debian 9 to
> Debian 10, this seriously broke 19.07 builds.  Maybe Petr or Jow will
> remember the details better.
> 
> I see two ways to address this:
> 
> - either buildbot can run latent workers with a different Docker image
>   depending on the build
>

IMHO, this would be the safest and better solution to the problem. But
this means that we will have to support 2 thing instead of having one
centrilized container.

> - otherwise, we have to think early about the update strategy.  Maybe use
>   the shared buildbot instance for master branch + most recent release
>   only, and move older releases back to a dedicated buildbot instance?
> 
> > The main upside is that all buildslaves could be pooled, improving overall throughput and reducing wasted « idle time », thus lowering build times and operating costs.
> > 
> > Petr also suggested that extra release workers could be spawned at will (through e.g. cloud VMs) when a new release is to be tagged; tagged release could be scheduled only to release workers: this would still work within this « single master » build scheme.
> > 
> > NB: I’m aware of the potential performance penalty of having buildslaves randomly switching between branches, so I would try to come up with a reasonably smart solution to this issue if it doesn’t conflict with the main goals.
> 
> One thing to look for is disk space usage.  Full disks is a common cause
> of build failures.  If a single worker goes through builds for different
> branches, I would expect disk usage to be higher (e.g. more different
> versions of software in dl/).
> 

Would be ideal to have one centrilized dl/ dir where each runner can go
and take the file. We already support that in openwrt (to have a
different dl dir) and there isn't any problem with having different
release tar for the same package.

-- 
	Ansuel



More information about the openwrt-devel mailing list