Commit graph

599 commits

Author SHA1 Message Date
Matt Sturgeon b780d50910
ci/github-script/bot: fix concurrency limit (#459207) 2025-11-06 17:59:26 +00:00
Wolfgang Walther 1311ce348c
ci/github-script/merge: add hint about stuck GitHub (#459122) 2025-11-06 17:11:28 +00:00
Wolfgang Walther a146035a2b
ci/github-script/bot: fix concurrency limit
This was introduced as part of the hotfix PR to avoid hitting API rate
limits - but the condition was wrong. It was supposed to trigger in all
PR contexts, not only for the Test workflow.
2025-11-06 17:46:56 +01:00
Wolfgang Walther cd7f83638e
ci/github-script/bot: limit concurrency in PR runs
This lead to reaching secondary API limits in a treewide recently, so we
better limit it to where we actually need it.
2025-11-06 16:17:22 +01:00
Wolfgang Walther 17199e5ff6
ci/github-script/reviewers: add TODO about future optimization
We still use a few too many API requests by checking team members for
collaborator status - we can improve on that in the future.
2025-11-06 16:17:17 +01:00
Wolfgang Walther 9efe926863
ci/github-script/reviewers: exit early for treewides
When hitting a treewide, we would previously find the username for each
user and then check all of them for collaborator status - only to then
realize that this results in more than 15 reviewers and exit.

We can put a simple stop-gap in, even before de-duplicating the combined
lists of maintainers and owners as safe guard. We could still hit huge
numbers of code owners, but in practice we don't nearly as many as
maintainers, so this will be sufficient for now.
2025-11-06 16:17:12 +01:00
Wolfgang Walther 51acc56dcb
ci/github-script/merge: ignore PRs with >= 100 files
We use the files endpoint to get a list of all *names* of files touched
in the PR - but this endpoint will also actually download the files /
their diff, too. That's pointless and actually takes quite some time for
huge treewides.

We're just putting in a stop-gap for now, so that we're not burning more
than 1 API requests on this and don't spend so much time on it either. A
limit of 99 files will be more than enough for quite some time - we will
only need to raise this when we're able to represent package sets in
by-name properly and have "package set maintainers", who are not
committers.
2025-11-06 16:17:08 +01:00
Wolfgang Walther d086c6c6b3
ci/github-script/merge: add hint about stuck GitHub
Unfortunately it still happens frequently that, after enabling
auto-merge, GitHub is stuck even though all checks have passed, and
doesn't merge the PR. Any contributor can trigger GitHub again with an
approval of the PR - this will then immediately queue the PR for merge.

Adding a hint to the posted comment, should help users through this
without my intervention.
2025-11-06 15:18:01 +01:00
Wolfgang Walther 4658d0d5a3
ci/github-script/bot: fix needs reviewer label
The recent change to use the result of requesting reviewers for setting
the `needs: reviewer` label caused a regression: It would not set the
label for PRs where no reviewers were requested, because *too many were
eligible*. Still - these PRs don't have reviewers, so they need
attention otherwise - via the label.
2025-11-06 15:10:58 +01:00
Wolfgang Walther d76ffa4136
ci/github-script/bot: fix collaborator warning
This was introduced shortly before merge of the reviewers.js file, but
not actually tested - I thought it was not easy to find a PR triggering
this warning. However, the scheduled run told me otherwise: The
staging-next PR is the perfect candidate.
2025-11-06 10:20:48 +01:00
Wolfgang Walther c4548e58fb
ci/github-script/bot: fix scheduled bot with older artifacts
We only recently introduced the owners.txt file to the comparison
artifact, so once the bot runs on a schedule it will it older artifacts
very quickly - and then can't find the owners file.

We can fallback to an empty owners list in this case, because an older
artifact also means an older workflow run previously, so this will have
pinged owners already.
2025-11-06 09:53:29 +01:00
Wolfgang Walther e68b0aef13
ci/github-script/reviewers: improve "needs: reviewers" label
This should fix the bug where the "needs: reviewer" label was set too
early, just to be removed immediately, because reviewers were then
requested.
2025-11-05 21:59:02 +01:00
Wolfgang Walther a23d0ab24c
ci/github-script/bot: request reviewers
This migrates the bash code to request reviewers to github-script. This
will allow multiple nice improvements later on, but at this stage it's
mostly a reduction in code and complexity.
2025-11-05 21:58:56 +01:00
Wolfgang Walther df6a9a739d
ci/github-script/bot: disregard bot and ghost approvals
We technically counted bot approvals and approvals by deleted users for
the approval labels as well. The former don't exist, yet, but if they
were, I don't think we'd count them. The latter should arguably *not* be
counted, because we can't tell anymore *who* approved, so we can't put
any weight on it as reviewers.

This simplifies the logic, too.
2025-11-05 21:42:28 +01:00
Matt Sturgeon ef3dca70a6
ci/treefmt: disable biome settings validation
The treefmt-nix `biome.settings` validation uses inputs that are liable
to hash mismatch.

See https://github.com/numtide/treefmt-nix/pull/430
2025-11-05 19:41:21 +01:00
Matt Sturgeon 7f7f879f92
Revert "ci/treefmt: disable biome for now"
This reverts commit 66260cc8c4.
2025-11-05 19:41:20 +01:00
Wolfgang Walther 2b819576cb
ci/pinned: update
This gives us a bugfix in treefmt-nix for biome.

From the nixpkgs-unstable channel:
https://hydra.nixos.org/build/312625570#tabs-buildinputs

Changes for treefmt-nix:
f56b1934f5...4ef3dfdbb5
2025-11-05 19:41:18 +01:00
Matt Sturgeon 66260cc8c4
ci/treefmt: disable biome for now
Disable biome due to a hash mismatch with validation for the
`settings.formatter.biome.options` option.

See https://github.com/numtide/treefmt-nix/pull/430
2025-11-05 00:22:15 +00:00
Matt Sturgeon ae90bb6238
Revert "wprkflows/bot: increase frequency to every 5 minutes" (#458570) 2025-11-04 20:16:42 +00:00
Wolfgang Walther 12c1f0253a
ci/github-script/merge: improve merge operation and error messages (#458412) 2025-11-04 19:54:02 +00:00
Wolfgang Walther 1e6124a504
ci/github-script/merge: list eligible users in comment
When a user tries to merge a PR, but is not allowed to, it is helpful to
explicitly list the users who *are* allowed. This helps explaining *why*
the merge-eligible label was set.

I objected to this proposal before, because it would incur too many API
requests. But after we have restructured the checklist, this is not
actually true anymore - we can now sensibly run this only when a comment
is posted and not whenever we check a PR for eligibility.
2025-11-04 20:50:41 +01:00
Wolfgang Walther 74d6ba3ab4
Revert "wprkflows/bot: increase frequency to every 5 minutes"
This partially reverts commit 1197fe48da.

GitHub just doesn't schedule these narrow intervals. 10 minutes is
alright in practice.
2025-11-04 19:49:07 +01:00
Wolfgang Walther 58a1fe4761
ci/github-script/bot: move getTeamMembers cache into main file
This allows re-using this elsewhere with a shared cache.
2025-11-04 16:33:16 +01:00
Wolfgang Walther 1197fe48da
wprkflows/bot: increase frequency to every 5 minutes
This makes reactions to merge comments and all the labeling a bit
quicker. Lower the number of backlog items to process per run
accordingly, so that we don't really need more API requests for it.
2025-11-04 16:13:41 +01:00
Wolfgang Walther 810b9ba51d
ci/github-script/bot: improve parallelism
We used to employ the worst strategy for parallelism possibly: The rate
limiter capped us at one concurrent request per second, while 100+ items
were handled in parallel. This lead to every item taking the full
duration of the job to proceed, making the data fetched at the beginning
of the job stale at the end. This leads to smaller hiccups when
labeling, or to the merge-bot posting comments after the PR has already
been closed.

GitHub allows 100 concurrent requests, but considers it a best practice
to serialize them. Since serializing all of them causes problems for us,
we should try to go higher.

Since other jobs are running in parallel, we use a conservative value of
20 concurrent requests here. We also introduce the same number of
workers going through the list of items, to make sure that each item is
handled in the shortest time possible from start to finish, before
proceeding to the next. This gives us roughly 2.5 seconds per individual
item - but speeds up the overall execution of the scheduled job to 20-30
seconds from 3-4 minutes before.
2025-11-04 16:13:40 +01:00
Wolfgang Walther 2d6602908b
ci/github-script/merge: improve testability
By only ignoring already-handled comments when running non-dry, it's
much easier to look at existing PRs, for which the merge bot already
commented, and iterate on them locally.

It's dry mode anyway, so it won't hurt to get a few more merge comments
in the console output.
2025-11-04 15:41:50 +01:00
Wolfgang Walther 747d9e2d34
ci/github-script/merge: switch order of merge operations
We previously used auto-merge first and then enqueued explicitly on the
assumption that auto-merge would fail if the PR was actually in
mergeable state already. This turned out to be false.

Instead, we currently face the problem of auto-merge sometimes getting
stuck. This seems to happen when, at the time of enabling auto-merge,
the required status checks already passed and the PR would be ready to
go - but sometimes GitHub doesn't do it. This *can* be unblocked by
approving the PR again, which seems to run the internal "let's check
whether we can merge this" procedures on the GitHub side again.

However, we can probably also solve this by just explicitly trying to
enqueue the PR first - and only if that fails, fall back to auto-merge.
I previously argued against that, based on a potential race condition,
in which a PR could become ready to merge between these two requests -
at which point the auto-merge operation would fail, if the original
assumption was true. But since we don't observe this, we might as well
switch.
2025-11-04 10:06:36 +01:00
Wolfgang Walther c768b4243e
ci/github-script/bot: fix infinite labeling cycle
When we recently refactored the code to use the maintainer map for
related labels, we made a mistake: When a PR has no packages with
maintainers returned from eval, the label would internally be set to `0`
instead of `false`.

The code would then go on compare the before and after labels with
strict equality - and assume a difference, because `0 !== false`. Thus,
it seemed like new labels needed to be set, so the PUT request was
actually sent. Of course, the labels were actually the same - when
filtering the labels to be set, the `0` would also be treated as falsy,
so the label would not be set. This would result in no visible change in
the PR, but internall GitHub would replace the `updated_at` timestamp
for that PR - after all we replaced all labels.

Repeatedly updating *all* PRs we're looking at quickly causes problems,
because we are going to look at the same PRs *again* in the next cycle -
essentially causing infinite recursion. The bot became slower and slower
over time, because it had to process more and more PRs each run.

Simply casting this to a proper Boolean, should get us out of the mess
soon.
2025-11-03 19:28:43 +01:00
Wolfgang Walther 6ad16e0620
ci/github-script/merge: fix with deleted users (#458074) 2025-11-03 11:19:29 +00:00
Wolfgang Walther 43f3fcc555
ci/github-script/merge: fix with deleted users
When a deleted user had approved a PR, this will cause the merge-bot to
fail.
2025-11-03 12:17:19 +01:00
Wolfgang Walther 5407abeb7d
ci/github-script/merge: unify terms for authoring and creating PRs
I didn't like r-ryantm "authoring"; so I changed that to "created"
earlier. Arguably, using "opened" is more consistent with what is
actually checked and can consistently be used for both.
2025-11-03 11:59:13 +01:00
Wolfgang Walther e0c0b2c54c
ci/github-script/merge: improve feedback for by-name check
The by-name check would previously be green when the
`pkgs/by-name/README.md` file was changed. This would still not mean the
maintainer was able to merge the PR, because there'd be no maintainer
for that file, but the feedback was not 100% accurate.
2025-11-03 11:59:08 +01:00
Michael Daniels 41a3c23cdc
treewide: drop figsoda as maintainer (part 4)
These were done manually by me, either due to not matching the regexes in the previous ones, or because of nixf-diagnose, which I have as a pre-commit hook.
2025-11-02 20:16:11 -05:00
Wolfgang Walther ffdc8205e5
workflows/bot: allow maintainer merges after committer approval
This allows committers to approve PRs with additional, optional nits
that the author-maintainer can either address or merge immediately
without these changes.

It also allows committers to approve a PR for merge, while still waiting
for other maintainers to give their feedback - they can then merge the
PR directly instead of passing it back to the committer.
2025-11-02 19:35:33 +01:00
Wolfgang Walther 9a637aa7a4
ci/github-script/merge: restructure head SHA check
While it was already the case that only merge comments *after* the
latest push were acted on, the logic wasn't easy to understand. This
change should make it more obvious, specially in combination with the
next commit, that all steps (comments, approvals, merge) must happen on
the same SHA - the current head SHA of the PR.
2025-11-02 19:35:32 +01:00
Wolfgang Walther 37b7773907
workflows/bot: allow maintainers to merge backports (#451324) 2025-11-02 18:11:52 +00:00
Wolfgang Walther c0b6cc9387
ci/eval/compare: fix without owners
Even without relevant owners, the owners.txt file must be created,
otherwise the next job will fail.
2025-11-02 17:30:46 +01:00
Wolfgang Walther 91c4d9236b
workflows/bot: allow maintainers to merge backports
All other conditions equal, there is no reason to prevent maintainers
from backporting changes to their packages. Maintainers are probably in
the *best* position to tell whether a certain change is backportable or
not - because they know the package well.
2025-11-02 17:26:01 +01:00
Wolfgang Walther 008ea3df2c
ci/request-reviews: fix request-reviewers.sh
We recently moved some code around and forgot to adjust the log line
here.
2025-11-02 17:10:07 +01:00
Wolfgang Walther 99750f21e0
ci/github-script/merge: various improvements (#457652) 2025-11-02 15:42:18 +00:00
Wolfgang Walther 1774ef870d
ci/request-reviews: untangle owner-related bash code (#457503) 2025-11-02 15:41:16 +00:00
Wolfgang Walther 84d6678f3b
ci/github-script/merge: support OR conditions
This supports AND on the first and OR on the second level, which is
needed for some follow up work like backports, approval based merges or
trusted maintainers.
2025-11-02 16:36:14 +01:00
Wolfgang Walther 6848f93842
ci/github-script/merge: add TODO about second merge method
We have not observed this merge method being used in practice, yet. Not
in the new bot, not in the old bot. It seems like auto-merge works for
all cases.
2025-11-02 16:36:06 +01:00
Wolfgang Walther db8f50b4de
ci/github-script/merge: improve wording 2025-11-02 16:36:01 +01:00
Wolfgang Walther 2d0a8791fe
ci/github-script/merge: improve maintainer check 2025-11-02 16:35:56 +01:00
Wolfgang Walther 6a3c294f6f
ci/github-script/merge: move all conditions into runChecklist
No special casing anymore, all conditions are in the same place. This
also has the benefit of hiding the "has maintainers eligible for merge"
condition from comments, because it is only really relevant for
labeling.
2025-11-02 16:35:51 +01:00
Wolfgang Walther 7ea127c83a
ci/github-script/merge: move API requests out of runChecklist
This makes runChecklist mostly a pure function (except for logging) to
allow calling it repeatedly later.
2025-11-02 16:35:48 +01:00
Wolfgang Walther c7766c637f
ci/github-script/merge: improve caching of team members
This removes the need to `await` committers further down in the function
and allows re-using the cache for other teams later.
2025-11-02 16:35:16 +01:00
Matt Sturgeon 830653ddac
ci/README: document nixpkgs-merge-bot
Based on the README on the old nixpkgs-merge-bot repo[1], but updated to
reflect the current reality.

[1]: https://github.com/NixOS/nixpkgs-merge-bot
2025-11-01 23:12:03 +00:00
Wolfgang Walther 1aa72502fb
workflows/bot: fix permission in test workflow (#457575) 2025-11-01 17:57:59 +00:00