GRUB Out-Of-Memory Error Test Day - Second Part #862

Open
opened 2026-01-20 18:07:05 +00:00 by lsandova · 17 comments

Test day proposal

We have a new fix [1] and hopefully this time would be the final one. A bit of resume
of this peculiar issue: the OOM issue is seen on those system with limited memory pool
so we try to increase the memory pool for grub_malloc [2] but hit other (unexpected) issues [3] so
[2] was reverted.

Test Cases

There are two testing scenarios below. For both, please use a testing HW (NEVER on your working desktop/lap) and DISABLE Secure boot (unsigned GRUB binaries for the moment) temporally.

  • Option 1: ISO install (for those that cannot install Fedora due to OOM issues)
    0. Disable Secure boot

    1. Download the ISO [5] which contains the fix, flash it on a USB stick
    2. boot with it your HW. In theory the OOM would be seen here on those particular machines where OOM has been observed before
    3. If possible and your time allows, please install Fedora (use 'On the Network' in the Installation Source menu, this is just
      in case the auto-detect installation media fails). The reason we are asking for full installation is that last time we did not
      testers this step and we ended up with [3].
    4. Reboot your new Fedora System
    5. Follow instructions below (Option 2).
    6. Share your results.
  • Option 2: RPM install (for those already running a Fedora rawhide system)
    0. Disable Secure boot

    1. Boot your system
    2. Download the rpms[4], e.g. koji download-task --arch=x86_64 --arch=noarch 141497105
    3. install the rpms, e.g. sudo dnf install *.rpm
    4. Reboot
    5. Share results

[1] https://src.fedoraproject.org/rpms/grub2/pull-request/207
[2] https://src.fedoraproject.org/rpms/grub2/pull-request/198
[3] https://bugzilla.redhat.com/show_bug.cgi?id=2427945
[4] https://koji.fedoraproject.org/koji/taskinfo?taskID=141497105
[5] https://people.redhat.com/lsandova/oom/boot-efi-alloc-on-verifiers.iso

Prerequisite for the Test Day

  • A bare metal machine (no VM)
  • Download [5] and write it to a USB through Fedora Media Writer
  • Make sure Secure Boot is DISABLE (mokutil --sb-state)

Time Frame

To-be-defined

Test day proposal We have a new fix [1] and hopefully this time would be the final one. A bit of resume of this peculiar issue: the OOM issue is seen on those system with limited memory pool so we try to increase the memory pool for grub_malloc [2] but hit other (unexpected) issues [3] so [2] was reverted. Test Cases There are two testing scenarios below. For both, please use a testing HW (NEVER on your working desktop/lap) and DISABLE Secure boot (unsigned GRUB binaries for the moment) temporally. - Option 1: ISO install (for those that cannot install Fedora due to OOM issues) 0. Disable Secure boot 1. Download the ISO [5] which contains the fix, flash it on a USB stick 3. boot with it your HW. In theory the OOM would be seen here on those particular machines where OOM has been observed before 4. If possible and your time allows, please install Fedora (use 'On the Network' in the Installation Source menu, this is just in case the auto-detect installation media fails). The reason we are asking for full installation is that last time we did not testers this step and we ended up with [3]. 5. Reboot your new Fedora System 6. Follow instructions below (Option 2). 7. Share your results. - Option 2: RPM install (for those already running a Fedora rawhide system) 0. Disable Secure boot 1. Boot your system 2. Download the rpms[4], e.g. koji download-task --arch=x86_64 --arch=noarch 141497105 3. install the rpms, e.g. sudo dnf install *.rpm 4. Reboot 5. Share results [1] https://src.fedoraproject.org/rpms/grub2/pull-request/207 [2] https://src.fedoraproject.org/rpms/grub2/pull-request/198 [3] https://bugzilla.redhat.com/show_bug.cgi?id=2427945 [4] https://koji.fedoraproject.org/koji/taskinfo?taskID=141497105 [5] https://people.redhat.com/lsandova/oom/boot-efi-alloc-on-verifiers.iso Prerequisite for the Test Day - A bare metal machine (no VM) - Download [5] and write it to a USB through Fedora Media Writer - Make sure Secure Boot is DISABLE (mokutil --sb-state) Time Frame To-be-defined
Author

Sorry I forgot to mention before:

For ownership, I believe I need help from the QA team for hosting/organizing the test day.

Sorry I forgot to mention before: For ownership, I believe I need help from the QA team for hosting/organizing the test day.
Owner

Metadata Update from @kparal:

  • Custom field story_points adjusted to 2
  • Issue set to the milestone: Fedora 44
  • Issue tagged with: test days
**Metadata Update from @kparal**: - Custom field story_points adjusted to 2 - Issue set to the milestone: Fedora 44 - Issue tagged with: test days
Owner

Metadata Update from @jgroman:

  • Issue assigned to jgroman
**Metadata Update from @jgroman**: - Issue assigned to jgroman
Owner

Hi @lsandova

I'll prepare wiki page and testdays result page for you.

Regarding available dates - how would next week (Jan 26 - Feb 2) work for you?
There are already KDE and Gnome test weeks planned after that. It is not a big problem to have
two testdays to share dates so we could also do that as well.

Thanks!

Hi @lsandova I'll prepare wiki page and testdays result page for you. Regarding available dates - how would next week (Jan 26 - Feb 2) work for you? There are already KDE and Gnome test weeks planned after that. It is not a big problem to have two testdays to share dates so we could also do that as well. Thanks!
Author

Hi @jgroman ,

Hi @lsandova

I'll prepare wiki page and testdays result page for you.

Regarding available dates - how would next week (Jan 26 - Feb 2) work for you?

I talked to the rest of the team and we agreed that we need to wait a bit more until we have a fix and then test both scenarios, the incoming fix and the one mentioned above.

Is it OK if we keep this ticket open for some time ? Hopefully it will not take much.

There are already KDE and Gnome test weeks planned after that. It is not a big problem to have
two testdays to share dates so we could also do that as well.

Thanks!

Hi @jgroman , > Hi @lsandova > > I'll prepare wiki page and testdays result page for you. > > Regarding available dates - how would next week (Jan 26 - Feb 2) work for you? I talked to the rest of the team and we agreed that we need to wait a bit more until we have a fix and then test both scenarios, the incoming fix and the one mentioned above. Is it OK if we keep this ticket open for some time ? Hopefully it will not take much. > There are already KDE and Gnome test weeks planned after that. It is not a big problem to have > two testdays to share dates so we could also do that as well. > > Thanks! >
kparal added this to the Sprint 1 project 2026-01-23 12:46:35 +00:00
Author

@jgroman I have updated the test day description: now we have a solid fix basically.

I have updated several bugzilla tickets asking for testing so we can get some feedback. If all seems ok eg.testers are booting fine, then we set a date. If there is anyone that can't boot, then I need to revisit the issue again :( but I expect this not to happen.

@jgroman I have updated the test day description: now we have a solid fix basically. I have updated several bugzilla tickets asking for testing so we can get some feedback. If all seems ok eg.testers are booting fine, then we set a date. If there is anyone that can't boot, then I need to revisit the issue again :( but I expect this not to happen.
adamwill modified the project from Sprint 1 to Sprint 2 2026-01-27 17:38:07 +00:00
Author

@jgroman @kparal unfortunately my plan requesting (before test day starts) some impacted users doing the same test as I am proposing for this test day did not work: no one has tested it or at least nothing has been reported.

Let's try to set a date for this test day and see if we get some traction. Any incoming week is fine, whatever you believe is the best week I am fine with it.

Let me know anything more you need from me or the team.

Thanks for your help.

@jgroman @kparal unfortunately my plan requesting (before test day starts) some impacted users doing the same test as I am proposing for this test day did not work: no one has tested it or at least nothing has been reported. Let's try to set a date for this test day and see if we get some traction. Any incoming week is fine, whatever you believe is the best week I am fine with it. Let me know anything more you need from me or the team. Thanks for your help.
Owner

I've pinged the bug report explicitly NEEDINFO'ing the reporters, and also posted to the discourse thread about the problem. Hopefully that'll shake some testers loose.

I've pinged the bug report explicitly NEEDINFO'ing the reporters, and also [posted to the discourse thread](https://discussion.fedoraproject.org/t/updating-rawhide-today-broke-my-boot-grub/179256/12) about the problem. Hopefully that'll shake some testers loose.
Owner

So we did have one person say the new build works for them - is that enough to go ahead and schedule the event?

So we did have one person [say the new build works for them](https://bugzilla.redhat.com/show_bug.cgi?id=2427945#c36) - is that enough to go ahead and schedule the event?
Author

@adamwill wrote in #862 (comment):

So we did have one person say the new build works for them - is that enough to go ahead and schedule the event?

there is another tester that booted succesfully [2263643#c107] .

I agree with @adamwill , time to move on and organize the test day.

@jgroman let me know something missing from my side.

[2263643#c107] https://bugzilla.redhat.com/show_bug.cgi?id=2263643#c107

@adamwill wrote in https://forge.fedoraproject.org/quality/tickets/issues/862#issuecomment-361312: > So we did have one person [say the new build works for them](https://bugzilla.redhat.com/show_bug.cgi?id=2427945#c36) - is that enough to go ahead and schedule the event? there is another tester that booted succesfully [2263643#c107] . I agree with @adamwill , time to move on and organize the test day. @jgroman let me know something missing from my side. [2263643#c107] https://bugzilla.redhat.com/show_bug.cgi?id=2263643#c107
Owner

Hi @lsandova , would next week (2026-02-09 to 2026-02-15) work for you?
There is a 3 day overlap with GNOME test days but I guess that is not a big problem.
We could also do 2026-02-16 to 2026-02-22 if you wish.

Hi @lsandova , would next week (2026-02-09 to 2026-02-15) work for you? There is a 3 day overlap with GNOME test days but I guess that is not a big problem. We could also do 2026-02-16 to 2026-02-22 if you wish.
Author

@jgroman wrote in #862 (comment):

Hi @lsandova , would next week (2026-02-09 to 2026-02-15) work for you?

Yes, next week is just fine!

There is a 3 day overlap with GNOME test days but I guess that is not a big problem.

it is not. I believe these are two type of testers (user space UI versus bootloader testers), so I doubt dates would interfere.

We could also do 2026-02-16 to 2026-02-22 if you wish.

@jgroman wrote in https://forge.fedoraproject.org/quality/tickets/issues/862#issuecomment-361647: > Hi @lsandova , would next week (2026-02-09 to 2026-02-15) work for you? Yes, next week is just fine! > > There is a 3 day overlap with GNOME test days but I guess that is not a big problem. it is not. I believe these are two type of testers (user space UI versus bootloader testers), so I doubt dates would interfere. > > We could also do 2026-02-16 to 2026-02-22 if you wish.
Owner

Hi Leo @lsandova

I have prepared

Please update the wiki page as necessary, I just copied over "part 1" page for now.
If the testdays result page needs testcase refresh as well I can update that page for you.

Hi Leo @lsandova I have prepared - Test Days wiki page: https://fedoraproject.org/wiki/Test_Day:2026-02-09_GRUB_out_of_memory_fix_verification_part_2 - Test Days App results page: https://testdays.fedoraproject.org/testday/15 - Fedora Calendar Event: https://calendar.fedoraproject.org/QA/2026/2/9/#m11292 Please update the wiki page as necessary, I just copied over "part 1" page for now. If the testdays result page needs testcase refresh as well I can update that page for you.
Author

@jgroman wrote in #862 (comment):

Hi @jgroman ,

Hi Leo @lsandova

I have prepared

This page looks (identical?) as first test day

https://fedoraproject.org/wiki/Test_Day:2025-12-15_GRUB_out_of_memory_verification

but this time we are testing a different fix and different test scenario. Perhaps you pasted the wrong link?

Great, thanks

Please update the wiki page as necessary, I just copied over "part 1" page for now. If the testdays result page needs testcase refresh as well I can update that page for you.

@jgroman wrote in https://forge.fedoraproject.org/quality/tickets/issues/862#issuecomment-372136: Hi @jgroman , > Hi Leo @lsandova > > I have prepared > > * Test Days wiki page: https://fedoraproject.org/wiki/Test_Day:2026-02-09_GRUB_out_of_memory_fix_verification_part_2 This page looks (identical?) as first test day https://fedoraproject.org/wiki/Test_Day:2025-12-15_GRUB_out_of_memory_verification but this time we are testing a different fix and different test scenario. Perhaps you pasted the wrong link? > * Test Days App results page: https://testdays.fedoraproject.org/testday/15 > * Fedora Calendar Event: https://calendar.fedoraproject.org/QA/2026/2/9/#m11292 Great, thanks > Please update the wiki page as necessary, I just copied over "part 1" page for now. If the testdays result page needs testcase refresh as well I can update that page for you.
Owner

This page looks (identical?) as first test day
but this time we are testing a different fix and different test scenario. Perhaps you pasted the wrong link?

Yes, I just copy-pasted the first page content in there as a placeholder.
Please update this page as necessary to fit your current scenario.

> This page looks (identical?) as first test day > but this time we are testing a different fix and different test scenario. Perhaps you pasted the wrong link? Yes, I just copy-pasted the first page content in there as a placeholder. Please update this page as necessary to fit your current scenario.
Author

@jgroman wrote in #862 (comment):

This page looks (identical?) as first test day
but this time we are testing a different fix and different test scenario. Perhaps you pasted the wrong link?

Yes, I just copy-pasted the first page content in there as a placeholder. Please update this page as necessary to fit your current scenario.

Got it! Sure, let me update it now.

@jgroman wrote in https://forge.fedoraproject.org/quality/tickets/issues/862#issuecomment-372150: > > This page looks (identical?) as first test day > > but this time we are testing a different fix and different test scenario. Perhaps you pasted the wrong link? > > Yes, I just copy-pasted the first page content in there as a placeholder. Please update this page as necessary to fit your current scenario. Got it! Sure, let me update it now.
Author

wiki updated https://fedoraproject.org/wiki/Test_Day:2026-02-09_GRUB_out_of_memory_fix_verification_part_2

Please review it and let me know if something is not clear.

wiki updated https://fedoraproject.org/wiki/Test_Day:2026-02-09_GRUB_out_of_memory_fix_verification_part_2 Please review it and let me know if something is not clear.
kparal modified the project from Sprint 2 to Sprint 3 2026-02-10 15:04:47 +00:00
Sign in to join this conversation.
No milestone
No project
No assignees
4 participants
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
quality/tickets#862
No description provided.