vmhost-p09-copr ipv6 woes #13085

Closed
opened 2026-01-26 17:42:21 +00:00 by kevin · 3 comments
Owner

Description of request

The vmhost-p09-copr01/02/03/04 machines in rdu3 have a ipv6 network setup to allow ssh in from the copr backend to manage them and get build artifacts off them.

Connectivity to the actual hypervisors seems to be working just fine.
However, vm's on those builders come up and sporadically become unreachable via ipv6.

It may be some bridge/bond issue, but everything is setup exactly like many of our other machines.
Or it could be some ipv6 / firewall bug.
Need to gather more info/debugging.

### Description of request The vmhost-p09-copr01/02/03/04 machines in rdu3 have a ipv6 network setup to allow ssh in from the copr backend to manage them and get build artifacts off them. Connectivity to the actual hypervisors seems to be working just fine. However, vm's on those builders come up and sporadically become unreachable via ipv6. It may be some bridge/bond issue, but everything is setup exactly like many of our other machines. Or it could be some ipv6 / firewall bug. Need to gather more info/debugging.
Author
Owner

I updated all the vmhost-p09-copr* machines to the latest today and rebooted them.

The problem still seems to be there.

pinging a builder from my home network I notice:

22:39:07.153445 IP6 2620:52:6:1161:dead:beef:cafe:c110.llmnr > vmhost-p09-copr02.rdu3.fedoraproject.org.57324: Flags [S.], seq 3245253773, ack 1703004485, win 64260, options [mss 1440,sackOK,TS val 249750629 ecr 3268020716,nop,wscale 10], length 0
22:39:07.154235 IP6 2620:52:6:1161::2 > 2620:52:6:1161:dead:beef:cafe:c110: ICMP6, time exceeded in-transit for vmhost-p09-copr02.rdu3.fedoraproject.org, length 88

Will see if networking has any ideas there...

I updated all the vmhost-p09-copr* machines to the latest today and rebooted them. The problem still seems to be there. pinging a builder from my home network I notice: ``` 22:39:07.153445 IP6 2620:52:6:1161:dead:beef:cafe:c110.llmnr > vmhost-p09-copr02.rdu3.fedoraproject.org.57324: Flags [S.], seq 3245253773, ack 1703004485, win 64260, options [mss 1440,sackOK,TS val 249750629 ecr 3268020716,nop,wscale 10], length 0 22:39:07.154235 IP6 2620:52:6:1161::2 > 2620:52:6:1161:dead:beef:cafe:c110: ICMP6, time exceeded in-transit for vmhost-p09-copr02.rdu3.fedoraproject.org, length 88 ``` Will see if networking has any ideas there...
Author
Owner

They said that they found the same thing and it's like theres a loop on one port where it bounces back and forth and hits tome exceeded.

They want me to get tcpdumps from both interfaces on the server and also from inside the guest builder.
We will need copr folks to get tcpdump available in the guest for us to do that part.

They said that they found the same thing and it's like theres a loop on one port where it bounces back and forth and hits tome exceeded. They want me to get tcpdumps from both interfaces on the server and also from inside the guest builder. We will need copr folks to get tcpdump available in the guest for us to do that part.
Author
Owner

Thanks to a bunch of work from @praiskup and puk from the networking team, this was tracked down to mac caching in the switching layer. ;(

It may now be fixed, but if it comes up again we should know the bug and can report it upstream.

Great work!

Thanks to a bunch of work from @praiskup and puk from the networking team, this was tracked down to mac caching in the switching layer. ;( It may now be fixed, but if it comes up again we should know the bug and can report it upstream. Great work!
kevin closed this issue 2026-02-13 17:27:07 +00:00
Sign in to join this conversation.
No milestone
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
infra/tickets#13085
No description provided.