Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Failing test] TestServiceExternalIP/testExternalIPAccess failing in test-e2e-encap-non-default #6992

Open
antoninbas opened this issue Feb 12, 2025 · 2 comments · May be fixed by #6994
Open
Assignees
Labels
kind/bug Categorizes issue or PR as related to a bug. kind/failing-test Categorizes issue or PR as related to a consistently or frequently failing test.

Comments

@antoninbas
Copy link
Contributor

Describe the bug
TestServiceExternalIP/testExternalIPAccess has been failing consistently for the test-e2e-encap-non-default job ever since #6903, which promotes the ServiceExternalIP feature gate to Beta, has been merged.

test-e2e-encap-non-default runs test with proxyAll enabled, kube-proxy removed, and DSR mode for LoadBalancer Services. It seems that one of these non-default configurations interferes with the ServiceExternalIP feature. Prior to #6903 and the promotion to Beta, TestServiceExternalIP test cases would have been skipped as part of this job (as the feature was disabled).

Here are the test logs for reference:

2025-02-12T20:09:51.7893261Z === RUN   TestServiceExternalIP/testExternalIPAccess/IPv4_cluster/ExternalTrafficPolicy_setting_to_Cluster
2025-02-12T20:11:22.0914250Z     service_externalip_test.go:812: 
2025-02-12T20:11:22.0915461Z         	Error Trace:	/home/runner/work/antrea/antrea/test/e2e/service_externalip_test.go:812
2025-02-12T20:11:22.0916408Z         	Error:      	Received unexpected error:
2025-02-12T20:11:22.0954432Z         	            	timed out waiting for the condition, Pod.Status: &PodStatus{Phase:Running,Conditions:[]PodCondition{PodCondition{Type:PodReadyToStartContainers,Status:True,LastProbeTime:0001-01-01 00:00:00 +0000 UTC,LastTransitionTime:2025-02-12 20:09:53 +0000 UTC,Reason:,Message:,},PodCondition{Type:Initialized,Status:True,LastProbeTime:0001-01-01 00:00:00 +0000 UTC,LastTransitionTime:2025-02-12 20:09:52 +0000 UTC,Reason:,Message:,},PodCondition{Type:Ready,Status:False,LastProbeTime:0001-01-01 00:00:00 +0000 UTC,LastTransitionTime:2025-02-12 20:09:52 +0000 UTC,Reason:ContainersNotReady,Message:containers with unready status: [agnhost],},PodCondition{Type:ContainersReady,Status:False,LastProbeTime:0001-01-01 00:00:00 +0000 UTC,LastTransitionTime:2025-02-12 20:09:52 +0000 UTC,Reason:ContainersNotReady,Message:containers with unready status: [agnhost],},PodCondition{Type:PodScheduled,Status:True,LastProbeTime:0001-01-01 00:00:00 +0000 UTC,LastTransitionTime:2025-02-12 20:09:52 +0000 UTC,Reason:,Message:,},},Message:,Reason:,HostIP:172.18.0.4,PodIP:172.18.0.4,StartTime:2025-02-12 20:09:52 +0000 UTC,ContainerStatuses:[]ContainerStatus{ContainerStatus{Name:agnhost,State:ContainerState{Waiting:nil,Running:&ContainerStateRunning{StartedAt:2025-02-12 20:09:52 +0000 UTC,},Terminated:nil,},LastTerminationState:ContainerState{Waiting:nil,Running:nil,Terminated:nil,},Ready:false,RestartCount:0,Image:registry.k8s.io/e2e-test-images/agnhost:2.40,ImageID:docker.io/library/import-2025-02-12@sha256:be8f46f5a255d8ed47c6d61977ea3c75efc1ce07434b667392f6f69d2a1c5e66,ContainerID:containerd://6b229d43336a54176da04a672c3df1db023758e3a2494b5bb867030907da6292,Started:*true,AllocatedResources:ResourceList{},Resources:nil,VolumeMounts:[]VolumeMountStatus{},User:nil,AllocatedResourcesStatus:[]ResourceStatus{},},},QOSClass:BestEffort,InitContainerStatuses:[]ContainerStatus{},NominatedNodeName:,PodIPs:[]PodIP{PodIP{IP:172.18.0.4,},},EphemeralContainerStatuses:[]ContainerStatus{},Resize:,ResourceClaimStatuses:[]PodResourceClaimStatus{},HostIPs:[]HostIP{HostIP{IP:172.18.0.4,},},}
2025-02-12T20:11:22.0971035Z         	Test:       	TestServiceExternalIP/testExternalIPAccess/IPv4_cluster/ExternalTrafficPolicy_setting_to_Cluster
2025-02-12T20:11:22.1367551Z === RUN   TestServiceExternalIP/testExternalIPAccess/IPv4_cluster/ExternalTrafficPolicy_setting_to_Local
2025-02-12T20:11:22.2658112Z     service_externalip_test.go:787: 
2025-02-12T20:11:22.2659345Z         	Error Trace:	/home/runner/work/antrea/antrea/test/e2e/service_externalip_test.go:787
2025-02-12T20:11:22.2660303Z         	Error:      	Received unexpected error:
2025-02-12T20:11:22.2661342Z         	            	pods "eth-ipv4" already exists
2025-02-12T20:11:22.2662132Z         	Test:       	TestServiceExternalIP/testExternalIPAccess/IPv4_cluster/ExternalTrafficPolicy_setting_to_Local
2025-02-12T20:11:24.2985206Z === RUN   TestServiceExternalIP/testExternalIPAccess/IPv6_cluster
2025-02-12T20:11:24.2990248Z     fixtures.go:111: Skipping test as it requires IPv6 addresses but the IPv6 network CIDR is not set
2025-02-12T20:11:24.2991343Z === RUN   TestServiceExternalIP/testServiceSharingLoadBalancerIP
2025-02-12T20:11:24.3074065Z === RUN   TestServiceExternalIP/testServiceSharingLoadBalancerIP/services-allowing-shared-ip
2025-02-12T20:11:25.8694263Z === RUN   TestServiceExternalIP/testServiceSharingLoadBalancerIP/services-not-allowing-shared-ip
2025-02-12T20:11:27.4770164Z === NAME  TestServiceExternalIP
2025-02-12T20:11:27.4772944Z     fixtures.go:352: Exporting test logs to '/home/runner/work/antrea/antrea/log/TestServiceExternalIP/beforeTeardown.Feb12-20-11-27'
2025-02-12T20:11:30.8809843Z     fixtures.go:523: Deleting 'testserviceexternalip-hnti292d' K8s Namespace
2025-02-12T20:11:30.8852982Z I0212 20:11:30.884705   24121 framework.go:863] Deleting Namespace testserviceexternalip-hnti292d took 3.946251ms
2025-02-12T20:11:30.8853963Z --- FAIL: TestServiceExternalIP (123.68s)
2025-02-12T20:11:30.8854733Z     --- PASS: TestServiceExternalIP/testServiceWithExternalIPCRUD (1.80s)
2025-02-12T20:11:30.8855605Z         --- PASS: TestServiceExternalIP/testServiceWithExternalIPCRUD/single_matching_Node (0.32s)
2025-02-12T20:11:30.8856814Z         --- SKIP: TestServiceExternalIP/testServiceWithExternalIPCRUD/single_matching_Node_with_IPv6_range (0.00s)
2025-02-12T20:11:30.8857732Z         --- PASS: TestServiceExternalIP/testServiceWithExternalIPCRUD/two_matching_Nodes (0.48s)
2025-02-12T20:11:30.8858545Z         --- PASS: TestServiceExternalIP/testServiceWithExternalIPCRUD/no_matching_Node (1.00s)
2025-02-12T20:11:30.8859172Z     --- PASS: TestServiceExternalIP/testServiceUpdateExternalIP (2.81s)
2025-02-12T20:11:30.8859847Z         --- PASS: TestServiceExternalIP/testServiceUpdateExternalIP/same_Node (1.41s)
2025-02-12T20:11:30.8860880Z         --- PASS: TestServiceExternalIP/testServiceUpdateExternalIP/different_Nodes (1.40s)
2025-02-12T20:11:30.8861778Z         --- SKIP: TestServiceExternalIP/testServiceUpdateExternalIP/different_Nodes_in_IPv6_cluster (0.00s)
2025-02-12T20:11:30.8862462Z     --- PASS: TestServiceExternalIP/testServiceExternalTrafficPolicyLocal (3.62s)
2025-02-12T20:11:30.8863241Z         --- PASS: TestServiceExternalIP/testServiceExternalTrafficPolicyLocal/endpoint_created (1.80s)
2025-02-12T20:11:30.8864101Z         --- SKIP: TestServiceExternalIP/testServiceExternalTrafficPolicyLocal/endpoint_created_IPv6 (0.00s)
2025-02-12T20:11:30.8864946Z         --- PASS: TestServiceExternalIP/testServiceExternalTrafficPolicyLocal/endpoint_changed (1.82s)
2025-02-12T20:11:30.8865790Z         --- SKIP: TestServiceExternalIP/testServiceExternalTrafficPolicyLocal/endpoint_changed_IPv6 (0.00s)
2025-02-12T20:11:30.8866424Z     --- PASS: TestServiceExternalIP/testServiceNodeFailure (7.40s)
2025-02-12T20:11:30.8867030Z         --- PASS: TestServiceExternalIP/testServiceNodeFailure/IPv4_cluster (7.40s)
2025-02-12T20:11:30.8867667Z         --- SKIP: TestServiceExternalIP/testServiceNodeFailure/IPv6_cluster (0.00s)
2025-02-12T20:11:30.8868182Z     --- FAIL: TestServiceExternalIP/testExternalIPAccess (98.70s)
2025-02-12T20:11:30.8868766Z         --- FAIL: TestServiceExternalIP/testExternalIPAccess/IPv4_cluster (98.70s)
2025-02-12T20:11:30.8869724Z             --- FAIL: TestServiceExternalIP/testExternalIPAccess/IPv4_cluster/ExternalTrafficPolicy_setting_to_Cluster (90.35s)
2025-02-12T20:11:30.8872138Z             --- FAIL: TestServiceExternalIP/testExternalIPAccess/IPv4_cluster/ExternalTrafficPolicy_setting_to_Local (0.14s)

See https://github.com/antrea-io/antrea/actions/runs/13293162829/job/37118872179

@antoninbas antoninbas added kind/bug Categorizes issue or PR as related to a bug. kind/failing-test Categorizes issue or PR as related to a consistently or frequently failing test. labels Feb 12, 2025
@antoninbas
Copy link
Contributor Author

cc @luolanzone @xliuxu

We should investigate the issue before the release

@antoninbas antoninbas added this to the Antrea v2.3 release milestone Feb 12, 2025
@xliuxu xliuxu self-assigned this Feb 13, 2025
xliuxu added a commit to xliuxu/antrea that referenced this issue Feb 13, 2025
xliuxu added a commit to xliuxu/antrea that referenced this issue Feb 13, 2025
@xliuxu
Copy link
Contributor

xliuxu commented Feb 13, 2025

The test fails because the Node hosting the LB endpoints may not have routes to the client. The route is required in DSR mode otherwise the reply path will be broken.

I have created #6994 to fix the issue.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind/bug Categorizes issue or PR as related to a bug. kind/failing-test Categorizes issue or PR as related to a consistently or frequently failing test.
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants