Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve the reliability of the pod pending test case #2176

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

johscheuer
Copy link
Member

Description

Improve the reliability of the pod pending test case. We sometimes see that the test is timing out and it seems like the test is failing because the reconciliation is requeue:

2024/11/26 01:41:00 Failed (unscheduled) Pod: operator-test-fdnhokqf-stateless-42898 , Pod to replace: operator-test-fdnhokqf-stateless-93723
...
{"level":"debug","ts":"2024-11-26T01:47:47Z","logger":"events","msg":"cannot: exclude processes, clusters last recovery was 0.00 seconds ago, wait until the last recovery was 1 seconds ago" ....}
# Roughly 4 minute break
{"level":"debug","ts":"2024-11-26T01:51:08Z","logger":"controller","msg":"Deleting pod","namespace":"nightly-2002-operator-test-irgo4uxk","cluster":"operator-test-fdnhokqf","traceID":"9ba8d86f-3d1d-4b79-ba60-09fef6771a49","reconciler":"controllers.removeProcessGroups","name":"operator-test-fdnhokqf-stateless-93723"}

Type of change

Please select one of the options below.

  • Bug fix (non-breaking change which fixes an issue)

Discussion

Testing

CI will run the e2e test.

Documentation

Follow-up

@johscheuer johscheuer added the bug Something isn't working label Nov 27, 2024
@foundationdb-ci
Copy link

Result of fdb-kubernetes-operator-pr on Linux CentOS 7

  • Commit ID: c732be2
  • Duration 3:08:02
  • Result: ❌ FAILED
  • Error: Error while executing command: if $fail_test; then exit 1; fi. Reason: exit status 1
  • Build Log terminal output (available for 30 days)
  • Build Workspace zip file of the working directory (available for 30 days)

@johscheuer johscheuer closed this Nov 27, 2024
@johscheuer johscheuer reopened this Nov 27, 2024
@johscheuer
Copy link
Member Author

Result of fdb-kubernetes-operator-pr on Linux CentOS 7

  • Commit ID: c732be2
  • Duration 3:08:02
  • Result: ❌ FAILED
  • Error: Error while executing command: if $fail_test; then exit 1; fi. Reason: exit status 1
  • Build Log terminal output (available for 30 days)
  • Build Workspace zip file of the working directory (available for 30 days)
Summarizing 1 Failure:
  [FAIL] Operator HA Upgrades when no remote storage processes are restarted [It] Upgrade from 7.1.63 to 7.3.43 [e2e, pr]
  /codebuild/output/src990390057/src/github.com/FoundationDB/fdb-kubernetes-operator/e2e/fixtures/ha_fdb_cluster.go:314

unrelated failure.

@foundationdb-ci
Copy link

Result of fdb-kubernetes-operator-pr on Linux CentOS 7

  • Commit ID: c732be2
  • Duration 2:47:48
  • Result: ❌ FAILED
  • Error: Error while executing command: if $fail_test; then exit 1; fi. Reason: exit status 1
  • Build Log terminal output (available for 30 days)
  • Build Workspace zip file of the working directory (available for 30 days)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants