Harness CI flaky tests due to environment-config sync issues in mf-25.3

anastrategist · November 10, 2025, 6:39am

We’re experiencing significant test flakiness (30% failure rate) in our Harness CI pipelines pulling tests from ALM environment management in mf-25.3. The same tests pass consistently when run directly in ALM, but fail intermittently in Harness.

Investigation shows that environment configurations aren’t syncing properly before test execution starts. Tests begin running with stale config data, causing failures related to database connections, API endpoints, and feature flags. We’ve tried adding wait periods, but that’s unreliable and extends pipeline time significantly.

- step:
    type: Run
    name: Execute ALM Tests
    spec:
      shell: Bash
      command: |
        curl -X GET https://alm/api/env-config/qa-env
        alm-runner --env qa-env --test-set TS_001

Looking for solutions around environment verification, config version pinning, or health check endpoints that could ensure environment readiness before test execution. The 30% flakiness is destroying team confidence in our CI pipeline.

datadev · November 20, 2025, 11:44am

Warm-up periods are underrated. Even after environment health checks pass, the first few test executions can fail due to cold caches, lazy-loaded connections, and service mesh routing updates. Add a warm-up step that runs a lightweight smoke test suite before the main test execution. This primes the environment and absorbs the initial instability. We run 3-5 basic API calls as warm-up and it significantly improved reliability.

kavya_thinker · November 20, 2025, 3:45am

For health check endpoints, ALM mf-25.3 added /api/env-health/{env-name} that returns environment status including database connectivity, API availability, and config sync status. Poll this endpoint with a 5-second interval until it returns status: ready before executing tests. Set a 2-minute timeout to fail fast if environment doesn’t stabilize. This cut our flakiness from 25% to under 5%.

akash_mfal · November 14, 2025, 3:37am

Thanks for the version pinning suggestion. I enabled config snapshots but I’m not seeing how to reference specific version IDs in the Harness YAML. Is there a parameter I should be passing to the alm-runner command? Also, how do I determine which config version corresponds to our qa-env environment?

Topic		Views
Mf-25.3 sprint-mgmt automated perf testing vs traditional manual scheduling Micro Focus ALM / Quality Center discussion , test-automation , ci-cd-integration , sprint-mgmt , performance-testing , mf-25-3 , performance-center , scheduling-scalability , daily-regression	3	November 17, 2025
Optimizing test automation pyramid in CI/CD connectors: unit vs integration vs E2E balance Micro Focus ALM / Quality Center discussion , test-automation , quality-metrics , pipeline-optimization , ci-cd-connectors , flaky-tests , mf-25-3 , test-pyramid , contract-testing	6	January 5, 2026
Defect tracking environment sync fails with 'Environment not found' error Micro Focus ALM / Quality Center question , rest-api , json , sync-failure , defect-tracking , environment-mgmt , mf-25-4 , test-lab , workflow-scripts	6	September 15, 2025
Test metrics dashboard showing stale data in sprint management Micro Focus ALM / Quality Center question , rest-api , performance-tuning , real-time-sync , quality-metrics , ci-cd-integration , sprint-mgmt , metrics-dashboard , mf-25-4	5	October 14, 2025
Best practices for environment config mgmt in codebeamer cb-23 PTC Codebeamer discussion , kubernetes , ci-cd-integration , configuration-management , deployment-automation , config-drift , environment-mgmt , cb-23 , helm	5	February 14, 2025
Regression test suite timing out in CI/CD connector pipeline IBM Engineering Lifecycle Management question , jenkins , pipeline-optimization , timeout-issues , test-execution , ci-cd-connectors , performance-testing , parallel-execution , elm-7-0-3	3	December 1, 2025
CI/CD connectors not triggering Jenkins release pipelines after port change Micro Focus ALM / Quality Center question , jenkins , rest-api , ssl , ci-cd-integration , deployment-blocked , ci-cd-connectors , webhook-timeout , mf-25-4	5	January 11, 2026
Recovering 22K Builds with ML-Based Flaky Test Detection Platform AI Adoption in ALM use-case , ci-cd , devops , scaling , ml-models , ai-adoption , test-prioritization , alm-ai , flaky-test-detection	3	February 19, 2025
Flaky test detection at scale: ML model vs heuristics vs hybrid? AI Adoption in ALM question , ci-cd , scaling , ai-adoption , flaky-tests , test-maintenance , test-prioritization , alm-ai , self-healing	6	February 14, 2025

Harness CI flaky tests due to environment-config sync issues in mf-25.3

Related topics