feat(ci): monitors marker + three-way Better Stack heartbeat split by ari-nz · Pull Request #641 · aignostics/python-sdk

ari-nz · 2026-05-12T08:36:56Z

Summary

Adds a monitors pytest marker so scheduled tests can declare which system they monitor (he-tme, test-app, platform-api). Tests without the marker are considered SDK-layer health checks.
Splits the hourly scheduled workflow from one pytest run into three independent runs, each feeding its own Better Stack heartbeat:
- SDK (not monitors and not monitors_platform_api): token management, service wiring
- Platform API (monitors_platform_api): health check, application listing, run listing
- HE-TME / applications (monitors and not monitors_platform_api): HE-TME and test-app processing tests
Adds monitors_platform_api as a companion boolean marker (alongside monitors("platform-api")) so the workflow can filter with standard -m expressions — pytest's -m syntax cannot filter on marker argument values.
New secrets BETTERSTACK_HEARTBEAT_URL_PLATFORM_API_{STAGING,PRODUCTION} are optional; heartbeat steps skip gracefully when absent.
All heartbeat steps use if: always() so they fire even if an upstream step errors unexpectedly — missing a heartbeat would cause a false "down" alert.
Exit code capture uses || '1' fallback per step output to prevent false-healthy heartbeats when a step never ran.

Test plan

uv run pytest --collect-only -m "monitors_platform_api" → 3 tests
uv run pytest --collect-only -m "monitors and not monitors_platform_api" → 6 tests (3 he-tme + 3 test-app)
uv run pytest --collect-only -m "not monitors and not monitors_platform_api" -m "scheduled or scheduled_only" → SDK-only scheduled tests
Better Stack heartbeat steps each skip gracefully when URL secrets are absent

Posted by Claude claude-sonnet-4-6 via Claude Code on behalf of ari@aignostics.com

Copilot

Pull request overview

This PR introduces a monitors(...) pytest marker intended to tag scheduled E2E tests by the Better Stack monitor they should feed, and updates the hourly scheduled CI workflow to split test execution and emit separate heartbeats (SDK vs. application-specific).

Changes:

Added @pytest.mark.monitors("he-tme" | "test-app") to selected platform E2E scheduled tests.
Registered the new monitors marker in pyproject.toml pytest configuration.
Split the hourly scheduled workflow into multiple pytest invocations and added a dedicated Better Stack heartbeat URL secret for HE-TME (propagated via the staging/production wrapper workflows).

Reviewed changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 4 comments.

Show a summary per file

File	Description
`tests/aignostics/platform/e2e_test.py`	Tags scheduled E2E tests with `monitors(...)` to support routing results/heartbeats by monitored system.
`pyproject.toml`	Registers the new pytest marker so `--strict-markers` runs remain valid.
`.github/workflows/_scheduled-test-hourly.yml`	Splits scheduled tests into separate runs and sends separate Better Stack heartbeats (SDK + HE-TME), plus combined status to Sentry.
`.github/workflows/scheduled-testing-staging-hourly.yml`	Passes through the new HE-TME Better Stack heartbeat secret to the reusable workflow.
`.github/workflows/scheduled-testing-production-hourly.yml`	Passes through the new HE-TME Better Stack heartbeat secret to the reusable workflow.

+          # set +e so a test failure does not abort the step — we capture the exit code
+          # manually and send it to Better Stack regardless of outcome.
          set +e
-          make test_scheduled
-          EXIT_CODE=$?
+          XDIST_WORKER_FACTOR=1 uv run --all-extras nox -s test -- \
+            -m "(scheduled or scheduled_only) and monitors and not stress_only" \
+            --junit-xml=reports/junit_he_tme.xml


+      - name: Test / scheduled / he-tme
+        id: test_he_tme
        env:
-          BETTERSTACK_HEARTBEAT_URL: "${{ inputs.platform_environment == 'staging' && secrets.BETTERSTACK_HEARTBEAT_URL_STAGING || secrets.BETTERSTACK_HEARTBEAT_URL_PRODUCTION }}"
          SENTRY_DSN: ${{ secrets.SENTRY_DSN }}
        shell: bash
        run: |
+          # set +e so a test failure does not abort the step — we capture the exit code
+          # manually and send it to Better Stack regardless of outcome.
          set +e
-          make test_scheduled
-          EXIT_CODE=$?
+          XDIST_WORKER_FACTOR=1 uv run --all-extras nox -s test -- \
+            -m "(scheduled or scheduled_only) and monitors and not stress_only" \
+            --junit-xml=reports/junit_he_tme.xml
+          echo "exit_code=$?" >> $GITHUB_OUTPUT


    "unit: Solitary unit tests - test a layer of a module in isolation with all dependencies mocked, except interaction with shared utils and the systems module. Unit tests must be able to pass offline, i.e. not calls to external services. The timeout should not be bigger than the default 10s, and must be <5 min.",
    "integration: Sociable integration tests - test interactions across architectural layers (e.g. CLI/GUI→Service, Service→Utils) or between modules (e.g. Application→Platform), using real SDK collaborators, real file I/O, real subprocesses, and real Docker containers. Integration test must be able to pass offline, i.e. mock external services (Aignostics Platform API, Auth0, S3/GCS buckets, IDC). The timeout should not be bigger than the default 10s, and must be <5 min.",
    "e2e: End-to-end tests - test complete workflows with real external network services (Aignostics Platform API, cloud storage, IDC, etc). If the test timeout is >= 5 min and < 60 min, additionally mark as `long_running`, if >= 60min mark as 'very_long_running'.",
+    "monitors: Tag a scheduled test with the application it monitors, e.g. @pytest.mark.monitors('he-tme'). Tests without this marker are considered SDK-layer health checks. Used to route Better Stack heartbeats to the correct monitor.",


+            reports/junit_sdk.xml
+            reports/junit_he_tme.xml


codecov · 2026-05-12T09:13:13Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ All tests successful. No failed tests found.

❌ Your project check has failed because the head coverage (65.75%) is below the target coverage (70.00%). You can increase the head coverage or adjust the target coverage.

❗ There is a different number of reports uploaded between BASE (71ff4b0) and HEAD (0683245). Click for more details.

HEAD has 3 uploads less than BASE

Flag BASE (71ff4b0) HEAD (0683245)

4 1

see 22 files with indirect coverage changes

Add @pytest.mark.monitors("he-tme") marker to tag tests that exercise the HE-TME application end-to-end. Tests without the marker are SDK health checks (auth, listing, connectivity). The hourly scheduled workflow now runs two separate pytest invocations and sends two independent Better Stack heartbeats so an HE-TME outage no longer pollutes the SDK monitor and vice versa. The HE-TME heartbeat URLs are optional secrets; the step skips gracefully until the monitors are created in Better Stack.

Extends the monitors marker to the three test-app e2e tests so they route to a test-app Better Stack monitor independently from the SDK health checks.

…lications Tests are now routed to three separate Better Stack heartbeat monitors: - SDK (no monitors marker): token management, service wiring - Platform API (monitors_platform_api): auth, listing, connectivity - test_cli_health_json (system) - test_cli_application_list_verbose (application) - test_cli_run_list_limit_10 (application) - HE-TME / applications (monitors): application processing - existing he-tme and test-app tests (unchanged) Adds monitors_platform_api pytest marker alongside the existing monitors("platform-api") string for pytest -m expression filtering. Adds BETTERSTACK_HEARTBEAT_URL_PLATFORM_API_* secrets (optional, heartbeat step skips gracefully when absent).

sonarqubecloud · 2026-05-12T13:42:47Z

Quality Gate failed

Failed conditions
E Security Rating on New Code (required ≥ A)

See analysis details on SonarQube Cloud

Catch issues before they fail your Quality Gate with our IDE extension SonarQube for IDE

Copilot AI review requested due to automatic review settings May 12, 2026 08:36

ari-nz requested review from a team and helmut-hoffer-von-ankershoffen as code owners May 12, 2026 08:36

ari-nz added the skip:test:long_running Skip long-running tests (≥5min) label May 12, 2026

Copilot started reviewing on behalf of ari-nz May 12, 2026 08:37 View session

Copilot AI reviewed May 12, 2026

View reviewed changes

ari-nz changed the title ~~feat(ci): split hourly heartbeat by concern and add monitors marker~~ feat(ci): monitors marker + three-way Better Stack heartbeat split May 12, 2026

ari-nz added 3 commits May 12, 2026 15:39

test(e2e): add monitors("test-app") tag to test-app scheduled tests

d7d10bc

Extends the monitors marker to the three test-app e2e tests so they route to a test-app Better Stack monitor independently from the SDK health checks.

ari-nz force-pushed the feat/monitors-marker-split branch from 129c9aa to 0683245 Compare May 12, 2026 13:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(ci): monitors marker + three-way Better Stack heartbeat split#641

feat(ci): monitors marker + three-way Better Stack heartbeat split#641
ari-nz wants to merge 3 commits into
mainfrom
feat/monitors-marker-split

ari-nz commented May 12, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

codecov Bot commented May 12, 2026 •

edited

Loading

Uh oh!

sonarqubecloud Bot commented May 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ari-nz commented May 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

codecov Bot commented May 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

sonarqubecloud Bot commented May 12, 2026

Quality Gate failed

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ari-nz commented May 12, 2026 •

edited

Loading

codecov Bot commented May 12, 2026 •

edited

Loading