Fix API contract validation workflow backend startup race condition by Copilot · Pull Request #3 · cliff-de-tech/Post-Bot

Copilot · 2026-01-26T03:23:08Z

The API contract validation workflow fails with ECONNREFUSED ::1:8000 because the FastAPI backend isn't ready when the health check runs. No logs captured to debug startup failures or crashes.

Changes

Capture backend logs - Redirect uvicorn output to ${github.workspace}/backend.log for post-mortem debugging
Extend startup grace period - 15s → 30s sleep before health check
Validate process liveness - Check backend PID still exists before attempting health check
Increase retry budget - 5 retries @ 2s → 10 retries @ 3s delays
Fail with context - Print backend logs when process dies or health check fails

Implementation

- name: Start FastAPI backend
  run: |
    cd backend
    BACKEND_LOG="${{ github.workspace }}/backend.log"
    
    # Start with logging
    python -m uvicorn app:app --host 0.0.0.0 --port 8000 > "$BACKEND_LOG" 2>&1 &
    BACKEND_PID=$!
    
    sleep 30
    
    # Detect crashes early
    if ! ps -p $BACKEND_PID > /dev/null; then
      echo "❌ Backend process died! Logs:"
      cat "$BACKEND_LOG"
      exit 1
    fi
    
    # Health check with retries
    curl --retry 10 --retry-delay 3 --retry-connrefused http://localhost:8000/health || {
      echo "❌ Backend health check failed! Logs:"
      cat "$BACKEND_LOG"
      exit 1
    }

This addresses both timing issues (insufficient startup time) and observability gaps (silent failures).

Original prompt

Problem

The API Contract Validation workflow is failing with connection refused errors when trying to connect to the FastAPI backend at http://localhost:8000. The job logs show:
❌ Error: connect ECONNREFUSED ::1:8000
💡 Make sure the backend is running: cd backend && python app.py
Job URL: https://github.com/cliff-de-tech/Post-Bot/actions/runs/21345140930/job/61431194379

Root Cause

The backend is started in the background but either:

Takes longer than 15 seconds to fully start

Crashes silently without proper logging

May have dependency or configuration issues

Required Changes

Update .github/workflows/api-contracts.yml to fix the backend startup issues:

Add backend logging - Redirect uvicorn output to a log file so we can see what's happening

Increase wait time - Change from 15 to 30 seconds to give the backend more time to start

Add retry logic - Increase curl retries from 5 to 10 with longer delays

Add better error handling - Print backend logs if the health check fails

Verify backend is actually running - Check the process is still alive before proceeding

Specific changes needed in the "Start FastAPI backend" step (lines 74-90):
- name: Start FastAPI backend
  run: |
    cd backend
    # Start backend with logging
    python -m uvicorn app:app --host 0.0.0.0 --port 8000 > ../backend.log 2>&1 &
    BACKEND_PID=$!
    echo "Backend started with PID: $BACKEND_PID"
    
    # Wait for backend to be ready
    echo "Waiting for backend to start..."
    sleep 30
    
    # Check if process is still running
    if ! ps -p $BACKEND_PID > /dev/null; then
      echo "❌ Backend process died! Logs:"
      cat ../backend.log
      exit 1
    fi
    
    # Verify backend is responding with more retries
    echo "Checking backend health..."
    curl --retry 10 --retry-delay 3 --retry-connrefused http://localhost:8000/health || {
      echo "❌ Backend health check failed! Logs:"
      cat ../backend.log
      exit 1
    }
    echo "✅ Backend is ready!"
  env:
    LINKEDIN_CLIENT_ID: test_client_id
    LINKEDIN_CLIENT_SECRET: test_client_secret
    GROQ_API_KEY: test_groq_key
    DATABASE_URL: sqlite+aiosqlite:///./test.db
    ENCRYPTION_KEY: Ag45Scx9q_Q6w3xF8Lz5j2p7n9v0k1m3b5v7c9x1z3m=
    CLERK_ISSUER: https://test-clerk.accounts.dev
This will provide better debugging information and increase the reliability of the backend startup process.

This pull request was created from Copilot chat.

✨ Let Copilot coding agent set things up for you — coding agent works faster and does higher quality work when set up for your repo.

- Add backend logging to backend.log for debugging - Increase wait time from 15 to 30 seconds - Add process validation to check if backend is still running - Increase curl retries from 5 to 10 with 3-second delays - Add error handling to print backend logs on failure Co-authored-by: cliff-de-tech <137389025+cliff-de-tech@users.noreply.github.com>

- Define BACKEND_LOG variable to avoid duplication - Use absolute path with github.workspace for better reliability - Quote variable references for safety Co-authored-by: cliff-de-tech <137389025+cliff-de-tech@users.noreply.github.com>

Copilot

Pull request overview

This PR addresses a race condition in the API Contract Validation workflow where the FastAPI backend wasn't ready when health checks ran, causing ECONNREFUSED errors. The changes improve startup reliability and debugging capabilities.

Changes:

Capture backend logs to a file for post-mortem debugging
Increase startup grace period from 15s to 30s
Add process liveness check before health check attempt
Increase curl retry budget from 5 to 10 attempts with 3-second delays
Print backend logs when process dies or health check fails

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-01-26T03:38:05Z