Get Started

AI Interview System

Build, run, and manage AI-powered interview workflows at scale.

The AI Interview System is a multi-tenant platform for creating, running, and evaluating AI-powered technical interviews. It provides a complete pipeline from interview creation to candidate assessment with human-in-the-loop approval at key stages.

Teamcast acts as a targeted qualification validation layer for integration partners. You send recruiter intent as natural-language qualification statements — Teamcast maps them internally to skills, generates the interview plan, conducts the session via the AI Hiring Assistant, and delivers a structured assessment back via signed webhooks. Candidate PII is optional and can be minimized or purged on demand.

LayerResponsibility
API GatewayREST integration endpoints, webhook notifications, Platform API key + X-Tenant-ID authentication
Agno AgentsQualification validation, interview plan generation, real-time interview conduction, post-interview assessment
HITL WorkflowOptional recruiter approval gates for plan generation and assessment verdicts. Auto-approve mode available.

What You Can Build

The AI Interview System powers real workflows built from the same primitives:

  • ATS Integration — Push interview requests from your Applicant Tracking System via REST, using qualification statements instead of a skills taxonomy
  • Webhook-driven pipelines — React to state changes (plan generated, approved, completed) via HMAC-signed webhooks to your registered endpoint
  • PII-minimal workflows — Use candidateRef instead of candidateName/Email to avoid sending personal data; purge PII on demand after assessment
  • Custom HITL interfaces — Build recruiter review tools against the approval workflow API, or enable auto-approve for fully automated flows
  • Assessment automation — Receive structured AI assessments with qualification evaluations and pipe them into your HR systems

Start Building

The system is designed to scale with you from a single API call to production multi-tenant deployments.

  • Quickstart — Create your first interview with a qualification-first request in under 5 minutes
  • Integration API — Integrate your ATS or HR platform via REST using your platform API key
  • Webhooks — Register your callbackUrl once per tenant to receive real-time state-change events
  • HITL Workflow — Enable auto-approve for zero-touch flows or build custom recruiter interfaces
bash
# Create your first interview — qualification-first, no PII required
curl -X POST https://mayaapi.teamcast.ai/api/v1/integration/interviews \
  -H "X-API-Key: your_api_key" \
  -H "X-Tenant-ID: your_tenant_id" \
  -H "Content-Type: application/json" \
  -d '{
    "candidateRef": "li_app_a1b2c3d4",
    "position": "Senior Engineer",
    "level": "SENIOR",
    "qualifications": [
      "5+ years production TypeScript experience",
      "Distributed systems design at scale"
    ]
  }'

# Response: { "runId": "run_...", "state": "VALIDATING_SKILLS" }

AI-Assisted Integration

Use our MCP (Model Context Protocol) server with Claude Code, Cursor, or Windsurf to get instant API integration help. Your AI coding assistant can search endpoints, read schemas, and access integration guides directly — no context-switching to docs.

Server URL

https://mayaapi.teamcast.ai/mcp

The server uses the Streamable HTTP transport. Add it to your MCP configuration:

json
{
  "mcpServers": {
    "teamcast-docs": {
      "url": "https://mayaapi.teamcast.ai/mcp"
    }
  }
}
Cursor: Add to ~/.cursor/mcp.json or project .cursor/mcp.json. Claude Code: Run claude mcp add --transport http teamcast-docs https://mayaapi.teamcast.ai/mcp. VS Code: Add to .vscode/mcp.json.

Once configured, ask your AI assistant questions like:

  • "How do I create an interview via the API?"
  • "What fields does CreateInterviewDto require?"
  • "Show me the webhook payload for interview.completed"
  • "What permissions does the workflow approve endpoint need?"
ToolWhat it does
search_endpointsFind API endpoints by keyword (e.g. "interview", "webhook")
get_endpoint_detailsFull endpoint details — params, request body, response schema, auth
list_schemasList all DTO/model types defined in the API
get_schemaGet the complete definition of a specific DTO
list_docsList available integration guides
get_docRead a specific integration guide (A2A, webhooks, permissions, etc.)
The MCP server only exposes external integration APIs and guides. Internal architecture, infrastructure details, and credentials are never shared with AI tools.

Built for Production

AI Interview System runs in your infrastructure, not ours.

  • Multi-tenant with row-level security at the PostgreSQL level
  • Platform → Tenant hierarchy — one API key per platform, X-Tenant-ID per request
  • Horizontally scalable — memory-based HPA on the Interviewer, CPU-based on Planner and Assessor
  • Event-driven via Google Managed Kafka (SASL_SSL) — 100 partitions per audio topic
  • Full audit trail via structured logging and Kafka event history
  • HMAC-signed webhooks with exponential backoff retry
  • Candidate PII purge API with immutable audit log
  • Configurable retention windows per tenant
New to the platform? Start with the Authentication guide then follow the Quick Start to create your first interview.

Production Performance

Benchmarked on the production GKE cluster — 2 interviewer pods, 3x n2-standard-8 nodes (24 vCPU / 96GB total). All numbers are from live cluster runs against real Kafka, Redis, Google STT, Vertex AI LLM, and Google TTS — no mocks.

MetricResult
Session creation throughput54 req/s at 100 concurrent — 0% errors
Mixed load throughput117 req/s at 200 concurrent — 0% errors
Session creation p50 / p95380ms / 1400ms
WebSocket connect latency600ms median (10 concurrent sessions)
Greeting latency — single session2.9s median (LLM + TTS + Kafka + WebSocket)
Greeting latency — 10 concurrent3.5s median / 4.1s p95 — 100% success
Audio round-trip per conversational turn1.4s – 3.2s (STT + LLM + TTS, end-to-end)
CPU under full audio load3–5% per pod (IO-bound — waiting on STT/LLM/TTS APIs)
Max simultaneous live interviews~540–700 (20 pods × 27–35 sessions/pod at HPA trigger)
Was this page helpful?