Get Started

AI Interview System

Build, run, and manage AI-powered interview workflows at scale.

The AI Interview System is a multi-tenant platform for creating, running, and evaluating AI-powered technical interviews. It provides a complete pipeline from interview creation to candidate assessment with human-in-the-loop approval at key stages.

Teamcast acts as a targeted qualification validation layer for integration partners. You send recruiter intent as natural-language qualification statements — Teamcast maps them internally to skills, generates the interview plan, conducts the session via the AI Hiring Assistant, and delivers a structured assessment back via signed webhooks. Candidate PII is optional and can be minimized or purged on demand.

Layer	Responsibility
API Gateway	REST integration endpoints, webhook notifications, Platform API key + X-Tenant-ID authentication
Agno Agents	Qualification validation, interview plan generation, real-time interview conduction, post-interview assessment
HITL Workflow	Optional recruiter approval gates for plan generation and assessment verdicts. Auto-approve mode available.

What You Can Build

The AI Interview System powers real workflows built from the same primitives:

ATS Integration — Push interview requests from your Applicant Tracking System via REST, using qualification statements instead of a skills taxonomy
Webhook-driven pipelines — React to state changes (plan generated, approved, completed) via HMAC-signed webhooks to your registered endpoint
PII-minimal workflows — Use candidateRef instead of candidateName/Email to avoid sending personal data; purge PII on demand after assessment
Custom HITL interfaces — Build recruiter review tools against the approval workflow API, or enable auto-approve for fully automated flows
Assessment automation — Receive structured AI assessments with qualification evaluations and pipe them into your HR systems

API Reference

Complete REST endpoint documentation with request/response schemas.

Integration API

REST endpoint reference and webhook integration guide for partner systems.

Architecture

System design, Kafka events, multi-tenancy, and agent integration.

Start Building

The system is designed to scale with you from a single API call to production multi-tenant deployments.

Quickstart — Create your first interview with a qualification-first request in under 5 minutes
Integration API — Integrate your ATS or HR platform via REST using your platform API key
Webhooks — Register your callbackUrl once per tenant to receive real-time state-change events
HITL Workflow — Enable auto-approve for zero-touch flows or build custom recruiter interfaces

bash

# Create your first interview — qualification-first, no PII required
curl -X POST https://mayaapi.teamcast.ai/api/v1/integration/interviews \
  -H "X-API-Key: your_api_key" \
  -H "X-Tenant-ID: your_tenant_id" \
  -H "Content-Type: application/json" \
  -d '{
    "candidateRef": "li_app_a1b2c3d4",
    "position": "Senior Engineer",
    "level": "SENIOR",
    "qualifications": [
      "5+ years production TypeScript experience",
      "Distributed systems design at scale"
    ]
  }'

# Response: { "runId": "run_...", "state": "VALIDATING_SKILLS" }

AI-Assisted Integration

Use our MCP (Model Context Protocol) server with Claude Code, Cursor, or Windsurf to get instant API integration help. Your AI coding assistant can search endpoints, read schemas, and access integration guides directly — no context-switching to docs.

Server URL

https://mayaapi.teamcast.ai/mcp

The server uses the Streamable HTTP transport. Add it to your MCP configuration:

json

{
  "mcpServers": {
    "teamcast-docs": {
      "url": "https://mayaapi.teamcast.ai/mcp"
    }
  }
}

Cursor: Add to ~/.cursor/mcp.json or project .cursor/mcp.json. Claude Code: Run claude mcp add --transport http teamcast-docs https://mayaapi.teamcast.ai/mcp. VS Code: Add to .vscode/mcp.json.

Once configured, ask your AI assistant questions like:

"How do I create an interview via the API?"
"What fields does CreateInterviewDto require?"
"Show me the webhook payload for interview.completed"
"What permissions does the workflow approve endpoint need?"

Tool	What it does
search_endpoints	Find API endpoints by keyword (e.g. "interview", "webhook")
get_endpoint_details	Full endpoint details — params, request body, response schema, auth
list_schemas	List all DTO/model types defined in the API
get_schema	Get the complete definition of a specific DTO
list_docs	List available integration guides
get_doc	Read a specific integration guide (A2A, webhooks, permissions, etc.)

The MCP server only exposes external integration APIs and guides. Internal architecture, infrastructure details, and credentials are never shared with AI tools.

Built for Production

AI Interview System runs in your infrastructure, not ours.

Multi-tenant with row-level security at the PostgreSQL level
Platform → Tenant hierarchy — one API key per platform, X-Tenant-ID per request
Horizontally scalable — memory-based HPA on the Interviewer, CPU-based on Planner and Assessor
Event-driven via Google Managed Kafka (SASL_SSL) — 100 partitions per audio topic
Full audit trail via structured logging and Kafka event history
HMAC-signed webhooks with exponential backoff retry
Candidate PII purge API with immutable audit log
Configurable retention windows per tenant

New to the platform? Start with the Authentication guide then follow the Quick Start to create your first interview.

Production Performance

Benchmarked on the production GKE cluster — 2 interviewer pods, 3x n2-standard-8 nodes (24 vCPU / 96GB total). All numbers are from live cluster runs against real Kafka, Redis, Google STT, Vertex AI LLM, and Google TTS — no mocks.

Metric	Result
Session creation throughput	54 req/s at 100 concurrent — 0% errors
Mixed load throughput	117 req/s at 200 concurrent — 0% errors
Session creation p50 / p95	380ms / 1400ms
WebSocket connect latency	600ms median (10 concurrent sessions)
Greeting latency — single session	2.9s median (LLM + TTS + Kafka + WebSocket)
Greeting latency — 10 concurrent	3.5s median / 4.1s p95 — 100% success
Audio round-trip per conversational turn	1.4s – 3.2s (STT + LLM + TTS, end-to-end)
CPU under full audio load	3–5% per pod (IO-bound — waiting on STT/LLM/TTS APIs)
Max simultaneous live interviews	~540–700 (20 pods × 27–35 sessions/pod at HPA trigger)

Was this page helpful?

NextQuick Start →