Get Started
AI Interview System
Build, run, and manage AI-powered interview workflows at scale.
The AI Interview System is a multi-tenant platform for creating, running, and evaluating AI-powered technical interviews. It provides a complete pipeline from interview creation to candidate assessment with human-in-the-loop approval at key stages.
Teamcast acts as a targeted qualification validation layer for integration partners. You send recruiter intent as natural-language qualification statements — Teamcast maps them internally to skills, generates the interview plan, conducts the session via the AI Hiring Assistant, and delivers a structured assessment back via signed webhooks. Candidate PII is optional and can be minimized or purged on demand.
| Layer | Responsibility |
|---|---|
| API Gateway | REST integration endpoints, webhook notifications, Platform API key + X-Tenant-ID authentication |
| Agno Agents | Qualification validation, interview plan generation, real-time interview conduction, post-interview assessment |
| HITL Workflow | Optional recruiter approval gates for plan generation and assessment verdicts. Auto-approve mode available. |
What You Can Build
The AI Interview System powers real workflows built from the same primitives:
- ATS Integration — Push interview requests from your Applicant Tracking System via REST, using qualification statements instead of a skills taxonomy
- Webhook-driven pipelines — React to state changes (plan generated, approved, completed) via HMAC-signed webhooks to your registered endpoint
- PII-minimal workflows — Use candidateRef instead of candidateName/Email to avoid sending personal data; purge PII on demand after assessment
- Custom HITL interfaces — Build recruiter review tools against the approval workflow API, or enable auto-approve for fully automated flows
- Assessment automation — Receive structured AI assessments with qualification evaluations and pipe them into your HR systems
Start Building
The system is designed to scale with you from a single API call to production multi-tenant deployments.
- Quickstart — Create your first interview with a qualification-first request in under 5 minutes
- Integration API — Integrate your ATS or HR platform via REST using your platform API key
- Webhooks — Register your callbackUrl once per tenant to receive real-time state-change events
- HITL Workflow — Enable auto-approve for zero-touch flows or build custom recruiter interfaces
# Create your first interview — qualification-first, no PII required
curl -X POST https://mayaapi.teamcast.ai/api/v1/integration/interviews \
-H "X-API-Key: your_api_key" \
-H "X-Tenant-ID: your_tenant_id" \
-H "Content-Type: application/json" \
-d '{
"candidateRef": "li_app_a1b2c3d4",
"position": "Senior Engineer",
"level": "SENIOR",
"qualifications": [
"5+ years production TypeScript experience",
"Distributed systems design at scale"
]
}'
# Response: { "runId": "run_...", "state": "VALIDATING_SKILLS" }AI-Assisted Integration
Use our MCP (Model Context Protocol) server with Claude Code, Cursor, or Windsurf to get instant API integration help. Your AI coding assistant can search endpoints, read schemas, and access integration guides directly — no context-switching to docs.
Server URL
The server uses the Streamable HTTP transport. Add it to your MCP configuration:
{
"mcpServers": {
"teamcast-docs": {
"url": "https://mayaapi.teamcast.ai/mcp"
}
}
}~/.cursor/mcp.json or project .cursor/mcp.json. Claude Code: Run claude mcp add --transport http teamcast-docs https://mayaapi.teamcast.ai/mcp. VS Code: Add to .vscode/mcp.json.Once configured, ask your AI assistant questions like:
- "How do I create an interview via the API?"
- "What fields does CreateInterviewDto require?"
- "Show me the webhook payload for interview.completed"
- "What permissions does the workflow approve endpoint need?"
| Tool | What it does |
|---|---|
| search_endpoints | Find API endpoints by keyword (e.g. "interview", "webhook") |
| get_endpoint_details | Full endpoint details — params, request body, response schema, auth |
| list_schemas | List all DTO/model types defined in the API |
| get_schema | Get the complete definition of a specific DTO |
| list_docs | List available integration guides |
| get_doc | Read a specific integration guide (A2A, webhooks, permissions, etc.) |
Built for Production
AI Interview System runs in your infrastructure, not ours.
- Multi-tenant with row-level security at the PostgreSQL level
- Platform → Tenant hierarchy — one API key per platform, X-Tenant-ID per request
- Horizontally scalable — memory-based HPA on the Interviewer, CPU-based on Planner and Assessor
- Event-driven via Google Managed Kafka (SASL_SSL) — 100 partitions per audio topic
- Full audit trail via structured logging and Kafka event history
- HMAC-signed webhooks with exponential backoff retry
- Candidate PII purge API with immutable audit log
- Configurable retention windows per tenant
Production Performance
Benchmarked on the production GKE cluster — 2 interviewer pods, 3x n2-standard-8 nodes (24 vCPU / 96GB total). All numbers are from live cluster runs against real Kafka, Redis, Google STT, Vertex AI LLM, and Google TTS — no mocks.
| Metric | Result |
|---|---|
| Session creation throughput | 54 req/s at 100 concurrent — 0% errors |
| Mixed load throughput | 117 req/s at 200 concurrent — 0% errors |
| Session creation p50 / p95 | 380ms / 1400ms |
| WebSocket connect latency | 600ms median (10 concurrent sessions) |
| Greeting latency — single session | 2.9s median (LLM + TTS + Kafka + WebSocket) |
| Greeting latency — 10 concurrent | 3.5s median / 4.1s p95 — 100% success |
| Audio round-trip per conversational turn | 1.4s – 3.2s (STT + LLM + TTS, end-to-end) |
| CPU under full audio load | 3–5% per pod (IO-bound — waiting on STT/LLM/TTS APIs) |
| Max simultaneous live interviews | ~540–700 (20 pods × 27–35 sessions/pod at HPA trigger) |