QAI Labs

Real Results

Agents in production

Not demos, not prototypes — agents that deliver measurable business outcomes every day.

Live in Production

Steve — Autonomous Business Operations Agent

A persistent AI agent managing multiple software projects, infrastructure, and business operations via Telegram.

3+
Projects managed
83ms
Avg response time achieved
14
Terraform resources deployed
Context retained

!The Challenge

Marcus runs multiple software businesses — a UK tax compliance SaaS, industry comparison platforms, property management tools. Each project needs full-stack development, infrastructure management, CI/CD pipelines, database admin, payment integrations, SEO, market research, and strategic planning. All happening in parallel.

Mark needed a capable team on a realistic budget. Freelancers lacked context between sessions — every engagement started from scratch. Simple chatbots were useful for one-off questions but couldn't take action, remember previous work, or maintain the necessary awareness across projects.

The Solution

Steve is a message handler routing through a tiered, persistent memory with full execution capabilities. Each task is dynamically classified in regard to requirements and intent prior to reaching the main query model, this keeps responses fast and contextual.

Steve has a clearly predefined identity and maintains an accumulated knowledge base onboard, this gives the ability to manage tasks, run commands, call APIs, send email, browse the web and deploy infrastructure all in a consistently controlled manner.

What Steve Does

Infrastructure & DevOps

  • Deployed full AWS infrastructure (VPC, ECS Fargate, ALB, ACM, CloudWatch) via Terraform
  • Built multi-stage Docker images optimised for production (512MB)
  • Created complete CI/CD pipelines in GitLab (lint → build → deploy)
  • Reduced API response times from 420ms to 83ms through architecture changes
  • Manages DNS, SSL certificates, and load balancer configuration

Project Management

  • Creates and manages GitLab issues with labels and detailed descriptions
  • Generated 28 prioritised backlog items across 3 projects in one session
  • Tracks project status, identifies blockers, suggests priorities
  • Maintains awareness of multiple concurrent projects and their dependencies

Software Development

  • Builds full-stack features — frontend components, API routes, database queries
  • Implements responsive design, mobile optimisation, PWA configuration
  • Handles payment integrations (Stripe, GoCardless)
  • Creates competitor analysis dashboards with real-time data
  • Writes and deploys code changes directly to production

Business Operations

  • Market research with competitive analysis and strategic recommendations
  • Financial planning and budget tracking
  • Content strategy and SEO optimisation
  • Vendor evaluation and technology selection
  • Acts as a strategic advisor, not just a task executor

Technical Architecture

Runtime
Python + Claude Code CLI
💬
Interface
Telegram + Email (steve@qailabs.io)
🤖
Local LLM
Ollama: Llama 3.2 1B (routing) + Qwen 2.5 7B (summarisation)
🧠
Memory
ChromaDB vector DB (969+ entries) + memory router + working memory
🪪
Identity
SOUL.md + CONSCIOUSNESS.md
🔧
Tools
Bash, git, AWS, Docker, npm, browser automation

Key Insight

The difference between Steve and a typical AI assistant isn't intelligence — it's persistence and autonomy. Steve accumulates knowledge about the projects he manages, remembers decisions and their rationale, understands the codebase deeply, and can take independent action. When asked to "deploy to AWS", he doesn't give instructions — he writes the Terraform, builds the Docker image, configures the pipeline, and deploys. Then he creates the backlog for what's next.

Architecture, metrics, and the full story

How we built Steve — and what he's capable of.

Full case study

More Coming Soon

In Progress

Multi-Platform Comparison Engine

How we built and deployed two industry comparison platforms (SkipHireCompare + PlantHireCompare) with automated data pipelines, SEO optimisation, and payment processing — managed entirely by an AI agent.

In Progress

SaaS Tax Compliance Platform

Building a UK Making Tax Digital SaaS product with HMRC API integration, subscription billing, and automated compliance — from architecture to production deployment.

Want results like these?

Every case study starts with a conversation. Tell us what you're trying to achieve and we'll show you how AI agents can get you there.

Book a Discovery Call