AI Training Specialist
at UnifiedBeez Ltd
Position: AI Training Specialist (Conversation
Tagging)
Company: UnifiedBeez Ltd, London, England
Duration: October 2025 - Present
Work Mode: Full-time Remote
About the Role
As an AI Training Specialist at UnifiedBeez Ltd, I lead the design, implementation, and maintenance of
the company's intelligent conversation-tagging platform. UnifiedBeez is building smart, scalable
conversation intelligence solutions for businesses across Africa and beyond.
Core Responsibilities
- Conversation Tagging System: End-to-end ownership of UnifiedBeez's multi-layer
tagging system using Rules → Classifier → LLM fallback architecture, from design to production
operation.
- Rules Layer: Design and maintain human-editable YAML/JSON rules files with
keywords, regex patterns, and negative phrases. Implement versioning, change reviews, and
comprehensive audit trails.
- ML Classifier Development: Build and maintain lightweight multi-label classifiers
using sentence embeddings with One-vs-Rest logistic regression/SVM. Set per-tag thresholds and
calibrate probabilities for optimal performance.
- LLM Fallback Implementation: Implement cost-controlled LLM fallback systems using
GPT-4o-mini and Gemini with strict JSON schema/function calling, caching, batching, and
confidence-based escalation.
- Data Engineering & Active Learning: Bootstrap datasets with weak labels from rules,
run active learning loops, curate gold standard datasets, and manage dataset versioning for
reproducible training and evaluation.
- API Development & Integration: Design, expose, and maintain the /tag endpoint and
related services. Ensure backward compatibility, performance SLAs, and seamless integration with
backend and frontend teams.
- Automation Engine: Define tag-triggered automations with event schemas, implement
webhooks/queues with retries and backoff, map tags to workflows (assign, escalate, notify,
autorespond, CRM updates), and monitor automation success rates.
- Monitoring & Metrics: Build comprehensive dashboards for coverage, per-tag
precision/recall/F1, LLM usage percentage, latency, and cost per message. Ensure every applied tag
logs a complete audit trail (rule ID, model version, confidence, LLM trace).
- Performance Optimization: Optimize inference latency and throughput, implement load
testing, rate limiting, graceful degradation paths, and plan model version rollouts using blue/green
and canary deployment strategies.
- Security & Compliance: Enforce access controls, implement encryption for data in
transit and at rest, apply data minimization and redaction techniques, and adhere to company
security policies and applicable laws.
- Documentation & Collaboration: Maintain clear documentation for rules, models,
endpoints, automations, and operational runbooks. Produce weekly summaries of progress, blockers,
and priorities. Work closely with Product, Backend, Frontend, and PM teams.
Key Technologies & Tools
- Machine Learning: Sentence Embeddings, One-vs-Rest Classification, SVM, Logistic Regression, Model
Calibration
- LLMs: GPT-4o-mini, Gemini, Function Calling, JSON Schema Validation
- Backend: Python, RESTful APIs, Webhooks, Queue Systems
- Data Engineering: YAML/JSON processing, Active Learning, Dataset Versioning
- DevOps: Blue/Green Deployment, Canary Releases, Load Testing, Performance Monitoring
- Cloud Services: AWS, Caching Systems, Database Management
- Security: PII Redaction, Data Encryption, Access Control, Compliance
Key Performance Metrics
The role involves meeting stringent performance targets including:
- Precision ≥ 85% and Recall ≥ 70% for top revenue-relevant tags
- ≥ 75% coverage through Rules + Classifier without LLM usage
- LLM usage rate ≤ 10% (rolling 30-day average)
- Latency p95 ≤ 120ms per request at 250 rps
- Cost per message ≤ $0.0010 including cache hits
- Cache hit rate ≥ 75% for LLM calls
- 99.9% monthly uptime for tagging services
Impact & Value
This role is critical to UnifiedBeez's mission of providing intelligent conversation intelligence to
businesses. By developing a robust, scalable, and cost-effective tagging system, I'm helping
organizations automatically categorize and understand customer conversations, enabling automated
workflows, better insights, and improved customer experiences across various industries in Africa and
beyond.