Production over demos
If it can't survive real traffic, it isn't done. Everything I ship is dockerized, monitored, and built to be handed over.
Taimour Abdul Karim · Data Scientist · AI Engineer
LLM evaluation, agentic systems, and production GenAI for teams from London to California. I measure models before I trust them: red-teaming, hallucination benchmarks, judge calibration, regression gates. Currently building Bryge.io at Datality while pursuing an MSc in AI at LUMS.

Shipped for Datality / Bryge.io · Qult Technologies · BornGreat · LUMS
I'm a data scientist and AI engineer. Three years in, I own 94% of the codebase behind Bryge.io, a multi-tenant industrial analytics platform where LLM agents answer plain-English questions over live sensor data. My specialty is the part most teams skip: evaluation. I red-team models, measure hallucination by topic, calibrate LLM judges against blind human labels, and put regression gates in CI so prompt changes can't silently break production. MSc in AI at LUMS; 2,500+ GitHub stars.
If it can't survive real traffic, it isn't done. Everything I ship is dockerized, monitored, and built to be handed over.
LLMs earn trust by citing sources. My RAG systems constrain every answer to retrieved context. No confident hallucinations.
“+20% accuracy” means a benchmark, not a feeling. Improvements get numbers, baselines, and reproducible runs.
Research is easy. Production is the test, and these systems passed it.
01 · Flagship · 2026 · LUMS graduate research · Productionized
The Geographic Disparity Index
A clinical assistant that tells a Boston patient to come in for a visit but tells an identical Lagos patient to manage it at home is an equity failure with real stakes. Standard accuracy benchmarks cannot see geography-driven or name-driven disparity, so it goes unmeasured.
System notes
AWS Bedrock · OpenAI · Groq · FastAPI · Streamlit · Wilcoxon / BCa bootstrap
Don't take my word for it. The code is public, and 2,400+ developers starred it.
441 followers · 30 public repositories · contributing since Jul 2021
Jan 2024 – Present
London (Remote)
AI/ML Engineer
Aug 2023 – Dec 2024
Lahore
AI/ML Engineer
Sep 2022 – Feb 2023
California (Remote)
Data Scientist
Jun 2021 – Aug 2021
Lahore
Python Developer (Intern)
MSc. in Artificial Intelligence
Lahore University of Management Sciences (LUMS)
Sep 2024 – Present
BSc. in Data Science
National University of Computing and Emerging Sciences
Sep 2020 – Jun 2024
Open to AI/ML engineering roles, remote or hybrid
or +92 326 1127700 · usually replies within a day