AI Engineer · Data Scientist · Makkah, KSA

I build production ML systems that explain themselves.

Arabic NLP
AI Engineer & Data Scientist focused on Explainable AI, privacy-preserving systems, and deterministic, well-documented production pipelines.

yousefammarabdullah.a@gmail.com github.com/ysfxjo55 Data Science · Umm Al-Qura University
0
Concurrent AI roles
Qanoniah · Moasherat · Rakaya
0
Live LLM tools
Rakaya Arabic copilot
0
Shipped systems
production · research · live
01 Selected work

Systems built to be trusted in production.

// Each one ships with honest benchmarks, graceful degradation, and a clear boundary between what decides and what explains.

02 More builds

Agents, pipelines & learning in public.

// Tooling and research I build to learn fast and ship faster.

03 Experience

Three AI roles, one Hajj season, zero shortcuts.

// Currently holding three concurrent AI roles while conducting active research.

CURRENT
Remote · Makkah
AI Engineer
Rakaya — Refada operations platform
Led the enhancement of «مساعد ركايا» — an Arabic admin copilot (Laravel + OpenAI function calling) that answers operational questions over live Hajj/Umrah data through 40+ database-backed tools.
Designed the Arabic intent-routing layer so multi-entity questions hit the right backend — no cross-entity hallucinations — and hardened the tool loop with conversation memory, retries, and Arabic error handling.
CURRENT
Hybrid · Makkah
Data Scientist Intern
Qanoniah — Legal Tech
Built an Arabic legal corpus pipeline — scraped and structured 500+ Saudi law documents into a JSONL corpus with metadata for NLP evaluation.
Evaluated Arabic embedding models via the MTEB leaderboard (2024–2026 releases), producing a ranked shortlist for production retrieval.
Built an OCR pipeline (AlOCR API) with Arabic normalization and WER/CER evaluation, plus a cached Balsam leaderboard scraper as a FastAPI service.
CURRENT
On-site · Makkah
AI Engineer Intern
Moasherat
Built an Umrah pilgrim permit extraction system (v2): PaddleOCR Arabic OCR + a FastAPI field parser pulling name, nationality, permit ID and dates from scanned permits.
Developed an Arabic analytics chatbot — a FastMCP server bridging Cube.dev + Directus into LibreChat for natural-language querying of live operational data.
HAJJ 2025
On-site · Makkah
Data Analyst — Hajj Season
Moasherat
Monitored real-time dashboards and KPIs across Hajj 2025 operations.
Analyzed performance irregularities and managed satisfaction-survey data under high-pressure, time-critical conditions.
04 Toolkit

What I reach for.

ML / NLP
HuggingFaceMARBERTAraBERTPyTorchscikit-learngensim
Explainable & Federated
SHAPLIMEPyMCFedAvgIID / non-IID
Vision / OCR
InsightFacePaddleOCROpenCVDeepFaceFAISS
Infrastructure
FastAPIDockerQdrantPostgreSQLMinIORailwayRender
Languages
PythonJavaScriptHTML / CSSSQL
Specialties
Arabic NLPExplainable AIFederated LearningPrivacy by DesignProduction ML
05 Credentials

Certifications & training.

// Continuous, structured learning — KAUST Academy, DeepLearning.AI, AWS.

2026
Advanced AI — Computer Vision
KAUST Academy
2026
Artificial Intelligence — Introduction
KAUST Academy
2025
Deep Learning Specialization
DeepLearning.AI
2025
Mathematics for ML & Data Science Specialization
DeepLearning.AI
2025
eJDS — Junior Data Scientist
INE
2025
Generative AI with AWS · ML Foundations
Udacity · AWS Educate
06 Get in touch

Let's build something that explains itself.

Open to AI / ML engineering and data science opportunities. The fastest way to reach me is email — or browse the source on GitHub.