Skip to content
View mouadja02's full-sized avatar

Highlights

  • Pro

Block or report mouadja02

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
mouadja02/README.md

Mouad Jaouhari — Data & AI Engineer


Portfolio LinkedIn Email Profile Views


~/ whoami  ·  the manifest, as an agent would read it

# mouad.agent.yaml — system manifest
identity:
  name:    Mouad Jaouhari          # alias: MJ
  role:    Data & AI Engineer
  company: ATOS
  motto:   "From signals to systems — now with a mind of their own."

mission: >
  Turn raw, messy data into autonomous systems that
  retrieve, reason, and act — not just dashboards that report.

operating_loop: [ plan, retrieve, act, observe, reflect ]

stack:
  data:    [ Snowflake, Spark, Airflow, Kafka, dbt, Iceberg ]
  agentic: [ LangGraph, LangChain, RAG, CAG ]
  memory:  [ Qdrant, Pinecone ]
  tooling: [ Claude Code, OpenAI Codex, Cursor ]
  cloud:   [ AWS, GCP, Cloudflare ]

currently_building: [ dacli, dag-doctor, CAG-Lab ]
status: "open to agentic-AI & data-platform collaborations"

Agent Orchestration Graph


How I think about agents — the reasoning loop

I don't build chatbots; I build systems that plan, pull their own context, take actions, and self-correct. Every agent I ship runs some version of this loop:

%%{init: {'theme':'dark','themeVariables':{'primaryColor':'#0d1117','primaryTextColor':'#e7e5e4','primaryBorderColor':'#00f0ff','lineColor':'#a855f7','fontFamily':'monospace','fontSize':'15px'}}}%%
flowchart LR
    U([ prompt ]) --> P[plan]
    P --> R["retrieve<br/>Qdrant · Pinecone"]
    R --> A["act<br/>tools · APIs · SQL"]
    A --> O[observe]
    O --> D{goal met?}
    D -->|no| Rf[reflect] --> P
    D -->|yes| Out([ answer / action ])
Loading

Data to Intelligence Pipeline


Technical Arsenal

Agentic AI · LLM Engineering

LangChain LangGraph RAG Qdrant Pinecone Claude OpenAI Codex Cursor Hugging Face n8n

Data Engineering

Snowflake Apache Spark Airflow Kafka dbt Iceberg PostgreSQL

Languages · ML

Python PyTorch TensorFlow Pandas C CUDA

Cloud · DevOps

GCP AWS Cloudflare Docker Terraform Bash


Capability Matrix


Featured Builds — where Data meets Agents

An intelligent CLI agent that builds production-grade data warehouses from scratch on your chosen data platform.

Agentic LangGraph Snowflake

An AI assistant that explains failed Airflow DAGs, pinpoints the root cause, and suggests safe fixes.

AI Ops Airflow LLM

Benchmarking Cache-Augmented Generation (CAG) against RAG for real-world knowledge systems.

Research RAG vs CAG Vector DB

A curated collection of 149 agent skills for Claude Code & Cursor — engineering craft, AI agents, prompt engineering, and more.

149 skills Claude Code Cursor

Bitcoin hourly OHLCV with 70+ technical indicators — a daily-updated dataset for ML & trading analysis.

Dataset Pandas ML

End-to-end data platform with real-time streaming and cloud processing — Kafka, Spark, AWS Glue, Snowflake, Iceberg.

Streaming Kafka Spark

also brewing  ·  murmur (local-LLM prompt overlay)  ·  mnemex (LLM memory vault)  ·  neural-network-cuda (NN from scratch → CUDA)


By the Numbers



Activity Graph


Let's build systems that think.

From raw signals → reasoning systems.

Portfolio LinkedIn Email

Pinned Loading

  1. bitcoin-technical-indicators-dataset bitcoin-technical-indicators-dataset Public

    Bitcoin Hourly OHLCV with 70+ Technical Indicators | Daily Updated Dataset for ML & Trading Analysis

    Python 26 4

  2. skills skills Public

    A curated collection of 149 agent skills for Claude Code and Cursor — engineering craft, AI agents, prompt engineering, design, growth marketing, C-suite advisors, DevOps, and more.

    Python 2

  3. bitcoin-news-data bitcoin-news-data Public

    Comprehensive dataset of Bitcoin news articles and Fear & Greed Index data collected hourly from major cryptocurrency publications (May 2025 - June 2025)

    Python 8

  4. end2end-datawarehouse-project end2end-datawarehouse-project Public

    End-to-end data engineering pipeline with real-time streaming, cloud processing, and analytics. Built with Apache Kafka, Spark, AWS Glue, and Snowflake using Apache Iceberg tables.

    Python 4

  5. airflow-self-hosted airflow-self-hosted Public

    A production-ready Apache Airflow deployment system for personal data engineering projects, optimized for Raspberry Pi and self-hosted environments with automated CI/CD pipeline.

    Python 2

  6. murmur murmur Public

    A desktop overlay that sits on top of everything, listens when you tap a hotkey, transcribes your voice locally, runs the transcription through a local LLM with a chosen prompt-engineering "skill",…

    TypeScript