Skip to content
View rrahimi-uci's full-sized avatar

Block or report rrahimi-uci

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
rrahimi-uci/README.md

Reza Rahimi

AI/ML Engineering Manager · Architect and Builder .

Building and scaling trustworthy, production-grade AI/ML Products. — AI/ML, Agentic AI, LLMs, LLM safety & guardrails, evaluation systems, and scalable ML infrastructure (MLOps / AgentOps).

🔭 Currently: how to build, evaluate, calibrate, Scale and deploy Safely LLM applications and AI agents efficiently. 💞️ Open to collaborating on open-source in AI/ML, LLMs · Generative AI · agentic workflows · AI evaluation · AgenticOps.

📫 GitHub · Pronouns: He/Him


🚀 Featured Projects

A tiny, fast SLM safety guardrail for LLMs & agents — screens prompts, tool calls, and outputs before they reach your model. Fine-tuned + RL-tuned (GRPO), benchmarked against GPT-4o-mini, with a training studio and an honest scoreboard. LLM safety · guardrails · prompt-injection · jailbreak detection · content moderation · small language models · MLflow

Open-source MLflow plugin for AI agents and agentic workflows — prompts, tools, skills, MCP servers, RAG knowledge bases, evaluation, deployment, and observability. MLflow · LLMOps · AI agents · evaluation · observability · MCP · RAG

Faithful ICLR 2026 implementation — evolving, self-improving context playbooks for LLM agents, with OpenAI Agents SDK support. context engineering · self-improving agents · in-context learning · agent memory

A clean reference implementation of the Agent-to-Agent (A2A) protocol — specialized AI agents that discover each other and collaborate over JSON-RPC 2.0. Python · FastAPI · Pydantic. multi-agent systems · agent interoperability · A2A · FastAPI

Enterprise compliance automation — turn compliance documents into queryable knowledge graphs via a multi-agent AI pipeline, with an interactive graph explorer. knowledge graphs · compliance · RegTech · multi-agent · JanusGraph

Reinforcement learning (PPO/A2C/DQN) that dynamically tunes AML risk-scoring weights per case — Gymnasium env, FastAPI backend, React training dashboard. reinforcement learning · AML · RegTech · PPO · risk scoring

Domain-agnostic, single-node tabular AutoML pipeline (Dagster + FLAML + MLflow + FastAPI) with drift monitoring and an online feature store. AutoML · MLflow · Dagster · drift detection · tabular ML

AI-powered mock-interview assistant for ML engineering, leadership/behavioural, and coding interviews. Gradio · LangChain · OpenAI · Whisper. interview prep · LangChain · speech-to-text · generative AI


🛠️ Focus Areas

Agentic AI · LLM safety & guardrails · prompt-injection / jailbreak detection · LLM & agent evaluation · MLflow / LLMOps / MLOps · RAG · reinforcement learning (RLHF/GRPO/PPO) · fine-tuning (SFT/LoRA) · knowledge graphs · multi-agent systems

Python · PyTorch · Transformers · TRL · MLflow · FastAPI · LangChain · React


⭐ If any of these are useful, a star helps others find them too.

Pinned Loading

  1. agentic-context-engineering agentic-context-engineering Public

    ACE — Agentic Context Engineering: evolving, self-improving context playbooks for LLM agents. Faithful ICLR 2026 implementation with OpenAI Agents SDK support.

    Python 1

  2. a2a-poc a2a-poc Public

    A clean reference implementation of the Agent-to-Agent (A2A) protocol: specialized AI agents that discover each other and collaborate over JSON-RPC 2.0. Python · FastAPI · Pydantic.

    Python 1

  3. policy-to-knowledge policy-to-knowledge Public

    Enterprise compliance automation: transform compliance documents into queryable knowledge graphs via a multi-agent AI pipeline, with an interactive graph explorer.

    HTML 1

  4. rl-anti-money-laundry rl-anti-money-laundry Public

    Reinforcement learning (PPO/A2C/DQN) that dynamically tunes Anti-Money-Laundering risk-scoring weights per case — Gymnasium env, FastAPI backend, and a React training dashboard.

    Python 1

  5. interviewer-gpt interviewer-gpt Public

    Guru.AI — an AI-powered mock-interview assistant for ML engineering, leadership/behavioural, and coding interviews. Built with Gradio, LangChain, and the OpenAI API.

    Python 1

  6. buyer-stage-prediction buyer-stage-prediction Public

    Real-estate buyer-stage classifier on a domain-agnostic, single-node tabular AutoML pipeline (Dagster + FLAML + MLflow + FastAPI), with drift monitoring and an online store.

    Python 1