SAADMAN RAFAT - AI SYSTEMS ENGINEER

AI Systems Engineer focused on building the infrastructure for the Agentic web. I specialize in productionizing LLMs, architecting MCP servers, and optimizing modern Python ecosystems.

From provisioning high-performance VMs to implementing real-time AI grounding with Google Gemini, I build the plumbing that makes intelligent applications scalable and secure.

EXPERTISE

AGENTIC AI (MCP) GOOGLE GEMINI SYSTEM ARCHITECTURE MODERN PYTHON (UV) VECTOR EMBEDDINGS DEVOPS DEEP LEARNING

RESEARCH PUBLICATIONS

Pruning Convolution Neural Networks Using Filter Clustering

Khan, Niaz Ashraf, and A. M. SAADMAN RAFAT 2024

"Pruning Convolution Neural Networks Using Filter Clustering Based on Normalized Cross-Correlation Similarity." Journal of Information and Telecommunication.

Optimizing Deep Learning Models for Resource-Constrained Environments

Khan, Niaz Ashraf, and A. M. SAADMAN RAFAT 2025

"Optimizing Deep Learning Models for Resource‐Constrained Environments With Cluster‐Quantized Knowledge Distillation." Engineering Reports.