
AI Engineer
I'm an AI Engineer with a solid track record of turning advanced AI models into real-world, production-ready systems. I began my career at FPT Software, developing scalable real-time video analytics and OCR systems for edge devices and servers using DeepStream, Triton, and microservices. There, I gained deep expertise in optimizing performance, accuracy, and deployment pipelines for AI solutions.
I now work at Success Software Services, focusing on large language models and conversational AI. My recent work includes building a Retrieval-Augmented Generation (RAG) platform with high-performance vector search and a real-time Voice Agent for healthcare and wellness using WebRTC and streaming inference. I’m passionate about combining high-performance engineering with natural, human-like AI experiences, following best practices in MLOps, observability, and scalability.

An AI operations platform for managing LLM deployments with advanced RAG capabilities.

Codewiki is an intelligent documentation tool that automatically generates wiki-style documentation for your GitHub or local repositories using AI.

Plug-and-Play Custom Parsers for AI Models in NVIDIA DeepStream SDK. Supported YOLOv11, D-FINE, SCRFD model.

Face recognition pipeline powered by Triton Inference Server.
Recognized for dedication and contributions to team success
A competition focused on real-world business challenges, and honing AI project deployment skills
Detecting Ships in Ports to Avoid Congestion and Manage Traffic
Placed second in Qualifying, advanced to the Finals, and finished in the top 4