Aditya Raj — AI Systems Engineer

// 02 — Selected Work

Systems built,
not prototyped.

Client Project · Private

🔒 Closed Source

01 — Legal AI Platform

AI Legal Intelligence Platform — US Law Firms

Full-stack AI legal platform engineered for US law firms, comprising two products. Product 1 — Document Comparison Tool: Automated contract vs. template diff engine that uploads a legal document, classifies it (NDA, lease, sales contract), and generates a structured clause-level diff report in seconds using LLM-as-Judge reasoning — catching semantic deviations that keyword diff entirely misses. Product 2 — AI Legal Assistant Platform: Agentic multi-role platform with LangGraph triage → research → draft workflow for client intake, case management, and lawyer review queues. Supports Public LLM API or fully on-premise Private SLM deployment for attorney-client privilege data sovereignty.

LangGraph Agents Graph-RAG / Neo4j voyage-law-2 LLM-as-Judge Weaviate Docling Cohere Rerank FastAPI Celery + Redis K8s / AWS

System Design Docs

DeepTech · In Development

Active Build

02 — Longevity AI Platform

Tessera — AI-Driven Longevity & Biomarker Platform

A deep-tech longevity platform that builds personalised health-extension programs from a 56-marker biomarker panel. The AI engine ingests blood work (metabolic, hormonal, inflammatory, haematological, cardiac markers) to calculate a client's PhenoAge biological age using the Levine formula, then outputs a four-lever protocol — nutrition, training, recovery, and supplementation — calibrated to their specific aging trajectory. Integrates an AI + doctor hybrid workflow: the AI handles intake (127 structured questions), biomarker interpretation, and protocol design; a licensed physician reviews and approves. Tiered programs (Foundations / Performance / Continuum) with 90-day tracking cycles and re-assessment loops.

PhenoAge Algorithm Biomarker Intelligence 56-Marker Panel AI Protocol Engine LLM Interpretation Doctor-AI Workflow Longitudinal Tracking Hyderabad Launch

Platform Rulebook

Research + Engineering

03 — Audio AI

Prosody-Content Disentangled Audio Deepfake Detector

State-of-the-art audio deepfake detection architecture targeting ICASSP/INTERSPEECH-level results. The core insight: existing detectors overfit to per-generator acoustic artifacts that change with every new spoofing model. This system trains a lightweight TCN/ResNet prosody encoder against a frozen Wav2Vec2 teacher via adversarial disentanglement — a Gradient Reversal Layer actively punishes the model for learning speech content, forcing generalization through prosodic cues alone. Trained on 119+ spoofing models (MLAAD) interleaved 50/50 with LibriSpeech. At inference, the heavy Wav2Vec2 is discarded — only the compact, edge-deployable encoder remains.

Wav2Vec2 TCN/ResNet Adversarial Training GRL MLAAD Dataset EER Optimization

GitHub Repository

Production System

04 — AI Video Engine

VidSimplify — Manim AI Animation Engine

15K+

Minutes Generated

1K+

Users

Enterprise Clients

Full-scale AI video production engine that converts natural language prompts into polished Manim animations. Built on a Reflexion-style multi-step LLM reasoning loop: scene decomposition → code synthesis → self-critique → validation → render. Uses a cascade of models (DeepSeek for code generation, smaller specialist models for scene planning and error correction). Async GPU-backed rendering pipeline with job resumption, rate-limiting, and intelligent caching. Real-time editing capability on generated animations. Scaled to 4 enterprise clients and 15,000+ processed minutes.

Reflexion Agent DeepSeek Manim LLM Orchestration Async GPU Pipeline Production Scale

VidSimplify.com GitHub Repository

05 — High-Scale Video AI

AI Viral Clip Intelligence Engine

30K+

Minutes Processed

Production video intelligence backend that ingests hour-long videos and surfaces the highest-engagement clips using ML virality scoring. The pipeline: audio-visual feature extraction → engagement signal modeling → clip boundary detection → automated subtitle generation → multilingual dubbing with voice synthesis. Transcription engine handles speaker diarization and timestamp alignment. Built for scale — async job queuing, distributed workers, and persistent state management across long-running GPU tasks. Served 30,000+ minutes of processed content in production.

Virality Scoring Whisper / ASR Multilingual TTS Video Segmentation Distributed Workers Async GPU

GitHub Repository

06 — AI Identity Synthesis

VidSimplify Cloner — AI Video Personality Engine

End-to-end AI video transformation platform enabling complete personality cloning, lip-sync dubbing, and multi-language localization. The system fuses three synchronized AI tracks: voice cloning (zero-shot speaker embedding extraction → neural vocoder synthesis), facial replication (identity-preserving face re-enactment with landmark-driven motion transfer), and lip synchronization (phoneme-to-viseme mapping with video-grade temporal alignment). Designed for professional content localization workflows — a single source video can be dubbed and visually cloned into any target language with no manual intervention.

Voice Cloning Lip Sync Face Re-enactment Zero-Shot TTS Global Dubbing Video Synthesis

GitHub Repository

07 — EdTech AI Platform

Knewbit Max — Adaptive AI Learning Platform

Next-generation personalized learning platform powered by Google Gemini LearnLM. Implements Socratic tutoring methodology — the AI guides through questions rather than direct answers, with dynamic cognitive load management. Multi-modal learning stack: multilingual video dubbing pipeline for course content, auto-generated adaptive flashcards and quizzes from course material, skill-graph-aware course recommendation engine. The recommendation system infers learning trajectories from enrollment history to serve personalized learning paths. Built with H.264-optimized async video processing, YouTube URL ingestion, and full progress analytics.

Gemini LearnLM Socratic AI Multilingual Dubbing Recommendation Engine Adaptive Learning EdTech

GitHub Repository

08 — Medical AI

Handwritten Prescription OCR & Clinical AI

Production-grade OCR pipeline targeting the notoriously difficult domain of handwritten Indian doctor prescriptions. Systematic multi-model benchmarking across MiniCPM-V, Gemini Vision, and LLaMA vision variants to identify optimal accuracy-latency tradeoffs. The pipeline extracts structured JSON (drug name, dosage, frequency, route) from low-quality scans and photographs. Extended into a full clinical intelligence layer: integrated HuaTuo-GPT for radiology report interpretation, lab value analysis, and diagnostic support. Evolved into Docmate — a full multimodal medical chatbot handling prescriptions, lab reports, and imaging.

MiniCPM-V Gemini Vision LLaMA OCR HuaTuo-GPT Clinical NLP Medical AI

Docmate — Vimeo Demo GitHub Repository

// 04 — Experience

Where the systems
went live.

Senior Software Engineer

Arcesium, Hyderabad

Designed and scaled compliance infra systems, reducing query latency 10x via backend re-architecture and data structure optimization.
Led AI adoption across compliance products — built reusable infra patterns now deployed across multiple product lines.
Built config-driven form infrastructure, AOP-based auth, and audit logging tools used across all compliance domains.
Resolved critical production incidents across Kubernetes, PostgreSQL, and SSL layers.

Jul 2025 – Present

Founder — Kalman Labs / VidSimplify

Hyderabad, India

Built and shipped VidSimplify.com — AI animation engine processing 15,000+ minutes for 1,000+ users and 4 enterprise clients.
Architected Docmate: multimodal medical AI chatbot for clinical summarization, prescription OCR, and imaging analysis.
Built cell-state simulation engine using Mamba-style SSMs + neural operators for drug perturbation trajectory prediction.
Designed and delivered AI Legal Intelligence Platform for US law firms — dual-product system covering document comparison and multi-agent legal assistant workflows.
Currently building Tessera — a deep-tech AI longevity platform using biomarker intelligence to personalise biological age reversal protocols.

Nov 2024 – Oct 2025

Software Engineer

Arcesium, Hyderabad

Owned development of major UI + backend features serving 100K+ records at scale.
Deployed tooling integrated with Kafka, SQS, KEDA, EC2, and role-based access systems.

Jul 2023 – Jun 2025

Deep Learning Researcher

Pucho Digital Health Inc., Remote

Researched Private AI in healthcare: Federated Learning + Homomorphic Encryption. Built signal-image preprocessing pipeline and deployed FastAPI backend on AWS EC2.

Jan – May 2022

Aditya
Raj_

Engineer who ships
at research depth.

Systems built,
not prototyped.

AI Legal Intelligence Platform — US Law Firms

Tessera — AI-Driven Longevity & Biomarker Platform

Prosody-Content Disentangled Audio Deepfake Detector

VidSimplify — Manim AI Animation Engine

AI Viral Clip Intelligence Engine

VidSimplify Cloner — AI Video Personality Engine

Knewbit Max — Adaptive AI Learning Platform

Handwritten Prescription OCR & Clinical AI

Published work in
speech & signal ML.

Where the systems
went live.

Got a hard
AI problem?

AdityaRaj_

Engineer who shipsat research depth.

Systems built,not prototyped.

AI Legal Intelligence Platform — US Law Firms

Tessera — AI-Driven Longevity & Biomarker Platform

Prosody-Content Disentangled Audio Deepfake Detector

VidSimplify — Manim AI Animation Engine

AI Viral Clip Intelligence Engine

VidSimplify Cloner — AI Video Personality Engine

Knewbit Max — Adaptive AI Learning Platform

Handwritten Prescription OCR & Clinical AI

Published work inspeech & signal ML.

Where the systemswent live.

Got a hardAI problem?

Aditya
Raj_

Engineer who ships
at research depth.

Systems built,
not prototyped.

Published work in
speech & signal ML.

Where the systems
went live.

Got a hard
AI problem?