Nikhil Bindal | Portfolio

Just A Rather Very Intelligent System

J.A.R.V.I.S · v4.7 · online

Below: Prepared briefings · scroll to access

Prepared Briefings · Curated by J.A.R.V.I.S

▸ Live Mission · Active Engagement

› MISSION · NRG-001·Transmitting · Live·class::ALPHA

Jan 2026 → Present San Francisco, CA

▸ Subject · Neurologyca — AI Coaching Infrastructure (Contract)

Architecting a multi-agent AI coaching platform that fuses biometric signal analysis with persistent personal memory.

Owning system design end to end on GCP Cloud Run — a 6-agent orchestration pipeline (Kopernica), the Mnemosyne memory subsystem, and a parallel voice sidecar that runs sentiment / query / analytics off the hot path. Currently shipping the production-grade Python + JavaScript SDKs and the multi-tenant BYOD surface that exposes it all.

› AI Latency

9.5se2e

↓ 73% from 36s

› Pipeline DAG

6agents

Coordinator-orchestrated

› Mnemosyne Memory

5lanes

Context-scoped, per-user

› SDKs Shipped

32ops

Python + JavaScript

▸ Signal Flow›Voice + HCI→Coordinator · 6-agent DAG→Response · 9.5s·Mnemosyne · 5 lanes·Gemini Live async sidecar

▸ What I'm Building · Active Subsystems

6-Agent Orchestration

Re-architected a serial 6-agent pipeline into a coordinator-driven DAG with prompt consolidation, model tiering, and async parallel execution. End-to-end latency 36s → 9.5s.

Mnemosyne Memory

Context-scoped memory architecture with semantic retrieval and temporal-decay lifecycle (Core → Dream → Forgotten → Deleted), bucketed per user and domain so context cannot cross boundaries.

Voice Sidecar (Gemini Live)

Parallel voice sidecar running sentiment, query, and analytics pipelines asynchronously off the hot path — they never add to perceived response time.

Concurrency & Sessions

Redis-backed concurrency control with per-session sliding-window locks prevents state corruption when multiple real-time voice turns hit the same session simultaneously.

Python + JS SDKs · 32 ops

Shipped both SDKs covering orchestration, memory, analytics, sessions, and BYOD multi-tenant infrastructure — with CI/CD, audit logging, and signed webhook delivery.

Production Ops on GCP

CI/CD pipelines, audit logging, signed webhooks, circuit breakers, and graceful shutdown — all on Cloud Run + Cloud SQL + Memorystore behind a private VPC.

▸ Deployed Assets · Stack

PythonFastAPINode.jsGCP Cloud RunCloud SQL (Postgres)Memorystore (Redis)Vertex AI · GeminiGemini LiveCrewAILangGraphKubernetesPrometheusTerraform

UPLINK · STABLE · CONTRACT ENGAGEMENTMISSION ONGOING

What I build

Services I Offer

Four operational modules — focused capabilities backed by shipped production work.

› MODULE-01·OPEN

class::ALPHA

−73%

latency · 6-agent

AI Infrastructure & Agentic Systems

Multi-agent orchestration with persistent memory and pipeline tuning.

› Capabilities

Multi-agent orchestrationMemory architecturesLangGraph / CrewAI

› Deployed Assets

PythonFastAPICrewAIQdrantGCP

› Driven by

01·craft 02·vision 06·leverage

›Initiate Dialog

› MODULE-02·OPEN

class::ALPHA

<200ms

voice latency

Real-Time Voice & RAG

Sub-200ms conversational AI with multimodal retrieval and streaming STT/TTS.

› Capabilities

LiveKit / WebRTCMultimodal RAGStreaming STT/TTS

› Deployed Assets

LiveKitDeepgramCartesiaQdrant

› Driven by

01·craft 03·velocity 02·vision

›Initiate Dialog

› MODULE-03·OPEN

class::BETA

22K TPS

event throughput

Scalable Backend Engineering

Event-driven systems with exactly-once semantics and observability.

› Capabilities

Event-driven workflowsDistributed systemsCloud-native ops

› Deployed Assets

Node.jsKafkaPostgreSQLKubernetes

› Driven by

01·craft 04·ownership 03·velocity

›Initiate Dialog

› MODULE-04·LIMITED

class::ADVISORY

6+ yrs

shipping AI

AI Consulting & Architecture

Architecture reviews, AI roadmaps, and hands-on guidance for production teams.

› Capabilities

Architecture reviewAI strategyCode & design reviews

› Deployed Assets

StrategyArchitectureMentoring

› Driven by

05·judgment 06·leverage 04·ownership

›Initiate Dialog

Selected work

Featured Projects

A curated selection of recent production AI systems and full-stack work — each rendered as a JARVIS mission file.

View entire archive

› FILE-001

ACTIVE·class::OPS

RecoMe — Personalized Recommendation Engine

Personal interest-graph and agentic recommendation engine turning cross-platform activity into a typed Neo4j interest graph with SSE-streamed recommendations.

› Key Metric

15+CROSS-PLATFORM SOURCES INTO A TY

▸ domain: AI · Agentic · Recommendation Systems · Personalization

› Deployed Assets

TypeScriptHonoPrismaNeo4jQdrantRedisBullMQStripe+ 2 more

Repo

› FILE-002

ACTIVE·class::ALPHA

Donna — Voice Document Intelligence Platform

AI meeting-intelligence platform automating pre-meeting research, live in-meeting assistance, and post-meeting synthesis with real-time voice and hybrid RAG.

› Key Metric

15AGENT 3-PHASE ORCHESTRATION BEHI

▸ domain: AI · Voice AI · RAG · Multi-Agent

› Deployed Assets

PythonFastAPIPostgreSQLQdrantRedisLiveKitWebRTCDeepgram+ 2 more

Repo

› FILE-004

ACTIVE·class::ALPHA

Manifold Strata — Geometric Low-LLM Retrieval Engine

Knowledge-graph retrieval engine that minimises LLM calls by doing entity resolution and validation in embedding and rule space; concepts live in hyperbolic (Poincaré) geometry.

› Key Metric

3DGS" → "3D GAUSSIAN SPLATTING"):

▸ domain: AI · Knowledge Graph · RAG · Agentic

› Deployed Assets

TypeScriptHonoDrizzlePostgreSQLReactMCP Server

Repo

View entire archive