Projects | Aayush Kumar

Category:

Status:

Showing 11 of 11 projects

⭐Featured

HyperSentry

An agentic security copilot that uses policy checks + retrieval to generate remediation PRs and accelerate incident analysis.

False positives under 3%
Auto-merged 30%+ remediation PRs

PythonRAGOPA/RegoGitHub API+2

BuildingML Systems

SchemaPulse

OpenAI-compatible inference layer for schema-valid structured outputs with streaming tool-call parsing.

Streaming incremental tool-call parsing
Adaptive guided decoding for schema validity

TypeScriptPythonvLLMPydantic+1

ResearchResearch

EdgeJury

Multi-LLM council with cross-review for truthful QA on serverless edge inference.

Cross-reviewed small-model ensembles
Structured critique + synthesis

TypeScriptReactCloudflare WorkersLLM APIs

In DevelopmentResearch

Ghosts in the Scale

Prompt injection via image resampling & certified defenses for multimodal systems.

Black-box downscaler fingerprinting
ScaleJail-mini benchmark

PythonPyTorchPillowOpenCV+1

All Projects

LiveRAG/Agents

TariffWhisperer

End-to-end RAG system over CBP rulings for explainable HTS code recommendations.

Dense + BM25 hybrid retrieval
Sub-500ms retrieval latency

PythonFAISSBM25Hybrid Retrieval

BuildingAI Security

Adversarial Phishing Detection

Simulated arms-race where an LLM attacker and classifier defender co-evolve.

LLM-powered phishing generation
Continuous adversarial training

PythonBERTRoBERTaGPT-2+1

In DevelopmentResearch

Lookahead RAG

Speculative retrieval planning for low-latency multi-hop QA.

Retrieval dependency graph planning
Parallel retrieval execution

PythonLangChainvLLM

BuildingResearch

BudgetBench

Adaptive reasoning-budget control for cost-efficient LLM inference.

Accuracy–cost Pareto curves
Adaptive budget escalation

PythonOpen ModelsColab/M1

BuildingRAG/Agents

PaperChatbot

Structure-aware + verifiable/abstain RAG for research papers.

Section-structured retrieval
Every-claim-must-be-cited mode

PythonLangChainPDF Parsing

In DevelopmentResearch

AdapterLLRD

Memory-free multilingual continual learning with PEFT + LLRD.

50+ hop sequences without replay
Privacy-compatible (no buffers)

PythonPyTorchLoRAmBERT+1

LiveResearch

OOD-Aware Fairness

Selective classification for toxicity detection with fairness constraints.

OOD score inversion diagnosis
86% reduction in FPR gap

PythonPyTorchRoBERTa