Abhinandan
Dubey

graphs, language models, and the occasional painting

About

Hello from New York. I like building things, looking at paintings, and reading books I'll never finish. I studied computer science, spent time in computer vision research, and ended up working on AI and security at Amazon Web Services, where I think about graphs, language models, and how to make cloud security less noisy.

Outside of work I build whatever catches my attention. Lately that's been an AI wardrobe app, a tool that harvests art from museum APIs, and a blog I write when something is worth writing about. I've been studying French for a few years now, and I read less than I probably should.

Experience
April 2021 · Present

Amazon Web Services

Security Analytics & AI Research, GuardDuty

February 2018 · April 2021

Nomura Securities

Data Management Technology

Education

Stony Brook University

M.S. Computer Science

Computer Vision & HCI · 2017

Birla Institute of Applied Sciences

B.Tech (Hons.) Computer Science

First Division with Honors · 2016

Projects
Bref Python / LLM Tooling

Prompt compression and token cost optimizer for LLM APIs. Multi-pass entropy scoring, TF-IDF pruning, response caching, and model routing.

Airavat Scala / Spark

Metric interceptor and job watchdog for Spark. Collects query plans, tracks disk spill and shuffle, kills resource-hogging jobs.

Segraph Python / Vision

Builds graphs from SLIC superpixels for CRF-based image segmentation. Published on PyPI. Born out of cell segmentation research at Stony Brook.

DeepScore Python / NLP

Automated essay scoring using RNNs and feature engineering. Semantic analysis, clause parsing, discourse flow.

SMOTE for Spark Scala / ML

Synthetic Minority Over-sampling Technique implemented for SparkML. Handles class imbalance in large-scale distributed datasets.

Delacroix Python / Art

Harvests openly licensed artwork from the Louvre, the Met, the Art Institute of Chicago, and the Rijksmuseum.

Nomade iOS / SwiftUI

A language-learning app with a dark, editorial aesthetic. Share text from anywhere, extract vocabulary, quiz yourself.

Ligne iOS / Vision

An AI wardrobe app. Scan your clothes, get AI descriptions, generate outfits. Minimal interface, editorial sensibility.

FusePod Python / Biology

An RNA world simulator. Nucleotides bond, pair, and form compounds in a procedurally generated biome.

Torque Scala / Avro

Serialize Spark DataFrames to and from Avro. A bridge between two serialization worlds.

Megaclite Python / JupyterHub

Resource manager for JupyterHub. Also a moon of Jupiter.

Adaptive Malware Detection Python / Security

Few-shot in-context learning for classifying malicious HTTP traffic. Fine-tunes BERT on packet captures from CTU-13 and WRCCDC datasets.

Reading
now L'Étranger Albert Camus
Le Petit Prince Saint-Exupéry
Circe Madeline Miller
The Truths We Hold Kamala Harris
107 Days Kamala Harris
A House in Fez Suzanna Clarke
De la Terre à la Lune Jules Verne
Zero Charles Seife
The Miraculous True History of Nomi Ali Uzma Aslam Khan
Principles Ray Dalio
The Life We Bury Allen Eskens
Hum Helen Phillips
Lettres à Yves Pierre Bergé
Notes from Underground Dostoevsky
All the Light We Cannot See Anthony Doerr
Giovanni's Room James Baldwin
next The Emperor of Gladness Ocean Vuong
next Co-Intelligence Ethan Mollick
next Bonjour Tristesse Françoise Sagan
next Classification Struggles Pierre Bourdieu

A partial list. Always looking for recommendations.

Assemblage Parfumée

Notes on scent, memory, and the small rituals of wearing something well. Bilingual reviews of fragrances encountered in shops, on skin, and sometimes only in passing. Written in French and English, because some things lose themselves in translation.

Read on Substack

Observations
Observation Observation Observation Observation Observation Observation Observation Observation

Places, light, and things that caught my attention.

Patents & Publications

Detection of Malicious Domains

US Patent No. 12,418,558 B1 · 2025

CNN Based Yeast Cell Segmentation in Multi-Modal Fluorescent Microscopy Data

CVMI, IEEE CVPR Workshop · 2017

Classification of CKD Cases Using MultiVariate K-Means Clustering

International Journal of Scientific & Research Publications · 2015

Contact