Software Engineer · AI Research · New York

Abhinandan
Dubey

graphs, language models, and the occasional painting

About

Hello from New York. I like building things, looking at paintings, and reading books I'll never finish. I studied computer science, spent time in computer vision research, and ended up working on AI and security at Amazon Web Services, where I think about graphs, language models, and how to make cloud security less noisy.

Outside of work I build whatever catches my attention. Lately that's been an AI wardrobe app, a tool that harvests art from museum APIs, and a blog I write when something is worth writing about. I've been studying French for a few years now, and I read less than I probably should.

Experience
April 2021 · Present

Amazon Web Services

Security Analytics & AI Research, GuardDuty

February 2018 · April 2021

Nomura Securities

Data Management Technology

Education

Stony Brook University

M.S. Computer Science

Computer Vision & HCI · 2017

Birla Institute of Applied Sciences

B.Tech (Hons.) Computer Science

First Division with Honors · 2016

Projects
Scala · Spark · Monitoring

Airavat

A metric interceptor and job watchdog for Spark. Collects query plans, tracks disk spill and shuffle, kills resource-hogging jobs before they take down your cluster. Ships with a React UI.

Python · Computer Vision · PyPI

Segraph

Builds graphs from SLIC superpixels for CRF-based image segmentation. Published on PyPI. Born out of the cell segmentation research at Stony Brook.

Python · NLP · Deep Learning

DeepScore

Automated essay scoring using RNNs and feature engineering. Semantic analysis, clause parsing, discourse flow. A two-fold study on grading with minimal training data.

Scala · Apache Spark

SMOTE for Spark

Synthetic Minority Over-sampling Technique implemented for SparkML. Handles class imbalance in large-scale distributed datasets.

Python · Museums · Art

Delacroix

Harvests openly licensed artwork from the Louvre, the Met, the Art Institute of Chicago, and the Rijksmuseum. Intelligent query building from a comprehensive art knowledge database.

iOS · SwiftUI · Language

Nomade

A language-learning app with a dark, editorial aesthetic. Share text from anywhere, extract vocabulary, quiz yourself. Built for quiet daily study.

iOS · SwiftUI · Computer Vision

Ligne

An AI wardrobe app. Scan your clothes, get AI descriptions, generate outfits. Minimal interface, editorial sensibility.

Python · Biology · Simulation

FusePod

An RNA world simulator. Nucleotides bond, pair, and form compounds in a procedurally generated biome. An experiment in origin-of-life chemistry.

Scala · Avro · Spark

Torque

Serialize Spark DataFrames to and from Avro. A bridge between two serialization worlds.

Python · GPT-4 · AWS Lambda

En Français

Daily French exercises delivered to your inbox. Translation drills, vocabulary, text extracts. Runs on Lambda for essentially $0.

Python · BERT · Network Security

Adaptive Malware Detection

Few-shot in-context learning for classifying malicious HTTP traffic. Fine-tunes BERT on packet captures from CTU-13 and WRCCDC datasets.

Python · PyTorch · Graph Neural Networks

Graph ML

Node classification, link prediction, and graph clustering with GNNs. GAT, GraphSAGE, and heterogeneous attention on CORA and ogbn-arxiv. Stanford CS224W coursework.

Reading
L'Étranger
Albert Camus · current
Le Petit Prince
Antoine de Saint-Exupéry
Circe
Madeline Miller
The Truths We Hold
Kamala Harris
107 Days
Kamala Harris
A House in Fez
Suzanna Clarke
De la Terre à la Lune
Jules Verne
Zero: The Biography of a Dangerous Idea
Charles Seife
The Miraculous True History of Nomi Ali
Uzma Aslam Khan
Principles: Life and Work
Ray Dalio
The Life We Bury
Allen Eskens
Hum
Helen Phillips
Lettres à Yves
Pierre Bergé
Notes from Underground
Fyodor Dostoevsky
All the Light We Cannot See
Anthony Doerr
Giovanni's Room
James Baldwin
The Emperor of Gladness
Ocean Vuong · to read
Co-Intelligence: Living and Working with AI
Ethan Mollick · to read
Bonjour Tristesse
Françoise Sagan · to read
Classification Struggles
Pierre Bourdieu · to read

A partial list. Always looking for recommendations.

Assemblage Parfumée

Notes on scent, memory, and the small rituals of wearing something well. Bilingual reviews of fragrances encountered in shops, on skin, and sometimes only in passing. Written in French and English, because some things lose themselves in translation.

Read on Substack

Sur Papier
coming soon
coming soon
coming soon

Sketches, paintings, and other things made by hand. More at art.abhinandandubey.com

Observations
coming soon
coming soon
coming soon
coming soon
coming soon
coming soon

Places, light, and things that caught my attention.

Patents & Publications

Detection of Malicious Domains

US Patent No. 12,418,558 B1 · 2025

CNN Based Yeast Cell Segmentation in Multi-Modal Fluorescent Microscopy Data

CVMI, IEEE CVPR Workshop · 2017

Classification of CKD Cases Using MultiVariate K-Means Clustering

International Journal of Scientific & Research Publications · 2015

Contact