Reza Esfandiarpoor
I am a Ph.D. candidate in Computer Science at Brown University, working with Stephen Bach. I am fortunate to have interned at Google (AdsAI team) and Microsoft (Office AI team). And, I am currently interning at Nvidia in the Nemo Retriever team, where I explore agentic information retrieval.
My research focuses on optimizing the interaction between large pretrained models to better understand their behavior and improve their performance across different domains. Here is a summary of my research:
-
AI Agents. I introduced a cheaper, faster, and more effective approach for building general-purpose agents using massive task-specific tool sets. I also created TheMCPCompany, a benchmark for evaluating enterprise agents with more than 18,000 tools and complex, realistic tasks, like managing Microsoft Azure resources.
-
Vision-Language Models. I introduced Extract and Explore, a novel analysis method that uses reinforcement learning to align an LLM with VLM preferences. By analyzing the aligned LLM outputs, I discovered that non-visual and even spurious information significantly impact VLM concept representations. I introduce Follow-up Differential Descriptions, a new method that adapts LLM-generated concept descriptions to each individual VLM to improve performance.
-
Information Retrieval. I introduced SyCL, a new synthetic data generation method that creates fine-grained training data with multi-level relevance labels for training dense retrievers. I also created Trove, an open-source toolkit for dense retrieval that simplifies IR experiments without sacrificing flexibility.
Before Ph.D., I completed my undergraduate studies with Shadrokh Samavi, where I worked on pruning and quantization methods to create efficient medical image analysis models for edge devices.