Gabriel Cha

i like to think, explain, build.

agentic harnesses that reason, search the web, and synthesize information fascinate me. so does fine-tuning llms on a100s to push quality on narrow tasks.

M.S. Data Science, Columbia

B.S. Data Science, UC San Diego

Paper

Accepted to ACM SIGCSE 2026

ContentGen: Improving LLM-Generated Educational Content for Data Science

Jiaen Yu*, Ylesia Wu*, Gabriel Cha, Ayush Shah, Sam Lau · UC San Diego

An experience report on prototyping a JupyterLab extension that generates programming practice questions inside the instructor's notebook — evaluated through evidence-based prompt engineering and a usability study with six data science instructors.

acm digital library

Selected Works

May 2025

Cloi: a local debugging agent for the terminal

An open-source CLI with a local LLM harness — runs on Ollama or Claude, retrieves relevant code via a CodeBERT + BM25 RAG, and proposes diffs for failing commands.

395 stars writeup
Awarded UCSD HDSI Research Scholarship 2024

Speech emotion recognition with CNNs, SVMs, and ViTs

12,000 clips across RAVDESS, TESS, CREMA-D, and SAVEE — reaching 86% test accuracy with a CNN. Funded by UCSD's $6,500 HDSI Undergraduate Research Scholarship.

HDSI scholarship
Aug 2023

An N-Gram language model from first principles

Estimating sequence likelihoods from word counts and smoothing — no neural networks.