Artificial intelligence · NLP · Learning tools

Researching efficient language models and tools for learning with data.

I am a fourth-year undergraduate at UC San Diego, double majoring in Data Science at the Halıcıoğlu Data Science Institute and Cognitive Science with a specialization in Machine Learning and Neural Computation.

Research

Current Projects

My work focuses on artificial intelligence research, especially machine learning algorithms, large language models, and computational approaches to natural language.

Publications

Single-Pass Document Scanning for Question Answering
Weili Cao*, Jianyou Wang*, Youze Zheng, Longtian Bao, Qirui Zheng, Taylor Berg-Kirkpatrick, Ramamohan Paturi, Leon Bergen
arXiv Preprint, 2025
TL;DR: We trained State Space Models (Mamba-2) for long-context Q&A, achieving performance comparable to GPT-4o on extremely long documents while being more computationally efficient.
@inproceedings{cao2025singlepassdocumentscanning, title={Single-Pass Document Scanning for Question Answering}, author={Cao, Weili and Wang, Jianyou and Zheng, Youze and Bao, Longtian and Zheng, Qirui and Berg-Kirkpatrick, Taylor and Paturi, Ramamohan and Bergen, Leon}, booktitle={Proceedings of the 2025 Conference on Language Modeling (COLM)}, year={2025}, }
How Novices Use Program Visualizations to Understand Code that Manipulates Data Tables
Ylesia Wu*, Qirui Zheng*, Sam Lau
Proceedings of the 56th ACM Technical Symposium on Computer Science Education V. 1 (SIGCSE TS 25), ACM, 2025
TL;DR: A case study on how novices use program visualizations to understand code manipulating data tables, revealing the importance of visualizations in learning programming.
@inproceedings{wu2025novices, title={How Novices Use Program Visualizations to Understand Code that Manipulates Data Tables}, author={Wu, Ylesia and Zheng, Qirui and Lau, Sam}, booktitle={Proceedings of the 56th ACM Technical Symposium on Computer Science Education V. 1}, pages={1267--1273}, year={2025} }

News