Hi! I am a final-year PhD candidate in Computer Science at Columbia University, advised by David Blei. My research is in the intersection of NLP and probabilistic machine learning.
I’m especially interested in:
During my PhD, I spent two summers interning at Meta in applied AI for ads recommendation. Previously, I completed my undergrad and masters at Penn and worked in Chris Callison-Burch’s lab. I also spent a summer at Columbia Business School working with Andrey Simonov.
email: carozheng at cs dot columbia dot edu
[google scholar] [github] [cv]
(* = equal contribution)
Hypothesis Testing the Circuit Hypothesis in LLMs
Claudia Shi*, Nicolas Beltran-Velez*, Achille Nazaret*, Carolina Zheng, Adrià Garriga-Alonso, Andrew Jesson, Maggie Makar, David Blei
International Conference on Machine Learning, 2024
Mechanistic Interpretability Workshop (Oral)
[paper] [code coming soon!]
Revisiting Topic-Guided Language Models
Carolina Zheng, Keyon Vafa, David Blei
Transactions on Machine Learning Research, 2023
[paper] [code]
An Invariant Learning Characterization of Controlled Text Generation
Carolina Zheng*, Claudia Shi*, Keyon Vafa, Amir Feder, David Blei
Proceedings of ACL, 2023
[paper] [code]
Do Suspense and Surprise Drive Entertainment Demand? Evidence from Twitch.tv
Andrey Simonov, Raluca Ursu, Carolina Zheng
Journal of Marketing Research, 2022
[paper]
Complexity-Weighted Loss and Diverse Reranking for Sentence Simplification
Reno Kriz, João Sedoc, Marianna Apidianaki, Carolina Zheng, Gaurav Kumar, Eleni Miltsakaki, Chris Callison-Burch
Proceedings of NAACL, 2019
[paper]