I am a first-year PhD student at Northeastern University, advised by Prof. Weiyan Shi. I obtained my Master’s degree in Computer Science at UMass Amherst where I was lucky to work with Prof. Andrew McCallum and Prof. Hong Yu. I finished my undergraduate study in Computer Science at HKUST, where I was a research assistant at SMART lab working on medical image analysis with Professor Hao Chen and an NLP research intern at IDEA.
Research Interests. I am especially interested in label-efficient learning / weakly-supervised learning and improving the robustness and interpretability of neural networks. I am particularly enthusiastic about applying the implication to biology and medicine where labeled data are scarce and trustworthiness is valued.
I currently work on large language models, especially focusing on safety alignment and interpretation. I also have experience of knowledge distillation from LLMs and reasoning. I am also broadly interested in the memory view of LLMs.
Publications
2024
Large Language Models are In-context Teachers for Knowledge Reasoning
Jiachen Zhao, Zonghai Yao, Zhichao Yang, Hong Yu
EMNLP 24 Findings.
[pdf]Learning and Forgetting Unsafe Examples in Large Language Models
Jiachen Zhao, Zhun Deng, David Madras, James Zou, Mengye Ren
ICML 24.
[pdf]Multistage Collaborative Knowledge Distillation from a Large Language Model for Semi-Supervised Sequence Generation
Jiachen Zhao, Wenlong Zhao*, Andrew Drozdov*, Benjamin Rozonoyer, Md Arafat Sultan, Jay-Yoon Lee, Mohit Iyyer, Andrew McCallum
ACL 24.
[pdf]
Adaptive Fusion of Deep Learning with Statistical Anatomical Knowledge for Robust Patella Segmentation from CT Images
Jiachen Zhao, Tianshu Jiang, Yi Lin, Justin Chan, Ping-Keung Lewis Chan, Chunyi Wen, Hao Chen
Journal of Biomedical & Health Informatics.
[code] [pdf]
2023
SELF-EXPLAIN: Teaching Large Language Models to Reason Complex Questions by Themselves
Jiachen Zhao, Zonghai Yao, Zhichao Yang, Hong Yu
Workshop on robustness of zero/few-shot learning in foundation models @NIPS 23
.
[pdf]
Student as an Inherent Denoiser of Noisy Teacher
Jiachen Zhao
3rd Workshop on Efficient Natural Language and Speech Processing @NIPS 23
.
[pdf]
In-Context Exemplars as Clues to Retrieving from Large Associative Memory
Jiachen Zhao
Neural Conversational AI @ICML 23
. Associative Memory & Hopfield Networks @NIPS 23
.
[pdf]
2022
- Trigger-free Event Detection via Derangement Reading Comprehension
Jiachen Zhao, Haiqin Yang
arXiv.
[pdf]