About Me
I am a Master's student in the Department of Computer Science and Engineering at University of California, San Diego, where I work as a Graduate Research Assistant in Hao AI Lab, advised by Prof. Hao Zhang.
I received my Bachelor's degree in Computer Science and Technology from ShanghaiTech University in June 2025, where I worked as an undergraduate researcher advised by Prof. Kewei Tu. During my undergraduate studies, I spent a semester as an exchange student at UC Berkeley.
Currently, I am working on efficient inference for video generation models at UCSD, applying quantization techniques like SVDQuant to accelerate the FastVideo project.
Research Interests
- Efficient Machine Learning Systems
- Long Context Modeling
- Natural Language Processing and Large Language Models
News
Education
University of California, San Diego
Sep 2025 - Jan 2027 (Expected)Master of Science in Computer Science and Engineering
La Jolla, CA
University of California, Berkeley
Aug 2023 - Jan 2024Exchange Student, EECS Department
Berkeley, CA
ShanghaiTech University
Sep 2021 - Jun 2025Bachelor of Engineering in Computer Science and Technology
Shanghai, China
Publications
Efficient Feed-Forward Network Architecture for Large Language Models
Under Review, 2025
A novel feed-forward network architecture designed for superior performance and computational efficiency in large language models. The proposed structure demonstrates competitive performance improvements while achieving significant speedup and memory reduction compared to standard architectures.
Projects
Enhancing 3D Character Generation with ControlNet and LoRA
EECS 182/282A | Deep Neural Networks, UC Berkeley, 2023
A project exploring enhanced 3D character generation techniques using ControlNet and LoRA for improved control and quality in generative models.
CUDA/C++ Parallel Image Rendering
Personal Project, 2023
Built a C++ path tracer supporting Lambertian, metal, dielectric, and emissive materials. Implemented motion blur, depth of field, and volumetric effects. Accelerated rendering via CUDA parallelization and importance sampling, achieving ~200× speedup vs. single-threaded CPU baseline.
NERF Neural Network
Personal Project, 2023
Built a NERF rendering pipeline by understanding Camera Intrinsics & Extrinsics and Volumetric Rendering. Trained and validated neural model on RTX4090 using open-source multi-perspective image datasets.