
Minshen Zhang
Computer Science Student, Researcher, Developer
About Me
I am currently a Computer Science undergraduate student at ShanghaiTech University, expected to graduate in June 2025. I will be pursuing a Master's degree in Computer Science at UC San Diego starting Fall 2025.
My research interests include Machine Learning, Large Language Models, Natural Language Processing, and Computer Vision. I am particularly interested in developing innovative algorithms and applications that bridge these fields.
I'm open for academic and professional collaborations. If you are interested, please feel free to email me.
Education
UC San Diego
Master in Computer Science (CS75)
ShanghaiTech University
Bachelor of Computer Science and Technology
Overall GPA: 3.69/4.0; Major GPA: 3.8+/4.0
English Tests: TOEFL(100), GRE(320), CET6(602), CET4(607), Duolingo(135)
UC Berkeley Extension
Berkeley GLOBE Student Program
Overall GPA: 4.0/4.0
Research Experience
Shanghai Qizhi Institute
Machine Learning and Computer Graphics Research Group
Advisor: Prof. Tao Du (Tsinghua University)
Developed real-time machine learning algorithms for water surface reconstruction to assist robot control tasks. Based on Snell's law, derived the relationship between water surface normal vectors and visual information, proving the feasibility of machine learning approaches. Implemented core algorithms using Nvidia-Warp-Cuda, achieving breakthrough performance (from 500ms to 8ms per frame).
Shanghai Kewei Tu Lab
Natural Language Processing and Large Language Models Research Group
Advisor: Prof. Kewei Tu (ShanghaiTech University)
Investigating how Feed-Forward Networks (FFN) in Transformers can be viewed as modeling key-value pairs for a set of global concepts. Exploring the modeling of associations between these concepts, which gives rise to a new FFN structure highly similar to the self-attention mechanism.
Shanghai ElanTech Lab
Embodied Intelligence Research Group
Advisor: Prof. Lan Xu (ShanghaiTech University)
Implemented and improved network structures including ResNet, BiRNN, Diffusion, and Transformers. Built large-scale human motion capture facilities using OptiTrack and Z-cam camera arrays for data collection. Integrated existing algorithms with Motion Diffusion Model to reconstruct human motion using IMU sensor signals.
Projects
Arximia: LLM-Agent Based Scientific Collaboration Platform
12/2024 - Present
Building a comprehensive platform for global scientists to enhance reading, searching, discussing, referencing, and managing scientific papers. Using React (Frontend) + Python (Backend) to create an integrated service that leverages the latest LLM-Agent technology for core functionalities.
This project is currently under development, will be available later on arximia.com
Ray Tracing Rendering Engine
10/2023
Built a C++ ray tracing renderer implementing diffuse Lambertian bodies, full/semi-reflective metals, semi-transparent media, light sources, and other materials. Rewritten the entire codebase in Nvidia CUDA C++ and designed Importance Sampling to accelerate rendering, achieving a 200× speed improvement.
Skills & Interests
Technical Skills
- Machine Learning
- Python
- C/C++
- CUDA C++
- Unity/C#
Hobbies
- Piano (Level 10, Shanghai Conservatory of Music)
- Badminton
- Basketball
- Western Philosophy
Honors
- Outstanding Student, ShanghaiTech University (2021-2022, 2023-2024)
- Teaching Assistant, CS100 Computer Programming, ShanghaiTech University (02/2024-06/2024)