Projects

Distributed LLM Training Pipeline

Built distributed pretraining + fine-tuning pipeline using DeepSpeed and vLLM. Optimized inference latency and scaling across clusters.

SZOPGD-AM Optimization Algorithm

Designed a novel zeroth-order optimization algorithm with adaptive momentum. Achieved sublinear convergence rate O(logT / sqrt(T)).

Voice-Based Wheelchair Navigation

Deployed CNN + MFCC on ARM Cortex M4 for real-time speech control.

SmartCane for Visually Impaired

LSTM-based motion gesture recognition on embedded system (93% accuracy).