LLM Research Project “Residual Activation Steering in GPT”
Co-led a project using open-source LLM "Qwen" to disable LLM generation on select topics while maintaining high performance. Performed rigorous experimental design and analysis. Leveraged GPT-4, MMLU benchmark, and BLEU/ROGUE scores as quantitative evaluation metrics (Python, PyTorch, Jupyter, Numpy, Transformers).