About
SangHyeon Park
Cyber Security undergraduate at Ajou University, Republic of Korea.
I study AI Security with a focus on safe and robust behavior of language models and vision-language models.
Research Direction
My current work sits between Security for AI and AI for Security.
| Area | Current questions |
|---|---|
| VLM and LLM safety | How can harmful behaviors, refusal behavior, and safety capability be controlled or interpreted inside modern models? |
| Mechanistic interpretability | Which internal components, such as attention heads and activations, contribute to safe or unsafe responses? |
| Adversarial robustness | How can we find and reduce model failure modes caused by adversarial examples, jailbreaks, and misuse-oriented inputs? |
| AI for vulnerability discovery | How can LLMs improve fuzzing, cyber threat simulation, and automated security analysis? |
| Privacy and misuse prevention | How can AI systems be designed to reduce data leakage and deepfake misuse risk? |
Featured Work
VLM-CAST: Conditional Activation Steering for Safe Response Control
Research project on safe response control in vision-language models. The project explores activation-level steering, refusal behavior, and robustness-oriented evaluation for multimodal AI systems.
- Domain: VLM safety, activation steering, adversarial robustness
- Current status: private research project, public release planned
- Related study: attention-head safety, mechanistic interpretability, LLM safety papers
VXShield: Lightweight Voice Protection Against Deepfake Audio Generation
VXShield is a proactive defense system that adds imperceptible perturbations to Korean speech so that later zero-shot voice-cloning attempts degrade.
- Role: AI security research and implementation
- Core methods: PGD-based perturbation, speaker-encoder attack surface, perceptual and semantic quality evaluation
- Stack: PyTorch, FastAPI, CAM++, WavLM, ECAPA-TDNN, Whisper, Zeroth-Korean
LLM-based Fuzzing
Project and seminar work on using large language models for software testing and vulnerability discovery.
- Reviewed recent LLM-based fuzzing research and built a topic map around LLMs, fuzzing, and generated test cases
- Connected previous firmware/security experience with modern AI-assisted vulnerability discovery
- Presented the topic in a student security seminar
AI-based Cyber Threat Intelligence Profiling
Research project on using AI agents for cyber threat intelligence profiling and cyberpower-related information modeling.
- Role: AI agent developer
- Output: related conference paper in 2025
Cyber Threat Simulation Automation
Research project around LLM-assisted cyber threat simulation and BAS-style automation.
- Role: Blue Team technical analyst
- Output: related conference paper in 2025
UEFI Exploitation Fuzzer
Firmware security project from the Best of the Best program period.
- Role: project manager
- Output: one paper and two assigned CVEs
- CVEs: CVE-2023-30738, CVE-2023-27471
Publications
CA-BAS: PoC-Generative BAS Framework based on LLM
Autumn Annual Conference of IEIE, 2025, Gwangju, Republic of KoreaUser Information-based Cyberpower Related LLM Model
Autumn Annual Conference of IEIE, 2025, Gwangju, Republic of KoreaCyber Threat Response in DeFi: Volatility-based Approach for RugPull Detection
17th KIPS International Conference on Ubiquitous Information Technology and Applications, 2023, Nha Trang, VietnamRugPull Detection Method based on Volatility in DeFi
Conference on Information Security and Cryptography, Winter 2023, Seoul, Republic of KoreaDigital Healthcare Attack Scenario based on DeFi Security Vulnerability
Annual Conference of KIPS, 2023, Busan, Republic of KoreaSecurity Threat Trend based on Drone Embedded System and Network Protocol
Annual Spring Conference of KIPS, 2023, Seoul, Republic of KoreaThe Trend of UEFI Firmware Security
Conference on Information Security and Cryptography, Winter 2022, Seoul, Republic of Korea
Selected Writing
- On the Role of Attention Heads in Large Language Models Safety
- Large Language Model Based Fuzzing Techniques: A Survey
- Several Transformer Models
- Recurrent Neural Network based Language Model
- CCE 2025 PaperLibrary Write-up
Experience
Ajou University
B.S. in Cyber Security, 2021-2027
GPA: 3.89 / 4.50
Whois, Information Security Student Club
- President, 2025
- Vice President, 2023
- Financial Manager, 2022
Education
- Attack the Web Hacking Wargames, 2025
- Web Hacking: Basic to Intermediate, 2022-2023
- C Language Programming, 2022-2023
Study
- Learning AI: NLP to Vision Overview, 2025
- Capture the Flag Team, 2022-2023
- Basic Web Developing, 2021
Seminar
- Road to LLM-based Fuzzing, 2025
- What is Security Consulting?, 2023
- About UEFI Exploitation, 2022
KITRI Best of the Best 11th
Security Consulting Track trainee, 2022-2023
- Completed advanced education
- Worked on UEFI exploitation and firmware security
Republic of Korea Army
Signal Intelligence Specialist, 2024-2025
Skills
Language
- English, TOEFL iBT 84
- Japanese
Communication
- Presentation
- Team management
- Security consulting
Computer Science
- Computer architecture
- Algorithms
- Cyber security
Technical Keywords
AI Security VLM Safety LLM Safety Mechanistic Interpretability Activation Steering
Adversarial Robustness LLM Fuzzing Prompt Injection Cyber Threat Intelligence
UEFI Security DeFi Security Data Privacy and Management Privacy-Preserving AI