Minkyoo Song

Ph.D. Candidate in Electrical Engineering at the Network & System Security (NSS) Lab, KAIST
Daejeon, South Korea
Advisor: Prof. Seungwon Shin

I study the safety and security of large language models and data-driven security systems, with a focus on adversarial ML. My research identifies vulnerabilities in emerging AI paradigms and develops defenses for robust deployment, while applying AI to real-world cybersecurity problems.

Last updated: January 2026

Portrait of Minkyoo Song
Portrait of Minkyoo Song

πŸ“° Publications [C]: conference, [J]: journal, [U]: under review

  1. [C]

    J. Kim, M. Song, S. Shin, S. Son.

    SafeMoE: Safe Fine-Tuning for MoE LLMs by Aligning Harmful Input Routing.

    The Fourteenth International Conference on Learning Representations (ICLR 2026)

  2. [C]

    J. Kim, S.H. Na, M. Song, S. Shin, S. Son.

    MoEvil: Poisoning Expert to Compromise the Safety of Mixture-of-Experts LLMs.

    2025 Annual Computer Security Applications Conference (ACSAC 2025, Distinguished Paper)

  3. [C]

    M. Song, H. Kim, J. Kim, S. Shin, S. Son.

    Refusal Is Not an Option: Unlearning Safety Alignment of Large Language Models.

    34th USENIX Security Symposium (USENIX Security 2025)

  4. [C]

    H. Kim, M. Song, S.H. Na, S. Shin, K. Lee.

    When LLMs Go Online: The Emerging Threat of Web-Enabled LLMs.

    34th USENIX Security Symposium (USENIX Security 2025)

  5. [C]

    M. Song, H. Kim, J. Kim, Y. Jin, S. Shin.

    Claim-Guided Textual Backdoor Attack for Practical Applications.

    The 2025 Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics (NACCL 2025 Findings)

  6. [C]

    J. Kim, M. Song, S.H. Na, S. Shin.

    Obliviate: Neutralizing Task-Agnostic Backdoors within the Parameter-Efficient Fine-Tuning Paradigm.

    The 2025 Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics (NACCL 2025 Findings)

  7. [C]

    M. Song, E. Jang, J. Kim, S. Shin.

    Covering Cracks in Content Moderation: Delexicalized Distant Supervision for Illicit Drug Jargon Detection.

    31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2025)

  8. [C]

    J. Kim, M. Song, M. Seo, Y. Jin, S. Shin.

    PassREfinder: Credential Stuffing Risk Prediction by Representing Password Reuse between Websites on a Graph.

    2024 IEEE Symposium on Security and Privacy (SP) (S&P 2024)

  9. [J]

    J. Kim, M. Song, M. Seo, Y. Jin, S. Shin, J. Kim.

    PassREfinder-FL: Privacy-Preserving Credential Stuffing Risk Prediction via Graph-Based Federated Learning for Representing Password Reuse between Websites.

    Elsevier Expert Systems with Applications (ESWA)

  10. [J]

    J. Choi, J. Kim, M. Song, H. Kim, N. Park, M. Seo, Y. Jin, S. Shin.

    A Large-Scale Bitcoin Abuse Measurement and Clustering Analysis Utilizing Public Reports.

    IEICE Transactions on Information and Systems

  11. [U]

    K. Kim, J. Cui, M. Song, S. Shin.

    Exploring the Familiar Taste of Toxicity: A Causal Influence Analysis of Toxic Comments on Internet Forums.

    Invited to Major Revision at IEEE Transactions on Knowledge and Data Engineering (TKDE)

  12. [U]

    W. Choi*, M. Seo*, M. Song, H. Heo, S. Shin, M. You.

    PC^2: Politically Controversial Content Generation via Jailbreaking Attacks on GPT-based Text-to-Image Models.

    Submitted to 33rd ACM Conference on Computer and Communications Security (CCS 2026)

  13. [U]

    K. Kim, S.H. Na, M. Song, S. Shin.

    Global Meta-path-level Counterfactual Explanation for Heterogeneous Graph Neural Networks by Path Exclusion.

    Submitted to 32nd ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2026)

  14. [U]

    J. Kim, M. Seo, M. Song, S. Shin, J. Kim.

    To Make Each Account Count: Exploring Credential Data Breach Threats through Victim-driven Analysis.

    Submitted to IEEE Transactions on Information Forensics and Security (TIFS)

🌐 Professional Experience

  • S2W Β· Research Intern, AI Team

    Jul 2022 – Feb 2023 Β· South Korea

    • Developed an LLM-based content moderation framework for illicit drug jargon detection, capturing contextual and lexical cues.
  • Reviewer

    2024 – 2025

    • The Web Conference (WWW) 2024
    • ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD) 2024
    • ACL Rolling Review (ARR) 2024, 2025
  • External Reviewer

    2025

    • The Web Conference (WWW) 2025
    • ACM Conference on Computer and Communications Security (CCS) 2025
    • IEEE International Conference on Distributed Computing Systems (ICDCS) 2025
    • International Symposium on Research in Attacks, Intrusions and Defenses (RAID) 2025
    • Annual Computer Security Applications Conference (ACSAC) 2025
  • KAIST Data Intelligence Laboratory Β· Undergraduate Research Intern

    Jan 2020 – Jun 2020 Β· South Korea

    • Conducted big data mining on COVID-19 datasets to surface actionable insights.
  • KAIST Data Mining Laboratory Β· Undergraduate Research Intern

    Jul 2019 – Aug 2019 Β· South Korea

    • Investigated abnormal node detection in bipartite networks via butterfly counting.

πŸŽ“ Education

  • Korea Advanced Institute of Science and Technology (KAIST)

    Mar 2023 – Present Β· Daejeon, South Korea

    Ph.D. Student in Electrical Engineering, NSS Lab β€” Advisor: Prof. Seungwon Shin

  • Korea Advanced Institute of Science and Technology (KAIST)

    Mar 2021 – Feb 2023 Β· Daejeon, South Korea

    M.S. in Electrical Engineering, NSS Lab β€” Advisor: Prof. Seungwon Shin

  • Korea Advanced Institute of Science and Technology (KAIST)

    Mar 2016 – Feb 2021 Β· Daejeon, South Korea

    B.S. in Industrial & Systems Engineering; Double Major in Electrical Engineering

πŸ… Honors & Awards

  • 4th Prize, Cybersecurity Paper Competition

    Korean Association of Cybersecurity Studies, 2025

    Poisoning Expert to Compromise the Safety of Mixture-of-Experts LLMs

  • 2nd Prize, Cybersecurity Paper Competition

    Korean Association of Cybersecurity Studies, 2023

    Graph-based Deep Learning Framework for Credential Stuffing Risk Prediction

  • 4th Prize, Cybersecurity Paper Competition

    Korean Association of Cybersecurity Studies, 2023

    Delexicalized Distant Supervision for Illicit Drug Jargon Detection

  • 4th Prize, Cybersecurity Paper Competition

    Korean Association of Cybersecurity Studies, 2023

    Understanding the Occurrence and Impact of Credential Data Breach

  • Cum Laude

    KAIST, 2021

  • Academic Achievement Award: Salutatorian

    KAIST, Spring 2019

  • Dean’s List

    Industrial & Systems Engineering, KAIST, Spring 2019

🌐 Languages

  • Korean

    Native proficiency

  • English

    Fluent

βœ‰οΈ Contact

For collaborations or inquiries, please use the channels below.