Jiliang Li (Preferred Name: Eric, 中文:李济良) is a second-year master’s student in computer science at Stanford University. He is a researcher as well as a full-stack developer.

His research interest lies at the intersection of Machine Learning, Software Engineering, and Security. Most recently, he worked on AI cybersecurity agent benchmarking at SAIL. Before Stanford, he worked with Prof. Yu Huang and Prof. Kevin Leach at Vanderbilt University on AI for Software Engineering (AI4SE) and LLMs for code (LLM4Code).

As a full-stack developer, he is most proficient with the React ecosystem, integrated with versatile backends. He has previously worked as a software engineer intern at Meta and Wave Dynamics Inc., where he developed full-stack products across messaging, ads, and ticketing at scale.

Interests
  • Artificial Intelligence for Software Engineering (AI4SE)
  • LLM4Code
  • Full-Stack Web Development
Education
  • M.S. in Computer Science, 2026

    Stanford University

  • B.S. in Computer Science and Mathematics, 2024

    Vanderbilt University

A Two-Fold Adventure

My adventure in computer science has been defined by both research and full-stack development.
Research
Machine Learning

NLP, Graph ML, AI for Programming Languages (LLM4Code), Human-Centered AI

Software Engineering

Binary Code Analysis, Program Analysis, Code Comprehension, CS Education

Software Development
Full-Stack Web Development

React.js, Next.js, Node.js, REST API, Flask, Spring, Firebase, SQL, AWS, Git/GitHub

Android Development

Android Studio

Experience

 
 
 
 
 
Meta Platforms Inc.
Software Engineer Intern
Meta Platforms Inc.
June 2025 – September 2025 Menlo Park, CA

Designed and launched 7 features for Meta Business Messaging, connecting advertisers and customers via WhatsApp.

  • Implemented ad creation and messaging functionalities, including multi-WhatsApp-number chat support, ad format previews, advertiser options defaulting, WhatsApp input expansion in new bid workflows, ad delivery matcher updates, and more.
  • Built full-stack web and mobile solutions using React.js, Relay, and GraphQL, and backend queries and services in Hack.
  • Owned product lifecycles end-to-end by driving cross-functional design, collaboration, and experimental analysis.
  • Led product decisions on features reaching 300K+ total advertisers.
 
 
 
 
 
The Stanford Artificial Intelligence Laboratory (SAIL)
Research Assistant
The Stanford Artificial Intelligence Laboratory (SAIL)
January 2025 – June 2025 Stanford, CA

Developed BountyBench, a cybersecurity bug bounty benchmark for AI agents (Principal Investigator: Percy Liang).

  • Introduced the benchmark featuring 25 systems from complex, real-world codebases, and 40 bug bounties covering 9 of the OWASP Top 10 Risks, to assess LLM agents’ ability to exploit and patch security vulnerabilities.
  • Built an automated verification pipeline to validate LLM-generated patches that preserve invariant functionalities in code.
  • Designed end-to-end LLM agents capable of real-world security tasks, optimized with prompting, memories, and reflection.
 
 
 
 
 
Wave Dynamics Inc.
Software Engineer Intern
Wave Dynamics Inc.
May 2024 – September 2024 Nashvile, TN

Designed and developed the Wave Dashboard:

  • Developed a full-stack ticket sales management dashboard generating $120,000+ monthly revenue using Next.js and Node.js.
  • Created key features such as event-editing and real-time ticket tracking, leveraging Firebase, Google OAuth, and Cloud Run for scalable data storage, retrieval, and access management.
 
 
 
 
 
Vanderbilt University
Undergraduate Researcher
Vanderbilt University
May 2022 – September 2024 Nashville, TN

Supervised by Prof. Yu Huang and Prof. Kevin Leach:

  • Reinforcement Learning for LLMs for Code: Generating negative code samples using RL with compiler-in-the-loop. More details coming up soon.
  • Few-shot Malware Classification: Designed a semi-supervised malware classifier, pioneering efficient & semantics-aware malware data augmentation through information retrieval, invariance learning, and alignment techniques. The classifier achieved SOTA results with 5.60% improvement in few-shot settings and 11.44% improvement under concept-drift.
  • Merging Human & Transformer Attention: Implemented a Transformer-based NLP model whose self-attention weights are refined using human attention data captured through eye-tracking. The model is applied to neural code summarization tasks with a 29.91% performance improvement, showcasing the potential of integrating human attention into neural models.
  • LLM Feature Attribution Analysis: Comprehensively evaluated feature attribution in six LLMs (GPT, Llama 2, StarCoder, etc.) in code summarization through Shapley-value analysis; compared against human eye-tracking attention patterns.
 
 
 
 
 
University of Illinois Urbana-Champaign
Summer Research Intern
University of Illinois Urbana-Champaign
May 2023 – September 2023 Champaign, IL

Supervised by Prof. Lingming Zhang:

  • Designed a universal LLM-based fuzzer targeting compilers of any programming language, with LLM-mutated code inputs integrated in the fuzzing loop; tested on 7 languages with significantly improved coverage and uncovered 76 new bugs.
  • Specialized in JIT vulnerabilities within the V8 JavaScript engine; refined fuzz testing algorithms for targeted exploitation; offered expertise in instrumentation, auto-prompting, and compiler and systems programming for vulnerability pinpointing.
 
 
 
 
 
Centurium Capital
Software Engineer Intern
Centurium Capital
June 2021 – July 2021 Beijing, CN

Post-Investment Management Team:

  • Worked on the post-investment team to eliminate future manual processing of three portfolio companies’ operational data.
  • Implemented a full-stack web service in React JS, Flask, and MySQL for robust client data catalog collection and storage.
  • Automated the conversion of raw operational data into financial statements using OpenPyXL, Pandas, NumPy, and Seaborn.
  • Built an interactive sales performance visualization interface using D3 for real-time strategic insights.

Publications

(2025). BountyBench: Dollar Impact of AI Agent Attackers and Defenders on Real-World Cybersecurity Systems. In Proceedings of the 39th Conference on Neural Information Processing Systems (NeurIPS), 2025.

PDF Cite Code Source Document

(2025). MalMixer: Few-Shot Malware Classification with Retrieval-Augmented Semi-Supervised Learning. In Proceedings of the 10th IEEE European Symposium on Security and Privacy (Euro S&P), 2025..

PDF Cite

(2025). A Comparative Study on ChatGPT and Checklist as Support Tools for Unit Testing Education. In Proceedings of the 33rd ACM International Conference on the Foundations of Software Engineering (FSE), 2025.

PDF Cite

(2024). EyeTrans: Merging Human and Machine Attention for Neural Code Summarization. In Proceedings of the ACM International Conference on the Foundations of Software Engineering (FSE), 2024.

PDF Cite Code

(2024). Do Machines and Humans Focus on Similar Code? Exploring Explainability of Large Language Models in Code Summarization. In Proceedings of the 32nd IEEE/ACM International Conference on Program Comprehension (ICPC), 2024.

PDF Cite Slides

Projects

.js-id-machine-learning
Adversarial RL for Hard-Negative Code Generation
Generating hard negative code examples using PPO with a GAN-style discriminator.
Adversarial RL for Hard-Negative Code Generation
SIREN for Medical Imaging
Sinusoidal Representation Networks for EEG-fMRI Translation.
SIREN for Medical Imaging
News Summarization with Continual Learning
Combatting concept drift in news articles via continual learning.
News Summarization with Continual Learning

Teaching

Vanderbilt University
Principles of Software Engineering (CS4278)
Teaching Assistant | Spring 2024 | Instructor: Prof. Yu Huang.
Vanderbilt University
Programming and Problem Solving (CS1101)
Teaching Assistant | Fall 2022, Spring 2023, and Fall 2023 | Instructors: Prof. Robert Tairas, Prof. Gina Bai, and Prof. Csaba Biegl.

Contact

  • ericlij [-at-] stanford [-dot-] edu