Kuo Yang

About

I am a first-year PhD student and Research Assistant @Northeastern University College of Engineering, specializing in Computer Engineering under the guidance of Prof. Yun Raymond Fu @SmileLab. I am interested in Large Language Models (LLMs) & multimodal large language models (MLLMs), Graph Neural Networks (GNNs), and Generative AI.

Recent

🎉 Started PhD Internship at Sanofi, focusing on LLMs/MLLMs! 🎉

Last updated: May 15, 2025

Journey

2025

PhD Intern – LLM/MLLMs

Cambridge, MA

Sanofi

May 2025 - Dec 2025

Conducted in-depth research on long-context understanding, Retrieval-Augmented Generation (RAG), and hallucination in medical and pharmacy documents.
Utilized evaluation methods and fine-tuned LLM/MLLMs.

2024

PhD in Computer Engineering

Boston, MA

Northeastern University

Sep 2024 - Present

Supervisor: Prof. Yun Fu (Raymond)

2024

AI Infrastructure Engineer

San Francisco, CA

bitHuman Inc.

May 2024 - Sep 2024

Optimized LLM performance by refining Retrieval-Augmented Generation (RAG) pipelines and prompt design to improve response quality and relevance.
Designed and implemented real-time speech interruption mechanisms to enhance coherence in user-agent interactions.
Deployed scalable, containerized digital agents on Kubernetes, streamlining deployment and ensuring reliability across environments.

2023

AI Engineer

Boston, MA

AInnnovation Labs

Nov 2022 - Apr 2024

Human Body Measurement Project: Developed a robust anthropometric body measurement pipeline using the SMPL framework to extract precise measurements from 3D human mesh data.
Digital Human Project: Built a real-time digital agent by integrating TTS, STT, LLMs, and Speech-to-Image generation.
Sleep Apnea Project: Created a cross-platform Flutter app utilizing Firebase as backend to support couples managing sleep apnea.

2022

M.S. in Electrical and Computer Engineering

Boston, MA

Northeastern University

Sep 2022 - Apr 2024

Concentration: Machine Learning, Computer Vision, and Algorithms

2018

B.S. in Optoelectronic Information Science and Engineering

Shenzhen, China

Shenzhen University

Sep 2018 - Jul 2022

Activities: Class President, Officer of Academic Department
Honor: First Prize of Innovation and Entrepreneurship, First Prize of Outstanding Student Leader

Publications

Meta-Review Auto-Generation teaser image

Meta-Review Auto-Generation via Long-Context Modeling

Kuo Yang, Jianglin Lu, Shi Liang, Qihua Dong, Yun Fu

In Preparation for Submission to EMNLP 2025

Developed a novel approach for automated meta-review generation (MRAG) by fine-tuning LLMs. Proposed an advanced long-context training methodology designed to effectively manage and process extensive textual information inherent in MRAG tasks. Created MetaSum, the largest comprehensive dataset for meta-review generation, featuring automated updating and rigorous data cleaning policies, significantly improving the quality, coherence, and informativeness of auto-generated meta-reviews compared to state-of-the-art benchmarks.

[Paper] Coming Soon [Code] [Dataset]

Show abstract ▼

Meta-review auto-generation (MRAG) has emerged as a promising solution to accelerate academic publishing by automatically generating structured, concise meta-reviews with minimal human intervention. Although existing MRAG methods have shown feasibility, they are limited by data constraints, suboptimal modeling strategies, and inefficient training paradigms. In this paper, we propose a novel approach to MRAG that overcomes these limitations from three key perspectives. First, we introduce MetaSum, the largest and most up-to-date meta-review dataset, covering $31$ conferences, $225$ venues, and $52,993$ submissions across various research domains. Second, unlike prior methods that truncate lengthy reviews, we treat MRAG as a long-context modeling task to retain complete textual information. Third, we develop a multi-role, dialogue-style training framework with efficient end-to-end optimization, fully leveraging LLMs for meta-review generation. Extensive experiments demonstrate that our approach significantly outperforms existing MRAG baselines and state-of-the-art LLMs. Our dataset and code will be made open-source.

Elicit Spatial Reasoning in MLLM via Rule-based Reinforcement Fine-tuning

Qihua Dong, Kuo Yang, Handong Zhao, Yitian Zhang, Yun Fu

In Preparation for Submission to NeurIPS 2025

Investigated the phenomenon of "thinking collapse" in MLLMs when performing spatial reasoning tasks using Group-Relative Policy Optimization (GRPO) reinforcement learning. Proposed Reinforce with Spatial Reward (RSR), a novel method integrating spatially-grounded rewards and carefully designed prompts to maintain robust reasoning. Developed "Super-Ref," a new benchmark specifically designed to rigorously evaluate spatial reasoning abilities, demonstrating substantial improvements over existing methods.

[Paper] Coming Soon [Code] Coming Soon [Dataset] Coming Soon

Investigated the phenomenon of "thinking collapse" in MLLMs when performing spatial reasoning tasks using Grounded-Relative Policy Optimization (GRPO) reinforcement learning. Proposed Reinforce with Spatial Reward (RSR), a novel method integrating spatially-grounded rewards and carefully designed prompts to maintain robust reasoning. Developed "Super-Ref," a new benchmark specifically designed to rigorously evaluate spatial reasoning abilities, demonstrating substantial improvements over baseline methods.

Degree-Centric Structure Adaptation

Jianglin Lu, Hailing Wang, Yixuan Liu, Kuo Yang, Yun Fu

Submitted to ICML 2025

Proposed a novel data-centric framework, Degree-Centric Structure Adaptation (DCSA), to improve Graph Neural Network (GNN) performance on nodes with extreme degrees. Introduced supervision-guided attachment for isolated nodes and influence-altered detachment for high-degree super nodes, optimizing graph structures without altering model architectures or training parameters. Demonstrated through comprehensive analyses and experiments that this approach significantly enhances classification performance on both synthetic and real-world datasets.

[Paper] Coming Soon

Proposed a novel data-centric framework, Degree-Centric Structure Adaptation (DCSA), to improve Graph Neural Network (GNN) performance on nodes with various degrees. Introduced supervision-guided attachment for isolated nodes and influence-directed detachment for high-degree super nodes, optimizing graph structures without altering model architectures or training parameters. Demonstrated through comprehensive analyses and experiments that this approach significantly enhances classification performance on both synthetic and real-world datasets.

Projects

Human Body Measurements

Developed a robust anthropometric body measurement pipeline using the SMPL framework to extract precise measurements from 3D human mesh data. Built a Python module to calculate key body dimensions from control points, scaled according to height and refined using correction factors for improved accuracy and repeatability.

Real-time Digital Human

Built a real-time digital agent by integrating TTS, STT, LLMs, and Speech-to-Image generation, synchronizing all components to deliver a seamless, interactive user experience.

OurSleepKit

[iOS] [Android]

Created a cross-platform Flutter app utilizing Firebase as backend to support couples managing sleep apnea, promoting awareness and engagement while collecting valuable health data for research.

Kuo Yang

PhD | Research Assistant

About

Recent

Journey

PhD Intern – LLM/MLLMs

PhD in Computer Engineering

AI Infrastructure Engineer

AI Engineer

M.S. in Electrical and Computer Engineering

B.S. in Optoelectronic Information Science and Engineering

Publications

Projects