Qianyu He

Qianyu He

Ph.D. Candidate

Fudan University

Biography

Qianyu He (何千羽) is currently a third-year PhD candidate at Fudan University in the School of Computer Science, Shanghai, China. Her previous work focused on the Creative Generation of Language Models:

  • Creative Generation: Integrating linguistic theory and commonsense knowledge into language model understanding and generation, with an emphasis on figurative language. The aim is to enable language models to generate creative responses reflecting human cognition, such as similes and metaphors.

Currently, her interested research topics are mostly around the Enhancement of Large Language Models(LLMs)’ Instruction Following and Reasoning ability:

  • Instruction Following: Developing advanced methods for LLMs to understand complex instructions, particularly those with multiple constraints (e.g., format, semantic constraints). The aim is to enhance LLMs’ understanding of diverse and complex human instructions, enabling them to excel in practical applications beyond traditional NLP tasks.
  • Mathematical Reasoning: Improving LLMs’ reasoning ability fundamentally without external support, with an emphasis on mathematical reasoning. This involves enhancing foundational skills of complex reasoning tasks, such as numerical computation and unit conversion through training.

(Download my resumé. The last update was on 2024-04-30.)

Interests
  • Instruction Following
  • Creative Generation
  • Large Language Models
Education
  • PhD in CS, 2021-2026 (estimated)

    Fudan University

  • B.S. in CS, 2017-2021

    Fudan University

News

Awards

Intel Fellowship
Outstanding Academic Scholarship for Master Students
Outstanding Graduates in Shanghai
China National Scholarship

Recent Publications

Quickly discover relevant content by filtering publications.
(2023). Can Large Language Models Understand Real-World Complex Instructions?. In The 38th Annual AAAI Conference on Artificial Intelligence (AAAI 2024).

PDF Code

(2023). Enhancing Quantitative Reasoning Skills of Large Language Models through Dimension Perception. In The IEEE International Conference on Data Engineering (ICDE 2024).

PDF

(2023). HAUSER: Towards Holistic and Automatic Evaluation of Simile Generation. In The 61st Annual Meeting of the Association for Computational Linguistics (ACL 2023).

PDF Code

(2023). BBT-Fin: Comprehensive Construction of Chinese Financial Domain Pre-trained Language Model, Corpus and Benchmark. Preprint.

PDF Code

Projects

CuteGPT
An open-source conversational language model developed by the Knowledge Works Research Laboratory at Fudan University.
CuteGPT