Hello. I am a computer science Ph.D. candidate in the Courant Institute of Mathematical Sciences at New York University. I am a member of the Machine Learning for Language (ML²) group (subset of the CILVR group). I am advised by Prof. He He and Prof. Kyunghyun Cho. I also frequently collaborated with Prof. Sam Bowman. Starting in October 2022, I will be a Visiting Researcher at Meta AI (FAIR) mentored by Dr. Jason Weston and Dr. Stephen Roller.

My research focuses on natural langauge processing and machine learning. Specifically, recent interests include text generation, structured prediction, and long text understanding.

Prior to my Ph.D., I graduated from the University of Chicago (B.S. in mathematics and B.S. in computer science). In Chicago, my advisor was Prof. Kevin Gimpel at Toyota Technological Institute at Chicago (TTIC) and the University of Chicago. In Summer 2020, I was a Research Intern at Google Research New York; in Summer 2021, I was a Research Intern at Google Brain.

Research

Primary research subfields: text generation (including machine translation), structured prediction, language understanding, pretraining, and others. [semantic scholar] [google scholar] [dblp] [abbreviations]


Publications and preprints (2021-)

What Do NLP Researchers Believe? Results of the NLP Community Metasurvey
Julian Michael, Ari Holtzman, Alicia Parrish, Aaron Mueller, Alex Wang, Angelica Chen, Divyam Madaan, Nikita Nangia, Richard Yuanzhe Pang, Jason Phang, Samuel R. Bowman
Preprint, August 2022
[paper] [website] [bibtex]

Amortized Noisy Channel Neural Machine Translation
Richard Yuanzhe Pang, He He, Kyunghyun Cho
In Proceedings of INLG 2022; best presentation
tl;dr: amortizing the inference cost of "beam search and rerank" – learning to rerank without explicitly reranking
[paper] [talk] [poster] [bibtex]

SQuALITY: Building a Long-Document Summarization Dataset the Hard Way
Alex Wang, Richard Yuanzhe Pang, Angelica Chen, Jason Phang, Samuel R. Bowman
Preprint, May 2022
[paper] [code] [bibtex]

QuALITY: Question Answering with Long Input Texts, Yes!
Richard Yuanzhe Pang*, Alicia Parrish*, Nitish Joshi*, Nikita Nangia, Jason Phang, Angelica Chen, Vishakh Padmakumar, Johnny Ma, Jana Thompson, He He, Samuel R. Bowman
In Proceedings of NAACL 2022
[paper] [abstract] [data] [code] [leaderboard] [15-min live talk] [slides] [bibtex] | by others: [tfds] [forecast] [press] [scrolls]

Token Dropping for Efficient BERT Pretraining
Le Hou*, Richard Yuanzhe Pang*, Tianyi Zhou, Yuexin Wu, Xinying Song, Xiaodan Song, Denny Zhou
In Proceedings of ACL 2022
[paper] [abstract] [code] [talk] [bibtex]

AgreeSum: Agreement-Oriented Multi-Document Summarization
Richard Yuanzhe Pang*, Adam D. Lelkes*, Vinh Q. Tran*, Cong Yu
In Findings of ACL 2021
[paper] [abstract] [data] [short talk] [bibtex]

Comparing Test Sets with Item Response Theory
Clara Vania*, Phu Mon Htut*, William Huang*, Dhara Mungra, Richard Yuanzhe Pang, Jason Phang, Haokun Liu, Kyunghyun Cho, Samuel R. Bowman
In Proceedings of ACL 2021
[paper] [abstract] [code] [bibtex]

Text Generation by Learning from Demonstrations
Richard Yuanzhe Pang, He He
In Proceedings of ICLR 2021
tl;dr: a high-precision-generation training objective to address the train/test objective mismatch and history mismatch
[paper] [abstract] [openreview] [poster] [slides] [code] [discussion] [bibtex]

Publications (-2020)

Improving Joint Training of Inference Networks and Structured Prediction Energy Networks
Lifu Tu, Richard Yuanzhe Pang, Kevin Gimpel
In Proceedings of EMNLP 2020 Workshop on Structured Prediction for NLP (SPNLP); spotlight paper
tl;dr: improving fast approximate+amortized inference for energy-based models in NLP structured prediction
[paper] [abstract] [my slides] [bibtex]

Consistency of a Recurrent Language Model With Respect to Incomplete Decoding
Sean Welleck*, Ilia Kulikov*, Jaedeok Kim, Richard Yuanzhe Pang, Kyunghyun Cho
In Proceedings of EMNLP 2020
Also appearing in the non-archival DeepMath 2020
[paper] [abstract] [code] [bibtex]

ENGINE: Energy-Based Inference Networks for Non-Autoregressive Machine Translation
Lifu Tu, Richard Yuanzhe Pang, Sam Wiseman, Kevin Gimpel
In Proceedings of ACL 2020
tl;dr: a "soft" form of knowledge distillation for non-autoregressive MT
[paper] [abstract] [code] [bibtex]

Intermediate-Task Transfer Learning with Pretrained Language Models: When and Why Does It Work?
Yada Pruksachatkun*, Jason Phang*, Haokun Liu*, Phu Mon Htut*, Xiaoyi Zhang, Richard Yuanzhe Pang, Clara Vania, Katharina Kann, Samuel R. Bowman
In Proceedings of ACL 2020
[paper] [abstract] [bibtex]

Unsupervised Evaluation Metrics and Learning Criteria for Non-Parallel Textual Transfer
Richard Yuanzhe Pang, Kevin Gimpel
In Proceedings of EMNLP 2019 Workshop on Neural Generation and Translation (WNGT)
tl;dr: proposing more dimensions for textual transfer evaluation metrics, and losses that target them
[paper] [supplementals] [abstract] [poster] [bibtex]

The Daunting Task of Real-World Textual Style Transfer Auto-Evaluation
Richard Yuanzhe Pang
Extended abstract in EMNLP 2019 Workshop on Neural Generation and Translation (WNGT); abstract in Proceedings of the Workshop on Noisy User-generated Text (W-NUT)
tl;dr: an opinion piece arguing that the research on textual style transfer and its evaluation are going astray
[paper] [abstract] [poster] [bibtex]

More info: [semantic scholar] [google scholar] [dblp] [abbreviations]

Discussion


Discussion of GOLD [pdf]
June 2022
tl;dr: GOLD does not maximize the expected reward. It maximizes the expected reward of training examples only.

More research activities


As a reviewer / program committee member

  • AAAI (2023), ACL Rolling Review (10,11/2021; 01,02/2022), ACL (2021), EMNLP (2021, 2022), ICLR (2022), ICLR blog post track (2022), ICML (2022), NeurIPS (2021 — top 8% reviewer, 2022), Transactions on Machine Learning Research (TMLR; 2022)
  • Workshops: Novel Ideas in Learning-to-Learn through Interaction (NILLI 2021 at EMNLP 2021, NILLI 2022 at EMNLP 2022), Efficient Benchmarking in NLP (NLP Power at ACL 2022)
  • Other events: Mid-Atlantic Student Colloquium on Speech, Language, and Learning (2022)

Teaching


External

  • May 2022, Teaching Assistant / Lab Instructor (virtual), African Masters of Machine Intelligence (course: Deep Learning for NLP by Prof. Kyunghyun Cho and Prof. Duygu Ataman) [AMMI site]

At New York University

  • Spring 2022, Section Leader / Teaching Assistant (in-person), DS-GA 1012 / LING-GA 1012: Natural Language Understanding and Computational Semantics (Bowman; graduate-level) [syllabus]
  • January 2022, Co-Instructor / Teaching Assistant (virtual), NYU AI School 2022 [site]
  • Fall 2020, Section Leader (in-person), DS-GA 1008: Deep Learning (Cho, LeCun; graduate-level) [syllabus]

At the University of Chicago

  • Spring 2017, Course Assistant, MATH 15910: Intro to Proofs in Analysis
  • Winter 2017, Course Assistant, MATH 15910: Intro to Proofs in Analysis [sol]
  • Winter 2017, Grader, CMSC 15200: Intro to Computer Science II
  • Autumn 2016, Teaching Assistant, MATH 15300: Calculus III

Presentations


Selected presentations

  • Talk titled QuALITY: Question Answering with Long Input Texts, Yes!; NAACL 2022 in Seattle, USA; July 2022 [live talk]
  • Talk on question answering data collection; Apple; December 2021
  • Talk titled Text Generation by Learning from Demonstrations; Samsung workshop; June 2021 [based on this slide deck]
  • Talk on structured prediction; Bank of New York Mellon; September 2020 [based on this slide deck]
  • Talk titled Text Generation by Offline RL; Google Research New York; July 2020
  • Poster presentation on Learning Criteria and Evaluation Metrics for Textual Transfer between Non-Parallel Corpora; NAACL 2019 NeuralGen workshop in Minneapolis, USA; June 2019
  • Talk titled Learning Approximate Inference Networks and Structured Prediction Energy Networks; Midwest Speech and Language Days (MSLD) 2019 in Chicago, USA; May 2019
  • Poster presentation on Learning Criteria and Evaluation Metrics for Textual Transfer between Non-Parallel Corpora; UChicago STEM Research Symposium in Chicago, USA; October 2018

Other conference presentations with associated proceeding papers

    Please email for full CV.



Last updated: September 22, 2022. Contact: My NYU office is at 60 5th Ave. Get in touch at yzpang at _ dot edu (where _ is nyu or uchicago)!