Graham Neubig

Associate Professor Carnegie Mellon University

Pittsburgh PA

Graham Neubig's research is concerned with language and its role in human communication.

Contact

Carnegie Mellon University
View more experts managed by Carnegie Mellon University

View all Experts

Biography

Graham Neubig's research is concerned with language and its role in human communication. In particular, his long-term research goal is to break down barriers in human-human or human-machine communication through the development of natural language processing (NLP) technologies. This includes the development of technology for machine translation, which helps break down barriers in communication for people who speak different languages, and natural language understanding, which helps computers understand and respond to human language. Within this overall goal of breaking down barriers to human communication, I have focused on several aspects of language that both make it interesting as a scientific subject, and hold potential for the construction of practical systems.

Areas of Expertise

Machine Learning

Natural Language Processing

Machine Translation

Spoken Language Processing

Media Appearances

AI isn't ready to do your job

Business Insider online

2025-04-22

AI Agents aren't ready to do your job. Researchers at CMU staffed a fake company with AI agents, and the results were disastrous. "While agents may be used to accelerate some portion of the tasks that human workers are doing, they are likely not a replacement for all tasks at the moment," said Graham Neubig (School of Computer Science).

Where DeepL Beats ChatGPT in Machine Translation with Graham Neubig

Slator online

2023-07-14

In this week’s SlatorPod, we are joined by Graham Neubig, Associate Professor of Computer Science at Carnegie Mellon University, to discuss his research on multilingual natural language processing (NLP) and machine translation (MT).

Angry Bing chatbot just mimicking humans, say experts

ARY News online

2023-02-18

“I think this is basically mimicking conversations that it’s seen online,” said Graham Neubig, an associate professor at Carnegie Mellon University’s language technologies institute.

Show All +

Media

Social

Industry Expertise

Education/Learning

Research

Education

Kyoto University

Ph.D.

Informatics

2012

Kyoto University

M.S.

Informatics

2010

University of Illinois Urbana-Champaign

B.S.

Computer Science

2005

Articles

Divergences between Language Models and Human Brains

Advances in Neural Information Processing Systems

2024

Do machines and humans process language in similar ways? Recent research has hinted at the affirmative, showing that human neural activity can be effectively predicted using the internal representations of language models (LMs). Although such results are thought to reflect shared computational principles between LMs and human brains, there are also clear differences in how LMs and humans represent and use language. In this work, we systematically explore the divergences between human and machine language processing by examining the differences between LM representations and human brain responses to language as measured by Magnetoencephalography (MEG) across two datasets in which subjects read and listened to narrative stories. Using an LLM-based data-driven approach, we identify two domains that LMs do not capture well: social/emotional intelligence and physical commonsense.

Do llms exhibit human-like response biases? a case study in survey design

Transactions of the Association for Computational Linguistics

2024

One widely cited barrier to the adoption of LLMs as proxies for humans in subjective tasks is their sensitivity to prompt wording—but interestingly, humans also display sensitivities to instruction changes in the form of response biases. We investigate the extent to which LLMs reflect human response biases, if at all. We look to survey design, where human response biases caused by changes in the wordings of “prompts” have been extensively explored in social psychology literature. Drawing from these works, we design a dataset and framework to evaluate whether LLMs exhibit human-like response biases in survey questionnaires. Our comprehensive evaluation of nine models shows that popular open and commercial LLMs generally fail to reflect human-like behavior, particularly in models that have undergone RLHF.

DIRE and its data: Neural decompiled variable renamings with respect to software class

ACM Transactions on Software Engineering and Methodology

2023

The decompiler is one of the most common tools for examining executable binaries without the corresponding source code. It transforms binaries into high-level code, reversing the compilation process. Unfortunately, decompiler output is far from readable because the decompilation process is often incomplete. State-of-the-art techniques use machine learning to predict missing information like variable names. While these approaches are often able to suggest good variable names in context, no existing work examines how the selection of training data influences these machine learning models. We investigate how data provenance and the quality of training data affect performance, and how well, if at all, trained models generalize across software domains. We focus on the variable renaming problem using one such machine learning model, DIRE.

Show All +

Graham Neubig

Carnegie Mellon University

Biography

Areas of Expertise

Media Appearances

AI isn't ready to do your job

Where DeepL Beats ChatGPT in Machine Translation with Graham Neubig

Angry Bing chatbot just mimicking humans, say experts

The Latest in Translation Devices

Media

Social

Industry Expertise

Education

Kyoto University

Kyoto University

University of Illinois Urbana-Champaign

Links

Articles

Divergences between Language Models and Human Brains

Do llms exhibit human-like response biases? a case study in survey design

DIRE and its data: Neural decompiled variable renamings with respect to software class

AmericasNLI: Machine translation and natural language inference systems for Indigenous languages of the Americas

Can we automate scientific reviewing?