Rayid Ghani

Distinguished Career Professor Carnegie Mellon University

  • Pittsburgh PA

Rayid Ghani is a reformed computer scientist who wants to increase the use of large-scale Machine Learning in solving large public policy.

Contact

Carnegie Mellon University

View more experts managed by Carnegie Mellon University

Biography

Rayid Ghani is a Distinguished Career Professor in the Machine Learning Department and the Heinz College of Information Systems and Public Policy at Carnegie Mellon University.

Rayid is a reformed computer scientist and wanna-be social scientist, but mostly just wants to increase the use of large-scale AI/Machine Learning/Data Science in solving large public policy and social challenges in a fair and equitable manner. Among other areas, Rayid works with governments and non-profits in policy areas such as health, criminal justice, education, public safety, economic development, and urban infrastructure. Rayid is also passionate about teaching practical data science and started the Data Science for Social Good Fellowship that trains computer scientists, statisticians, and social scientists from around the world to work on data science problems with social impact.

Before joining Carnegie Mellon University, Rayid was the Founding Director of the Center for Data Science & Public Policy, Research Associate Professor in Computer Science, and a Senior Fellow at the Harris School of Public Policy at the University of Chicago. Previously, Rayid was the Chief Scientist of the Obama 2012 Election Campaign where he focused on data, analytics, and technology to target and influence voters, donors, and volunteers. In his ample free time, Rayid obsesses over everything related to coffee and works with non-profits to help them with their data, analytics and digital efforts and strategy.

Areas of Expertise

Predictive Analytics
Machine Learning
Ethics
Public Policy
Information Systems

Media Appearances

A Disaster for American Innovation: The Trump administration is jeopardizing the AI boom.

The Atlantic  online

2025-04-11

Trump Administration cuts to science puts the U.S. at risk of losing ground as a leader in AI.“I don’t think anybody would seriously claim that these [AI breakthroughs] could have been done if the research universities in the U.S. didn’t exist at the same scale,” said Rayid Ghani (Heinz College).

View More

Artificial Intelligence Makes Energy Demand More Complex — And More Achievable

CMU News  online

2025-03-24

The work of individuals like Rayid Ghani, Distinguished Career Professor in the Machine Learning Department and the Heinz College of Information Systems and Public Policy, is one example of how this cross-disciplinary approach can look.

Ghani often looks at the applications of machine learning and artificial intelligence not exclusively in a climate context, but concerning a wide range of social and economic potential. His research primarily focuses on how to use the technology to promote social good in areas such as public health, economic development and urban infrastructure.

View More

The logic behind AI chatbots like ChatGPT is surprisingly basic

Popular Science  online

2023-08-22

Systems like ChatGPT can use only what they’ve gleaned from the web. “All it’s doing is taking the internet it has access to and then filling in what would come next,” says Rayid Ghani, a professor in the machine learning department at Carnegie Mellon University.

View More

Show All +

Social

Industry Expertise

Social Media
Education/Learning
Computer Software
Research

Accomplishments

Distinguished Young Alumni Award

2013

Sewanee - University of the South

American Statistical Association Harry V. Roberts Statistical Advocate of the Year Award

2017

Milbank Memorial Fund and AcademyHealth State and Local Innovation Prize

2018

Show All +

Education

Carnegie Mellon University

M.S.

Machine Learning

2001

University of the South

B.S (with Honors)

Computer Science, Mathematics

1999

Affiliations

  • ChangeLab Solutions : Board of Directors
  • The University of the South : Member Board of Regents
  • Hispanic Scholarship Fund : Technology Advisor
  • AI for Good Foundation : Steering Committee Member
  • Data Science for Social Good Foundation : Board Member
Show All +

Patents

Claims analytics engine

US8762180B2

2014-06-24

Methods and systems for processing claims (e.g., healthcare insurance claims) are described. For example, prior to payment of an unpaid claim, a prediction is made as to whether or not an attribute specified in the claim is correct. Depending on the prediction results, the claim can be flagged for an audit. Feedback from the audit can be used to update the prediction models in order to refine the accuracy of those models.

View more

User modification of generative model for determining topics and sentiments

US9015035B2

2013-01-17

A generative model is used to develop at least one topic model and at least one sentiment model for a body of text. The at least one topic model is displayed such that, in response, a user may provide user input indicating modifications to the at least one topic model. Based on the received user input, the generative model is used to provide at least one updated topic model and at least one updated sentiment model based on the user input. Thereafter, the at least one updated topic model may again be displayed in order to solicit further user input, which further input is then used to once again update the models. The at least one updated topic model and the at least one updated sentiment model may be employed to analyze target text in order to identify topics and associated sentiments therein.

View more

Classification-based redaction in natural language text

US8938386B2

2012-09-12

When redacting natural language text, a classifier is used to provide a sensitive concept model according to features in natural language text and in which the various classes employed are sensitive concepts reflected in the natural language text. Similarly, the classifier is used to provide an utility concepts model based on utility concepts. Based on these models, and for one or more identified sensitive concept and identified utility concept, at least one feature in the natural language text is identified that implicates the at least one identified sensitive topic more than the at least one identified utility concept. At least some of the features thus identified may be perturbed such that the modified natural language text may be provided as at least one redacted document. In this manner, features are perturbed to maximize classification error for sensitive concepts while simultaneously minimizing classification error in the utility concepts.

View more

Articles

A recommendation and risk classification system for connecting rough sleepers to essential outreach services

Data & Policy

2021

Rough sleeping is a chronic experience faced by some of the most disadvantaged people in modern society. This paper describes work carried out in partnership with Homeless Link (HL), a UK-based charity, in developing a data-driven approach to better connect people sleeping rough on the streets with outreach service providers. HL's platform has grown exponentially in recent years, leading to thousands of alerts per day during extreme weather events; this overwhelms the volunteer-based system they currently rely upon for the processing of alerts.

View more

An Empirical Comparison of Bias Reduction Methods on Real-World Problems in High-Stakes Policy Settings

ACM SIGKDD Explorations Newsletter

2021

Applications of machine learning (ML) to high-stakes policy settings - such as education, criminal justice, healthcare, and social service delivery - have grown rapidly in recent years, sparking important conversations about how to ensure fair outcomes from these systems. The machine learning research community has responded to this challenge with a wide array of proposed fairness-enhancing strategies for ML models, but despite the large number of methods that have been developed, little empirical work exists evaluating these methods in real-world settings.

View more

Empirical observation of negligible fairness–accuracy trade-offs in machine learning for public policy

Nature Machine Intelligence

2021

The growing use of machine learning in policy and social impact settings has raised concerns over fairness implications, especially for racial minorities. These concerns have generated considerable interest among machine learning and artificial intelligence researchers, who have developed new methods and established theoretical bounds for improving fairness, focusing on the source data, regularization and model training, or post-hoc adjustments to model scores.

View more

Show All +