Top Language papers

Find all the TopLanguage papers. Links to pdf, code repos and demos are provided.

•

7 months ago

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

Shunchi Zhang, Dit Yan Yeung

Investigates the understanding capabilities of large language models (LLMs) through a task called PHYSICO, designed to assess their comprehension of p...
see more

Language

over 5 years ago

Evolution Strategies Converges to Finite Differences

John C. Raisbeck, Matthew Allen, Ralph Weissleder + 2 more

Since the debut of Evolution Strategies (ES) as a tool for Reinforcement Learning by Salimans et al. 2017, there has been interest in determining the ...
see more

Language

Computer Vision

6 months ago

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Lovish Madaan, Yoram Bachrach, Nikolay Bashlykov + 4 more

MLGym is a novel framework and benchmark designed to evaluate and develop large language model (LLM) agents on diverse AI research tasks, providing a ...
see more

Language

over 3 years ago

NeuraHealth: An Automated Screening Pipeline to Detect Undiagnosed Cognitive Impairment in Electronic Health Records with Deep Learning and Natural Language Processing

Tanish Tyagi, Colin G. Magdamo, Ayush Noori + 22 more

Dementia related cognitive impairment (CI) is a neurodegenerative disorder, affecting over 55 million people worldwide and growing rapidly at the rate...
see more

Language

almost 4 years ago

Using Deep Learning to Identify Patients with Cognitive Impairment in Electronic Health Records

Tanish Tyagi, Colin G. Magdamo, Ayush Noori + 22 more

Dementia is a neurodegenerative disorder that causes cognitive decline and affects more than 50 million people worldwide. Dementia is under-diagnosed ...
see more

Language

about 1 year ago

Defection-Free Collaboration between Competitors in a Learning System

Michael. I. Jordan, Mariel Werner, Sai Praneeth Karimireddy

We study collaborative learning systems in which the participants are competitors who will defect from the system if they lose revenue by collaboratin...
see more

Language

about 1 year ago

Fairness-Aware Meta-Learning via Nash Bargaining

Michael. I. Jordan, Yi Zeng, Xuelin Yang + 4 more

To address issues of group-level fairness in machine learning, it is natural to adjust model parameters based on specific fairness objectives over a s...
see more

Language

over 1 year ago

Towards a Theoretical Understanding of the 'Reversal Curse' via Training Dynamics

Stuart J. Russell, Michael. I. Jordan, Hanlin Zhu + 4 more

Auto-regressive large language models (LLMs) show impressive capacities to solve many complex reasoning tasks while struggling with some simple logica...
see more

Language

over 1 year ago

Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference

Michael. I. Jordan, Wei-Lin Chiang, Lianmin Zheng + 8 more

Introduces Chatbot Arena, an innovative open platform designed for evaluating large language models (LLMs) based on human preferences through a crowds...
see more

#10

Language

over 1 year ago

Iterative Data Smoothing: Mitigating Reward Overfitting and Overoptimization in RLHF

Michael. I. Jordan, Banghua Zhu, Jiantao Jiao

Reinforcement Learning from Human Feedback (RLHF) is a pivotal technique that aligns language models closely with human-centric values. The initial ph...
see more

#11

Language

Computer Vision

Graphs

about 7 years ago

L-Shapley and C-Shapley: Efficient Model Interpretation for Structured Data

ICLR • Jianbo Chen, Martin J. Wainwright, Michael. I. Jordan + 1 more

Presents L-Shapley and C-Shapley, two innovative algorithms for instancewise feature importance scoring tailored to structured data, particularly wher...
see more

#12

Language

about 4 years ago

The Stereotyping Problem in Collaboratively Filtered Recommender Systems

Michael. I. Jordan, Wenshuo Guo, Karl Krauth + 1 more

Recommender systems play a crucial role in mediating our access to online information. We show that such algorithms induce a particular kind of stereo...
see more

Loading. Please wait...

Top Language papers

Find all the TopLanguage papers. Links to pdf, code repos and demos are provided.

•

Language

7 months ago

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

Shunchi Zhang, Dit Yan Yeung

Investigates the understanding capabilities of large language models (LLMs) through a task called PHYSICO, designed to assess their comprehension of p...
see more

Language

over 5 years ago

Evolution Strategies Converges to Finite Differences

John C. Raisbeck, Matthew Allen, Ralph Weissleder + 2 more

Since the debut of Evolution Strategies (ES) as a tool for Reinforcement Learning by Salimans et al. 2017, there has been interest in determining the ...
see more

Language

Computer Vision

6 months ago

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Lovish Madaan, Yoram Bachrach, Nikolay Bashlykov + 4 more

MLGym is a novel framework and benchmark designed to evaluate and develop large language model (LLM) agents on diverse AI research tasks, providing a ...
see more

Language

over 3 years ago

NeuraHealth: An Automated Screening Pipeline to Detect Undiagnosed Cognitive Impairment in Electronic Health Records with Deep Learning and Natural Language Processing

Tanish Tyagi, Colin G. Magdamo, Ayush Noori + 22 more

Dementia related cognitive impairment (CI) is a neurodegenerative disorder, affecting over 55 million people worldwide and growing rapidly at the rate...
see more

Language

almost 4 years ago

Using Deep Learning to Identify Patients with Cognitive Impairment in Electronic Health Records

Tanish Tyagi, Colin G. Magdamo, Ayush Noori + 22 more

Dementia is a neurodegenerative disorder that causes cognitive decline and affects more than 50 million people worldwide. Dementia is under-diagnosed ...
see more

Language

about 1 year ago

Defection-Free Collaboration between Competitors in a Learning System

Michael. I. Jordan, Mariel Werner, Sai Praneeth Karimireddy

We study collaborative learning systems in which the participants are competitors who will defect from the system if they lose revenue by collaboratin...
see more

Language

about 1 year ago

Fairness-Aware Meta-Learning via Nash Bargaining

Michael. I. Jordan, Yi Zeng, Xuelin Yang + 4 more

To address issues of group-level fairness in machine learning, it is natural to adjust model parameters based on specific fairness objectives over a s...
see more

Language

over 1 year ago

Towards a Theoretical Understanding of the 'Reversal Curse' via Training Dynamics

Stuart J. Russell, Michael. I. Jordan, Hanlin Zhu + 4 more

Auto-regressive large language models (LLMs) show impressive capacities to solve many complex reasoning tasks while struggling with some simple logica...
see more

Language

over 1 year ago

Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference

Michael. I. Jordan, Wei-Lin Chiang, Lianmin Zheng + 8 more

Introduces Chatbot Arena, an innovative open platform designed for evaluating large language models (LLMs) based on human preferences through a crowds...
see more

#10

Language

over 1 year ago

Iterative Data Smoothing: Mitigating Reward Overfitting and Overoptimization in RLHF

Michael. I. Jordan, Banghua Zhu, Jiantao Jiao

Reinforcement Learning from Human Feedback (RLHF) is a pivotal technique that aligns language models closely with human-centric values. The initial ph...
see more

#11

Language

Computer Vision

Graphs

about 7 years ago

L-Shapley and C-Shapley: Efficient Model Interpretation for Structured Data

ICLR • Jianbo Chen, Martin J. Wainwright, Michael. I. Jordan + 1 more

Presents L-Shapley and C-Shapley, two innovative algorithms for instancewise feature importance scoring tailored to structured data, particularly wher...
see more

#12

Language

about 4 years ago

The Stereotyping Problem in Collaboratively Filtered Recommender Systems

Michael. I. Jordan, Wenshuo Guo, Karl Krauth + 1 more

Recommender systems play a crucial role in mediating our access to online information. We show that such algorithms induce a particular kind of stereo...
see more