The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding
Investigates the understanding capabilities of large language models (LLMs) through a task called PHYSICO, designed to assess their comprehension of p...
see more