A mathematical theory of relational generalization in transitive inference

Samuel Lippl; Kenneth Kay; Greg Jensen; Vincent P. Ferrera; L. F. Abbott

doi:10.1073/pnas.2314511121

A mathematical theory of relational generalization in transitive inference

Samuel Lippl, Kenneth Kay, Greg Jensen, Vincent P. Ferrera, L. F. Abbott

Zuckerman Institute

Producción científica › revisión exhaustiva

3 Citas (Scopus)

Resumen

Humans and animals routinely infer relations between different items or events and generalize these relations to novel combinations of items. This allows them to respond appropriately to radically novel circumstances and is fundamental to advanced cognition. However, how learning systems (including the brain) can implement the necessary inductive biases has been unclear. We investigated transitive inference (TI), a classic relational task paradigm in which subjects must learn a relation (A > B and B > C) and generalize it to new combinations of items (A > C). Through mathematical analysis, we found that a broad range of biologically relevant learning models (e.g. gradient flow or ridge regression) perform TI successfully and recapitulate signature behavioral patterns long observed in living subjects. First, we found that models with item-wise additive representations automatically encode transitive relations. Second, for more general representations, a single scalar "conjunctivity factor"determines model behavior on TI and, further, the principle of norm minimization (a standard statistical inductive bias) enables models with fixed, partly conjunctive representations to generalize transitively. Finally, neural networks in the "rich regime,"which enables representation learning and improves generalization on many tasks, unexpectedly show poor generalization and anomalous behavior on TI. We find that such networks implement a form of norm minimization (over hidden weights) that yields a local encoding mechanism lacking transitivity. Our findings show how minimal statistical learning principles give rise to a classical relational inductive bias (transitivity), explain empirically observed behaviors, and establish a formal approach to understanding the neural basis of relational abstraction.

Idioma original	English
Número de artículo	e2314511121
Publicación	Proceedings of the National Academy of Sciences of the United States of America
Volumen	121
N.º	28
DOI	https://doi.org/10.1073/pnas.2314511121
Estado	Published - jul. 9 2024

ASJC Scopus Subject Areas

General

Acceder al documento

10.1073/pnas.2314511121

Otros archivos y enlaces

Citar esto

Lippl, S., Kay, K., Jensen, G., Ferrera, V. P., & Abbott, L. F. (2024). A mathematical theory of relational generalization in transitive inference. Proceedings of the National Academy of Sciences of the United States of America, 121(28), Artículo e2314511121. https://doi.org/10.1073/pnas.2314511121

@article{32f9787dd79e44ba8cede9029e796cd6,

title = "A mathematical theory of relational generalization in transitive inference",

abstract = "Humans and animals routinely infer relations between different items or events and generalize these relations to novel combinations of items. This allows them to respond appropriately to radically novel circumstances and is fundamental to advanced cognition. However, how learning systems (including the brain) can implement the necessary inductive biases has been unclear. We investigated transitive inference (TI), a classic relational task paradigm in which subjects must learn a relation (A > B and B > C) and generalize it to new combinations of items (A > C). Through mathematical analysis, we found that a broad range of biologically relevant learning models (e.g. gradient flow or ridge regression) perform TI successfully and recapitulate signature behavioral patterns long observed in living subjects. First, we found that models with item-wise additive representations automatically encode transitive relations. Second, for more general representations, a single scalar {"}conjunctivity factor{"}determines model behavior on TI and, further, the principle of norm minimization (a standard statistical inductive bias) enables models with fixed, partly conjunctive representations to generalize transitively. Finally, neural networks in the {"}rich regime,{"}which enables representation learning and improves generalization on many tasks, unexpectedly show poor generalization and anomalous behavior on TI. We find that such networks implement a form of norm minimization (over hidden weights) that yields a local encoding mechanism lacking transitivity. Our findings show how minimal statistical learning principles give rise to a classical relational inductive bias (transitivity), explain empirically observed behaviors, and establish a formal approach to understanding the neural basis of relational abstraction.",

author = "Samuel Lippl and Kenneth Kay and Greg Jensen and Ferrera, {Vincent P.} and Abbott, {L. F.}",

note = "Publisher Copyright: Copyright {\textcopyright} 2024 the Author(s).",

year = "2024",

month = jul,

day = "9",

doi = "10.1073/pnas.2314511121",

language = "English",

volume = "121",

journal = "Proceedings of the National Academy of Sciences of the United States of America",

issn = "0027-8424",

number = "28",

}

TY - JOUR

T1 - A mathematical theory of relational generalization in transitive inference

AU - Lippl, Samuel

AU - Kay, Kenneth

AU - Jensen, Greg

AU - Ferrera, Vincent P.

AU - Abbott, L. F.

PY - 2024/7/9

Y1 - 2024/7/9

N2 - Humans and animals routinely infer relations between different items or events and generalize these relations to novel combinations of items. This allows them to respond appropriately to radically novel circumstances and is fundamental to advanced cognition. However, how learning systems (including the brain) can implement the necessary inductive biases has been unclear. We investigated transitive inference (TI), a classic relational task paradigm in which subjects must learn a relation (A > B and B > C) and generalize it to new combinations of items (A > C). Through mathematical analysis, we found that a broad range of biologically relevant learning models (e.g. gradient flow or ridge regression) perform TI successfully and recapitulate signature behavioral patterns long observed in living subjects. First, we found that models with item-wise additive representations automatically encode transitive relations. Second, for more general representations, a single scalar "conjunctivity factor"determines model behavior on TI and, further, the principle of norm minimization (a standard statistical inductive bias) enables models with fixed, partly conjunctive representations to generalize transitively. Finally, neural networks in the "rich regime,"which enables representation learning and improves generalization on many tasks, unexpectedly show poor generalization and anomalous behavior on TI. We find that such networks implement a form of norm minimization (over hidden weights) that yields a local encoding mechanism lacking transitivity. Our findings show how minimal statistical learning principles give rise to a classical relational inductive bias (transitivity), explain empirically observed behaviors, and establish a formal approach to understanding the neural basis of relational abstraction.

AB - Humans and animals routinely infer relations between different items or events and generalize these relations to novel combinations of items. This allows them to respond appropriately to radically novel circumstances and is fundamental to advanced cognition. However, how learning systems (including the brain) can implement the necessary inductive biases has been unclear. We investigated transitive inference (TI), a classic relational task paradigm in which subjects must learn a relation (A > B and B > C) and generalize it to new combinations of items (A > C). Through mathematical analysis, we found that a broad range of biologically relevant learning models (e.g. gradient flow or ridge regression) perform TI successfully and recapitulate signature behavioral patterns long observed in living subjects. First, we found that models with item-wise additive representations automatically encode transitive relations. Second, for more general representations, a single scalar "conjunctivity factor"determines model behavior on TI and, further, the principle of norm minimization (a standard statistical inductive bias) enables models with fixed, partly conjunctive representations to generalize transitively. Finally, neural networks in the "rich regime,"which enables representation learning and improves generalization on many tasks, unexpectedly show poor generalization and anomalous behavior on TI. We find that such networks implement a form of norm minimization (over hidden weights) that yields a local encoding mechanism lacking transitivity. Our findings show how minimal statistical learning principles give rise to a classical relational inductive bias (transitivity), explain empirically observed behaviors, and establish a formal approach to understanding the neural basis of relational abstraction.

UR - http://www.scopus.com/inward/record.url?scp=85197814589&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85197814589&partnerID=8YFLogxK

U2 - 10.1073/pnas.2314511121

DO - 10.1073/pnas.2314511121

M3 - Article

C2 - 38968113

AN - SCOPUS:85197814589

SN - 0027-8424

VL - 121

JO - Proceedings of the National Academy of Sciences of the United States of America

JF - Proceedings of the National Academy of Sciences of the United States of America

IS - 28

M1 - e2314511121

ER -

A mathematical theory of relational generalization in transitive inference

Resumen

ASJC Scopus Subject Areas

Acceder al documento

Otros archivos y enlaces

Huella

Citar esto