Computer Science > Machine Learning
[Submitted on 13 May 2024 (v1), last revised 25 Jul 2024 (this version, v5)]
Title:The Platonic Representation Hypothesis
View PDF HTML (experimental)Abstract:We argue that representations in AI models, particularly deep networks, are converging. First, we survey many examples of convergence in the literature: over time and across multiple domains, the ways by which different neural networks represent data are becoming more aligned. Next, we demonstrate convergence across data modalities: as vision models and language models get larger, they measure distance between datapoints in a more and more alike way. We hypothesize that this convergence is driving toward a shared statistical model of reality, akin to Plato's concept of an ideal reality. We term such a representation the platonic representation and discuss several possible selective pressures toward it. Finally, we discuss the implications of these trends, their limitations, and counterexamples to our analysis.
Submission history
From: Tongzhou Wang [view email][v1] Mon, 13 May 2024 17:58:30 UTC (1,316 KB)
[v2] Sun, 21 Jul 2024 05:11:37 UTC (4,977 KB)
[v3] Tue, 23 Jul 2024 01:42:21 UTC (4,977 KB)
[v4] Wed, 24 Jul 2024 05:01:21 UTC (4,977 KB)
[v5] Thu, 25 Jul 2024 09:33:50 UTC (4,977 KB)
Current browse context:
cs.LG
References & Citations
export BibTeX citation
Loading...
Bibliographic and Citation Tools
Bibliographic Explorer (What is the Explorer?)
Connected Papers (What is Connected Papers?)
Litmaps (What is Litmaps?)
scite Smart Citations (What are Smart Citations?)