Paper Matrix

Byte Latent Transformer: Patches Scale Better Than Tokens (2024) NEW

Artidoro Pagnoni, Ram Pasunuru, Pedro Rodriguez, John Nguyen, Benjamin Muller, Margaret Li, Chunting Zhou, Lili Y, Jason Weston, Luke Zettlemoyer, Gargi Ghosh, Mike Lewis, Ari Holtzman, Srinivasan Iyer

The Platonic Representation Hypothesis (2024)

Minyoung Huh, Brian Cheung, Tongzhou Wang, Phillip Isola

Helpless infants are learning a foundation model (2024)

Rhodri Cusack, Marc’Aurelio Ranzato, and Christine J. Charvet

Position: LLMs Can’t Plan, But Can Help Planning in LLM-Modulo Frameworks (2024) NEW

Subbarao Kambhampati, Karthik Valmeekam, Lin Guan, Mudit Verma, Kaya Stechly, Siddhant Bhambri, Lucas Saldyt, Anil Murthy

$100K or 100 Days: Trade-offs when Pre-Training with Academic Resources (2023) NEW

Apoorv Khandelwal, Tian Yun, Nihal V. Nayak, Jack Merullo, Stephen H. Bach, Chen Sun, Ellie Pavlick

Convolutional architectures are cortex-aligned de novo (2023)

Atlas Kazemian, Eric Elmoznino, Michael F. Bonner

SemanticCMC: Contrastive Learning of Meaningful Object Associations from Temporal Co-occurrence Patterns in Naturalistic Movies (2023)

Cliona O’Doherty, Rhodri Cusack

Scaling Laws for Neural Language Models (2023) NEW

Jared Kaplan, Sam McCandlish, Tom Henighan, Tom B. Brown, Benjamin Chess, Rewon Child, Scott Gray, Alec Radford, Jeffrey Wu, Dario Amodei

Deep learning-based rigid motion correction for magnetic resonance imaging: A survey (2023) NEW

Yuchou Chang, Zhiqiang Li, Gulfam Saju, Hui Mao, Tianming Liu

Abductive Knowledge Induction From Raw Data (2021) NEW

Wang-Zhou Dai, Stephen Muggleton

Attention Is All You Need (2017)

Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Illia Polosukhin

Principles of Philosophy (2017)

René Descartes

Why Brains Are Not Computers, Why Behaviorism Is Not Satanism, and Why Dolphins Are Not Aquatic Apes (2015) NEW

Louise Barrett

The remarkable, yet not extraordinary, human brain as a scaled-up primate brain and its associated cost (2012)

Suzana Herculano-Houzel

Universal Intelligence: A Deﬁnition of Machine Intelligence (2007) NEW

Shane Legg, Marcus Hutter

Centaur - TODO NEW

Authors

Gemini paper - TODO NEW

Authors

Scaling Monosemanticity - TODO NEW

Authors