Beyond the imitation game: Quantifying and extrapolating the capabilities of language models A Srivastava, A Rastogi, A Rao, AAM Shoeb, A Abid, A Fisch, AR Brown, ... arXiv preprint arXiv:2206.04615, 2022 | 976 | 2022 |
Simplified Josephson-junction fabrication process for reproducibly high-performance superconducting qubits A Osman, J Simon, A Bengtsson, S Kosen, P Krantz, D P Lozano, ... Applied Physics Letters 118 (6), 2021 | 83 | 2021 |
Benign, tempered, or catastrophic: Toward a refined taxonomy of overfitting N Mallinar, J Simon, A Abedsoltan, P Pandit, M Belkin, P Nakkiran Advances in Neural Information Processing Systems 35, 1182-1195, 2022 | 56 | 2022 |
The eigenlearning framework: A conservation law perspective on kernel regression and wide neural networks JB Simon, M Dickens, D Karkada, MR DeWeese arXiv preprint arXiv:2110.03922, 2021 | 43* | 2021 |
On the stepwise nature of self-supervised learning JB Simon, M Knutins, L Ziyin, D Geisz, AJ Fetterman, J Albrecht International Conference on Machine Learning, 31852-31876, 2023 | 23 | 2023 |
Avalon: A benchmark for RL generalization using procedurally generated worlds J Albrecht, A Fetterman, B Fogelman, E Kitanidis, B Wróblewski, N Seo, ... Advances in Neural Information Processing Systems 35, 12813-12825, 2022 | 18 | 2022 |
Sgd with a constant large learning rate can converge to local maxima L Ziyin, B Li, JB Simon, M Ueda arXiv preprint arXiv:2107.11774, 2021 | 14* | 2021 |
More is better in modern machine learning: when infinite overparameterization is optimal and overfitting is obligatory JB Simon, D Karkada, N Ghosh, M Belkin arXiv preprint arXiv:2311.14646, 2023 | 12 | 2023 |
Reverse engineering the neural tangent kernel JB Simon, S Anand, M Deweese International Conference on Machine Learning, 20215-20231, 2022 | 12 | 2022 |
A spectral condition for feature learning G Yang, JB Simon, J Bernstein arXiv preprint arXiv:2310.17813, 2023 | 10 | 2023 |
Critical point-finding methods reveal gradient-flat regions of deep network losses CG Frye, J Simon, NS Wadia, A Ligeralde, MR DeWeese, KE Bouchard Neural computation 33 (6), 1469-1497, 2021 | 9 | 2021 |
Interleaved electro-optic dual comb generation to expand bandwidth and scan rate for molecular spectroscopy and dynamics studies near 1.6 µm JR Stroud, JB Simon, GA Wagner, DF Plusquellic Optics Express 29 (21), 33155-33170, 2021 | 8 | 2021 |
Tune as you scale: Hyperparameter optimization for compute efficient training AJ Fetterman, E Kitanidis, J Albrecht, Z Polizzi, B Fogelman, M Knutins, ... arXiv preprint arXiv:2306.08055, 2023 | 4 | 2023 |
An agnostic view on the cost of overfitting in (kernel) ridge regression L Zhou, JB Simon, G Vardi, N Srebro arXiv preprint arXiv:2306.13185, 2023 | 3 | 2023 |
On Kernel Regression with Data-Dependent Kernels JB Simon arXiv preprint arXiv:2209.01691, 2022 | 2 | 2022 |
Rapid Passage Signals from CO2 at 1.6 µm Using a Dual Chirped-Pulse Electro-Optic Comb System with High-Order Interleaving JR Stroud, J Simon, GA Wagner, DF Plusquellic CLEO: Science and Innovations, SM3A. 1, 2021 | 2 | 2021 |
Fast noise-resistant control of donor nuclear spin qubits in silicon J Simon, FA Calderon-Vargas, E Barnes, SE Economou Physical Review B 101 (20), 205307, 2020 | 2 | 2020 |
More is Better: when Infinite Overparameterization is Optimal and Overfitting is Obligatory JB Simon, D Karkada, N Ghosh, M Belkin The Twelfth International Conference on Learning Representations, 0 | 1 | |
Les Houches Lectures on Deep Learning at Large & Infinite Width Y Bahri, B Hanin, A Brossollet, V Erba, C Keup, R Pacelli, JB Simon arXiv preprint arXiv:2309.01592, 2023 | | 2023 |
Fast noise-resistant control for nuclear spin qubits in silicon J Simon, FA Calderon-Vargas, E Barnes, SE Economou arXiv, arXiv: 2001.10029, 2020 | | 2020 |