Simplicity bias and optimization threshold in two-layer ReLU networks

Published:

Direct Link