Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
gaogao
4 months ago
|
parent
|
context
|
favorite
| on:
Training AI models might not need enormous data ce...
The intuition is that smaller models are figuring out things like grammar where the whole model comes into play, but larger models, especially in the back half of training, have localized knowledge updates that can merge easier in the AllReduce
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: