I have fond memories of cs224d [1] taught by richardsocher. It’s a bit dated at this point as it was created in the pre-transformer era, but it was a very cool introduction to applying deep learning to nlp at the time.
Similar thoughts here. That was when I realized the potential of the Internet: I didn't have to be a grad student at a tier 1 research university to learn about the frontier.
Those suggestions they make for a B200 start at $4.99 an hour.
Is that really required, for starting out?
I've been tinkering with my own from-scratch LLM, but in the early phases I don't need anything more than a 4090 on Vast.ai
I brought a group together to do this class using the YouTube videos and course materials available online. It is challenging but rewarding. We tackled it one lecture video per week. Started with over 30 learners and by last session we were down to 8.
i recently started reading "build reasoning model from scratch" then i realized that i am not really interested in building part and just want to understand theory and practice behind it.
A want like a casual lesswrong style from ground up explanation.
storus | 2 hours ago
meken | an hour ago
[1] https://cs224d.stanford.edu
egl2020 | an hour ago
tmule | an hour ago
Bilal_io | an hour ago
mindcrime | an hour ago
aerohit | an hour ago
skerit | an hour ago
Those suggestions they make for a B200 start at $4.99 an hour.
Is that really required, for starting out? I've been tinkering with my own from-scratch LLM, but in the early phases I don't need anything more than a 4090 on Vast.ai
root-parent | 58 minutes ago
flakiness | 29 minutes ago
grahameb | 9 minutes ago
airstrike | an hour ago
danbrooks | 11 minutes ago
Would be great to have a community to discuss the material - even if folks can't commit to the full course.
sonabinu | 35 minutes ago
dominotw | 12 minutes ago
A want like a casual lesswrong style from ground up explanation.