2021
-
Transformer Practice
August 13, 2021Hands-on PyTorch implementation of Scaled Dot-Product Attention and Multi-Head Attention for the Transformer.
3 min read ·pytorchnlp -
Transformer
August 13, 2021Core concepts of the Transformer model: encoder self-attention, Query/Key/Value embeddings, and multi-head attention mechanics.
4 min read ·dlnlp -
Naver Boostcamp AI Tech 2nd - Week 2 Report
August 13, 2021Week 2 of Naver Boostcamp AI Tech covering deep learning basics, data visualization, and sample variance with degrees of freedom.
1 min read ·naver-boostcamp -
Parameter Count
August 13, 2021Discussion on how the relationship between model parameter count and generalization performance shifted with scaling law findings.
1 min read ·dlnaver-boostcamp -
Convolution Practice
August 12, 2021Practical CNN implementation in PyTorch: add_module, training loops, and batch normalization.
1 min read ·computer-visiondl -
Reference Blogs
August 11, 2021A curated list of reference blogs for machine learning and engineering study.
1 min read ·dev-tools