-
Retrieval-Augmented Diffusion Model
May 18, 2022A review of the paper that enhances diffusion models using retrieval-based approaches from NLP, achieving high-fidelity image generation with fewer parameters.
7 min read ·computer-visionnlp+1 -
Med-BERT
May 17, 2022A review of Med-BERT, a BERT-based model pretrained on structured EHR data for disease prediction.
5 min read ·nlppaper-review -
Condenser, coCondenser
May 16, 2022Paper review of Condenser and coCondenser for dense retrieval pre-training.
2 min read ·nlppaper-review -
NLP Trends
October 29, 2021Survey of NLP trends including reinforcement learning for summarization and QA, and multimodal NLP+CV approaches like Show and Tell and GLAC Net.
1 min read ·nlp -
QA with Phrase Retrieval
October 20, 2021Overview of phrase retrieval for open-domain QA, using dense-sparse representations to bypass the retriever-reader pipeline.
3 min read ·nlp -
Negative In-batch
October 18, 2021An approach that enables large-batch contrastive learning under memory constraints by using in-batch negatives instead of explicit negative sampling.
1 min read ·nlp -
Reducing Training Bias
October 18, 2021Types of training bias in MRC and ODQA, and techniques to mitigate them including negative sampling and annotation bias handling.
3 min read ·nlp -
Linking MRC and Retrieval
October 17, 2021Deep dive into Open Domain Question Answering, covering the Retriever-Reader approach, distant supervision, and passage granularity strategies.
3 min read ·nlp -
Passage Retrieval - Scaling Up
October 17, 2021Scaling passage retrieval with approximate similarity search techniques: scalar quantization, inverted file indexing, and FAISS.
3 min read ·nlp -
Dense Embedding
October 14, 2021Dense embedding for passage retrieval: training bi-encoders with contrastive learning and negative sampling strategies.
3 min read ·nlp -
Passage Retrieval
October 13, 2021Fundamentals of passage retrieval for open-domain QA, covering sparse embedding, TF-IDF, and similarity search in vector space.
3 min read ·nlp -
Generation-based MRC
October 13, 2021Generation-based MRC using seq2seq models like BART: comparison with extraction-based approaches, preprocessing, and decoding strategies.
1 min read ·nlp -
Recent NLP Models
October 13, 2021Survey of post-BERT NLP models including XLNet, RoBERTa, BART, T5, Meena, and controllable language models like PPLM.
3 min read ·nlp -
GPT Language Models
October 12, 2021Evolution of GPT language models from GPT-1 to GPT-3: architecture changes, zero/few-shot learning, and scaling effects.
2 min read ·nlp -
Unicode, Tokenization
October 12, 2021Basics of Unicode handling in Python and subword tokenization methods including BPE (Byte-Pair Encoding).
1 min read ·nlppython -
MRC
October 12, 2021Introduction to Machine Reading Comprehension covering dataset types, key challenges like coreference resolution and multi-hop reasoning, and evaluation metrics.
2 min read ·nlp -
Sentence Token Classification
September 28, 2021Token-level classification tasks with BERT, covering Named Entity Recognition (NER) and POS tagging for Korean text.
1 min read ·nlp -
KLUE Dependency Parsing and Single Sentence Classification
September 28, 2021Overview of Korean dependency parsing and single sentence classification tasks in KLUE, including BERT-based model architecture and training pipeline.
2 min read ·nlp -
BERT Sentence Pair Classification Task
September 28, 2021BERT-based sentence pair classification covering NLI, semantic text pair similarity, and information retrieval QA.
1 min read ·nlp -
Training BERT
September 28, 2021BERT training pipeline details: tokenizer creation, dataset construction with NSP, masking strategies, segment handling, and truncation.
2 min read ·nlp -
BERT Applications
September 27, 2021Practical applications of BERT for sentiment analysis, relation extraction, semantic similarity, NER, and machine reading comprehension.
4 min read ·nlp -
Regex Practice
September 27, 2021Hands-on regex practice dissecting a password validation pattern with lookaheads and character classes.
1 min read ·nlppython -
Language Model Benchmark Summary
September 26, 2021Summary of language model benchmarks from GLUE to multilingual evaluations, covering the evolution from RNN to ELMo, BERT, and beyond.
3 min read ·nlpnaver-boostcamp -
Confusing Points in NLP
September 20, 2021Clarifying commonly confused NLP concepts: argmax vs. multinomial sampling, beam search vs. sampling, and Pre-LN vs. Post-LN in GPT and Transformer.
2 min read ·nlp -
Recent Trends of NLP
September 19, 2021Survey of recent NLP trends covering QA, open-domain chatbots, unsupervised machine translation, text style transfer, and in-context learning.
5 min read ·nlp -
Advanced Self-supervised Pre-training Models
September 18, 2021Overview of GPT-2, GPT-3, and ALBERT covering zero/few-shot learning, model scaling, and efficient pre-training approaches.
8 min read ·nlp -
Self-supervised Pre-training Models
September 18, 2021Overview of self-supervised pre-training models including GPT-1 and BERT, covering masked language modeling and transfer learning.
7 min read ·nlp -
Introduction to the Transformer
September 13, 2021Introduction to the Transformer architecture: self-attention with Query/Key/Value, scaled dot-product attention, and how it overcomes RNN limitations.
6 min read ·nlp -
BLEU
September 10, 2021BLEU score for machine translation evaluation: precision, recall, F1, n-gram overlap, and brevity penalty.
2 min read ·nlp -
Beam Search
September 8, 2021Explanation of beam search decoding as a trade-off between greedy decoding and exhaustive search in sequence generation.
2 min read ·nlp -
Attention
September 8, 2021Explanation of the attention mechanism in Seq2Seq models, covering attention vectors, context vectors, teacher forcing, and similarity functions.
4 min read ·nlp -
LSTM, GRU
September 7, 2021Detailed explanation of LSTM and GRU architectures, covering gate mechanisms, cell state, and how they solve RNN's long-term dependency problem.
2 min read ·nlp -
RNNs
September 7, 2021Deep dive into RNN types (one-to-one, one-to-many, many-to-many), character-level language models, and LSTM/GRU architectures.
4 min read ·nlp -
NLP Preprocessing
September 6, 2021NLP text preprocessing techniques: stopword removal, lemmatization, and punctuation handling with NLTK and spaCy.
1 min read ·nlp -
Word Embedding
September 6, 2021Introduction to word embedding and Word2Vec: how words are mapped to vector spaces to capture semantic relationships.
4 min read ·nlp -
Bag-of-Words
September 6, 2021Introduction to Bag-of-Words text representation with one-hot encoding and Naive Bayes classification.
2 min read ·nlp -
NLP Overview
September 6, 2021Overview of NLP fundamentals including tokenization, NER, sentiment analysis, machine translation, and the impact of Transformer on the field.
3 min read ·nlp -
Transformer Practice
August 13, 2021Hands-on PyTorch implementation of Scaled Dot-Product Attention and Multi-Head Attention for the Transformer.
3 min read ·pytorchnlp -
Transformer
August 13, 2021Core concepts of the Transformer model: encoder self-attention, Query/Key/Value embeddings, and multi-head attention mechanics.
4 min read ·dlnlp -
RNN
August 6, 2021Fundamentals of RNN including sequence data handling, latent autoregressive models, BPTT, and truncated backpropagation.
2 min read ·mlnlp+1