Posts tagged with "nlp" - Sungho Park

Retrieval-Augmented Diffusion Model
May 18, 2022

A review of the paper that enhances diffusion models using retrieval-based approaches from NLP, achieving high-fidelity image generation with fewer parameters.

7 min read ·

computer-vision
nlp
+1
Med-BERT
May 17, 2022

A review of Med-BERT, a BERT-based model pretrained on structured EHR data for disease prediction.

5 min read ·

nlp
paper-review
Condenser, coCondenser
May 16, 2022

Paper review of Condenser and coCondenser for dense retrieval pre-training.

2 min read ·

nlp
paper-review
NLP Trends
October 29, 2021

Survey of NLP trends including reinforcement learning for summarization and QA, and multimodal NLP+CV approaches like Show and Tell and GLAC Net.

1 min read ·

nlp
QA with Phrase Retrieval
October 20, 2021

Overview of phrase retrieval for open-domain QA, using dense-sparse representations to bypass the retriever-reader pipeline.

3 min read ·

nlp
Negative In-batch
October 18, 2021

An approach that enables large-batch contrastive learning under memory constraints by using in-batch negatives instead of explicit negative sampling.

1 min read ·

nlp
Reducing Training Bias
October 18, 2021

Types of training bias in MRC and ODQA, and techniques to mitigate them including negative sampling and annotation bias handling.

3 min read ·

nlp
Linking MRC and Retrieval
October 17, 2021

Deep dive into Open Domain Question Answering, covering the Retriever-Reader approach, distant supervision, and passage granularity strategies.

3 min read ·

nlp
Passage Retrieval - Scaling Up
October 17, 2021

Scaling passage retrieval with approximate similarity search techniques: scalar quantization, inverted file indexing, and FAISS.

3 min read ·

nlp
Dense Embedding
October 14, 2021

Dense embedding for passage retrieval: training bi-encoders with contrastive learning and negative sampling strategies.

3 min read ·

nlp
Passage Retrieval
October 13, 2021

Fundamentals of passage retrieval for open-domain QA, covering sparse embedding, TF-IDF, and similarity search in vector space.

3 min read ·

nlp
Generation-based MRC
October 13, 2021

Generation-based MRC using seq2seq models like BART: comparison with extraction-based approaches, preprocessing, and decoding strategies.

1 min read ·

nlp
Recent NLP Models
October 13, 2021

Survey of post-BERT NLP models including XLNet, RoBERTa, BART, T5, Meena, and controllable language models like PPLM.

3 min read ·

nlp
GPT Language Models
October 12, 2021

Evolution of GPT language models from GPT-1 to GPT-3: architecture changes, zero/few-shot learning, and scaling effects.

2 min read ·

nlp
Unicode, Tokenization
October 12, 2021

Basics of Unicode handling in Python and subword tokenization methods including BPE (Byte-Pair Encoding).

1 min read ·

nlp
python
MRC
October 12, 2021

Introduction to Machine Reading Comprehension covering dataset types, key challenges like coreference resolution and multi-hop reasoning, and evaluation metrics.

2 min read ·

nlp
Sentence Token Classification
September 28, 2021

Token-level classification tasks with BERT, covering Named Entity Recognition (NER) and POS tagging for Korean text.

1 min read ·

nlp
KLUE Dependency Parsing and Single Sentence Classification
September 28, 2021

Overview of Korean dependency parsing and single sentence classification tasks in KLUE, including BERT-based model architecture and training pipeline.

2 min read ·

nlp
BERT Sentence Pair Classification Task
September 28, 2021

BERT-based sentence pair classification covering NLI, semantic text pair similarity, and information retrieval QA.

1 min read ·

nlp
Training BERT
September 28, 2021

BERT training pipeline details: tokenizer creation, dataset construction with NSP, masking strategies, segment handling, and truncation.

2 min read ·

nlp
BERT Applications
September 27, 2021

Practical applications of BERT for sentiment analysis, relation extraction, semantic similarity, NER, and machine reading comprehension.

4 min read ·

nlp
Regex Practice
September 27, 2021

Hands-on regex practice dissecting a password validation pattern with lookaheads and character classes.

1 min read ·

nlp
python
Language Model Benchmark Summary
September 26, 2021

Summary of language model benchmarks from GLUE to multilingual evaluations, covering the evolution from RNN to ELMo, BERT, and beyond.

3 min read ·

nlp
naver-boostcamp
Confusing Points in NLP
September 20, 2021

Clarifying commonly confused NLP concepts: argmax vs. multinomial sampling, beam search vs. sampling, and Pre-LN vs. Post-LN in GPT and Transformer.

2 min read ·

nlp
Recent Trends of NLP
September 19, 2021

Survey of recent NLP trends covering QA, open-domain chatbots, unsupervised machine translation, text style transfer, and in-context learning.

5 min read ·

nlp
Advanced Self-supervised Pre-training Models
September 18, 2021

Overview of GPT-2, GPT-3, and ALBERT covering zero/few-shot learning, model scaling, and efficient pre-training approaches.

8 min read ·

nlp
Self-supervised Pre-training Models
September 18, 2021

Overview of self-supervised pre-training models including GPT-1 and BERT, covering masked language modeling and transfer learning.

7 min read ·

nlp
Introduction to the Transformer
September 13, 2021

Introduction to the Transformer architecture: self-attention with Query/Key/Value, scaled dot-product attention, and how it overcomes RNN limitations.

6 min read ·

nlp
BLEU
September 10, 2021

BLEU score for machine translation evaluation: precision, recall, F1, n-gram overlap, and brevity penalty.

2 min read ·

nlp
Beam Search
September 8, 2021

Explanation of beam search decoding as a trade-off between greedy decoding and exhaustive search in sequence generation.

2 min read ·

nlp
Attention
September 8, 2021

Explanation of the attention mechanism in Seq2Seq models, covering attention vectors, context vectors, teacher forcing, and similarity functions.

4 min read ·

nlp
LSTM, GRU
September 7, 2021

Detailed explanation of LSTM and GRU architectures, covering gate mechanisms, cell state, and how they solve RNN's long-term dependency problem.

2 min read ·

nlp
RNNs
September 7, 2021

Deep dive into RNN types (one-to-one, one-to-many, many-to-many), character-level language models, and LSTM/GRU architectures.

4 min read ·

nlp
NLP Preprocessing
September 6, 2021

NLP text preprocessing techniques: stopword removal, lemmatization, and punctuation handling with NLTK and spaCy.

1 min read ·

nlp
Word Embedding
September 6, 2021

Introduction to word embedding and Word2Vec: how words are mapped to vector spaces to capture semantic relationships.

4 min read ·

nlp
Bag-of-Words
September 6, 2021

Introduction to Bag-of-Words text representation with one-hot encoding and Naive Bayes classification.

2 min read ·

nlp
NLP Overview
September 6, 2021

Overview of NLP fundamentals including tokenization, NER, sentiment analysis, machine translation, and the impact of Transformer on the field.

3 min read ·

nlp
Transformer Practice
August 13, 2021

Hands-on PyTorch implementation of Scaled Dot-Product Attention and Multi-Head Attention for the Transformer.

3 min read ·

pytorch
nlp
Transformer
August 13, 2021

Core concepts of the Transformer model: encoder self-attention, Query/Key/Value embeddings, and multi-head attention mechanics.

4 min read ·

dl
nlp
RNN
August 6, 2021

Fundamentals of RNN including sequence data handling, latent autoregressive models, BPTT, and truncated backpropagation.

2 min read ·

ml
nlp
+1