Joseph Wang

Posts

Thoughts on data science, machine learning, and research

Nov 2, 2025

How I'm Building a Context-Aware Retriever to Boost RAG Quality (Part 2: Implementation)

In Part 1, I walked through the design. Now let’s see it in action with a hands-on example using the ContractNLI dataset. I’ll assume chunking is a...

Oct 25, 2025

: Software, NLP, LLM, AI

How I’m Building a Context-Aware Retriever to Boost RAG Quality (Part 1: Introduction)

In this post, I’ll share how I built a context-aware retriever for knowledge base question answering. It runs on top of an MCP server and can easil...

Sep 5, 2025

: Machine Learning, Deep Learning, NLP

Hangman with DQN and Transformers

TL;DR. We train a small bidirectional Transformer + Double DQN to play Hangman, restricted to words ≤5 characters (training and inference) to keep ...

Mar 26, 2024

: Machine Learning, Deep Learning

GMV Forecasting via xDeepFM

In this post, I aim to share how I conducted a proof of concept (PoC) to solve a real-world problem using deep learning techniques, emphasizing a c...

Nov 16, 2023

: Machine Learning, Deep Learning, NLP, LLM

Posts

How I'm Building a Context-Aware Retriever to Boost RAG Quality (Part 2: Implementation)

How I’m Building a Context-Aware Retriever to Boost RAG Quality (Part 1: Introduction)

Hangman with DQN and Transformers

GMV Forecasting via xDeepFM

Decoding LoRA: A Comprehensive Summary on Low-Rank Adaptation

Explore

About Me

Software

Books