Assignment: Query Transformation#

Assignment Metadata#

Field	Description
Assignment Name	Query Transformation with HyDE and Decomposition
Course	RAG and Optimization
Project Name	`query-transformation-rag`
Estimated Time	90 minutes
Framework	Python 3.10+, LangChain, OpenAI API, Sentence-Transformers

By completing this assignment, you will be able to:

Implement Hypothetical Document Embeddings (HyDE) for improved query-document matching
Build Query Decomposition pipelines to handle complex multi-part questions
Design effective prompts for LLM-based query transformation
Evaluate the impact of query transformation on retrieval quality
Apply these techniques to real-world RAG scenarios

Users of your RAG system often submit queries that perform poorly in retrieval:

Short, vague queries: “remote work” instead of “What are the company’s remote work policies?”
Question-answer mismatch: Questions are interrogative while documents are declarative
Complex multi-part queries: “Compare the battery life and camera quality of iPhone 15 and Samsung S24”

Your task is to implement HyDE and Query Decomposition to transform user queries before retrieval.

Python 3.10 or higher
Required packages:
- langchain >= 0.1.0
- openai >= 1.0.0
- sentence-transformers >= 2.2.0
- chromadb >= 0.4.0

Build a HyDE pipeline with three stages:
- Generate: Use LLM to create a hypothetical answer paragraph
- Encode: Convert the hypothetical answer to an embedding
- Retrieve: Search using the hypothetical answer embedding
Design the generation prompt that:
- Instructs the LLM to write in the style of your target documents
- Includes domain-specific vocabulary guidance
- Handles different query types (how-to, what-is, troubleshooting)
Test with at least 5 query types:
- Short queries (1-3 words)
- Technical troubleshooting queries
- Conceptual/definition queries
- How-to procedure queries
- Comparison queries

Build a Query Decomposition pipeline:
- Use LLM to analyze if a query contains multiple sub-questions
- Generate independent sub-queries for parallel retrieval
- Aggregate retrieved documents from all sub-queries
- Synthesize a final answer using all gathered context
Design the decomposition prompt that:
- Identifies multiple intents within a single question
- Generates standalone sub-queries (each understandable without context)
- Preserves important constraints and filters from the original query
Handle these decomposition scenarios:
- Comparison queries (A vs B)
- Aggregation queries (list all X with property Y)
- Sequential queries (first do A, then what happens to B?)

Create a test set with 15 queries:
- 5 queries suitable for HyDE (short/vague)
- 5 queries suitable for Decomposition (complex/multi-part)
- 5 baseline queries (clear, single-intent)
Compare retrieval quality across methods:

Analyze LLM outputs:
- Document 3 example HyDE generations and their effectiveness
- Document 3 example decompositions and sub-query quality
- Identify failure cases and suggest improvements

Criteria	Points
HyDE implementation correctness	20
HyDE prompt design quality	15
Decomposition implementation	20
Decomposition prompt design	15
Evaluation methodology	15
Analysis and examples quality	10
Code quality and documentation	5
Total	100

For HyDE, the hypothetical answer doesn’t need to be factually correct—focus on style and vocabulary
Use few-shot examples in your prompts for more consistent LLM outputs
Consider caching LLM responses during development to save API costs
The companion notebook 03-langchain-hyde-demo.ipynb provides a starting point
For decomposition, test that each sub-query makes sense in isolation