Module 1 · AI

📖 5 min read · By ['[“HungHM15”]']

Quiz#

Query Transformation#

Question 1: Why do raw user questions often yield poor search results in a RAG system?

A. The questions are too long.
B. The question’s vector does not match the vector of detailed legal documents.
C. The database is too small.
D. The system uses keyword search instead of vectors.

Answer: B

Question 2: What is the core idea of the Query Transformation technique?

A. To compress the database size.
B. To manually review user queries.
C. To use an LLM to rewrite, expand, or break down the user’s question.
D. To translate questions into multiple languages.

Answer: C

Question 3: What are the three steps of the HyDE process?

A. Encode, Decode, Generate
B. Generate, Encode, Retrieve
C. Retrieve, Read, Rank
D. Breakdown, Retrieval, Synthesis

Answer: B

Question 4: In the HyDE generation step, what does the system ask the LLM to do?

A. Write a hypothetical answer paragraph for the user’s question.
B. Search the internet for an answer.
C. Break the question into sub-queries.
D. Translate the question.

Answer: A

Question 5: What is a common characteristic of user questions compared to documents?

A. They are longer and descriptive.
B. They are identical in length.
C. They are often short and interrogative.
D. They use more complex technical vocabulary.

Answer: C

Question 6: What might happen if a user asks a very short question like ‘How to handle blue screen error’ without HyDE?

A. The system crashes.
B. The vector might mistakenly match documents describing screen colors.
C. The system automatically downloads new drivers.
D. The query is rejected.

Answer: B

Question 7: In the HyDE process, why is the ‘fake answer’ useful for retrieval?

A. Its vector is closer to the vector of the ‘real answer’ than the question’s vector.
B. It is always factually 100% accurate.
C. It bypasses the embedding model completely.
D. It relies purely on BM25 keyword matching.

Answer: A

Question 8: Which statement is true regarding the factual accuracy of the paragraph generated in the HyDE Generate step?

A. It is guaranteed to be factually perfect.
B. It must be verified by a human.
C. The information may be factually incorrect, but the style resembles the actual document.
D. It only contains information already present in the user’s prompt.

Answer: C

Question 9: When is the Query Decomposition technique particularly useful?

A. When the database is empty.
B. For simple, one-word queries.
C. For complex questions where a single text passage cannot contain enough information to answer.
D. When the user wants a hypothetical answer.

Answer: C

Question 10: What happens to a simple search vector if a question requires aggregating information from multiple sources?

A. It finds the perfect document immediately.
B. It hangs in between different topics.
C. It triggers a system timeout.
D. It switches to keyword search automatically.

Answer: B

Question 11: What is the first step in the Query Decomposition strategy?

A. Split multi-intent questions into single-intent questions.
B. Aggregate all text segments.
C. Generate a hypothetical answer.
D. Translate the query.

Answer: A

Question 12: What does the Synthesis step do in Query Decomposition?

A. Creates hypothetical answers.
B. Aggregates text segments found from all steps and gives them to the LLM.
C. Splits the question into multiple intents.
D. Generates a new vector for the question.

Answer: B

Question 13: How does Query Transformation ensure the system understands the true intent behind concise commands?

A. By skipping the embedding step.
B. By acting as an intelligent editor to edit and reorient user questions.
C. By asking the user to retype their question.
D. By only searching for the exact keywords provided.

Answer: B

Question 14: In the context of HyDE, what specifically helps the system find the exact technical instruction document for a ‘blue screen’ query?

A. The presence of technical keywords like ‘BSOD’, ‘driver’, and ‘Safe Mode’ appearing in the LLM’s draft.
B. The user explicitly typing ‘BSOD’.
C. A fallback to exact BM25 matching.
D. The decomposition of the query into three sub-queries.

Answer: A

Question 15: During Query Decomposition, why is performing a document search for each separate sub-question necessary?

A. To ensure each search has a clear goal and high accuracy.
B. To reduce the number of tokens used.
C. To bypass the vector database entirely.
D. Because the LLM cannot read long documents.

Answer: A

Question 16: If a user asks to ‘Compare the revenue of iPhone 15 and Samsung S24 in Q1 2024’, why does simple searching fail?

A. The LLM refuses to compare competitors.
B. The query length exceeds the maximum token limit.
C. There is no single document containing this comparison table, information is scattered.
D. The query lacks technical keywords.

Answer: C

Question 17: What is the ‘Encode’ step’s specific function in the HyDE process?

A. To compress the final response.
B. To pass the hypothetical paragraph through the embedding model to create a vector.
C. To encrypt user data for privacy.
D. To split the text into manageable chunks.

Answer: B

Question 18: Which technique is best to overcome semantic asymmetry by utilizing the creativity of the LLM?

A. BM25
B. Query Decomposition
C. HNSW
D. HyDE

Answer: D

Question 19: What ultimate role does Query Transformation play in the overall RAG architecture?

A. It replaces the vector database.
B. It acts as an intelligent editor, editing and reorienting questions before lookup.
C. It generates the final answer displayed to the user.
D. It handles the user interface.

Answer: B

Question 20: In a complex query scenario, what does the LLM do in the Final Generation stage of Query Decomposition?

A. It receives figures from the searches and self-aggregates them into a complete comparison answer.
B. It asks the user for clarification.
C. It generates a hypothetical document to search with.
D. It runs a Cypher query against a graph database.

Answer: A