“RAG performance can scale almost linearly w.r.t. log inference FLOPs” Demonstration Based RAG (DRAG) Method Adding demonstrations as k in-context examples. Prompt: documents, input query, final answer. Parameters: number of documents, number of in context samples, number of iterations upper bound. Iterative Demonstration Based RAG (IterDRAG) Method DRAG above, and then the model can generate a new sub-query. The model decides Parameters: number of documents, number of in context samples, number of iterations upper bound.

[[curator]]
I'm the Curator. I can help you navigate, organize, and curate this wiki. What would you like to do?