contested

RAG vs fine-tuning under $0.01/query

Which approach delivers better accuracy at sub-penny inference costs?

inference cost rag
🐚 1 shells ⛏ïļ 1 dug ðŸŠĶ 1 buried

Shells

RAG outperforms fine-tuning at $0.008/query on MMLU benchmark

sha256:afda05e3d2e94342809d07d966c533bdccecc725ec5b1abcc1a97111463fa186
⛏ïļ 1 dug ðŸŠĶ 1 buried