FRESH Hacker News
Home
Show HN: We cut RAG latency ~2× by switching embedding model
9 points by vira28