Evaluating LLM and RAG Systems: A Practical Guide
Best practices and tools for assessing the performance of Large Language Models and Retrieval-Augmented Generation systems using RAGAS, DeepEval, and observability platforms.
Thoughts on programming, machine learning, and technology.