Category: Strategy
-
Guide on Performance and Security for Advanced Production RAG: Part 9 – Logging
Appropriate logging facilitates performance, cost and security analyses. We share three key categories of data and metadata every production RAG system should maintain within their logs.
-
Guide on Performance and Security for Advanced Production RAG: Part 8 – Guardrail Implementation
Guardrails are key, but the diversity and complexity of input formats makes it challenging to put them in place, especially for multimodal systems.
-
Guide on Performance and Security for Advanced Production RAG: Part 7 – Post-retrieval Optimization
Post-retrieval, a combination of filtering and reranking of search results help to refine, optimize and stack content for LLM operations.
-
Guide on Performance and Security for Advanced Production RAG: Part 6 – Pre-retrieval Optimization
The pre-retrieval phase is a critical stage where user input is processed and transformed into optimized vector search queries
-
Guide on Performance and Security for Advanced Production RAG: Part 5 – Chunking Approaches
The way content is chunked has a profound impact on the eventual performance of a RAG system. We introduce three broad approaches to chunking.
-
Guide on Performance and Security for Advanced Production RAG: Part 4 – Data Pipeline Design
A well-designed data pipeline ensures that large volume of data is accurately and consistently incorporated. We outline 7 key design considerations for production-grade enterprise RAG data pipelines.
-
Guide on Performance and Security for Advanced Production RAG: Part 3 – Choice of Vector Databases
Different vector databases support different search algorithms, distance metrics, scalability, security capabilities, and options for performance optimization
-
Guide on Performance and Security for Advanced Production RAG: Part 2 – Model Choice, Model Precision
In any given RAG system, the choice of model and precision is often a trade-off between performance and cost
-
Guide on Performance and Security for Advanced Production RAG: Part 1 – Model Deployment Mode
API model access vs hosted model on self-managed instance: A key decision point for CTOs, CISOs and AI Engineers
-
Guide on Performance and Security for Advanced Production RAG: Overview
“The case for production-grade RAG systems in enterprises warrant much deeper scrutiny over system design, given performance, cost and security considerations.”