Author: aigosadmin
-
Benchmarking Guardrail Implementations: Deepseek, Perplexity, Grok, Gemini, ChatGPT.
We recently concluded am experiment to test the robustness of guardrail implementation by five popular AI chat agents against a potential data exfiltration vulnerability. Our goal was to see if we could coax these systems into generating hyperlinks that, if clicked, would reveal sensitive information like user queries and system prompts. Our test prompt was…
-
Guide on Performance and Security for Advanced Production RAG: Part 9 – Logging
Appropriate logging facilitates performance, cost and security analyses. We share three key categories of data and metadata every production RAG system should maintain within their logs.
-
Guide on Performance and Security for Advanced Production RAG: Part 8 – Guardrail Implementation
Guardrails are key, but the diversity and complexity of input formats makes it challenging to put them in place, especially for multimodal systems.
-
Guide on Performance and Security for Advanced Production RAG: Part 7 – Post-retrieval Optimization
Post-retrieval, a combination of filtering and reranking of search results help to refine, optimize and stack content for LLM operations.
-
Guide on Performance and Security for Advanced Production RAG: Part 6 – Pre-retrieval Optimization
The pre-retrieval phase is a critical stage where user input is processed and transformed into optimized vector search queries
-
Guide on Performance and Security for Advanced Production RAG: Part 5 – Chunking Approaches
The way content is chunked has a profound impact on the eventual performance of a RAG system. We introduce three broad approaches to chunking.
-
Guide on Performance and Security for Advanced Production RAG: Part 4 – Data Pipeline Design
A well-designed data pipeline ensures that large volume of data is accurately and consistently incorporated. We outline 7 key design considerations for production-grade enterprise RAG data pipelines.
-
Guide on Performance and Security for Advanced Production RAG: Part 3 – Choice of Vector Databases
Different vector databases support different search algorithms, distance metrics, scalability, security capabilities, and options for performance optimization
-
Guide on Performance and Security for Advanced Production RAG: Part 2 – Model Choice, Model Precision
In any given RAG system, the choice of model and precision is often a trade-off between performance and cost
-
Guide on Performance and Security for Advanced Production RAG: Part 1 – Model Deployment Mode
API model access vs hosted model on self-managed instance: A key decision point for CTOs, CISOs and AI Engineers