Author: aigosadmin

Benchmarking Guardrail Implementations: Deepseek, Perplexity, Grok, Gemini, ChatGPT.

We recently concluded a set of experiments to test the robustness of guardrail implementation by five popular AI chat agents against a potential data exfiltration vulnerability. Our goal was to see if we could coax these systems into generating hyperlinks that, if clicked, would reveal sensitive information like user queries and system prompts. Our test…

March 31, 2025
Guide on Performance and Security for Advanced Production RAG: Part 9 – Logging

Appropriate logging facilitates performance, cost and security analyses. We share three key categories of data and metadata every production RAG system should maintain within their logs.

June 12, 2024
Guide on Performance and Security for Advanced Production RAG: Part 8 – Guardrail Implementation

Guardrails are key, but the diversity and complexity of input formats makes it challenging to put them in place, especially for multimodal systems.

June 12, 2024
Guide on Performance and Security for Advanced Production RAG: Part 7 – Post-retrieval Optimization

Post-retrieval, a combination of filtering and reranking of search results help to refine, optimize and stack content for LLM operations.

June 12, 2024
Guide on Performance and Security for Advanced Production RAG: Part 6 – Pre-retrieval Optimization

The pre-retrieval phase is a critical stage where user input is processed and transformed into optimized vector search queries

June 12, 2024
Guide on Performance and Security for Advanced Production RAG: Part 5 – Chunking Approaches

The way content is chunked has a profound impact on the eventual performance of a RAG system. We introduce three broad approaches to chunking.

June 12, 2024
Guide on Performance and Security for Advanced Production RAG: Part 4 – Data Pipeline Design

A well-designed data pipeline ensures that large volume of data is accurately and consistently incorporated. We outline 7 key design considerations for production-grade enterprise RAG data pipelines.

June 12, 2024
Guide on Performance and Security for Advanced Production RAG: Part 3 – Choice of Vector Databases

Different vector databases support different search algorithms, distance metrics, scalability, security capabilities, and options for performance optimization

June 9, 2024
Guide on Performance and Security for Advanced Production RAG: Part 2 – Model Choice, Model Precision

In any given RAG system, the choice of model and precision is often a trade-off between performance and cost

June 9, 2024
Guide on Performance and Security for Advanced Production RAG: Part 1 – Model Deployment Mode

API model access vs hosted model on self-managed instance: A key decision point for CTOs, CISOs and AI Engineers

June 9, 2024