Skip to content

AI & QA Learning Library

A self-contained, plain-English reference for testing AI systems โ€” from the absolute basics of machine learning to LLM evaluation, RAG testing, security red-teaming, and the tools that tie it together.

Start at the top if you're new. Jump straight to a topic if you're not.

๐Ÿ“š 21 guides ๐ŸŒฑ No prior AI knowledge needed ๐Ÿงช QA / test-engineering focus ๐Ÿ†“ Free to read & share

๐ŸŒฑ Start Here โ€” Foundations

AI Fundamentals for Beginners โญ

The one long read that covers everything from zero โ€” the AI journey, machine learning, neural networks, LLMs, RAG, how they differ, and how each is tested.

Diagrams ยท glossary ยท no jargon left unexplained

๐ŸŽญ Playwright & TypeScript

Playwright Project Anatomy โญ

Every file in a Playwright project explained โ€” package.json, node_modules, playwright.config.ts, tsconfig.json and why they exist.

For anyone opening a JS/TS test project the first time

Playwright Framework Tutorial โญ

Building a real, maintainable framework โ€” Page Objects, fixtures, test data, utils, env config, CI โ€” with every file's purpose explained.

Bare project โ†’ professional test framework

TypeScript Cheat Sheet โญ

Just enough TypeScript to read and write tests โ€” types, interfaces, classes, generics, async/await, and the symbols you'll see daily.

Skim once, keep open as reference

๐Ÿง  Core Concepts

RAG vs Agents vs Agentic RAG

The three dominant AI architectures, side by side โ€” what each is, when to use it.

MCP Servers โ€” FAQ

What the Model Context Protocol is and why it matters for connecting LLMs to tools.

Enterprise LLM Platforms

Azure OpenAI, AWS Bedrock, and OpenAI compared for production use.

๐Ÿงช Testing & Evaluation

LLM Testing Lifecycle

The end-to-end process for testing a language model, stage by stage.

RAG Automation Testing Roadmap โญ

A 6-stage plan for testing a RAG pipeline, with concrete test cases per stage.

LLM & Agent Evaluation Matrix

What to measure, and which metric answers which question.

Ragas FAQ ยท DeepEval FAQ

The two industry-standard evaluation frameworks, explained.

AI QA Agents Catalogue โญ

Five practical AI agents that augment a QA team's day-to-day work.

Autonomous QA Multi-Agent Pipeline

A 7-agent pipeline for self-running quality engineering.

Test-Case Generator Agent

A JIRA + RAG + LangGraph architecture for AI-generated test cases.

MCP Testing Roadmap

Six steps for testing MCP-based systems comprehensively.

๐Ÿ›ก๏ธ Security & Red Teaming

Prompt Injection โ€” Complete Guide

The #1 LLM security risk, how it works, and how to test for it.

Red / Blue / Purple Teams for AI

How adversarial, defensive, and collaborative security testing apply to AI.

๐Ÿงฐ Tooling

Commercial LLM / MCP Testing Tools

A survey of the paid and open-source tools in the AI-testing landscape.

๐Ÿ’ก Framing โ€” Why & How to Move

QA Evolution โ€” Testing Intelligence โญ

"QA is no longer testing software. We are testing intelligence." The case for why QA is changing.

QA โ†’ AI QA โ€” 6-Week Transition Plan โญ

A practical, week-by-week roadmap for a traditional QA moving into AI testing.


This library is free to read and share. Built with Material for MkDocs.