CoT prompting techniques — zero-shot CoT, few-shot CoT, self-consistency, tree of thoughts, and how reasoning models compare with prompted reasoning.
Build production evaluation pipelines for LLM applications — golden datasets, LLM-as-judge, rubrics, statistical significance, regression detection, and evals vs tests.
In-context learning techniques — example selection, format design, count tuning, dynamic retrieval of demonstrations, and pitfalls of few-shot prompting.
Reliable prompt structures for reasoning, extraction, classification, generation, extended thinking, and vision tasks with Claude.
End-to-end checklist and code for building reliable Retrieval-Augmented Generation pipelines — chunking, embedding, vector DBs, retrieval, and evaluation.
Techniques for reliable structured generation — JSON mode, schema-constrained decoding, function/tool calls as output, and validator pairing with Pydantic or Zod.
navigation
actions
cheat sheet pages