LLM-driven systems that pursue a goal by interleaving reasoning, tool calls, and observations inside a loop — and that decide for themselves which step to take next.
Anthropic's agentic command-line coding assistant — a tool-using LLM that lives in your terminal, reads and edits your codebase, runs commands, and coordinates multi-step engineering work through natural language.
Grounding LLM responses in chunks retrieved from an external corpus so the model reasons over real, citable sources instead of parametric memory alone.
Local-first CLI and TypeScript library that reads JSONL session logs from Claude Code, Codex, and 13 other coding agents to produce daily/weekly/monthly/session/billing-block reports of token spend.
Package-level reference for the Vercel AI SDK — streamText, generateObject, tool calling, structured output, and the multi-provider model interface.
Package-level reference for openai on npm — Chat Completions, the Responses API, streaming, tool calls, structured outputs, embeddings, and the v4→v5 migration.
Package-level reference for the autogen-agentchat / autogen-core / autogen-ext family on PyPI plus the legacy pyautogen — install, rename history, versioning, and alternatives.
Package-level reference for chromadb on PyPI — install variants, server/client split, embedding-function extras, and alternative vector stores.
Package-level reference for the crewai library on PyPI plus the crewai-tools companion — install, versioning, and multi-agent alternatives.
Package-level reference for DSPy on PyPI — the dspy / dspy-ai rename, install variants, version policy, and alternatives.
Package-level reference for google-genai (the current Gemini SDK) and its predecessor google-generativeai — install, auth, versioning, and alternatives.
Package-level reference for the guidance library on PyPI — install, LLM-provider extras, versioning, and alternatives like instructor and outlines.
Package-level reference for haystack-ai on PyPI — install variants, the farm-haystack v1 → haystack-ai v2 rename, integrations, and alternative frameworks.
Package-level reference for the langchain family on PyPI — install variants, partner packages, version churn, and alternatives.
Package-level reference for the langsmith SDK on PyPI — install, versioning, env-var setup, and observability alternatives.
Package-level reference for qdrant-client on PyPI — install variants, server version matching, gRPC vs HTTP, fastembed extras, and alternatives.
Package-level reference for ragas on PyPI — install variants, LLM-as-judge dependencies, metric churn, and alternative evaluators.
Package-level reference for semantic-kernel on PyPI — install variants, the Python vs .NET split, provider extras, and alternative frameworks.
Package-level reference for the sentence-transformers library on PyPI — install, transformers/torch deps, model registry, and embedding alternatives.
Package-level reference for the Hugging Face transformers library on PyPI — install extras, backend choice, versioning, and alternatives.
Package-level reference for trulens-eval on PyPI — install variants, the trulens umbrella rename, framework extras, and alternative evaluators.
Package-level reference for unstructured on PyPI — install variants, the huge extras tree, system-level dependencies, and alternative parsers.
Package-level reference for weaviate-client on PyPI — install variants, the v3 → v4 API split, gRPC, and alternative vector stores.
Side-by-side comparison of LangChain, LlamaIndex, AutoGen, CrewAI, Haystack, and Semantic Kernel for building LLM-powered applications and agent systems. Covers strengths, weaknesses, and when to pick each.
Build LLM programs in DSPy with declarative signatures, modules, and optimisers. Covers Predict, ChainOfThought, ReAct, BootstrapFewShot, COPRO, MIPRO, MIPROv2, and inference compilation.
Build production-grade LLM pipelines with Haystack 2.x. Covers components, the pipeline graph, indexing and querying, retrievers, generators, RAG patterns, and evaluation.
Model Context Protocol (MCP) framework overview. Covers client/server architecture, stdio vs SSE vs streamable HTTP transports, FastMCP, mcp-go, the Python and TypeScript SDKs, and comparison with custom tool servers.
Build LLM-powered applications with Microsoft Semantic Kernel. Covers the kernel, plugins, prompt templates, planners, function calling, Kernel Memory, Python and .NET SDKs.
Reliable prompt structures for reasoning, extraction, classification, generation, extended thinking, and vision tasks with Claude.
End-to-end checklist and code for building reliable Retrieval-Augmented Generation pipelines — chunking, embedding, vector DBs, retrieval, and evaluation.
Hugging Face Transformers, LangChain, Google Gemini SDK, and LangSmith — practical reference for AI/ML frameworks and observability tools.
Claude Code, Codex CLI, the Claude API, and prompt engineering — practical reference for building with and using large language models.
Build multi-agent AI systems with Microsoft AutoGen. Covers agents, group chats, code execution, tool registration, async runtimes, and LLM configuration.
Store and query vector embeddings locally or over a network with ChromaDB. Covers client types, collections, add, query, metadata filters, embedding functions, and LangChain/LlamaIndex integration.
Orchestrate teams of role-playing AI agents with crewAI. Covers agents, tasks, crews, tools, LLM selection, memory, YAML config, and the kickoff lifecycle.
Call Google's Gemini models from Python for text, multimodal, streaming, chat, function calling, and embeddings. Covers the genai SDK, safety settings, file API, and async usage.
Interleave Python control flow with LLM generation and enforce structured output using guidance. Covers gen(), select(), chat blocks, regex constraints, JSON schemas, and token healing.
Build LLM-powered pipelines with LangChain. Covers LCEL chains, chat models, prompts, output parsers, tools, agents, retrievers, memory, and streaming.
Trace, debug, evaluate, and monitor LLM applications with LangSmith. Covers tracing setup, datasets, evaluators, prompt hub, comparing runs, and CI integration.
Build RAG pipelines and LLM-powered data applications with LlamaIndex. Covers document loading, indexing, query engines, custom LLMs and embeddings, persistent storage, and agents.
Automate Google NotebookLM from Python with the unofficial notebooklm-py library. Covers authentication, notebook and source management, summaries, FAQ generation, and audio podcast creation.
Store and search vector embeddings with the Qdrant Python client. Covers collections, CRUD, filtered vector search, payload indexing, batch upsert, sparse/dense hybrid search, and integrations.
Measure and improve RAG pipeline quality with ragas. Covers faithfulness, answer relevancy, context precision, context recall, dataset format, LLM judges, and CI integration.
Evaluate and monitor LLM applications with TruLens. Covers the RAG triad, feedback functions, TruChain, TruLlama, custom evaluators, the dashboard, and CI integration.
Store, search, and manage vector embeddings with the Weaviate Python client. Covers collections, CRUD, vector/hybrid/BM25 search, multi-tenancy, generative search, and batch import.
navigation
actions
cheat sheet pages