AI and Agents Course

Chapter 1: Foundations of Modern AI

SubtopicDetail
1.1 Evolution from Classical AI to Foundation ModelsBrief history: symbolic AI, statistical ML, deep learning, emergence of foundation & frontier models.
1.2 Key Concepts: Tokens, Embeddings, ParametersTokenization strategies, embedding spaces, scaling of parameters and compute laws.
1.3 Model Taxonomy: SLMs vs LLMs vs MLLMsSmall vs large language models, multimodal LLMs, trade-offs in capacity, latency, cost.
1.4 Core Capabilities & LimitationsReasoning, generation, hallucination, context windows, system prompts, reliability constraints.

Chapter 2: Small Language Models (SLMs)

SubtopicDetail
2.1 Definition & Roles of SLMsEdge deployment, on-device inference, privacy and latency benefits.
2.2 Distillation & QuantizationKnowledge distillation, 8/4-bit quantization, QLoRA concepts.
2.3 Popular Open SLM FamiliesPhi, Mistral small, TinyLlama, Gemma variants—capabilities & benchmarks.
2.4 When to Choose SLMs over LLMsCost modeling, latency thresholds, data governance, offline resilience.

Chapter 3: Large Language Models (LLMs)

SubtopicDetail
3.1 Architecture Recap & Scaling LawsTransformer blocks, depth vs width, Chinchilla scaling principles.
3.2 Instruction Tuning & AlignmentSFT, RLHF, DPO, safety layers, preference data pipelines.
3.3 Function Calling & Structured OutputsJSON mode, schema constraints, tool invocation protocols.
3.4 Cost, Latency & Throughput OptimizationBatching, caching, speculative decoding, dynamic routing.

Chapter 4: Multimodal & Vision-Language Models

SubtopicDetail
4.1 Multimodal Encoders & FusionImage, audio, video encoders; joint embedding spaces.
4.2 Vision-Language ArchitecturesBLIP, Flamingo, LLaVA style adaptors & projection heads.
4.3 Image & Document UnderstandingOCR integration, layout parsing, chart/table reasoning.
4.4 Multimodal Tool UseGrounding outputs with external perception & analysis tools.

Chapter 5: Tool Use & Function Calling

SubtopicDetail
5.1 Prompt Patterns for Tool InvocationSchema design, tool selection heuristics, reasoning scaffolds.
5.2 OpenAI / JSON / Schema ApproachesFunction calling schemas, JSON schema validation, error recovery.
5.3 Tool Error Handling & RetriesGuardrails, tool feedback loops, self-correction prompts.
5.4 Chaining Tools & PlanningSequential vs parallel tool execution design patterns (LangChain tool chains, LangGraph stateful graphs).

Chapter 6: Model Context Protocol (MCP) & Open Ecosystems

SubtopicDetail
6.1 Overview of MCPGoals: portability, interoperability, decoupling model & tools.
6.2 MCP Server & Client RolesResource provision, capability advertisement, session negotiation.
6.3 Integrating MCP with AgentsRouting requests, context expansion, secure tool boundaries.
6.4 Emerging Standards & ProtocolsOpenAI tool schemas, LangChain tool interfaces, LangGraph workflow graphs, Open RAG specs.

Chapter 7: Agent Foundations

SubtopicDetail
7.1 What is an AI Agent?Definitions: autonomy, goals, environment interaction.
7.2 Core Agent LoopPerceive → Reason → Act → Reflect cycle.
7.3 Prompt Engineering for AgentsSystem vs scratchpad vs reflection prompts.
7.4 Memory & State ManagementShort-term scratchpads, episodic & semantic memory stores.

Chapter 8: Agent Architectures I

SubtopicDetail
8.1 ReAct & Chain-of-ThoughtCombining reasoning traces with tool/action steps.
8.2 Reflexion & Self-CritiqueIterative refinement loops, critique generation patterns.
8.3 Tree of Thoughts / Graph SearchExploring reasoning branches (ToT), graph execution with LangGraph, pruning heuristics.
8.4 Multi-Agent Role SpecializationOrchestrators, planners, executors & evaluators.

Chapter 9: Agent Architectures II

SubtopicDetail
9.1 Hierarchical & Modular AgentsTask decomposition, sub-goal scheduling.
9.2 Planning with LLMsBackward vs forward planning, dynamic replanning.
9.3 Retrieval-Augmented AgentsDynamic context injection via vector stores.
9.4 Evaluator & Judge AgentsQuality scoring, safety checks, meta-reasoning roles.

Chapter 10: Retrieval Augmented Generation (RAG)

SubtopicDetail
10.1 Embeddings & Vector StoresEmbedding models, ANN indexes (FAISS, HNSW).
10.2 Chunking & Index ConstructionSemantic vs fixed-size chunking, metadata strategies.
10.3 Retrieval PipelinesHybrid search (BM25+vector), rerankers, filtering (LangChain retrievers & LangGraph orchestrations).
10.4 Advanced RAG PatternsQuery rewriting, multi-hop, tool-enriched retrieval.

Chapter 11: Hugging Face & Model Ecosystem

SubtopicDetail
11.1 Navigating the HubModel cards, datasets, spaces, licensing considerations.
11.2 Popular Foundation ModelsLLaMA, Mistral, Mixtral, GPT-NeoX, Gemma, T5 families.
11.3 Specialized Models (Vision, Audio)Whisper, CLIP, Stable Diffusion, Wav2Vec, SAM.
11.4 Fine-Tuning & LoRA AdaptationPEFT, adapters, parameter efficiency, eval tracking.

Chapter 12: Reasoning & Advanced Prompting

SubtopicDetail
12.1 Chain-of-Thought & DeliberateStructured reasoning traces for improved accuracy.
12.2 Program-Aided & Tool-Augmented ReasoningPAL, code interpreters, symbolic hybrids.
12.3 Self-Consistency & Majority VotingEnsembles of reasoning paths for reliability.
12.4 Prompt Compression & Context OptimizationWindow management, summarization, context distillation.

Chapter 13: Memory & Long-Term Context

SubtopicDetail
13.1 Episodic vs Semantic MemoryTemporal logs vs knowledge graphs.
13.2 Vector Memory StoresPersisting conversation embeddings & retrieval strategies.
13.3 Memory Consolidation PatternsSummarization cycles, forgetting strategies.
13.4 Context Window ExtensionsSliding windows, recurrence, external memory tools.

Chapter 14: Evaluation & Benchmarks

SubtopicDetail
14.1 Quantitative BenchmarksMMLU, GSM8K, BBH, MT-Bench: interpreting scores.
14.2 Human & AI Feedback LoopsPreference data, rubric scoring, judge models.
14.3 Agent-Specific EvaluationTask success, tool efficiency, autonomy metrics.
14.4 Regression & Monitoring DashboardsTracking drift, latency, cost, and quality trends.

Chapter 15: Safety, Ethics & Governance

SubtopicDetail
15.1 Prompt Injection & JailbreaksAttack surfaces, sanitization strategies.
15.2 Data Privacy & PII HandlingRedaction, classification, isolation of sensitive content.
15.3 Alignment & Content ModerationPolicy filters, refusal patterns, fallback flows.
15.4 Governance & AuditabilityAction logs, reproducibility, compliance frameworks.

Chapter 16: Deployment & Infrastructure

SubtopicDetail
16.1 Serving ArchitecturesGateway, model router, scaling replicas, autoscaling.
16.2 Caching & Session StrategiesKV caches, prefix reuse, partial decoding.
16.3 Observability & TelemetryTracing prompts, metrics, structured logging.
16.4 Cost Optimization TechniquesModel routing, tiered quality, re-ranking cascades.

Chapter 17: Optimization & Performance Engineering

SubtopicDetail
17.1 Inference OptimizationQuantization, pruning, speculative & streaming decode.
17.2 Hardware AccelerationGPUs vs TPUs vs NPUs; attention offloading.
17.3 Throughput & Concurrency ControlAdmission control, batching algorithms.
17.4 Latency DiagnosticsToken breakdown, network vs compute bottlenecks.

Chapter 18: Specialized Agent Modalities

SubtopicDetail
18.1 Code Agents & Execution SandboxesExecution safety, timeouts, state isolation.
18.2 Data Analysis & Notebook AgentsEDA automation, visualization toolchains.
18.3 Robotic & Embodied AgentsPerception-action loops, simulation-to-real gaps.
18.4 Autonomous Web & API AgentsBrowser automation, DOM plans, API chaining.

Chapter 19: Future Directions & Research Frontiers

SubtopicDetail
19.1 Continual & Lifelong LearningAdaptive fine-tuning, catastrophic forgetting mitigation.
19.2 Neuro-Symbolic & Hybrid SystemsSymbol grounding, reasoning augmentation.
19.3 Open Weights vs Closed ModelsEconomics, innovation velocity, community impact.
19.4 Autonomous Alignment & Self-EvaluationAgentic red teaming, reflective safety loops.

Chapter 20: Capstone & Integration

SubtopicDetail
20.1 Designing an End-to-End Agent SystemBlueprint: ingestion, memory, planning, tool layer.
20.2 Production Hardening ChecklistResilience, fallbacks, rate limiting, observability.
20.3 Case Studies & Reference ArchitecturesPatterns across customer support, analytics, automation.
20.4 Next Steps & Learning PathsOpen research problems, community resources, benchmarking.

Chapter 21: Practical LangChain/LangGraph Implementation Patterns

SubtopicDetail
21.1 LangChain Fundamentals & Core ComponentsChains, prompts, memory, retrievers, agents - building blocks and composition patterns.
21.2 LangGraph State Machines & Workflow DesignGraph-based agent flows, state management, conditional routing, human-in-the-loop patterns.
21.3 Advanced Integration PatternsCustom tools, multi-model routing, error handling, streaming responses, production deployment.
21.4 Performance Optimization & DebuggingLangSmith tracing, async execution, caching strategies, cost optimization techniques.
Start Course →