A literature note for keeping up on AI Agent architectures and workflows, which seems to be changing each day. Occasionally, I do a point-in-time summary to help get a baseline for where we are as an industry. - [[From Buzz to Building - Introduction to GenAI for Developers - Part 2 - The Technical Stack]] - [[A High-level Overview of the AI Agent Technical Stack in Early 2026]] # How Did We Get Here? ![[image-32.png]] The billion dollar question - what is phase 5? # AI Agent Architectures ## Single Agent ### Reason Act (ReAct) ### Reasoning and Acting through Scratchpad Examples (RAISE) ## Multi-Agent ### Centralized vs. decentralized ### Holonic ### Coalition # Architecture Components ## Orchestrator ### Orchestration Approaches #### Role-based Orchestration #### Task-based Orchestration ### Orchestrator Types #### Rule-based Orchestrators #### LLM-based Orchestrators ## Communication ## The Brain ### Cognitive Mindset #### Reasoning #### Planning #### Knowledge Retrieval #### Optimization ## Memory ### Prompt Cacheing [Site Unreachable](https://x.com/trq212/status/2024574133011673516) # Workflows and Orchestration Approaches - Thread-Based (i.e. Process-based) Workflow ([AGENT THREADS](https://www.youtube.com/watch?v=-WBHNFAB0OE)) - Base Thread: Your fundamental unit of work - P Thread: Parallel execution for scaling output - C Thread: Chained work for production-sensitive tasks - F Thread: Fusion threads for rapid prototyping and confidence - B Thread: Meta structures with agents prompting agents - L Thread: Long duration, high autonomy workflows - The Cooridnator - Sequential - Iterative Refinement between 2 agents - ## Coding Workflows - Plan -> Build -> Review -> Fix ![[image-31.png]] # Technical Stack # Sources -