SAP AI & Agentic Architecture Center
Strategic and technical reference for enterprise architects, AI architects, and CIO/CAIO organisations. Covers SAP Business AI strategy, Joule Agents, multi-agent orchestration, enterprise agent patterns, AI governance, SAP Sapphire announcements, and canonical reference architectures.
SAP AI Strategy
How SAP positions Business AI as an embedded capability across every application, process, and platform layer — not as a standalone product.
SAP defines SAP Business AI as AI that is relevant (grounded in SAP business context and data), reliable (enterprise-grade with human controls), and responsible (governed, auditable, privacy-preserving). This is not a standalone AI product — it is AI embedded at every layer of the SAP Business Suite, delivered through SAP Joule and the SAP AI Foundation platform.
The strategic bet is that AI-powered ERP has a structural advantage over general-purpose AI: SAP systems hold the most complete, trusted representation of enterprise operations — purchase orders, financial ledgers, HR records, supply chains, customer contracts. AI that reasons over this data and can directly act on released APIs creates business value that generic LLMs cannot replicate. SAP calls this the AI-First ERP vision.
| SAP AI Layer | What it is | Personas | Commercial model |
|---|---|---|---|
| SAP Business AI | Embedded AI capabilities in SAP applications | Business users, process owners | Included in application licence |
| SAP Joule | Conversational AI copilot across SAP applications | End users, knowledge workers | RISE/GROW included; Joule Booster add-on |
| Joule Agents | Autonomous multi-step agents on Joule infrastructure | Power users, process experts | Joule Booster + AI Core (CPEA) |
| AI Foundation | SAP AI Core + Generative AI Hub + Launchpad | Developers, ML engineers, architects | RISE/GROW included (standard); CPEA for extended |
| SAP AI Core | MLOps platform — train, deploy, serve models | ML engineers, platform teams | CPEA (Resource Units + Inference Units) |
| Generative AI Hub | 20+ foundation models via SAP-managed endpoints | Developers, prompt engineers | CPEA (per 1K tokens per model) |
Agentic AI Overview
The shift from copilot assistance to autonomous goal-directed agents that plan, act, and adapt.
What is Agentic AI?
An AI Agent is a software system that combines a large language model with a set of tools and a reasoning loop. The LLM acts as a decision engine — it receives a goal, creates a plan, selects appropriate tools (API calls, database queries, calculations), observes results, and iterates until the goal is achieved or it needs human input.
SAP's implementation uses the ReAct pattern (Reason–Act–Observe) where the agent thinks step-by-step, calls one tool at a time, and refines its reasoning based on what the tool returns — grounded in SAP business context via the Orchestration Service.
Agent vs. Copilot (Assistant)
Autonomous Workflows
Agents execute workflows that previously required human coordination across systems. Example: month-end close — the agent queries all outstanding items, triggers reconciliation tasks, posts corrections after approval, and generates the close status report. The human role shifts from executor to approver.
Human-in-the-Loop (HITL)
SAP's design principle: agents autonomously read and analyse; humans explicitly approve writes. Confirmation gates are implemented via Joule confirmation cards, SAP Build Process Automation approval steps, or SAP Task Center tasks — all creating a traceable audit log.
Multi-Agent Systems
Complex goals are decomposed across specialised sub-agents running in parallel. An Orchestrator Agent (high-capability LLM) plans and delegates. Specialist Agents(optimised for cost/latency) execute: data retrieval, process execution, RAG knowledge lookups, and escalation routing — then the orchestrator synthesises the final response.
SAP Agent Architecture
How user intent flows from natural language through Joule, the agent reasoning engine, and the tool layer to SAP business systems.
End user in SAP Fiori Launchpad or Work Zone. Interacts in natural language through the embedded Joule panel.
SAP Joule classifies intent via the Skill Router and either dispatches to a pre-built skill or routes to a Joule Agent for complex goals.
LLM-powered reasoning engine (GPT-4o or Claude 3.5). Uses the ReAct loop: reasons about the next action, calls a tool, observes results, repeats.
Callable functions the agent uses: OData V4 APIs, CAP service functions, Build Process Automation workflows, HANA Vector search, Integration Suite iFlows.
SAP S/4HANA, SuccessFactors, Ariba — the systems of record. Only accessed via released, role-enforced APIs. Never bypassed via direct DB access.
Multi-Agent Orchestration Pattern
Joule Agents
Building autonomous agents on the Joule infrastructure using Joule Studio and the emerging Agent Builder.
Joule Agent Builder
The Joule Agent Builder (Preview — SAP Road Map) is a guided, low-code tool within Joule Studio for defining agent personas, goals, tool sets, and escalation rules. It generates the agent skeleton TypeScript code and registers the agent with the Joule Skill Catalogue so it appears in the Joule panel.
Agent Design Principles
- Single Responsibility
- Each agent is designed for one domain (Finance, HR, IT). Avoids cross-domain coupling and maintains clear security boundaries.
- Tool-First Design
- Define the tool set before writing agent logic. Tools are reusable across agents — a "Get PO Status" tool is shared by Finance, Procurement, and Reporting agents.
- Fail Safe by Default
- On any uncertainty, the agent surfaces to the user for clarification rather than taking a default action. No silent failures.
- Idempotent Write Tools
- All tool functions that modify data are designed to be idempotent — calling them twice has the same effect as once. Prevents double-posting.
- Minimal Privilege
- Agent service account has exactly the OData scopes needed. No wildcard roles. Audited separately from the human user scopes.
ctx.clarify().ctx.sessionContext. Long-term user preferences and delegation rules can be stored as persistent entities linked to the user's BTP identity.Enterprise Agent Patterns
Complete architecture, workflow, API, and security reference for five production-grade Joule Agent patterns.
The HR Agent handles employee queries across SuccessFactors — leave balances, payslip access, holiday calendars, and benefit enrolment — and can initiate leave requests subject to manager confirmation. It uses RAG over company HR policies to ground all answers in official documentation.
- Joule Studio skill with SuccessFactors OData V2 tools
- HANA Vector Engine — HR policy knowledge base
- SAP IAS Principal Propagation for employee identity
- Build Process Automation — leave approval workflow
- SAP Task Center — manager approval task
- 1Employee asks: "How many leave days do I have?"
- 2Agent calls SF OData API: GET /LeaveBalance
- 3Checks policy RAG: entitlement rules for employee type
- 4Responds with balance + policy context
- 5If booking: presents confirmation card for employee
- 6On confirm: POST /TimeOff → triggers manager workflow
- 7Manager approves in Task Center → agent notifies employee
- SF: GET /LeaveBalanceOData V2
- SF: POST /TimeOffOData V2
- SF: GET /PayStatementOData V2
- BPA: Trigger leave workflowREST
- Task Center: Create taskREST
- Principal Propagation: employee JWT to SuccessFactors
- SAP IAS: single sign-on, MFA enforced
- Least-privilege scope: only HR OData scopes
- PII masking in Orchestration Service
- Audit log: every API call logged with user + timestamp
The Procurement Agent manages the full purchase-to-pay cycle from within the Joule panel. It reads PO and PR status from S/4HANA, retrieves supplier data from Ariba, recommends vendors using Gen AI Hub reasoning over historical spend data, and routes approvals through configured approval workflows.
- S/4HANA OData V4: MM purchasing APIs
- SAP Ariba Network API for supplier data
- HANA Cloud: spend analytics + supplier scoring
- Gen AI Hub: vendor recommendation reasoning
- Build Process Automation: multi-level approval workflow
- 1User: "Create a PR for 50 Dell laptops under IT budget"
- 2Agent retrieves approved vendor list from Ariba
- 3LLM scores vendors by price/quality/delivery from HANA data
- 4Presents recommendation with justification to user
- 5User confirms vendor and quantity
- 6Agent calls POST /PurchaseRequisition in S/4HANA
- 7Triggers multi-level approval workflow via BPA
- S/4: GET /PurchaseOrderOData V4
- S/4: POST /PurchaseRequisitionOData V4
- Ariba: GET /SuppliersREST
- HANA: Spend analytics viewSQL / OData
- BPA: Approval workflow triggerREST
- Role-based purchasing limits enforced at API layer
- Ariba API key stored in BTP Destination Service (no hardcode)
- Spend limit validation before PR creation
- Dual-control: buyer creates, manager approves
- SAP GRC integration for compliance checks (roadmap)
The Finance Agent accelerates period-close activities by autonomously querying G/L accounts, identifying discrepancies, generating variance commentary using Gen AI Hub, and routing journal entry approvals through the finance controller. It replaces hours of manual query-and-report cycles.
- S/4HANA FI OData V4: G/L, controlling, cost centres
- SAP Analytics Cloud API: actuals vs. plan data
- Datasphere: consolidated financial data fabric
- Gen AI Hub (GPT-4o): variance narrative generation
- Orchestration Service: grounded on accounting policies
- 1CFO asks: "Summarise Q3 variance vs. plan by cost centre"
- 2Agent queries SAC API for actuals and plan data
- 3Retrieves cost centre structure from S/4HANA
- 4LLM analyses variance, generates narrative per cost centre
- 5RAG: grounds narrative against accounting policy docs
- 6Presents board-ready summary with top 3 variances
- 7CFO requests journal correction → triggers FI approval
- S/4: GET /GLAccountOData V4
- S/4: POST /JournalEntryOData V4
- SAC: GET /PlanDataREST API
- Datasphere: Consumption APIOData V4
- Gen AI Hub: Chat completionsAI API
- FI posting period controls enforced via S/4HANA authorisation
- Journal entries: maker-checker approval mandatory
- Sensitive financial data: Orchestration PII masking enabled
- SOD (Segregation of Duties): agent cannot both post and approve
- All G/L queries logged in SAP Security Audit Log
The IT Service Agent deflects Level 1 ITSM tickets through intelligent self-service. It searches the IT knowledge base via RAG, creates incidents if self-service fails, monitors ticket progress, and escalates to on-call engineers. Integrates with ITSM tools via SAP Integration Suite.
- SAP Integration Suite: ITSM connector (ServiceNow / Jira)
- HANA Vector Engine: IT knowledge base (KB articles)
- Gen AI Hub: solution recommendation with RAG grounding
- Build Process Automation: escalation workflow
- SAP Event Mesh: real-time ticket status notifications
- 1Employee: "My VPN is not connecting from home"
- 2Agent: RAG search of KB → finds VPN reset procedure
- 3Presents step-by-step fix with screenshots (if found)
- 4If unresolved: "Shall I create an incident?" → confirm
- 5POST incident via Integration Suite → ITSM tool
- 6Agent monitors via Event Mesh for status updates
- 7Notifies employee when ticket is assigned/resolved
- Integration Suite: POST /IncidentiFlow REST
- Integration Suite: GET /Ticket/{id}iFlow REST
- HANA Vector: KB searchSQL cosine similarity
- Event Mesh: ticket-updates topicAMQP / REST
- Gen AI Hub: Orchestration (RAG)AI API
- ITSM access via Integration Suite OAuth2 client credentials
- KB articles: role-filtered (no access to security-classified docs)
- Incident creation limited to reporting user's own assets
- Escalation: PagerDuty/on-call only via Integration Suite (not direct)
- Content filter: blocks requests to query other users' tickets
The Project Management Agent provides real-time project intelligence from SAP Project System and SAP PPM. It reads milestone progress, identifies at-risk deliverables using LLM reasoning, suggests corrective actions grounded in PMO policies, and creates status reports for programme governance.
- S/4HANA Project System OData V4 APIs
- Datasphere: consolidated project data + KPIs
- HANA Vector Engine: PMO policies and templates
- Gen AI Hub: risk narrative and corrective action
- SAP Analytics Cloud: programme dashboard data
- 1PMO asks: "Which projects are at risk this quarter?"
- 2Agent queries all active projects from PS APIs
- 3Calculates SPI/CPI (Schedule/Cost Performance Index)
- 4LLM identifies risk patterns, cross-references PMO policy (RAG)
- 5Generates risk register with root causes and recommendations
- 6Creates SAC dashboard update with narrative commentary
- 7Optionally: creates corrective action tasks in Work Zone
- S/4: GET /Project + MilestonesOData V4
- S/4: GET /WBSElementOData V4
- Datasphere: GET /ProjectKPIsOData V4
- SAC: GET /PlanVsActualREST API
- Work Zone: POST /TaskREST API
- Project data access: role-filtered to user's portfolio
- No cross-project data leakage in multi-tenant scenarios
- RAG knowledge: PMO policies filtered by classification level
- Report output: cannot include budget data without FI role
- Audit trail: every project query logged with requestor identity
AI Governance
SAP's framework for Responsible AI, security controls, auditability, data privacy, and regulatory compliance.
SAP Responsible AI Framework
SAP Responsible AI is built on five commitments, published by SAP and embedded into every Business AI capability. These are not aspirational statements — they map to specific technical controls in AI Core and the Orchestration Service.
Security Controls
- Principal Propagation
- User JWT forwarded from BTP to S/4HANA via Destination Service OAuth2 SAML Bearer. No service-account escalation for user-initiated actions.
- Least-Privilege API Scopes
- Agent service accounts hold only the OData scopes required for defined tools. Reviewed during Joule Studio security review.
- BTP Destination Service
- All credentials (API keys, OAuth client secrets) stored in BTP Destination Service. Never hardcoded in agent code or templates.
- Network Isolation
- AI Core and Gen AI Hub run in SAP-managed VPCs. Private Link available for S/4HANA on-premise connectivity. No public internet exposure of backend APIs.
- Content Filtering
- Azure AI Content Safety filters on all LLM input/output via Orchestration Service. Configurable per deployment.
Auditability
- SAP Security Audit Log
- All Joule skill invocations and agent API calls logged in S/4HANA Security Audit Log with user, timestamp, and action.
- BTP Audit Log Service
- BTP-side agent actions logged in the BTP Audit Log Service — available for SIEM integration via Cloud Logging Service.
- HITL Audit Trail
- Every human confirmation (approve / reject / modify) creates an immutable record in Build Process Automation or Task Center.
- Token Usage Tracking
- Gen AI Hub tracks token consumption per resource group and user context. Supports chargeback reporting by cost centre.
Data Privacy
Data residency: All Gen AI Hub inference occurs within SAP-operated infrastructure in the configured BTP region. Prompts are never forwarded directly to model vendor endpoints — SAP is the data controller.
PII masking: The Orchestration Service applies PII detection and anonymisation to prompts before LLM calls. De-anonymisation applied to the response before returning to the user. Masked PII never exposed to model providers.
Data Processing Agreement: SAP Gen AI Hub is covered by SAP's standard DPA. Customer data is not used for model retraining by any third-party model provider under SAP's agreements.
Retention: Conversation logs in HANA Cloud subject to configurable retention policies. No permanent retention of prompt content by SAP AI infrastructure.
Compliance
- EU AI Act
- SAP classifies most SAP Business AI capabilities as Limited Risk (transparency obligations). SAP provides compliance documentation for customers' own AI Act assessments.
- GDPR / Data Protection
- Data residency in EU BTP regions (EU10, EU11). PII masking. Data subject rights supported via SAP Privacy Governance.
- SOX / Financial Controls
- Finance Agent patterns include segregation of duties controls. Maker-checker enforcement at the OData API layer.
- ISO 27001 / SOC 2
- SAP AI Core and Gen AI Hub are in scope for SAP's BTP ISO 27001 and SOC 2 Type II certifications. Documentation available via SAP Trust Center.
- Industry-Specific
- SAP Healthcare, Financial Services, and Public Sector industry clouds have additional compliance controls layered over AI Foundation services.
SAP Sapphire & Key AI Announcements
Availability status for major SAP AI capabilities announced at SAP Sapphire 2025 and through recent SAP Road Map updates.
AI Reference Architectures
Five canonical enterprise AI architectures for SAP solutions — from S/4HANA+Joule to Business Data Cloud+Joule.
S/4HANA + Joule
Production-ready architecture for Joule-enabled SAP S/4HANA — on-premise, RISE (PCE), or GROW (Public Cloud).
Standard Joule deployment for RISE/GROW customers. All pre-built skills (Finance, HR, Procurement) plus optional custom skills via Joule Studio.
Joule only calls released OData V4 APIs. No ABAP modifications. S/4HANA 2023+ required. BTP connectivity via Cloud Connector (on-prem) or internal (PCE).
Principal Propagation via BTP Destination Service. User JWT exchanged for SAML assertion. S/4HANA authorisation enforced at API layer — no elevation.
CAP + Joule
Custom Joule Agent backed by a CAP service — for domain-specific business logic and composite scenarios.
Reference Architecture B — CAP + Joule Agent Flow
CAP service exposes typed CDS action as a Joule tool. Handles business logic, S/4HANA OData calls, and data transformation — the agent calls it as a single action.
The Joule Agent tool definition maps to the CAP action URL. Tool schema (JSON Schema) is defined in the Joule Studio skill descriptor.
Joule → CAP: XSUAA JWT (BTP user identity). CAP → S/4HANA: Principal Propagation via Destination Service. CAP XSUAA scopes enforce access.
CAP on Cloud Foundry or Kyma. Multi-tenant with IAS tenant routing. AI Core skill container separately deployed and registered in Joule Studio.
Work Zone + Joule
Digital workplace experience with Joule embedded — Task Center, approval workflows, and notification-driven agent handoffs.
Reference Architecture C — Work Zone + Joule Agent Flow
SAP Build Work Zone (Advanced Edition) is the single entry point. Joule panel appears across all apps in the launchpad. Task Center aggregates tasks from S/4HANA, SuccessFactors, and BPA workflows.
Joule Agents initiated from Work Zone operate in the context of the active Work Zone tile or My Inbox task. Context (document ID, process step) is passed to the agent via ctx.appContext.
AI Core + Joule
Joule Studio custom skills with AI Core as the execution engine — for RAG-grounded, LLM-powered agent capabilities.
Reference Architecture D — AI Core + Joule Skill Execution
Joule Studio skills run as Docker containers on AI Core compute. The skill TypeScript handler calls ctx.callApi() for OData, ctx.searchKnowledge() for RAG, and ctx.generateResponse() for LLM calls.
The Orchestration Service provides the RAG pipeline: embed query → HANA Vector search → inject context → LLM call → content filter. Skills call the Orchestration endpoint, not the LLM directly.
Skill handlers can override model selection per call — GPT-4o for complex reasoning, Claude 3.5 for document analysis, GPT-4o mini for low-latency responses. All routed via Gen AI Hub.
Business Data Cloud + Joule
AI-powered analytics and natural language data access — Joule over Datasphere and SAC for strategic decision support.
Business Data Cloud (BDC) creates a unified, governed data fabric from S/4HANA, SuccessFactors, and external sources. Datasphere handles integration and semantic modelling; SAC provides BI and planning.
Joule provides natural language query over the BDC data model. The Gen AI Hub Orchestration Service retrieves relevant fact data and business definitions from the HANA Vector Engine before LLM reasoning.
Executive Q&A: "What drove the EBITDA miss in Q3?" Report narration (SAC Smart Insights). Predictive planning assistance. Cross-domain analysis combining HR, Finance, and Supply Chain data.
Reference Architecture Selection Guide
Licensing
Commercial model for SAP Business AI, Joule, AI Foundation, AI Core, and Generative AI Hub.
Joule
SAP's generative AI copilot embedded across SAP applications — providing natural language interaction for navigation, transactions, insights, and code generation across the SAP portfolio.
Core Joule skills included in RISE with SAP. Joule Booster (additional skill pack) is a separate entitlement for RISE customers. Standalone access requires SAP AI Business Services licensing.
AI Core
SAP's MLOps service on SAP BTP — providing infrastructure for AI model training, deployment, serving, and lifecycle management including access to the Generative AI Hub.
CPEA consumption-based: Resource Units for model training/serving, Inference Units for production AI workloads. Storage charged separately.
Generative AI Hub
SAP's curated access point for 20+ foundation models (GPT-4o, Claude, Gemini, Llama, DALL-E, and SAP-specific models) — with data privacy, usage tracking, and SAP context grounding.
Access via SAP AI Core (Standard plan). Token consumption billed per model per 1,000 tokens. All inference processed within SAP-operated infrastructure for data sovereignty.
SAP Business AI — Licensing by Deployment Model
Capability | RISE with SAPS/4HANA Private CloudGenerally Available | GROW with SAPS/4HANA Public CloudGenerally Available | Joule BoosterRISE/GROW add-onGenerally Available | CPEA (AI Core / Gen AI Hub)Consumption-basedGenerally Available |
|---|---|---|---|---|
| SAP Joule — Core Skills | ||||
| Joule panel in Fiori Launchpad | n/a | |||
| S/4HANA Finance skills (11 GA) | n/a | |||
| SuccessFactors HR skills | n/a | |||
| Joule in Work Zone | Work Zone licence | Work Zone licence | Booster includes | n/a |
| Joule Booster Features | ||||
| Extended skill packs (Procurement, Sales) | n/a | |||
| Custom Joule skills (Joule Studio) | n/a | |||
| Joule Agent Builder (Preview) | Planned | n/a | ||
| Automated BTP provisioning (Booster) | n/a | |||
| AI Foundation (Platform) | ||||
| SAP AI Core — Standard plan | Included (limited) | Included (limited) | Extended via Booster | Full (CPEA) |
| Gen AI Hub — model access | Included (limited) | Included (limited) | Extended model access | Full 20+ models |
| Orchestration Service (RAG) | Included | Included | Included | Included |
| SAP AI Launchpad | Separate sub. | Separate sub. | Separate sub. | Separate sub. |
| Custom AI / MLOps (CPEA) | ||||
| AI Core training executions (GPU) | n/a | n/a | n/a | Resource Units |
| Inference deployments (serving) | n/a | n/a | n/a | Inference Units |
| Token consumption (LLM calls) | n/a | n/a | n/a | Per 1K tokens / model |
| BYOM fine-tuning | n/a | n/a | n/a | AI Core Standard required |
AI Units (AIU) — Consumption Metric
AI Units (AIU) are the SAP commercial unit for Business AI services consumed under CPEA. Different AI actions consume different quantities of AI Units:
- LLM inference (Gen AI Hub): measured per 1,000 tokens, rate varies by model (GPT-4o vs. Llama 3)
- AI Core training (GPU compute): measured in Resource Units per GPU-hour
- Inference serving (model deployment): measured in Inference Units per pod-hour
- Vector search (HANA Vector Engine): included in HANA Cloud licence — no separate AI Unit charge
- Orchestration Service: no additional AI Unit charge beyond the underlying LLM token cost