mirror of https://github.com/langgenius/dify.git synced 2026-03-05 07:37:07 +08:00

Files

GareArc bd898968b1 refactor(telemetry): migrate to type-safe enum-based event routing with centralized enterprise filtering

Changes:
- Change TelemetryEvent.name from str to TraceTaskName enum for type safety
- Remove hardcoded trace_task_name_map from facade (no mapping needed)
- Add centralized enterprise-only filter in TelemetryFacade.emit()
- Rename is_telemetry_enabled() to is_enterprise_telemetry_enabled()
- Update all 11 call sites to pass TraceTaskName enum values
- Remove redundant enterprise guard from draft_trace.py
- Add unit tests for TelemetryFacade.emit() routing (6 tests)
- Add unit tests for TraceQueueManager telemetry guard (5 tests)
- Fix test fixture scoping issue for full test suite compatibility
- Fix tenant_id handling in agent tool callback handler

Benefits:
- 100% type-safe: basedpyright catches errors at compile time
- No string literals: eliminates entire class of typo bugs
- Single point of control: centralized filtering in facade
- All guards removed except facade
- Zero regressions: 4887 tests passing

Verification:
- make lint: PASS
- make type-check: PASS (0 errors, 0 warnings)
- pytest: 4887 passed, 8 skipped

2026-02-05 15:12:02 -08:00

entities

chore: use from __future__ import annotations (#30254 )

2026-01-06 23:57:20 +09:00

graph

chore: use from __future__ import annotations (#30254 )

2026-01-06 23:57:20 +09:00

graph_engine

refactor(telemetry): migrate to type-safe enum-based event routing with centralized enterprise filtering

2026-02-05 15:12:02 -08:00

graph_events

Enhanced GraphEngine Pause Handling (#28196 )

2025-11-26 19:59:34 +08:00

node_events

Feat/support multimodal embedding (#29115 )

2025-12-09 14:41:46 +08:00

nodes

feat(telemetry): add input/output token split to enterprise OTEL traces

2026-02-03 19:27:11 -08:00

repositories

chore: use from __future__ import annotations (#30254 )

2026-01-06 23:57:20 +09:00

runtime

chore: use from __future__ import annotations (#30254 )

2026-01-06 23:57:20 +09:00

utils

fix: fix numeric type conversion issue in if-else condition comparison (#28155 )

2025-11-21 12:58:08 +08:00

__init__.py

FEAT: NEW WORKFLOW ENGINE (#3160 )

2024-04-08 18:51:46 +08:00

constants.py

feat: knowledge pipeline (#25360 )

2025-09-18 12:49:10 +08:00

conversation_variable_updater.py

remove bare list, dict, Sequence, None, Any (#25058 )

2025-09-06 03:32:23 +08:00

enums.py

feat(telemetry): add input/output token split to enterprise OTEL traces

2026-02-03 19:27:11 -08:00

errors.py

feat: knowledge pipeline (#25360 )

2025-09-18 12:49:10 +08:00

README.md

feat(graph-engine): make layer runtime state non-null and bound early (#30552 )

2026-01-05 16:43:42 +08:00

system_variable.py

chore: use from __future__ import annotations (#30254 )

2026-01-06 23:57:20 +09:00

variable_loader.py

feat(graph_engine): Support pausing workflow graph executions (#26585 )

2025-10-19 21:33:41 +08:00

workflow_entry.py

fix: use node factory for single-step workflow nodes (#30859 )

2026-01-13 10:11:18 +08:00

workflow_type_encoder.py

feat: knowledge pipeline (#25360 )

2025-09-18 12:49:10 +08:00

README.md

Workflow

Project Overview

This is the workflow graph engine module of Dify, implementing a queue-based distributed workflow execution system. The engine handles agentic AI workflows with support for parallel execution, node iteration, conditional logic, and external command control.

Architecture

Core Components

The graph engine follows a layered architecture with strict dependency rules:

Graph Engine (graph_engine/) - Orchestrates workflow execution
- Manager - External control interface for stop/pause/resume commands
- Worker - Node execution runtime
- Command Processing - Handles control commands (abort, pause, resume)
- Event Management - Event propagation and layer notifications
- Graph Traversal - Edge processing and skip propagation
- Response Coordinator - Path tracking and session management
- Layers - Pluggable middleware (debug logging, execution limits)
- Command Channels - Communication channels (InMemory, Redis)
Graph (graph/) - Graph structure and runtime state
- Graph Template - Workflow definition
- Edge - Node connections with conditions
- Runtime State Protocol - State management interface
Nodes (nodes/) - Node implementations
- Base - Abstract node classes and variable parsing
- Specific Nodes - LLM, Agent, Code, HTTP Request, Iteration, Loop, etc.
Events (node_events/) - Event system
- Base - Event protocols
- Node Events - Node lifecycle events
Entities (entities/) - Domain models
- Variable Pool - Variable storage
- Graph Init Params - Initialization configuration

Key Design Patterns

Command Channel Pattern

External workflow control via Redis or in-memory channels:

# Send stop command to running workflow
channel = RedisChannel(redis_client, f"workflow:{task_id}:commands")
channel.send_command(AbortCommand(reason="User requested"))

Layer System

Extensible middleware for cross-cutting concerns:

engine = GraphEngine(graph)
engine.layer(DebugLoggingLayer(level="INFO"))
engine.layer(ExecutionLimitsLayer(max_nodes=100))

engine.layer() binds the read-only runtime state before execution, so layer hooks can assume graph_runtime_state is available.

Event-Driven Architecture

All node executions emit events for monitoring and integration:

NodeRunStartedEvent - Node execution begins
NodeRunSucceededEvent - Node completes successfully
NodeRunFailedEvent - Node encounters error
GraphRunStartedEvent/GraphRunCompletedEvent - Workflow lifecycle

Variable Pool

Centralized variable storage with namespace isolation:

# Variables scoped by node_id
pool.add(["node1", "output"], value)
result = pool.get(["node1", "output"])

Import Architecture Rules

The codebase enforces strict layering via import-linter:

Workflow Layers (top to bottom):
- graph_engine → graph_events → graph → nodes → node_events → entities
Graph Engine Internal Layers:
- orchestration → command_processing → event_management → graph_traversal → domain
Domain Isolation:
- Domain models cannot import from infrastructure layers
Command Channel Independence:
- InMemory and Redis channels must remain independent

Common Tasks

Adding a New Node Type

Create node class in nodes/<node_type>/
Inherit from BaseNode or appropriate base class
Implement _run() method
Register in nodes/node_mapping.py
Add tests in tests/unit_tests/core/workflow/nodes/

Implementing a Custom Layer

Create class inheriting from Layer base
Override lifecycle methods: on_graph_start(), on_event(), on_graph_end()
Add to engine via engine.layer()

Debugging Workflow Execution

Enable debug logging layer:

debug_layer = DebugLoggingLayer(
    level="DEBUG",
    include_inputs=True,
    include_outputs=True
)