ChiMod Developer Guide

Overview
Architecture
Coding Style
Module Structure
Configuration and Code Generation
Task Development
Synchronization Primitives
Pool Query and Task Routing
Client-Server Communication
Memory Management
- CHI_IPC Buffer Allocation
- Task Data Structures
Build System Integration
External ChiMod Development
Example Module

Overview

Chimaera modules (ChiMods) are dynamically loadable components that extend the runtime with new functionality. Each module consists of:

Client library: Minimal code for task submission from user processes
Runtime library: Server-side execution logic
Task definitions: Shared structures for client-server communication
Configuration: YAML metadata describing the module

Header Organization: All ChiMod headers are organized under the namespace directory structure (include/[namespace]/[module_name]/) to provide clear namespace separation and prevent header conflicts.

Architecture

Core Principles

Client-Server Separation: Clients only submit tasks; runtime handles all logic
Shared Memory Communication: Tasks are allocated in shared memory segments
Task-Based Processing: All operations are expressed as tasks with methods
Zero-Copy Design: Data stays in shared memory; only pointers are passed

Key Components

ChiMod/
├── include/
│   └── [namespace]/
│       └── MOD_NAME/
│           ├── MOD_NAME_client.h     # Client API
│           ├── MOD_NAME_runtime.h    # Runtime container
│           ├── MOD_NAME_tasks.h      # Task definitions
│           └── autogen/
│               └── MOD_NAME_methods.h    # Method constants
├── src/
│   ├── MOD_NAME_client.cc        # Client implementation
│   ├── MOD_NAME_runtime.cc       # Runtime implementation
│   └── autogen/
│       └── MOD_NAME_lib_exec.cc  # Auto-generated virtual method implementations
├── chimaera_mod.yaml              # Module configuration
└── CMakeLists.txt                 # Build configuration

Include Directory Structure:

All ChiMod headers are organized under the namespace directory
Structure: include/[namespace]/[module_name]/
Example: Admin headers are in include/[namespace]/admin/ (where [namespace] is the namespace from chimaera_repo.yaml)
Headers follow naming pattern: [module_name]_[type].h
Auto-generated headers are in the autogen/ subdirectory
Note: The namespace comes from chimaera_repo.yaml and the chimod directory name doesn't need to match the namespace

Coding Style

General Guidelines

Namespace: All module code under chimaera::MOD_NAME
Naming Conventions:
- Classes: PascalCase (e.g., CustomTask)
- Methods: PascalCase for public, camelCase for private
- Variables: snake_case_ with trailing underscore for members
- Constants: kConstantName
- Enums: kEnumValue
Header Guards: Use #ifndef MOD_NAME_COMPONENT_H_
Includes: System headers first, then library headers, then local headers
Comments: Use Doxygen-style comments for public APIs

Code Formatting

namespace chimaera::MOD_NAME {

/**
 * Brief description
 * 
 * Detailed description if needed
 * @param param_name Parameter description
 * @return Return value description
 */
class ExampleClass {
 public:
  // Public methods
  void PublicMethod();
  
 private:
  // Private members with trailing underscore
  u32 member_variable_;
};

}  // namespace chimaera::MOD_NAME

Module Structure

Task Definition (MOD_NAME_tasks.h)

Task definition patterns:

CreateParams Structure: Define configuration parameters for pool creation
- Uses cereal serialization for programmatic creation (direct API calls)
- Implements LoadConfig() for declarative creation (compose mode from YAML)
- Must define chimod_lib_name static string for module loading
CreateTask Template: Use GetOrCreatePoolTask template for container creation (non-admin modules)
Custom Tasks: Define custom tasks with SHM/Emplace constructors and HSHM data members

#ifndef MOD_NAME_TASKS_H_
#define MOD_NAME_TASKS_H_

#include <chimaera/chimaera.h>
#include <[namespace]/MOD_NAME/autogen/MOD_NAME_methods.h>
// Include admin tasks for GetOrCreatePoolTask
#include <[namespace]/admin/admin_tasks.h>

namespace chimaera::MOD_NAME {

/**
 * CreateParams for MOD_NAME chimod
 * Contains configuration parameters for MOD_NAME container creation
 */
struct CreateParams {
  // MOD_NAME-specific parameters
  std::string config_data_;
  chi::u32 worker_count_;

  // Required: chimod library name for module manager
  static constexpr const char* chimod_lib_name = "chimaera_MOD_NAME";

  // Default constructor
  CreateParams() : worker_count_(1) {}

  // Constructor with parameters
  CreateParams(const std::string& config_data = "",
               chi::u32 worker_count = 1)
      : config_data_(config_data), worker_count_(worker_count) {}

  // Serialization support for cereal
  template<class Archive>
  void serialize(Archive& ar) {
    ar(config_data_, worker_count_);
  }

  /**
   * Load configuration from PoolConfig (for compose mode)
   * Required for compose feature support
   * @param pool_config Pool configuration from compose section
   */
  void LoadConfig(const chi::PoolConfig& pool_config) {
    // Parse YAML config string
    YAML::Node config = YAML::Load(pool_config.config_);

    // Load module-specific parameters from YAML
    if (config["config_data"]) {
      config_data_ = config["config_data"].as<std::string>();
    }
    if (config["worker_count"]) {
      worker_count_ = config["worker_count"].as<chi::u32>();
    }
  }
};

/**
 * CreateTask - Initialize the MOD_NAME container
 * Type alias for GetOrCreatePoolTask with CreateParams (uses kGetOrCreatePool method)
 * Non-admin modules should use GetOrCreatePoolTask instead of BaseCreateTask
 */
using CreateTask = chimaera::admin::GetOrCreatePoolTask<CreateParams>;

CreateParams Dual-Mode System

CreateParams supports two modes of pool creation, controlled internally by BaseCreateTask::GetParams():

Programmatic mode (default): When you call AsyncCreate() from the client API, CreateParams is serialized with cereal and sent as binary data. GetParams() deserializes it directly.
Compose mode: When pools are created from YAML configuration (via the compose feature), GetParams() detects the compose flag and instead deserializes a chi::PoolConfig struct, then calls your LoadConfig() method to populate the parameters from the YAML config string.

// Internal logic in BaseCreateTask::GetParams() (admin_tasks.h)
CreateParamsT GetParams() const {
  if (do_compose_) {
    // Compose mode: deserialize PoolConfig, then call LoadConfig
    chi::PoolConfig pool_config =
        chi::Task::Deserialize<chi::PoolConfig>(chimod_params_);
    CreateParamsT params;
    params.LoadConfig(pool_config);
    return params;
  } else {
    // Programmatic mode: direct cereal deserialization
    return chi::Task::Deserialize<CreateParamsT>(chimod_params_);
  }
}

The chi::PoolConfig struct carries the YAML compose entry:

mod_name_ - ChiMod library name (e.g., "chimaera_bdev")
pool_name_ - Pool identifier or file path
pool_id_ - Pool ID
pool_query_ - Scheduling query (dynamic or local)
config_ - Remaining YAML fields as a raw string for LoadConfig() to parse
restart_ - Whether the pool should be restarted

This means every CreateParams must implement both serialize() (for programmatic mode) and LoadConfig() (for compose mode). See the Compose Configuration Feature section for full details on the YAML format and usage.

/**
 * Custom operation task
 */
struct CustomTask : public chi::Task {
  // Task-specific data using standard types
  INOUT std::string data_;        // Input/output string
  IN chi::u32 operation_id_;      // Input parameter
  OUT chi::u32 result_code_;      // Output result

  // Default constructor
  CustomTask()
      : chi::Task(),
        operation_id_(0),
        result_code_(0) {}

  // Emplace constructor
  explicit CustomTask(
      const chi::TaskId &task_id,
      const chi::PoolId &pool_id,
      const chi::PoolQuery &pool_query,
      const std::string &data,
      chi::u32 operation_id)
      : chi::Task(task_id, pool_id, pool_query, Method::kCustom),
        data_(data),
        operation_id_(operation_id),
        result_code_(0) {
    task_id_ = task_id;
    pool_id_ = pool_id;
    method_ = Method::kCustom;
    task_flags_.Clear();
    pool_query_ = pool_query;
  }
};

}  // namespace chimaera::MOD_NAME

#endif  // MOD_NAME_TASKS_H_

Client Implementation (MOD_NAME_client.h/cc)

The client provides an async-only API for task submission. All operations return chi::Future<TaskType> objects:

#ifndef MOD_NAME_CLIENT_H_
#define MOD_NAME_CLIENT_H_

#include <chimaera/chimaera.h>
#include <[namespace]/MOD_NAME/MOD_NAME_tasks.h>

namespace chimaera::MOD_NAME {

class Client : public chi::ContainerClient {
 public:
  Client() = default;
  explicit Client(const chi::PoolId& pool_id) { Init(pool_id); }

  /**
   * Async Create operation - returns Future for task completion
   * Caller must call task.Wait() and check GetReturnCode()
   */
  chi::Future<CreateTask> AsyncCreate(
      const chi::PoolQuery& pool_query,
      const std::string& pool_name,
      const chi::PoolId& custom_pool_id,
      const CreateParams& params = CreateParams()) {
    auto* ipc_manager = CHI_IPC;

    // CRITICAL: CreateTask MUST use admin pool for GetOrCreatePool processing
    auto task = ipc_manager->NewTask<CreateTask>(
        chi::CreateTaskId(),
        chi::kAdminPoolId,  // Always use admin pool for CreateTask
        pool_query,
        CreateParams::chimod_lib_name,  // ChiMod name from CreateParams
        pool_name,              // User-provided pool name
        custom_pool_id,         // Target pool ID to create
        this,                   // Client pointer for PostWait callback
        params);                // CreateParams with configuration

    // Submit to runtime and return Future
    return ipc_manager->Send(task);
  }

  /**
   * Async Custom operation - example of a typical async method
   */
  chi::Future<CustomTask> AsyncCustom(
      const chi::PoolQuery& pool_query,
      const std::string& input_data,
      chi::u32 operation_id) {
    auto* ipc_manager = CHI_IPC;
    auto task = ipc_manager->NewTask<CustomTask>(
        chi::CreateTaskId(),
        pool_id_,           // Use client's pool_id_ for non-Create operations
        pool_query,
        input_data,
        operation_id);
    return ipc_manager->Send(task);
  }
};

}  // namespace chimaera::MOD_NAME

#endif  // MOD_NAME_CLIENT_H_

Usage Pattern:

// Create client and initialize
chimaera::MOD_NAME::Client client;
const chi::PoolId pool_id(7000, 0);

// Async create
auto create_task = client.AsyncCreate(chi::PoolQuery::Dynamic(), "my_pool", pool_id);
create_task.Wait();

if (create_task->GetReturnCode() != 0) {
    std::cerr << "Create failed" << std::endl;
    return;
}

// Perform operations
auto op_task = client.AsyncCustom(chi::PoolQuery::Local(), "data", 1);
op_task.Wait();

// Access results
std::cout << "Result: " << op_task->result_code_ << std::endl;

ChiMod CreateTask Pool Assignment Requirements

CRITICAL: All ChiMod clients implementing Create functions MUST use the explicit chi::kAdminPoolId variable when constructing CreateTask operations. You CANNOT use pool_id_ for CreateTask operations.

Why This is Required

CreateTask operations are actually GetOrCreatePoolTask operations that must be processed by the admin ChiMod to create or find the target pool. The pool_id_ variable is not initialized until after the Create operation completes successfully.

Correct Usage

// CORRECT: Always use chi::kAdminPoolId for CreateTask
auto task = ipc_manager->NewTask<CreateTask>(
    chi::CreateTaskId(),
    chi::kAdminPoolId,          // REQUIRED: Use admin pool for CreateTask
    pool_query,
    CreateParams::chimod_lib_name,
    pool_name,
    pool_id_,                   // Target pool ID to create
    params);

Incorrect Usage

// WRONG: Never use pool_id_ for CreateTask operations
auto task = ipc_manager->NewTask<CreateTask>(
    chi::CreateTaskId(),
    pool_id_,                   // WRONG: pool_id_ is not initialized yet
    pool_query,
    CreateParams::chimod_lib_name,
    pool_name,
    pool_id_,
    params);

Key Points

Admin Pool Processing: CreateTask is a GetOrCreatePoolTask that must be handled by the admin pool
Uninitialized Variable: pool_id_ is not set until after Create completes
Universal Requirement: This applies to ALL ChiMod clients, including admin, bdev, and custom modules
Create Responsibility: Create operations are responsible for allocating new pool IDs using the admin pool

ChiMod Name Requirements

CRITICAL: All ChiMod clients MUST use CreateParams::chimod_lib_name instead of hardcoding module names.

Correct Usage

// CORRECT: Use CreateParams::chimod_lib_name
auto task = ipc_manager->NewTask<CreateTask>(
    chi::CreateTaskId(),
    chi::kAdminPoolId,
    pool_query,
    CreateParams::chimod_lib_name,  // Dynamic reference to CreateParams
    pool_name,
    pool_id,
    params);

Incorrect Usage

// WRONG: Never hardcode module names
auto task = ipc_manager->NewTask<CreateTask>(
    chi::CreateTaskId(),
    chi::kAdminPoolId,
    pool_query,
    "chimaera_MOD_NAME",  // Hardcoded name breaks flexibility
    pool_name,
    pool_id,
    params);

Why This is Required

Namespace Flexibility: Allows ChiMods to work with different namespace configurations
Single Source of Truth: The module name is defined once in CreateParams
External ChiMods: Essential for external ChiMods using custom namespaces
Maintainability: Changes to module names only require updating CreateParams

Implementation Pattern

All non-admin ChiMods using GetOrCreatePoolTask must follow this pattern:

// In AsyncCreate method
auto task = ipc_manager->NewTask<CreateTask>(
    chi::CreateTaskId(),
    chi::kAdminPoolId,                    // Always use admin pool
    pool_query,
    CreateParams::chimod_lib_name,        // REQUIRED: Use static member
    pool_name,                            // Pool identifier
    pool_id,                              // Target pool ID
    /* ...CreateParams arguments... */);

Note: The admin ChiMod uses BaseCreateTask directly and doesn't require the chimod name parameter.

Runtime Container (MOD_NAME_runtime.h/cc)

The runtime container executes tasks server-side:

#ifndef MOD_NAME_RUNTIME_H_
#define MOD_NAME_RUNTIME_H_

#include <chimaera/chimaera.h>
#include <[namespace]/MOD_NAME/MOD_NAME_tasks.h>

namespace chimaera::MOD_NAME {

class Container : public chi::Container {
 public:
  Container() = default;
  ~Container() override = default;

  /**
   * Initialize container with pool information (REQUIRED)
   * This is called by the framework before Create is called
   */
  void Init(const chi::PoolId& pool_id, const std::string& pool_name) override {
    // Call base class initialization
    chi::Container::Init(pool_id, pool_name);

    // Initialize the client for this ChiMod
    client_ = Client(pool_id);
  }

  /**
   * Create the container (Method::kCreate)
   * This method creates queues and sets up container resources
   * NOTE: Container is already initialized via Init() before Create is called
   */
  void Create(hipc::FullPtr<CreateTask> task, chi::RunContext& ctx) {
    // Container is already initialized via Init() before Create is called
    // Do NOT call Init() here

    // Additional container-specific initialization logic here
    std::cout << "Container created and initialized for pool: " << pool_name_
              << " (ID: " << pool_id_ << ")" << std::endl;
  }

  /**
   * Custom operation (Method::kCustom)
   */
  void Custom(hipc::FullPtr<CustomTask> task, chi::RunContext& ctx) {
    // Process the operation
    std::string result = processData(task->data_,
                                    task->operation_id_);
    task->data_ = result;
    task->result_code_ = 0;
    // Task completion is handled by the framework
  }

 private:
  std::string processData(const std::string& input, u32 op_id) {
    // Business logic here
    return input + "_processed";
  }
};

}  // namespace chimaera::MOD_NAME

// Define ChiMod entry points using CHI_TASK_CC macro
CHI_TASK_CC(chimaera::MOD_NAME::Container)

#endif  // MOD_NAME_RUNTIME_H_

Execution Modes and Dynamic Scheduling

The Chimaera runtime supports two execution modes through the ExecMode enum in RunContext, enabling sophisticated task routing patterns:

ExecMode Overview

/**
 * Execution mode for task processing
 */
enum class ExecMode : u32 {
  kExec = 0,              /**< Normal task execution (default) */
  kDynamicSchedule = 1    /**< Dynamic scheduling - route after execution */
};

The execution mode is accessible through the RunContext parameter passed to all task methods:

void YourMethod(hipc::FullPtr<YourTask> task, chi::RunContext& rctx) {
  // Check execution mode
  if (rctx.exec_mode_ == chi::ExecMode::kDynamicSchedule) {
    // Dynamic scheduling logic - modify task routing
    task->pool_query_ = chi::PoolQuery::Broadcast();
    return;  // Return early - task will be re-routed
  }

  // Normal execution logic (kExec mode)
  // ... perform actual work ...
}

kExec Mode (Default)

Purpose: Normal task execution mode where tasks are processed completely and then marked as finished.

Behavior:

Tasks execute their full logic
Results are written to task output parameters
Worker calls EndTask() after execution completes
Task is marked as completed and cleaned up

When to use: The default mode for all standard task processing.

kDynamicSchedule Mode

Purpose: Two-phase execution where the first execution determines routing, then the task is re-routed and executed again.

Behavior:

Worker sets exec_mode = kDynamicSchedule before first execution
Task method examines state and modifies task->pool_query_ for routing
Task returns early without performing full execution
Worker calls RerouteDynamicTask() instead of EndTask()
Task is re-routed using the updated pool_query_
Task executes again in normal kExec mode with the new routing

When to use:

Tasks that need runtime-dependent routing decisions
Cache optimization patterns (check local, then broadcast if not found)
Conditional distributed execution based on state

Example: GetOrCreatePool with Dynamic Scheduling

The admin ChiMod's GetOrCreatePool method demonstrates the canonical dynamic scheduling pattern:

void Runtime::GetOrCreatePool(
    hipc::FullPtr<chimaera::admin::GetOrCreatePoolTask<chimaera::admin::CreateParams>> task,
    chi::RunContext &rctx) {

  auto *pool_manager = CHI_POOL_MANAGER;
  std::string pool_name = task->pool_name_.str();

  // PHASE 1: Dynamic scheduling - determine routing
  if (rctx.exec_mode_ == chi::ExecMode::kDynamicSchedule) {
    // Check if pool exists locally first
    chi::PoolId existing_pool_id = pool_manager->FindPoolByName(pool_name);

    if (!existing_pool_id.IsNull()) {
      // Pool exists locally - route to local execution only
      HLOG(kDebug, "Admin: Pool '{}' found locally (ID: {}), using Local query",
            pool_name, existing_pool_id);
      task->pool_query_ = chi::PoolQuery::Local();
    } else {
      // Pool doesn't exist - broadcast creation to all nodes
      HLOG(kDebug, "Admin: Pool '{}' not found locally, broadcasting creation",
            pool_name);
      task->pool_query_ = chi::PoolQuery::Broadcast();
    }
    return;  // Return early - worker will re-route task
  }

  // PHASE 2: Normal execution - actually create/get the pool
  HLOG(kDebug, "Admin: Executing GetOrCreatePool task - ChiMod: {}, Pool: {}",
        task->chimod_name_.str(), pool_name);

  task->return_code_ = 0;
  task->error_message_ = "";

  try {
    if (!pool_manager->CreatePool(task.Cast<chi::Task>(), &rctx)) {
      task->return_code_ = 2;
      task->error_message_ = "Failed to create or get pool via PoolManager";
      return;
    }

    task->return_code_ = 0;
    pools_created_++;

    HLOG(kDebug, "Admin: Pool operation completed successfully - ID: {}, Name: {}",
          task->new_pool_id_, pool_name);

  } catch (const std::exception &e) {
    task->return_code_ = 99;
    task->error_message_ =
        std::string("Exception during pool creation: ") + e.what();
    HLOG(kError, "Admin: Pool creation failed with exception: {}", e.what());
  }
}

Using Dynamic() PoolQuery

The PoolQuery::Dynamic() factory method triggers dynamic scheduling:

// Client code - request dynamic routing
auto pool_query = chi::PoolQuery::Dynamic();
client.Create(pool_query, "my_pool_name", pool_id);

What happens internally:

Worker recognizes Dynamic() pool query
Sets rctx.exec_mode_ = ExecMode::kDynamicSchedule
Routes task to local node first
Task method checks cache and updates pool_query_
Worker re-routes with updated query
Task executes again in normal mode with correct routing

Benefits of Dynamic Scheduling

Performance Optimization:

Avoids redundant operations (e.g., pool creation when pool already exists)
Reduces network overhead by checking local state first
Enables intelligent routing based on runtime conditions

Cache Optimization Pattern:

// Check local cache first
if (rctx.exec_mode_ == chi::ExecMode::kDynamicSchedule) {
  if (LocalCacheHas(resource_id)) {
    task->pool_query_ = chi::PoolQuery::Local();  // Found locally
  } else {
    task->pool_query_ = chi::PoolQuery::Broadcast();  // Need to fetch
  }
  return;
}

// Normal execution with optimized routing
auto resource = GetResource(resource_id);

State-Dependent Routing:

// Route based on runtime conditions
if (rctx.exec_mode_ == chi::ExecMode::kDynamicSchedule) {
  if (ShouldExecuteDistributed(task)) {
    task->pool_query_ = chi::PoolQuery::Broadcast();
  } else {
    task->pool_query_ = chi::PoolQuery::Local();
  }
  return;
}

Implementation Guidelines

DO:

✅ Check rctx.exec_mode_ at the start of your method
✅ Return early after modifying pool_query_ in dynamic mode
✅ Keep dynamic scheduling logic lightweight (fast checks only)
✅ Use dynamic scheduling for cache optimization patterns

DON'T:

❌ Perform expensive operations in dynamic scheduling mode
❌ Modify task output parameters in dynamic scheduling mode
❌ Call Wait() or spawn subtasks in dynamic scheduling mode
❌ Use dynamic scheduling for simple operations that don't need routing optimization

Worker Implementation Details

The worker automatically handles dynamic scheduling:

// Worker::ExecTask() logic
if (run_ctx->exec_mode_ == ExecMode::kDynamicSchedule) {
  // After task returns, call RerouteDynamicTask instead of EndTask
  RerouteDynamicTask(task_ptr, run_ctx);
  return;
}

// Normal mode - end task after execution
EndTask(task_ptr, run_ctx);

The RerouteDynamicTask() method:

Resets task flags (TASK_STARTED, TASK_ROUTED)
Re-routes task using updated pool_query_
Sets exec_mode = kExec for next execution
Schedules task for execution again

Configuration and Code Generation

Overview

Chimaera uses a two-level configuration system with automated code generation:

chimaera_repo.yaml: Repository-wide configuration (namespace, version, etc.)
chimaera_mod.yaml: Module-specific configuration (method IDs, metadata)
chimaera repo refresh: Utility script that generates autogen files from YAML configurations

chimaera_repo.yaml

Located at chimods/chimaera_repo.yaml, this file defines repository-wide settings:

# Repository Configuration
namespace: chimaera        # MUST match namespace in all chimaera_mod.yaml files
version: 1.0.0
description: "Chimaera Runtime ChiMod Repository"

# Module discovery - directories to scan for ChiMods
modules:
  - MOD_NAME
  - admin  
  - bdev

Key Requirements:

The namespace field MUST be identical in both chimaera_repo.yaml and all chimaera_mod.yaml files
Used by build system for CMake package generation and installation paths
Determines export target names: $\{namespace\}::$\{module\}_runtime, $\{namespace\}::$\{module\}_client

chimaera_mod.yaml

Each ChiMod must have its own configuration file specifying methods and metadata:

# MOD_NAME ChiMod Configuration
module_name: MOD_NAME
namespace: chimaera        # MUST match chimaera_repo.yaml namespace
version: 1.0.0

# Inherited Methods (fixed IDs)
kCreate: 0        # Container creation (required)
kDestroy: 1       # Container destruction (required)
kNodeFailure: -1  # Not implemented (-1 means disabled)
kRecover: -1      # Not implemented 
kMigrate: -1      # Not implemented
kUpgrade: -1      # Not implemented

# Custom Methods (start from 10, use sequential IDs)
kCustom: 10       # Custom operation method
kCoMutexTest: 20  # CoMutex synchronization testing method
kCoRwLockTest: 21 # CoRwLock reader-writer synchronization testing method

Method ID Assignment Rules:

0-9: Reserved for system methods (kCreate=0, kDestroy=1, etc.)
10+: Custom methods (assign sequential IDs starting from 10)
Disabled methods: Use -1 to disable inherited methods not implemented
Consistency: Once assigned, never change method IDs (breaks compatibility)

chimaera repo refresh Utility

The chimaera repo refresh utility automatically generates autogen files from YAML configurations.

Usage

# From project root, regenerate all autogen files
./build/bin/chimaera repo refresh chimods

# The utility will:
# 1. Read chimaera_repo.yaml for global settings
# 2. Scan each module's chimaera_mod.yaml 
# 3. Generate MOD_NAME_methods.h with method constants
# 4. Generate MOD_NAME_lib_exec.cc with virtual method dispatch

Generated Files

For each ChiMod, the utility generates:

include/[namespace]/MOD_NAME/autogen/MOD_NAME_methods.h:

namespace chimaera::MOD_NAME {
namespace Method {
GLOBAL_CONST chi::u32 kCreate = 0;
GLOBAL_CONST chi::u32 kDestroy = 1;
GLOBAL_CONST chi::u32 kCustom = 10;
GLOBAL_CONST chi::u32 kCoMutexTest = 20;
}  // namespace Method
}  // namespace chimaera::MOD_NAME

src/autogen/MOD_NAME_lib_exec.cc:
- Virtual method dispatch (Runtime::Run, etc.)
- Task serialization support (SaveIn/Out, LoadIn/Out)
- Memory management (Del, NewCopy, Aggregate)

When to Run chimaera repo refresh

ALWAYS run chimaera repo refresh when:

Adding new methods to chimaera_mod.yaml
Changing method IDs or names
Adding new ChiMods to the repository
Modifying namespace or version information

Important Notes

Never manually edit autogen files - they are overwritten by chimaera repo refresh
Run chimaera repo refresh before building after YAML changes
Commit autogen files to git so other developers don't need to regenerate
Method IDs are permanent - changing them breaks binary compatibility

Workflow Summary

Define methods in chimaera_mod.yaml with sequential IDs
Implement corresponding methods in MOD_NAME_runtime.h/MOD_NAME_runtime.cc
Run ./build/bin/chimaera repo refresh chimods to generate autogen files
Build project with make - autogen files provide the dispatch logic
Autogen files handle virtual method routing, serialization, and memory management

This automated approach ensures consistency across all ChiMods and reduces boilerplate code maintenance.

Task Development

Task Requirements

Inherit from chi::Task: All tasks must inherit the base Task class
Two Constructors: SHM and emplace constructors are mandatory
Serializable Types: Use HSHM types (chi::string, chi::vector, etc.) for member variables
Method Assignment: Set the method_ field to identify the operation
FullPtr Usage: All task method signatures use hipc::FullPtr<TaskType> instead of raw pointers
Copy Method: Required - copies task data for network transport and replication
Aggregate Method: Optional - combines results from task replicas when needed

Task Methods: Copy and Aggregate

All tasks must implement a Copy() method. The network adapter calls Copy() when tasks are sent to remote nodes — without it, your task cannot be transported across the network. Aggregate() is optional and is used to combine results from task replicas after distributed execution.

The autogenerated NewCopy dispatcher allocates a new task and calls your Copy() method, but it does not call Task::Copy() for you. You must call the base method yourself.

Copy Method

The Copy() method creates a deep copy of a task for network transport and replication. You must call Task::Copy() first to copy the base task fields (pool_id, task_id, pool_query, method, task_flags, period, return_code, completer, stat).

Signature:

void Copy(const hipc::FullPtr<YourTask> &other);

Implementation Pattern:

struct WriteTask : public chi::Task {
  IN Block block_;
  IN hipc::ShmPtr<> data_;
  IN size_t length_;
  OUT chi::u64 bytes_written_;

  void Copy(const hipc::FullPtr<WriteTask> &other) {
    // REQUIRED: Copy base Task fields first
    Task::Copy(other.template Cast<Task>());
    // Copy task-specific fields
    block_ = other->block_;
    data_ = other->data_;
    length_ = other->length_;
    bytes_written_ = other->bytes_written_;
  }
};

Key Points:

Always call Task::Copy(other.template Cast<Task>()) as the first line
Then copy all task-specific fields from the source task
The destination task (this) is already constructed - don't call constructors
For pointer fields, decide if you need deep or shallow copy based on ownership

Aggregate Method (Optional)

The Aggregate() method combines results from a task replica back into the original task. It is optional — implement it when your task is distributed across multiple nodes and you need to merge replica results. When implemented, you must call Task::Aggregate() first to propagate the return code and completer from the replica.

Signature:

void Aggregate(const hipc::FullPtr<YourTask> &other);

Implementation Patterns:

Pattern 1: Last-Writer-Wins (Simple Override)

struct WriteTask : public chi::Task {
  IN Block block_;
  IN hipc::ShmPtr<> data_;
  OUT chi::u64 bytes_written_;

  void Aggregate(const hipc::FullPtr<WriteTask> &other) {
    // REQUIRED: Aggregate base Task fields first
    Task::Aggregate(other);
    // Copy the result from the completed replica
    bytes_written_ = other->bytes_written_;
  }
};

Pattern 2: Accumulation (Sum/Max/Min)

struct GetStatsTask : public chi::Task {
  OUT chi::u64 total_bytes_;
  OUT chi::u64 operation_count_;
  OUT chi::u64 max_latency_us_;

  void Aggregate(const hipc::FullPtr<GetStatsTask> &other) {
    // REQUIRED: Aggregate base Task fields first
    Task::Aggregate(other);
    // Sum cumulative metrics
    total_bytes_ += other->total_bytes_;
    operation_count_ += other->operation_count_;
    // Take maximum for latency
    max_latency_us_ = std::max(max_latency_us_, other->max_latency_us_);
  }
};

Pattern 3: Full Copy Aggregate

struct ReadTask : public chi::Task {
  IN Block block_;
  OUT hipc::ShmPtr<> data_;
  OUT chi::u64 bytes_read_;

  void Aggregate(const hipc::FullPtr<ReadTask> &other) {
    // REQUIRED: Aggregate base Task fields first
    Task::Aggregate(other);
    // For reads, take all results from the replica
    Copy(other);
  }
};

Copy/Aggregate Usage in Networking

When tasks are sent across nodes, the network adapter uses Copy and Aggregate:

Send Phase: Copy() creates a replica of the origin task for network transport

container->NewCopy(task->method_, origin_task, replica, /* replica_flag */);
// Internally calls: replica->Copy(origin_task)

Recv Phase: Aggregate() merges replica results back into the origin task

container->Aggregate(task->method_, origin_task, replica);
// Internally calls: origin_task->Aggregate(replica)

Autogeneration: The code generator creates dispatcher methods that allocate a new task and call your Copy(). Note that NewCopy does not call Task::Copy() for you — your Copy() implementation must do that itself:

// In autogen/MOD_NAME_lib_exec.cc
void NewCopyTask(Runtime* runtime, chi::u32 method,
                 hipc::FullPtr<chi::Task> orig_task,
                 hipc::FullPtr<chi::Task>& new_task,
                 bool deep_copy) {
  switch (method) {
    case Method::kWrite: {
      auto orig = orig_task.Cast<WriteTask>();
      new_task = CHI_IPC->NewTask<WriteTask>(...);
      // Calls YOUR Copy(), which must call Task::Copy() internally
      new_task.Cast<WriteTask>()->Copy(orig);
      break;
    }
  }
}

Complete Example: ReadTask with Copy and Aggregate

struct ReadTask : public chi::Task {
  IN Block block_;
  OUT hipc::ShmPtr<> data_;
  INOUT size_t length_;
  OUT chi::u64 bytes_read_;

  ReadTask()
      : chi::Task(), length_(0), bytes_read_(0) {}

  explicit ReadTask(const chi::TaskId &task_node,
                    const chi::PoolId &pool_id,
                    const chi::PoolQuery &pool_query,
                    const Block &block,
                    hipc::ShmPtr<> data,
                    size_t length)
      : chi::Task(task_node, pool_id, pool_query, Method::kRead),
        block_(block), data_(data), length_(length), bytes_read_(0) {
    task_id_ = task_node;
    pool_id_ = pool_id;
    method_ = Method::kRead;
    task_flags_.Clear();
    pool_query_ = pool_query;
  }

  void Copy(const hipc::FullPtr<ReadTask> &other) {
    // REQUIRED: Copy base Task fields first
    Task::Copy(other.template Cast<Task>());
    // Copy task-specific fields
    block_ = other->block_;
    data_ = other->data_;
    length_ = other->length_;
    bytes_read_ = other->bytes_read_;
  }

  void Aggregate(const hipc::FullPtr<ReadTask> &other) {
    // REQUIRED: Aggregate base Task fields first
    Task::Aggregate(other);
    // For reads, take the result from the replica
    Copy(other);
  }
};

Task Naming Conventions

CRITICAL: All task names MUST follow consistent naming patterns to ensure proper code generation and maintenance.

Required Naming Pattern

The naming convention enforces consistency across function names, task types, and method constants:

Function Name → Task Name → Method Constant
FunctionName()  → FunctionNameTask  → kFunctionName

Examples

Correct Naming:

// Client Function: AsyncGetStats()
// Task: GetStatsTask
// Method: kGetStats

// In bdev_client.h (async-only API)
chi::Future<GetStatsTask> AsyncGetStats(const chi::PoolQuery& pool_query);

// In bdev_tasks.h
struct GetStatsTask : public chi::Task {
  OUT PerfMetrics metrics_;
  OUT chi::u64 remaining_size_;
  // ... constructors and methods
};

// In chimaera_mod.yaml
kGetStats: 14    # Get performance statistics

// In bdev_runtime.h
void GetStats(hipc::FullPtr<GetStatsTask> task, chi::RunContext& ctx);

Incorrect Naming Examples:

// WRONG: Function and task names don't match
chi::Future<StatTask> AsyncGetStats(...);  // Task name doesn't match function
struct StatTask { ... };                    // Task name doesn't match function

// WRONG: Method constant doesn't match function  
GLOBAL_CONST chi::u32 kStat = 14;    // Method doesn't match function name

// WRONG: Runtime method doesn't match function
void Stat(hipc::FullPtr<StatTask> task, ...);  // Runtime method doesn't match

Naming Rules

Client Function Names: Use Async prefix with descriptive verbs (e.g., AsyncGetStats, AsyncAllocateBlocks, AsyncWriteData)
Task Names: Remove Async prefix and append "Task" (e.g., GetStatsTask, AllocateBlocksTask)
Method Constants: Prefix with "k" and match the base function name (e.g., kGetStats, kAllocateBlocks)
Runtime Methods: Match the base function name without Async prefix (e.g., GetStats())

Backward Compatibility

When renaming tasks, provide backward compatibility aliases:

// In bdev_tasks.h - provide alias for old name
using StatTask = GetStatsTask;  // Backward compatibility

// In autogen/bdev_methods.h - provide constant alias
GLOBAL_CONST chi::u32 kGetStats = 14;
GLOBAL_CONST chi::u32 kStat = kGetStats;  // Backward compatibility

// In bdev_runtime.h - provide wrapper methods
void GetStats(hipc::FullPtr<GetStatsTask> task, chi::RunContext& ctx);  // Primary
void Stat(hipc::FullPtr<StatTask> task, chi::RunContext& ctx) {         // Wrapper
  GetStats(task, ctx);
}

Benefits of Consistent Naming

Code Generation: Automated tools can reliably generate method dispatch code
Maintenance: Clear correlation between client functions and runtime implementations
Documentation: Self-documenting code with predictable naming patterns
Debugging: Easy to trace from client calls to runtime execution
Testing: Consistent patterns make it easier to write comprehensive tests

Method System and Auto-Generated Files

Method Definitions (autogen/MOD_NAME_methods.h)

Method IDs are now defined as namespace constants instead of enum class values. This eliminates the need for static casting:

#ifndef MOD_NAME_AUTOGEN_METHODS_H_
#define MOD_NAME_AUTOGEN_METHODS_H_

#include <chimaera/chimaera.h>

namespace chimaera::MOD_NAME {

namespace Method {
  // Inherited methods
  GLOBAL_CONST chi::u32 kCreate = 0;
  GLOBAL_CONST chi::u32 kDestroy = 1;
  GLOBAL_CONST chi::u32 kNodeFailure = 2;
  GLOBAL_CONST chi::u32 kRecover = 3;
  GLOBAL_CONST chi::u32 kMigrate = 4;
  GLOBAL_CONST chi::u32 kUpgrade = 5;
  
  // Module-specific methods
  GLOBAL_CONST chi::u32 kCustom = 10;
}

} // namespace chimaera::MOD_NAME

#endif // MOD_NAME_AUTOGEN_METHODS_H_

Key Changes:

Namespace instead of enum class: Use Method::kMethodName directly
GLOBAL_CONST values: No more static casting required
Include chimaera.h: Required for GLOBAL_CONST macro
Direct assignment: method_ = Method::kCreate; (no casting)

BaseCreateTask Template System

For modules that need container creation functionality, use the BaseCreateTask template instead of implementing custom CreateTask. However, there are different approaches depending on whether your module is the admin module or a regular ChiMod:

GetOrCreatePoolTask vs BaseCreateTask Usage

For Non-Admin Modules (Recommended Pattern):

All non-admin ChiMods should use GetOrCreatePoolTask which is a specialized version of BaseCreateTask designed for external pool creation:

#include <[namespace]/admin/admin_tasks.h>  // Include admin templates

namespace chimaera::MOD_NAME {

/**
 * CreateParams for MOD_NAME container creation
 */
struct CreateParams {
  // Module-specific configuration
  std::string config_data_;
  chi::u32 worker_count_;
  
  // Required: chimod library name
  static constexpr const char* chimod_lib_name = "chimaera_MOD_NAME";

  // Constructors
  CreateParams() : worker_count_(1) {}

  CreateParams(const std::string& config_data = "",
               chi::u32 worker_count = 1)
      : config_data_(config_data), worker_count_(worker_count) {}

  // Cereal serialization
  template<class Archive>
  void serialize(Archive& ar) {
    ar(config_data_, worker_count_);
  }
};

/**
 * CreateTask - Non-admin modules should use GetOrCreatePoolTask
 * This uses Method::kGetOrCreatePool and is designed for external pool creation
 */
using CreateTask = chimaera::admin::GetOrCreatePoolTask<CreateParams>;

}  // namespace chimaera::MOD_NAME

For Admin Module Only:

The admin module itself uses BaseCreateTask directly with Method::kCreate:

namespace chimaera::admin {

/**
 * CreateTask - Admin uses BaseCreateTask with Method::kCreate and IS_ADMIN=true
 */
using CreateTask = BaseCreateTask<CreateParams, Method::kCreate, true>;

}  // namespace chimaera::admin

BaseCreateTask Template Parameters

The BaseCreateTask template has three parameters with smart defaults:

template <typename CreateParamsT, 
          chi::u32 MethodId = Method::kGetOrCreatePool, 
          bool IS_ADMIN = false>
struct BaseCreateTask : public chi::Task

Template Parameters:

CreateParamsT: Your module's parameter structure (required)
MethodId: Method ID for the task (default: kGetOrCreatePool)
IS_ADMIN: Whether this is an admin operation (default: false)

GetOrCreatePoolTask Template:

The GetOrCreatePoolTask template is a convenient alias that uses the optimal defaults for non-admin modules:

template<typename CreateParamsT>
using GetOrCreatePoolTask = BaseCreateTask<CreateParamsT, Method::kGetOrCreatePool, false>;

When to Use Each Pattern:

GetOrCreatePoolTask: For all non-admin ChiMods (recommended)
BaseCreateTask with Method::kCreate: Only for admin module internal operations
BaseCreateTask with Method::kGetOrCreatePool: Same as GetOrCreatePoolTask (not typically used directly)

BaseCreateTask Structure

BaseCreateTask provides a unified structure for container creation and pool operations:

template <typename CreateParamsT, chi::u32 MethodId, bool IS_ADMIN>
struct BaseCreateTask : public chi::Task {
  // Pool operation parameters
  INOUT chi::string chimod_name_;     // ChiMod name for loading
  IN chi::string pool_name_;          // Target pool name
  INOUT chi::string chimod_params_;   // Serialized CreateParamsT
  INOUT chi::PoolId pool_id_;          // Input: requested ID, Output: actual ID
  
  // Results
  OUT chi::u32 result_code_;           // 0 = success, non-zero = error
  OUT chi::string error_message_;     // Error description if failed
  
  // Runtime flag set by template parameter
  volatile bool is_admin_;             // Set to IS_ADMIN template value
  
  // Serialization methods
  template<typename... Args>
  void SetParams(AllocT* alloc, Args &&...args);

  CreateParamsT GetParams() const;
};

Key Features:

Single pool_id: Serves as both input (requested) and output (result)
Serialized parameters: chimod_params_ stores serialized CreateParamsT
Error checking: Use result_code_ != 0 to check for failures
Template-driven behavior: IS_ADMIN template parameter sets volatile variable
No static casting: Direct method assignment using namespace constants

Usage Examples

Non-Admin ChiMod Container Creation (Recommended):

// Use GetOrCreatePoolTask for all non-admin modules
using CreateTask = chimaera::admin::GetOrCreatePoolTask<MyCreateParams>;

Admin Module Container Creation:

// Admin module uses BaseCreateTask with Method::kCreate and IS_ADMIN=true
using CreateTask = chimaera::admin::BaseCreateTask<AdminCreateParams, Method::kCreate, true>;

Data Annotations

IN: Input-only parameters (read by runtime)
OUT: Output-only parameters (written by runtime)
INOUT: Bidirectional parameters

Task Lifecycle

Client allocates task in shared memory using ipc_manager->NewTask()
Client enqueues task pointer to IPC queue
Worker dequeues and executes task
Framework calls ipc_manager->DelTask() to deallocate task from shared memory
Task memory is properly reclaimed from the appropriate memory segment

Note: Individual DelTaskType methods are no longer required. The framework's autogenerated Del dispatcher automatically calls ipc_manager->DelTask() for proper shared memory deallocation.

C++20 Coroutines for ChiMod Development

Chimaera uses C++20 coroutines to enable cooperative task execution within runtime methods. This section covers the coroutine primitives and patterns used in ChiMod development.

Overview

Coroutines allow runtime methods to suspend execution (co_await) and resume later without blocking the worker thread. This is essential for:

Nested Pool Creation: Create methods that need to create sub-pools
Subtask Execution: Runtime methods that spawn and wait for child tasks
I/O Operations: Async I/O that needs to yield while waiting

TaskResume Return Type

All runtime methods that use coroutines must return chi::TaskResume:

class Runtime : public chi::Container {
public:
  // Coroutine method - can use co_await and co_return
  chi::TaskResume Create(hipc::FullPtr<CreateTask> task, chi::RunContext& rctx);

  // Non-coroutine method - regular void return
  void Custom(hipc::FullPtr<CustomTask> task, chi::RunContext& rctx);
};

Key Points:

Methods returning TaskResume are coroutines and can use co_await/co_return
Methods returning void are regular functions and cannot use coroutine keywords
The TaskResume type integrates with Chimaera's task scheduling system

co_await - Suspending Execution

Use co_await to suspend the current coroutine until an operation completes:

chi::TaskResume Runtime::Create(hipc::FullPtr<CreateTask> task, chi::RunContext& rctx) {
  // Create a subtask (e.g., create a BDev pool for storage)
  auto bdev_task = bdev_client_.AsyncCreate(
      chi::PoolQuery::Local(),
      "storage_device",
      chi::PoolId(7001, 0),
      chimaera::bdev::BdevType::kFile);

  // Suspend until subtask completes - worker can process other tasks
  co_await bdev_task;

  // Execution resumes here after bdev_task completes
  if (bdev_task->GetReturnCode() != 0) {
    task->SetReturnCode(1);
    task->error_message_ = "Failed to create storage device";
    co_return;
  }

  // Continue with remaining initialization
  task->SetReturnCode(0);
  co_return;
}

What Happens During co_await:

Coroutine state is saved
Worker thread is released to process other tasks
When awaited operation completes, coroutine is scheduled for resumption
Execution continues from the point after co_await

co_return - Completing Coroutines

Use co_return to complete a coroutine. For TaskResume coroutines, co_return takes no value:

chi::TaskResume Runtime::Create(hipc::FullPtr<CreateTask> task, chi::RunContext& rctx) {
  // Early return on error
  if (!ValidateParams(task)) {
    task->SetReturnCode(1);
    co_return;  // Complete coroutine immediately
  }

  // Normal completion
  task->SetReturnCode(0);
  co_return;  // Complete coroutine
}

Important Notes:

Always use co_return (not return) in coroutine methods
co_return takes no arguments for TaskResume coroutines
Ensure all code paths end with co_return

Common Coroutine Patterns

Pattern 1: Nested Pool Creation

The most common use case is creating dependent pools during container initialization:

chi::TaskResume Runtime::Create(hipc::FullPtr<CreateTask> task, chi::RunContext& rctx) {
  // Initialize container state
  pool_id_ = task->pool_id_;

  // Create dependent BDev pool for storage
  auto bdev_task = bdev_client_.AsyncCreate(
      chi::PoolQuery::Local(),
      task->storage_path_.str(),
      chi::PoolId(pool_id_.IsLocal() + 1, 0),
      chimaera::bdev::BdevType::kFile,
      task->storage_size_);

  co_await bdev_task;

  if (bdev_task->GetReturnCode() != 0) {
    task->SetReturnCode(2);
    task->error_message_ = "Storage initialization failed";
    co_return;
  }

  storage_pool_id_ = bdev_task->new_pool_id_;
  task->SetReturnCode(0);
  co_return;
}

Pattern 2: Sequential Subtask Execution

Execute multiple subtasks in sequence:

chi::TaskResume Runtime::Initialize(hipc::FullPtr<InitTask> task, chi::RunContext& rctx) {
  // Step 1: Initialize storage
  auto storage_task = storage_client_.AsyncInit(chi::PoolQuery::Local());
  co_await storage_task;

  if (storage_task->GetReturnCode() != 0) {
    task->SetReturnCode(1);
    co_return;
  }

  // Step 2: Initialize network (depends on storage)
  auto network_task = network_client_.AsyncInit(chi::PoolQuery::Local());
  co_await network_task;

  if (network_task->GetReturnCode() != 0) {
    task->SetReturnCode(2);
    co_return;
  }

  // Step 3: Complete initialization
  task->SetReturnCode(0);
  co_return;
}

Pattern 3: Parallel Subtask Execution

For independent subtasks, launch them all first, then await:

chi::TaskResume Runtime::ParallelInit(hipc::FullPtr<InitTask> task, chi::RunContext& rctx) {
  // Launch multiple independent tasks
  auto task1 = client1_.AsyncInit(chi::PoolQuery::Local());
  auto task2 = client2_.AsyncInit(chi::PoolQuery::Local());
  auto task3 = client3_.AsyncInit(chi::PoolQuery::Local());

  // Await all tasks (order doesn't matter for independent tasks)
  co_await task1;
  co_await task2;
  co_await task3;

  // Check all results
  if (task1->GetReturnCode() != 0 ||
      task2->GetReturnCode() != 0 ||
      task3->GetReturnCode() != 0) {
    task->SetReturnCode(1);
    co_return;
  }

  task->SetReturnCode(0);
  co_return;
}

When to Use Coroutines

Use coroutines (TaskResume return type) when:

✅ Creating nested/dependent pools in Create methods
✅ Spawning and waiting for subtasks
✅ Performing async I/O operations that need to yield
✅ Any operation that might block and should yield to other tasks

Use regular void methods when:

✅ Simple synchronous operations
✅ Operations that complete quickly without waiting
✅ Methods that don't spawn subtasks or wait for external events

PoolManager Coroutine Integration

The PoolManager methods that create/destroy pools are coroutines:

// In PoolManager (internal usage)
TaskResume CreatePool(hipc::FullPtr<chi::Task> task, chi::RunContext* rctx);
TaskResume DestroyPool(hipc::FullPtr<chi::Task> task, chi::RunContext* rctx);

Why These Are Coroutines:

CreatePool co_awaits the container's Create method
Create methods may need to co_await nested pool creations
The coroutine chain allows proper suspension and resumption

Admin Runtime Coroutine Methods

The admin runtime has coroutine methods for pool management:

// Admin runtime coroutine methods
chi::TaskResume GetOrCreatePool(hipc::FullPtr<GetOrCreatePoolTask<T>> task, chi::RunContext& rctx);
chi::TaskResume DestroyPool(hipc::FullPtr<DestroyPoolTask> task, chi::RunContext& rctx);

These methods co_await PoolManager operations internally.

Best Practices

DO:

✅ Use co_return (not return) in coroutine methods
✅ Check return codes after co_await
✅ Use TaskResume return type for methods that need co_await
✅ Handle errors before continuing after co_await

DON'T:

❌ Mix return and co_return in the same method
❌ Use co_await in non-coroutine (void) methods
❌ Forget to co_return at the end of coroutine methods
❌ Ignore return codes from awaited tasks

Migration from Non-Coroutine Methods

To convert a regular method to a coroutine:

Change return type: void → chi::TaskResume
Change all return;: → co_return;
Add co_await: For any async operations that need waiting
Update autogen: Ensure dispatch code handles the new return type

Framework Del Implementation

The autogenerated Del dispatcher handles task cleanup:

inline void Del(Runtime* runtime, chi::u32 method, hipc::FullPtr<chi::Task> task_ptr) {
  auto* ipc_manager = CHI_IPC;
  Method method_enum = static_cast<Method>(method);

  switch (method_enum) {
    case Method::kCreate: {
      ipc_manager->DelTask(task_ptr.Cast<CreateTask>());
      break;
    }
    case Method::kCustom: {
      ipc_manager->DelTask(task_ptr.Cast<CustomTask>());
      break;
    }
    default:
      ipc_manager->DelTask(task_ptr);
      break;
  }
}

This ensures proper shared memory deallocation without requiring module-specific cleanup code.

Synchronization Primitives

Chimaera provides specialized cooperative synchronization primitives designed for the runtime's task-based architecture. These should be used instead of standard synchronization primitives like std::mutex, std::shared_mutex, or pthread_mutex when synchronizing access to module data structures.

Why Use Chimaera Synchronization Primitives?

Critical: Always use CoMutex and CoRwLock for module synchronization:

Cooperative Design: Compatible with Chimaera's fiber-based task execution
TaskId Grouping: Tasks sharing the same TaskId can proceed together (bypassing locks)
Deadlock Prevention: Designed to prevent deadlocks in the runtime environment
Runtime Integration: Automatically integrate with CHI_CUR_WORKER and task context
Performance: Optimized for the runtime's execution model

Do NOT use these standard synchronization primitives in module code:

❌ std::mutex - Can cause fiber blocking issues
❌ std::shared_mutex - Not compatible with task execution model
❌ pthread_mutex_t - Can deadlock with runtime scheduling
❌ std::condition_variable - Incompatible with cooperative scheduling

CoMutex: Cooperative Mutual Exclusion

CoMutex provides mutual exclusion with TaskId grouping support. Tasks sharing the same TaskId can bypass the lock and execute concurrently.

Basic Usage

#include <chimaera/comutex.h>

class Runtime : public chi::Container {
private:
  // Static member for shared synchronization across all container instances
  static chi::CoMutex shared_mutex_;
  
  // Instance member for per-container synchronization
  chi::CoMutex instance_mutex_;

public:
  void SomeTask(hipc::FullPtr<SomeTaskType> task, chi::RunContext& rctx) {
    // Manual lock/unlock
    shared_mutex_.Lock();
    // ... critical section ...
    shared_mutex_.Unlock();
    
    // OR use RAII scoped lock (recommended)
    chi::ScopedCoMutex lock(instance_mutex_);
    // ... critical section ...
    // Automatically unlocks when leaving scope
  }
};

// Static member definition (required)
chi::CoMutex Runtime::shared_mutex_;

Key Features

Automatic Task Context: Uses CHI_CUR_WORKER internally - no task parameters needed
TaskId Grouping: Tasks with the same TaskId bypass the mutex
RAII Support: ScopedCoMutex for automatic lock management
Try-Lock Support: Non-blocking lock attempts

API Reference

namespace chi {
  class CoMutex {
  public:
    // Blocking operations
    void Lock();                    // Block until lock acquired
    void Unlock();                  // Release the lock
    bool TryLock();                 // Non-blocking lock attempt
    
    // No task parameters needed - uses CHI_CUR_WORKER automatically
  };
  
  // RAII wrapper (recommended)
  class ScopedCoMutex {
  public:
    explicit ScopedCoMutex(CoMutex& mutex);  // Locks in constructor
    ~ScopedCoMutex();                        // Unlocks in destructor
  };
}

CoRwLock: Cooperative Reader-Writer Lock

CoRwLock provides reader-writer semantics with TaskId grouping. Multiple readers can proceed concurrently, but writers have exclusive access.

Basic Usage

#include <chimaera/corwlock.h>

class Runtime : public chi::Container {
private:
  static chi::CoRwLock data_lock_;  // Protect shared data structures

public:
  void ReadTask(hipc::FullPtr<ReadTaskType> task, chi::RunContext& rctx) {
    // Manual reader lock
    data_lock_.ReadLock();
    // ... read operations ...
    data_lock_.ReadUnlock();
    
    // OR use RAII scoped reader lock (recommended)
    chi::ScopedCoRwReadLock lock(data_lock_);
    // ... read operations ...
    // Automatically unlocks when leaving scope
  }
  
  void WriteTask(hipc::FullPtr<WriteTaskType> task, chi::RunContext& rctx) {
    // RAII scoped writer lock (recommended)
    chi::ScopedCoRwWriteLock lock(data_lock_);
    // ... write operations ...
    // Automatically unlocks when leaving scope
  }
};

// Static member definition
chi::CoRwLock Runtime::data_lock_;

Key Features

Multiple Readers: Concurrent read access when no writers are active
Exclusive Writers: Writers get exclusive access, blocking all other operations
TaskId Grouping: Tasks with same TaskId can bypass reader locks
Automatic Context: Uses CHI_CUR_WORKER for task identification
RAII Support: Scoped locks for both readers and writers

API Reference

namespace chi {
  class CoRwLock {
  public:
    // Reader operations
    void ReadLock();                // Acquire reader lock
    void ReadUnlock();              // Release reader lock
    bool TryReadLock();             // Non-blocking reader lock attempt
    
    // Writer operations  
    void WriteLock();               // Acquire exclusive writer lock
    void WriteUnlock();             // Release writer lock
    bool TryWriteLock();            // Non-blocking writer lock attempt
  };
  
  // RAII wrappers (recommended)
  class ScopedCoRwReadLock {
  public:
    explicit ScopedCoRwReadLock(CoRwLock& lock);  // Acquire read lock
    ~ScopedCoRwReadLock();                        // Release read lock
  };
  
  class ScopedCoRwWriteLock {
  public:
    explicit ScopedCoRwWriteLock(CoRwLock& lock); // Acquire write lock
    ~ScopedCoRwWriteLock();                       // Release write lock
  };
}

TaskId Grouping Behavior

Both CoMutex and CoRwLock support TaskId grouping, which allows related tasks to bypass synchronization:

// Tasks created with the same TaskId can proceed together
auto task_id = chi::CreateTaskId();

// These tasks share the same TaskId - they can bypass CoMutex/CoRwLock
auto task1 = ipc_manager->NewTask<Task1>(task_id, pool_id, pool_query, ...);
auto task2 = ipc_manager->NewTask<Task2>(task_id, pool_id, pool_query, ...);

// This task has a different TaskId - must respect locks normally
auto task3 = ipc_manager->NewTask<Task3>(chi::CreateTaskId(), pool_id, pool_query, ...);

Key Points:

Tasks with the same TaskId are considered "grouped" and can bypass locks
Use TaskId grouping for logically related operations that don't need mutual exclusion
Different TaskIds must respect normal lock semantics

Best Practices

Use RAII Wrappers: Always prefer ScopedCoMutex and ScopedCoRw*Lock over manual lock/unlock
Static vs Instance: Use static members for cross-container synchronization, instance members for per-container data
Member Definition: Don't forget to define static members in your .cc file
Choose Appropriate Lock: Use CoRwLock for read-heavy workloads, CoMutex for simple mutual exclusion
Minimal Critical Sections: Keep locked sections as small as possible
TaskId Design: Group related tasks that can safely bypass locks

Example: Module with Synchronized Data Structure

// In MOD_NAME_runtime.h
class Runtime : public chi::Container {
private:
  // Synchronized data structure
  chi::hash_map<chi::u32, ModuleData> data_map_;
  
  // Synchronization primitives
  static chi::CoRwLock data_lock_;        // For data_map_ access
  static chi::CoMutex operation_mutex_;   // For exclusive operations

public:
  void ReadData(hipc::FullPtr<ReadDataTask> task, chi::RunContext& rctx);
  void WriteData(hipc::FullPtr<WriteDataTask> task, chi::RunContext& rctx);
  void ExclusiveOperation(hipc::FullPtr<ExclusiveTask> task, chi::RunContext& rctx);
};

// In MOD_NAME_runtime.cc  
chi::CoRwLock Runtime::data_lock_;
chi::CoMutex Runtime::operation_mutex_;

void Runtime::ReadData(hipc::FullPtr<ReadDataTask> task, chi::RunContext& rctx) {
  chi::ScopedCoRwReadLock lock(data_lock_);  // Multiple readers allowed
  
  // Safe to read data_map_ concurrently
  auto it = data_map_.find(task->key_);
  if (it != data_map_.end()) {
    task->result_data_ = it->second;
    task->result_ = 0;  // Success
  } else {
    task->result_ = 1;  // Not found
  }
}

void Runtime::WriteData(hipc::FullPtr<WriteDataTask> task, chi::RunContext& rctx) {
  chi::ScopedCoRwWriteLock lock(data_lock_);  // Exclusive writer access
  
  // Safe to modify data_map_ exclusively
  data_map_[task->key_] = task->new_data_;
  task->result_ = 0;  // Success
}

void Runtime::ExclusiveOperation(hipc::FullPtr<ExclusiveTask> task, chi::RunContext& rctx) {
  chi::ScopedCoMutex lock(operation_mutex_);  // Exclusive operation
  
  // Perform operation that requires complete exclusivity
  // ... complex operation ...
  task->result_ = 0;  // Success
}

This synchronization model ensures thread-safe access to module data structures while maintaining compatibility with Chimaera's cooperative task execution system.

Pool Query and Task Routing

Overview of PoolQuery

PoolQuery is a fundamental component of Chimaera's task routing system that determines where and how tasks are executed across the distributed runtime. It provides flexible routing strategies for load balancing, locality optimization, and distributed execution patterns.

PoolQuery Types

The chi::PoolQuery class provides six different routing modes through static factory methods:

1. Local Mode

chi::PoolQuery::Local()

Purpose: Routes tasks to the local node only
Use Case: Operations that must execute on the calling node
Example: MPI-based container creation, node-specific diagnostics

// Client usage in MPI environment
const chi::PoolId custom_pool_id(7000, 0);
client.Create(chi::PoolQuery::Local(), "my_pool", custom_pool_id);

2. Direct ID Mode

chi::PoolQuery::DirectId(ContainerId container_id)

Purpose: Routes to a specific container by its unique ID
Use Case: Targeted operations on known containers
Example: Container-specific configuration changes

// Route to container with ID 42
auto query = chi::PoolQuery::DirectId(ContainerId(42));
client.UpdateConfig(query, new_config);

3. Direct Hash Mode

chi::PoolQuery::DirectHash(u32 hash)

Purpose: Routes using consistent hash-based load balancing
Use Case: Distributing operations across containers deterministically
Example: Key-value store operations where keys map to specific containers

// Hash-based routing for a key
u32 hash = std::hash<std::string>{}(key);
auto query = chi::PoolQuery::DirectHash(hash);
client.Put(query, key, value);

4. Range Mode

chi::PoolQuery::Range(u32 offset, u32 count)

Purpose: Routes to a range of containers
Use Case: Batch operations across multiple containers
Example: Parallel scan operations, bulk updates

// Process containers 10-19 (10 containers starting at offset 10)
auto query = chi::PoolQuery::Range(10, 10);
client.BulkUpdate(query, update_data);

5. Broadcast Mode

chi::PoolQuery::Broadcast()

Purpose: Routes to all containers in the pool
Use Case: Global operations affecting all containers
Example: Configuration updates, global cache invalidation

// Broadcast configuration change to all containers
auto query = chi::PoolQuery::Broadcast();
client.InvalidateCache(query);

6. Physical Mode

chi::PoolQuery::Physical(u32 node_id)

Purpose: Routes to a specific physical node by ID
Use Case: Node-specific operations in distributed deployments
Example: Remote node administration, cross-node data migration

// Execute on physical node 3
auto query = chi::PoolQuery::Physical(3);
client.NodeDiagnostics(query);

7. Dynamic Mode (Recommended for Create Operations)

chi::PoolQuery::Dynamic()

Purpose: Intelligent routing with automatic caching optimization
Use Case: Create operations that benefit from local cache checking
Behavior: Uses dynamic scheduling (ExecMode::kDynamicSchedule) for cache optimization
1. Check if pool exists locally using PoolManager
2. If pool exists: change pool_query to Local (execute locally using existing pool)
3. If pool doesn't exist: change pool_query to Broadcast (create pool on all nodes)
Benefits:
- Avoids redundant pool creation attempts
- Eliminates unnecessary network overhead for existing pools
- Automatic fallback to broadcast creation when needed
Example: Container creation with automatic caching

// Recommended: Use Dynamic() for Create operations
const chi::PoolId custom_pool_id(7000, 0);
client.Create(chi::PoolQuery::Dynamic(), "my_pool", custom_pool_id);

// Dynamic scheduling will:
// - Check local cache for "my_pool"
// - If found: switch to Local mode (fast path)
// - If not found: switch to Broadcast mode (creation path)

PoolQuery Usage Guidelines

Best Practices

Never use null queries: Always specify an explicit PoolQuery type
Default to Dynamic for Create: Use PoolQuery::Dynamic() for container creation to enable automatic caching optimization
Alternative: Use Broadcast or Local explicitly:
- Use Broadcast() when you want to force distributed creation regardless of cache
- Use Local() in MPI jobs when you want node-local containers only
Consider locality: Prefer local execution to minimize network overhead for regular operations
Use appropriate granularity: Match routing mode to operation scope

Common Patterns

Container Creation Pattern (Recommended):

// Recommended: Use Dynamic for automatic cache optimization
// This checks local cache first and falls back to broadcast creation if needed
const chi::PoolId custom_pool_id(7000, 0);
client.Create(chi::PoolQuery::Dynamic(), "my_pool_name", custom_pool_id);

Container Creation Pattern (Explicit Broadcast):

// Alternative: Use Broadcast to force distributed creation regardless of cache
// This ensures the container is created across all nodes in distributed environments
const chi::PoolId custom_pool_id(7000, 0);
client.Create(chi::PoolQuery::Broadcast(), "my_pool_name", custom_pool_id);

Container Creation Pattern (MPI Environments):

// In MPI jobs, Local may be more efficient for node-local containers
// Use Local when you want node-local containers only
const chi::PoolId custom_pool_id(7000, 0);
client.Create(chi::PoolQuery::Local(), "my_pool_name", custom_pool_id);

Load-Balanced Operations:

// Use hash-based routing for even distribution
for (const auto& item : items) {
  u32 hash = ComputeHash(item.id);
  auto query = chi::PoolQuery::DirectHash(hash);
  client.Process(query, item);
}

Batch Processing:

// Process containers in chunks
const u32 total_containers = pool_info->num_containers_;
const u32 batch_size = 10;
for (u32 offset = 0; offset < total_containers; offset += batch_size) {
  u32 count = std::min(batch_size, total_containers - offset);
  auto query = chi::PoolQuery::Range(offset, count);
  client.BatchProcess(query, batch_data);
}

Runtime Routing Implementation

The runtime uses PoolQuery to determine task routing through several stages:

Query Validation: Ensures the query parameters are valid
Container Resolution: Maps query to specific container(s)
Task Distribution: Routes task to appropriate worker queues
Load Balancing: Applies distribution strategies for multi-container queries

PoolQuery in Task Definitions

Tasks must include PoolQuery in their constructors (no allocator parameter needed):

class CustomTask : public chi::Task {
 public:
  CustomTask(const chi::TaskId &task_id,
             const chi::PoolId &pool_id,
             const chi::PoolQuery &pool_query,  // Required parameter
             /* custom parameters */)
      : chi::Task(task_id, pool_id, pool_query, method_id) {
    // Task initialization
  }
};

Advanced PoolQuery Features

Query Introspection

PoolQuery query = PoolQuery::Range(0, 10);

// Check routing mode
if (query.IsRangeMode()) {
  u32 offset = query.GetRangeOffset();
  u32 count = query.GetRangeCount();
  // Process range parameters
}

// Get routing mode enum
RoutingMode mode = query.GetRoutingMode();
switch (mode) {
  case RoutingMode::Local:
    // Handle local routing
    break;
  case RoutingMode::Broadcast:
    // Handle broadcast
    break;
  // ... other cases
}

Combining with Task Priorities

// High-priority broadcast
auto query = chi::PoolQuery::Broadcast();
auto task = ipc_manager->NewTask<UpdateTask>(
    chi::CreateTaskId(),
    pool_id,
    query,
    update_data
);
return ipc_manager->Send(task);

Troubleshooting PoolQuery Issues

Common Errors:

Null Query Error: "NEVER use a null pool query"
- Solution: Always use a factory method like PoolQuery::Local()
Invalid Container ID: Container not found for DirectId query
- Solution: Verify container exists before using DirectId
Range Out of Bounds: Range exceeds available containers
- Solution: Check pool size before creating Range queries
Node ID Invalid: Physical node ID doesn't exist
- Solution: Validate node IDs against cluster configuration

Client-Server Communication

Client Implementation Patterns

Chimaera uses an async-only client API pattern. All client operations are asynchronous, returning chi::Future<TaskType> objects. This design:

Enables parallel task submission for better performance
Provides consistent API across all operations
Allows fine-grained control over task completion timing
Simplifies the codebase by eliminating duplicate sync/async methods

Async Create Pattern

IMPORTANT: All ChiMod clients MUST update their pool_id_ field with the actual pool ID returned from completed CreateTask operations. This is essential because:

CreateTask operations may return a different pool ID than initially specified
Pool creation may reuse existing pools with different IDs
Subsequent client operations depend on the correct pool ID

Required Pattern for All Client AsyncCreate Methods:

chi::Future<CreateTask> AsyncCreate(const chi::PoolQuery& pool_query,
                                    const std::string& pool_name,
                                    const chi::PoolId& custom_pool_id,
                                    /* other module-specific parameters */) {
    auto* ipc_manager = CHI_IPC;
    auto task = ipc_manager->NewTask<CreateTask>(
        chi::CreateTaskId(),
        chi::kAdminPoolId,  // Always use admin pool for CreateTask
        pool_query,
        CreateParams::chimod_lib_name,
        pool_name,
        custom_pool_id,
        /* module-specific parameters */
        this  // Client pointer for PostWait callback
    );
    return ipc_manager->Send(task);
}

Required Parameters for All AsyncCreate Methods:

pool_query: Task routing strategy (use Dynamic() recommended, Broadcast() for non-MPI, Local() for MPI)
pool_name: User-provided name for the pool (must be unique, used as file path for file-based modules)
custom_pool_id: Explicit pool ID for the container being created (must not be null)
Module-specific parameters: Additional parameters specific to the ChiMod (e.g., BDev type, size)

Why Pool ID Update Is Required:

Pool Reuse: CreateTask is actually a GetOrCreatePoolTask that may return an existing pool
ID Assignment: The admin ChiMod may assign a different pool ID than requested
Client Consistency: All subsequent operations must use the correct pool ID
Distributed Operation: Pool IDs must be consistent across all client instances

Usage Pattern (Caller Side):

// Create a client and async create the pool
chimaera::bdev::Client bdev_client;
const chi::PoolId pool_id(7000, 0);

auto task = bdev_client.AsyncCreate(
    chi::PoolQuery::Dynamic(),
    "/path/to/storage.dat",
    pool_id,
    chimaera::bdev::BdevType::kFile,
    1024 * 1024 * 1024);  // 1GB

// Wait for completion
task.Wait();

// Check result
if (task->GetReturnCode() != 0) {
    std::cerr << "Create failed: " << task->error_message_.str() << std::endl;
    return;
}

// The client's pool_id_ is updated via PostWait callback
// Now can use the client for operations
auto read_task = bdev_client.AsyncRead(chi::PoolQuery::Local(), block, buffer, size);
read_task.Wait();

Examples of Correct AsyncCreate Implementation:

// Admin client AsyncCreate method
chi::Future<CreateTask> AsyncCreate(const chi::PoolQuery& pool_query,
                                    const std::string& pool_name,
                                    const chi::PoolId& custom_pool_id) {
    auto* ipc_manager = CHI_IPC;
    auto task = ipc_manager->NewTask<CreateTask>(
        chi::CreateTaskId(),
        chi::kAdminPoolId,
        pool_query,
        CreateParams::chimod_lib_name,
        pool_name,
        custom_pool_id,
        this);
    return ipc_manager->Send(task);
}

// BDev client AsyncCreate method (with module-specific parameters)
chi::Future<CreateTask> AsyncCreate(const chi::PoolQuery& pool_query,
                                    const std::string& pool_name,
                                    const chi::PoolId& custom_pool_id,
                                    BdevType bdev_type,
                                    chi::u64 total_size = 0,
                                    chi::u32 io_depth = 128,
                                    chi::u32 alignment = 4096) {
    auto* ipc_manager = CHI_IPC;
    auto task = ipc_manager->NewTask<CreateTask>(
        chi::CreateTaskId(),
        chi::kAdminPoolId,
        pool_query,
        CreateParams::chimod_lib_name,
        pool_name,
        custom_pool_id,
        bdev_type, total_size, io_depth, alignment,
        this);
    return ipc_manager->Send(task);
}

Common Mistakes to Avoid:

❌ Using null PoolId for custom_pool_id: Create operations REQUIRE explicit, non-null pool IDs
❌ Forgetting PostWait callback: Ensure client pointer is passed to NewTask for pool_id_ update
❌ Using original pool_id_: The task may return a different pool ID than initially specified
❌ Accessing results before Wait(): Always call task.Wait() before reading task fields
❌ Implementing synchronous wrappers: Use async-only pattern, let callers handle waiting
❌ Using Local instead of Dynamic/Broadcast: Use Dynamic() (recommended) or Broadcast() for distributed container creation

Critical Validation:

The runtime validates that custom_pool_id is not null during Create operations. If a null PoolId is provided, the Create operation will fail with an error:

// WRONG - This will fail with error
chi::PoolId null_id;  // Null pool ID
client.Create(chi::PoolQuery::Broadcast(), "my_pool", null_id);
// Error: "Cannot create pool with null PoolId. Explicit pool IDs are required."

// CORRECT - Always provide explicit pool IDs
const chi::PoolId custom_pool_id(7000, 0);
client.Create(chi::PoolQuery::Broadcast(), "my_pool", custom_pool_id);

This pattern is mandatory for all ChiMod clients and ensures correct pool ID management throughout the client lifecycle.

Memory Segments

Three shared memory segments are used:

Main Segment: Tasks and control structures
Client Data Segment: User data buffers
Runtime Data Segment: Runtime-only data

IPC Queue

Tasks are submitted via the IPC manager:

// Client side - create and submit task, returns Future
auto task = ipc_manager->NewTask<CustomTask>(...);
auto future = ipc_manager->Send(task);

// Wait for completion
future.Wait();

// Access results
auto result = future->output_field_;

Memory Management

Task Memory Allocation

Tasks are allocated in private memory using standard new/delete. Use standard C++ types (std::string, std::vector) for task data fields:

// Standard C++ types work in task definitions
std::string my_string = "initial value";
std::vector<int> my_vec = {1, 2, 3};

Best Practices

Use standard C++ types (std::string, std::vector) for task data fields
Use FullPtr for cross-process references
Let framework handle task cleanup via ipc_manager->DelTask()

Task Allocation and Deallocation Pattern

// Client side - allocation (NewTask uses main allocator automatically)
auto task = ipc_manager->NewTask<CustomTask>(
    chi::CreateTaskId(),
    pool_id_,
    pool_query,
    input_data,
    operation_id);

// Client side - cleanup (after task completion)
ipc_manager->DelTask(task);

// Runtime side - automatic cleanup (no code needed)
// Framework Del dispatcher calls ipc_manager->DelTask() automatically

CHI_IPC Buffer Allocation

The CHI_IPC singleton provides centralized buffer allocation for shared memory operations in client code. Use this for allocating temporary buffers that need to be shared between client and runtime processes.

Important: AllocateBuffer returns hipc::FullPtr<char>, not hipc::ShmPtr<>. It is NOT a template function.

Basic Usage

#include <chimaera/chimaera.h>

// Get the IPC manager singleton
auto* ipc_manager = CHI_IPC;

// Allocate a buffer in shared memory (returns FullPtr<char>)
size_t buffer_size = 1024;
hipc::FullPtr<char> buffer_ptr = ipc_manager->AllocateBuffer(buffer_size);

// Use the buffer (example: copy data into it)
char* buffer_data = buffer_ptr.ptr_;
memcpy(buffer_data, source_data, data_size);

// Alternative: Use directly
strncpy(buffer_ptr.ptr_, "example data", buffer_size);

// The buffer will be automatically freed when buffer_ptr goes out of scope
// or when explicitly deallocated by the framework

Use Cases for CHI_IPC Buffers

Temporary data transfer: When passing large data to tasks
Intermediate storage: For computations that need shared memory
I/O operations: Reading/writing data that needs to be accessible by runtime

Best Practices

// ✅ Good: Use CHI_IPC for temporary shared buffers
auto* ipc_manager = CHI_IPC;
hipc::FullPtr<char> temp_buffer = ipc_manager->AllocateBuffer(data_size);

// ✅ Good: Use std::string/std::vector for task data fields
std::string task_string = "persistent data";

// ❌ Avoid: Don't use CHI_IPC for small, simple task parameters
// Use standard types directly in task definitions instead

Task Data Structures

Use standard C++ types for task data fields. The framework handles serialization automatically.

Strings and Vectors

// Task definition using standard types
struct CustomTask : public chi::Task {
  INOUT std::string input_data_;
  INOUT std::string output_data_;

  // Default constructor
  CustomTask() : chi::Task() {}

  // Emplace constructor
  explicit CustomTask(const chi::TaskId& task_id,
                      const chi::PoolId& pool_id,
                      const chi::PoolQuery& pool_query,
                      const std::string& input)
      : chi::Task(task_id, pool_id, pool_query, Method::kCustom),
        input_data_(input) {}

  std::string getResult() const {
    return output_data_;
  }
};

// Task definition using standard vector
struct ProcessArrayTask : public chi::Task {
  INOUT std::vector<chi::u32> data_array_;
  INOUT std::vector<chi::f32> result_array_;

  // Default constructor
  ProcessArrayTask() : chi::Task() {}

  // Emplace constructor
  explicit ProcessArrayTask(const chi::TaskId& task_id,
                            const chi::PoolId& pool_id,
                            const chi::PoolQuery& pool_query,
                            const std::vector<chi::u32>& input_data)
      : chi::Task(task_id, pool_id, pool_query, Method::kProcessArray),
        data_array_(input_data) {}
};

Serialization Support

Standard types automatically support serialization for task communication:

// Task definition - no manual serialization needed
struct SerializableTask : public chi::Task {
  INOUT std::string message_;
  INOUT std::vector<chi::u64> timestamps_;

  // Cereal automatically handles standard types
  template<class Archive>
  void serialize(Archive& ar) {
    ar(message_, timestamps_);  // Works automatically
  }
};

GPU-Compatible Data Structures

For GPU-compatible modules, HSHM provides shared-memory data structures (hshm::string, hshm::ipc::vector, hshm::ipc::ring_buffer) with cross-platform annotations. See the Vector Guide, Ring Buffer Guide, and String Guide for details.

Bulk Transfer Support with ar.bulk

For tasks that involve large data transfers (such as I/O operations), Chimaera provides ar.bulk() for efficient bulk data serialization. This feature integrates with the Lightbeam networking layer to enable zero-copy data transfer and RDMA optimization.

Overview

The ar.bulk() method marks data pointers for bulk transfer during task serialization. This is essential for:

Large I/O Operations: Read/write tasks with multi-megabyte payloads
Zero-Copy Transfer: Avoiding unnecessary data copies during serialization
RDMA Optimization: Preparing data for remote direct memory access
Distributed Execution: Sending tasks with large data buffers across nodes

Bulk Transfer Flags

Two flags control bulk transfer behavior:

// Defined in hermes_shm/lightbeam/lightbeam.h
#define BULK_EXPOSE  // Metadata only - no data transfer (receiver allocates)
#define BULK_XFER    // Marks bulk for actual data transmission

Flag Usage:

BULK_EXPOSE: Sender exposes metadata (size, pointer info) but doesn't transfer data
- Receiver sees the bulk size and allocates local buffer
- Useful when receiver will write data (e.g., Read operations)
BULK_XFER: Marks bulk for actual data transmission
- Data is transferred over network
- Used when sender has data to send (e.g., Write operations)

Basic Usage Pattern

Write Operation (Sender Has Data)

For write operations, the sender has data to transfer:

struct WriteTask : public chi::Task {
  IN Block block_;              // Block to write to
  IN hipc::ShmPtr<> data_;       // Data buffer pointer
  IN size_t length_;            // Data length
  OUT chi::u64 bytes_written_;  // Result

  /** Serialize IN and INOUT parameters */
  template <typename Archive>
  void SerializeIn(Archive &ar) {
    ar(block_, length_);
    // Use BULK_XFER to transfer data from sender to receiver
    ar.bulk(data_, length_, BULK_XFER);
  }

  /** Serialize OUT and INOUT parameters */
  template <typename Archive>
  void SerializeOut(Archive &ar) {
    ar(bytes_written_);
    // No bulk transfer needed for output (just metadata)
  }
};

Workflow:

Client Side (SerializeIn):
- Serializes block_ and length_ metadata
- Marks data_ buffer with BULK_XFER flag
- Lightbeam transmits the data buffer to receiver
Runtime Side (Execute):
- Receives metadata and data buffer
- Executes write operation using transferred data
- Sets bytes_written_ result
Client Side (SerializeOut):
- Receives bytes_written_ result
- No bulk transfer needed for small output values

Read Operation (Receiver Needs Data)

For read operations, the receiver allocates buffer for incoming data:

struct ReadTask : public chi::Task {
  IN Block block_;              // Block to read from
  OUT hipc::ShmPtr<> data_;      // Data buffer pointer (allocated by receiver)
  INOUT size_t length_;         // Requested/actual length
  OUT chi::u64 bytes_read_;     // Result

  /** Serialize IN and INOUT parameters */
  template <typename Archive>
  void SerializeIn(Archive &ar) {
    ar(block_, length_);
    // Use BULK_EXPOSE - metadata only, receiver will allocate buffer
    ar.bulk(data_, length_, BULK_EXPOSE);
  }

  /** Serialize OUT and INOUT parameters */
  template <typename Archive>
  void SerializeOut(Archive &ar) {
    ar(length_, bytes_read_);
    // Use BULK_XFER to transfer read data back to client
    ar.bulk(data_, length_, BULK_XFER);
  }
};

Workflow:

Client Side (SerializeIn):
- Serializes block_ and length_ metadata
- Marks data_ with BULK_EXPOSE (no data sent yet)
- Receiver sees buffer size needed
Runtime Side (Execute):
- Receives metadata including buffer size
- Allocates local buffer for data_
- Executes read operation filling the buffer
- Sets length_ and bytes_read_ results
Client Side (SerializeOut):
- Marks data_ with BULK_XFER flag
- Lightbeam transfers read data back to client
- Client receives length_, bytes_read_, and data buffer

API Reference

template <typename Archive>
void ar.bulk(hipc::ShmPtr<> ptr, size_t size, uint32_t flags);

Parameters:

ptr: Pointer to data buffer (hipc::ShmPtr<>, hipc::FullPtr, or raw pointer)
size: Size of data in bytes
flags: Transfer flags (BULK_EXPOSE or BULK_XFER)

Behavior:

Records bulk transfer metadata in the archive
For BULK_XFER: Prepares data for network transmission
For BULK_EXPOSE: Records metadata only (size and pointer info)
Integrates with Lightbeam networking for actual data transfer

Advanced Pattern: Bidirectional Transfer

Some operations require data transfer in both directions:

struct ProcessTask : public chi::Task {
  INOUT hipc::ShmPtr<> data_;    // Data buffer (modified in-place)
  INOUT size_t length_;         // Buffer length

  /** Serialize IN and INOUT parameters */
  template <typename Archive>
  void SerializeIn(Archive &ar) {
    ar(length_);
    // Send input data to runtime
    ar.bulk(data_, length_, BULK_XFER);
  }

  /** Serialize OUT and INOUT parameters */
  template <typename Archive>
  void SerializeOut(Archive &ar) {
    ar(length_);
    // Send modified data back to client
    ar.bulk(data_, length_, BULK_XFER);
  }
};

Integration with Lightbeam

The ar.bulk() calls integrate seamlessly with the Lightbeam networking layer:

Archive Records Bulks:
- TaskOutputArchive stores bulk metadata in bulk_transfers_ vector
- Each bulk includes pointer, size, and flags
Lightbeam Transmission:
- Bulks marked BULK_XFER are transmitted via Send() and RecvBulks()
- Bulks marked BULK_EXPOSE provide metadata only
- Receiver inspects all bulks to determine buffer sizes
Zero-Copy Optimization:
- Data stays in original buffers during serialization
- Only pointers and metadata are serialized
- Actual data transfer handled separately by Lightbeam

Complete Example: BDev Read Task

struct ReadTask : public chi::Task {
  IN Block block_;              // Block descriptor
  OUT hipc::ShmPtr<> data_;      // Data buffer
  INOUT size_t length_;         // Buffer length
  OUT chi::u64 bytes_read_;     // Bytes actually read

  /** SHM default constructor */
  ReadTask()
      : chi::Task(), length_(0), bytes_read_(0) {}

  /** Emplace constructor */
  explicit ReadTask(const chi::TaskId &task_node,
                    const chi::PoolId &pool_id,
                    const chi::PoolQuery &pool_query,
                    const Block &block,
                    hipc::ShmPtr<> data,
                    size_t length)
      : chi::Task(task_node, pool_id, pool_query, Method::kRead),
        block_(block), data_(data), length_(length), bytes_read_(0) {}

  /** Serialize IN and INOUT parameters */
  template <typename Archive>
  void SerializeIn(Archive &ar) {
    ar(block_, length_);
    // BULK_EXPOSE: Tell receiver the buffer size, but don't send data yet
    // Receiver will allocate local buffer
    ar.bulk(data_, length_, BULK_EXPOSE);
  }

  /** Serialize OUT and INOUT parameters */
  template <typename Archive>
  void SerializeOut(Archive &ar) {
    ar(length_, bytes_read_);
    // BULK_XFER: Transfer the read data back to client
    ar.bulk(data_, length_, BULK_XFER);
  }

  /** Copy from another ReadTask */
  void Copy(const hipc::FullPtr<ReadTask> &other) {
    // Copy task-specific fields only
    // Base Task fields are copied automatically by NewCopy
    block_ = other->block_;
    data_ = other->data_;
    length_ = other->length_;
    bytes_read_ = other->bytes_read_;
  }

  /** Aggregate results from replica */
  void Aggregate(const hipc::FullPtr<ReadTask> &other) {
    // For reads, just copy the result from the replica
    Copy(other);
  }
};

Best Practices

DO:

✅ Use BULK_XFER when sender has data to transmit (Write operations)
✅ Use BULK_EXPOSE when receiver needs to allocate buffer (Read operations)
✅ Always specify both SerializeIn() and SerializeOut() for consistency
✅ Use ar.bulk() for data buffers larger than a few KB
✅ Ensure data buffer lifetime extends until serialization completes

DON'T:

❌ Don't use ar.bulk() for small data (< 4KB) - serialize directly instead
❌ Don't forget to specify bulk size - it determines receiver buffer allocation
❌ Don't mix ar() and ar.bulk() for the same data - choose one approach
❌ Don't use BULK_EXPOSE for write operations (sender has data to send)
❌ Don't use BULK_XFER in SerializeIn for read operations (no data to send yet)

Performance Considerations

Buffer Alignment: Ensure buffers are properly aligned (typically 4KB for I/O operations)
Size Thresholds: Use bulk transfer for data > 4KB; use regular serialization for smaller data
Zero-Copy: Lightbeam can use zero-copy techniques when data is in shared memory
RDMA Ready: The bulk transfer API is designed for future RDMA transport integration

Troubleshooting

Common Issues:

Missing Data Transfer:
- Ensure BULK_XFER flag is used when data should be transmitted
- Check that SerializeOut uses BULK_XFER for read operations
Buffer Size Mismatch:
- Verify length_ parameter matches actual buffer size
- Ensure receiver allocates buffer matching the exposed size
Serialization Order:
- Serialize metadata (block, length) before ar.bulk() call
- This ensures receiver knows buffer size before allocating

Build System Integration

CMakeLists.txt Template

ChiMod CMakeLists.txt files should use the standardized ChimaeraCommon.cmake functions for consistency and proper configuration:

cmake_minimum_required(VERSION 3.10)

# Create both client and runtime libraries for your module
# This creates targets: ${NAMESPACE}_${CHIMOD_NAME}_runtime and ${NAMESPACE}_${CHIMOD_NAME}_client
# CMake aliases: ${NAMESPACE}::${CHIMOD_NAME}_runtime and ${NAMESPACE}::${CHIMOD_NAME}_client
add_chimod_client(
  CHIMOD_NAME YOUR_MODULE_NAME
  SOURCES src/YOUR_MODULE_NAME_client.cc
)
add_chimod_runtime(
  CHIMOD_NAME YOUR_MODULE_NAME
  SOURCES src/YOUR_MODULE_NAME_runtime.cc src/autogen/YOUR_MODULE_NAME_lib_exec.cc
)

# Installation is automatic - no separate install_chimod() call required

CMakeLists.txt Guidelines

DO:

Use add_chimod_client() and add_chimod_runtime() utility functions (installation is automatic)
Set CHIMOD_NAME to your module's name
List source files explicitly in SOURCES parameters
Include autogen source files in runtime SOURCES
Keep the CMakeLists.txt minimal and consistent

DON'T:

Use manual add_library() calls - use the utilities instead
Call install_chimod() separately - it's handled automatically
Include relative paths like ../include/* - use proper include directories
Set custom compile definitions - the utilities handle this
Manually configure target properties - the utilities provide standard settings

ChiMod Build Functions Reference

`add_chimod_client()` Function

Creates a ChiMod client library target with automatic dependency management.

add_chimod_client(
  SOURCES source_file1.cc source_file2.cc ...
  [COMPILE_DEFINITIONS definition1 definition2 ...]
  [LINK_LIBRARIES library1 library2 ...]
  [LINK_DIRECTORIES directory1 directory2 ...]
  [INCLUDE_LIBRARIES target1 target2 ...]
  [INCLUDE_DIRECTORIES directory1 directory2 ...]
)

Parameters:

SOURCES (required): List of source files for the client library
COMPILE_DEFINITIONS (optional): Additional preprocessor definitions beyond automatic ones
LINK_LIBRARIES (optional): Additional libraries to link beyond automatic dependencies
LINK_DIRECTORIES (optional): Additional library search directories
INCLUDE_LIBRARIES (optional): Target libraries whose include directories should be inherited
INCLUDE_DIRECTORIES (optional): Additional include directories beyond automatic ones

Automatic Behavior:

Creates target: ${NAMESPACE}_${CHIMOD_NAME}_client
Creates alias: ${NAMESPACE}::${CHIMOD_NAME}_client
Automatically links core Chimaera library (chimaera::cxx or hermes_shm::cxx)
For non-admin ChiMods: automatically links chimaera_admin_client
Automatically includes module headers from include/ directory
Installs library and headers with proper CMake export configuration

Example:

add_chimod_client(
  SOURCES src/my_module_client.cc
  COMPILE_DEFINITIONS MY_MODULE_DEBUG=1
  LINK_LIBRARIES additional_lib
  INCLUDE_DIRECTORIES ${EXTERNAL_INCLUDE_DIR}
)

`add_chimod_runtime()` Function

Creates a ChiMod runtime library target with automatic dependency management.

add_chimod_runtime(
  SOURCES source_file1.cc source_file2.cc ...
  [COMPILE_DEFINITIONS definition1 definition2 ...]
  [LINK_LIBRARIES library1 library2 ...]
  [LINK_DIRECTORIES directory1 directory2 ...]
  [INCLUDE_LIBRARIES target1 target2 ...]
  [INCLUDE_DIRECTORIES directory1 directory2 ...]
)

Parameters:

SOURCES (required): List of source files for the runtime library (include autogen files)
COMPILE_DEFINITIONS (optional): Additional preprocessor definitions beyond automatic ones
LINK_LIBRARIES (optional): Additional libraries to link beyond automatic dependencies
LINK_DIRECTORIES (optional): Additional library search directories
INCLUDE_LIBRARIES (optional): Target libraries whose include directories should be inherited
INCLUDE_DIRECTORIES (optional): Additional include directories beyond automatic ones

Automatic Behavior:

Creates target: ${NAMESPACE}_${CHIMOD_NAME}_runtime
Creates alias: ${NAMESPACE}::${CHIMOD_NAME}_runtime
Automatically defines CHIMAERA_RUNTIME=1 for runtime code
Automatically links core Chimaera library (chimaera::cxx or hermes_shm::cxx)
Automatically links rt library for POSIX real-time support
For non-admin ChiMods: automatically links both chimaera_admin_runtime and chimaera_admin_client
Automatically includes module headers from include/ directory
Links to client library if it exists
Installs library and headers with proper CMake export configuration

Example:

add_chimod_runtime(
  SOURCES 
    src/my_module_runtime.cc 
    src/autogen/my_module_lib_exec.cc
  COMPILE_DEFINITIONS MY_MODULE_RUNTIME_DEBUG=1
  LINK_LIBRARIES libaio
  INCLUDE_DIRECTORIES ${LIBAIO_INCLUDE_DIR}
)

Configuration Requirements

Before using these functions, ensure your ChiMod directory contains:

chimaera_mod.yaml: Module configuration file defining the module name
```
module_name: my_module
```
Include structure: Headers organized as include/[namespace]/[module_name]/
Source files: Client and runtime implementations with autogen files for runtime

Typical Usage Pattern

Most ChiMods use both functions together:

# Create client library
add_chimod_client(
  SOURCES src/my_module_client.cc
)

# Create runtime library
add_chimod_runtime(
  SOURCES 
    src/my_module_runtime.cc 
    src/autogen/my_module_lib_exec.cc
)

Function Dependencies: Both functions automatically handle common dependencies:

Core Library: Automatically links appropriate Chimaera core library
Runtime Libraries: add_chimod_runtime() automatically links rt library for async I/O operations
Admin ChiMod Integration: For non-admin chimods, both functions automatically link admin libraries and include admin headers
Client-Runtime Linking: Runtime automatically links to client library when both exist

This eliminates the need for manual dependency configuration in individual ChiMod CMakeLists.txt files.

Target Naming and Linking

Target Format

The system uses underscore-based target names for consistency with CMake conventions:

Target Names:

Runtime: ${NAMESPACE}_${CHIMOD_NAME}_runtime (e.g., chimaera_admin_runtime)
Client: ${NAMESPACE}_${CHIMOD_NAME}_client (e.g., chimaera_admin_client)

CMake Aliases (Recommended):

Runtime: ${NAMESPACE}::${CHIMOD_NAME}_runtime (e.g., chimaera::admin_runtime)
Client: ${NAMESPACE}::${CHIMOD_NAME}_client (e.g., chimaera::admin_client)

Package Names:

Format: ${NAMESPACE}_${CHIMOD_NAME} (e.g., chimaera_admin)
Used with: find_package(chimaera_admin REQUIRED)
Core package: chimaera (automatically included by find_package(chimaera))

External Application Linking (Based on test/unit/external/CMakeLists.txt):

# External applications typically only need ChiMod client libraries
# Core library dependencies are automatically included
find_package(chimaera REQUIRED)
find_package(chimaera_admin REQUIRED)

target_link_libraries(my_external_app
  chimaera::admin_client            # Admin client (includes all dependencies)
  ${CMAKE_THREAD_LIBS_INIT}         # Threading support
)
# Note: chimaera::cxx is automatically included by ChiMod client libraries

Internal Development Linking:

# For internal development within the Chimaera project
target_link_libraries(internal_app
  chimaera::admin_client            # ChiMod client
  chimaera::bdev_client             # BDev client 
  # Core dependencies are automatically linked by ChiMod libraries
)

Automatic Dependencies

The ChiMod build functions automatically handle common dependencies:

For Runtime Code:

rt library: Automatically linked for POSIX real-time library support (async I/O operations)
Admin ChiMod: Automatically linked for all non-admin ChiMods (both runtime and client)
Admin includes: Automatically added to include directories for non-admin ChiMods

For All ChiMods:

Creates both client and runtime shared libraries
Sets proper include directories (include/, ${CMAKE_SOURCE_DIR}/include)
Automatically links core Chimaera dependencies
Sets required compile definitions (CHI_CHIMOD_NAME, CHI_NAMESPACE)
Configures proper build flags and settings

Simplified Development: ChiMod developers no longer need to manually specify:

rt library dependencies
Admin ChiMod dependencies (chimaera_admin_runtime, chimaera_admin_client)
Admin include directories
Core Chimaera library dependencies
Common linking patterns

Important Note for External Applications: External applications linking against ChiMod libraries receive all necessary dependencies automatically. The ChiMod client libraries include the core Chimaera library as a transitive dependency.

Automatic Installation: The ChiMod build functions automatically handle installation:

Installs libraries to the correct destination
Sets up proper runtime paths
Configures installation properties
Includes automatic dependencies in exported CMake packages
No separate install_chimod() call required

Targets Created by ChiMod Functions

When you call add_chimod_client() and add_chimod_runtime() with CHIMOD_NAME YOUR_MODULE_NAME, they create the following CMake targets using the underscore-based naming format:

Target Naming System

Actual Target Names: ${NAMESPACE}_${CHIMOD_NAME}_runtime and ${NAMESPACE}_${CHIMOD_NAME}_client
CMake Aliases: ${NAMESPACE}::${CHIMOD_NAME}_runtime and ${NAMESPACE}::${CHIMOD_NAME}_client (recommended)
Package Names: ${NAMESPACE}_${CHIMOD_NAME} (for find_package())

Runtime Target: `${NAMESPACE}_${CHIMOD_NAME}_runtime`

Target Name: chimaera_YOUR_MODULE_NAME_runtime (e.g., chimaera_admin_runtime, chimaera_MOD_NAME_runtime)
CMake Alias: chimaera::YOUR_MODULE_NAME_runtime (e.g., chimaera::admin_runtime) - recommended for linking
Type: Shared library (.so file)
Purpose: Contains server-side execution logic, runs in the Chimaera runtime process
Compile Definitions:
- CHI_CHIMOD_NAME="${CHIMOD_NAME}" - Module name for runtime identification
- CHI_NAMESPACE="${NAMESPACE}" - Project namespace
Include Directories:
- include/ - Local module headers
- $\{CMAKE_SOURCE_DIR\}/include - Chimaera framework headers
Dependencies: Links against chimaera library, rt library (automatic), admin dependencies (automatic)

Client Target: `${NAMESPACE}_${CHIMOD_NAME}_client`

Target Name: chimaera_YOUR_MODULE_NAME_client (e.g., chimaera_admin_client, chimaera_MOD_NAME_client)
CMake Alias: chimaera::YOUR_MODULE_NAME_client (e.g., chimaera::admin_client) - recommended for linking
Type: Shared library (.so file)
Purpose: Contains client-side API, runs in user processes
Compile Definitions:
- CHI_CHIMOD_NAME="${CHIMOD_NAME}" - Module name for client identification
- CHI_NAMESPACE="${NAMESPACE}" - Project namespace
Include Directories:
- include/ - Local module headers
- $\{CMAKE_SOURCE_DIR\}/include - Chimaera framework headers
Dependencies: Links against chimaera library, admin dependencies (automatic)

Namespace Configuration

The namespace is automatically read from chimaera_repo.yaml files. The system searches up the directory tree from the CMakeLists.txt location to find the first chimaera_repo.yaml file:

Main project chimaera_repo.yaml:

namespace: chimaera  # Main project namespace

Module repository chimods/chimaera_repo.yaml:

namespace: chimods   # Modules get this namespace

This means modules in the chimods/ directory will use the "chimods" namespace, creating targets like chimods_admin_runtime, while other components use the main project namespace.

Example Output Files

For a module named "admin" with namespace "chimods" (from chimods/chimaera_repo.yaml), the build produces:

build/bin/libchimods_admin_runtime.so    # Runtime library  
build/bin/libchimods_admin_client.so     # Client library

Using the Targets

You can reference these targets in your CMakeLists.txt using the full target name:

# Add custom properties to the runtime target
set_target_properties(chimaera_${CHIMOD_NAME}_runtime PROPERTIES
  VERSION 1.0.0
  SOVERSION 1
)

# Add additional dependencies if needed
target_link_libraries(chimaera_${CHIMOD_NAME}_runtime PRIVATE some_external_lib)

# Or use the global property to get the actual target name
get_property(RUNTIME_TARGET GLOBAL PROPERTY ${CHIMOD_NAME}_RUNTIME_TARGET)
target_link_libraries(${RUNTIME_TARGET} PRIVATE some_external_lib)

Module Configuration (chimaera_mod.yaml)

name: MOD_NAME
version: 1.0.0
description: "Module description"
author: "Author Name"
methods:
  - kCreate
  - kCustom
dependencies: []

Auto-Generated Method Files

Each module requires an auto-generated methods file at include/[namespace]/MOD_NAME/autogen/MOD_NAME_methods.h. This file must:

Include chimaera.h: Required for GLOBAL_CONST macro
Use namespace constants: Define methods as GLOBAL_CONST chi::u32 values
Follow naming convention: Method names should start with k (e.g., kCreate, kCustom)

Required Template:

#ifndef MOD_NAME_AUTOGEN_METHODS_H_
#define MOD_NAME_AUTOGEN_METHODS_H_

#include <chimaera/chimaera.h>

namespace chimaera::MOD_NAME {

namespace Method {
  // Standard inherited methods (always include these)
  GLOBAL_CONST chi::u32 kCreate = 0;
  GLOBAL_CONST chi::u32 kDestroy = 1;
  GLOBAL_CONST chi::u32 kNodeFailure = 2;
  GLOBAL_CONST chi::u32 kRecover = 3;
  GLOBAL_CONST chi::u32 kMigrate = 4;
  GLOBAL_CONST chi::u32 kUpgrade = 5;
  
  // Module-specific methods (customize these)
  GLOBAL_CONST chi::u32 kCustom = 10;
  // Add more module-specific methods starting from 10+
}

} // namespace chimaera::MOD_NAME

#endif // MOD_NAME_AUTOGEN_METHODS_H_

Important Notes:

GLOBAL_CONST is required: Do not use const or constexpr - use GLOBAL_CONST
Include chimaera.h: This header defines the GLOBAL_CONST macro
Standard methods 0-5: Always include the inherited methods (kCreate through kUpgrade)
Custom methods 10+: Start custom methods from ID 10 to avoid conflicts
No static casting needed: Use method values directly (e.g., method_ = Method::kCreate;)

Runtime Entry Points

Use the CHI_TASK_CC macro to define module entry points:

// At the end of your runtime source file (_runtime.cc)
CHI_TASK_CC(your_namespace::YourContainerClass)

This macro automatically generates all required extern "C" functions and gets the module name from YourContainerClass::CreateParams::chimod_lib_name:

alloc_chimod() - Creates container instance
new_chimod() - Creates and initializes container
get_chimod_name() - Returns module name
destroy_chimod() - Destroys container instance

Requirements for CHI_TASK_CC to work:

Your runtime class must define a public typedef: using CreateParams = your_namespace::CreateParams;
Your CreateParams struct must have: static constexpr const char* chimod_lib_name = "your_module_name";

IMPORTANT: The chimod_lib_name should NOT include the _runtime suffix. The module manager automatically appends _runtime when loading the library. For example, use "chimaera_mymodule" not "chimaera_mymodule_runtime".

Example:

namespace chimaera::your_module {

struct CreateParams {
  static constexpr const char* chimod_lib_name = "chimaera_your_module";
  // ... other parameters
};

class Runtime : public chi::Container {
public:
  using CreateParams = chimaera::your_module::CreateParams;  // Required for CHI_TASK_CC
  // ... rest of class
};

}  // namespace chimaera::your_module

is_chimaera_chimod_ - Module identification flag

Auto-Generated Code Pattern

Overview

ChiMods use auto-generated source files to implement the Container virtual APIs (Run, Monitor, Del, SaveIn, LoadIn, SaveOut, LoadOut, NewCopy). This approach provides consistent dispatch logic and reduces boilerplate code.

New Pattern: Auto-Generated Source Files

Instead of using inline functions in headers, ChiMods now use auto-generated .cc source files that implement the virtual methods directly. This pattern:

Eliminates inline dispatchers: Virtual methods are implemented directly in auto-generated source
Reduces header dependencies: No need to include autogen headers in runtime files
Improves compilation: Source files compile once, not in every including file
Maintains consistency: All ChiMods use the same dispatch pattern

File Structure

src/
└── autogen/
    └── MOD_NAME_lib_exec.cc    # Auto-generated virtual method implementations

The auto-generated source file contains:

Container virtual method implementations (Run, Del, etc.)
Switch-case dispatch based on method IDs
Proper task type casting and method invocation
IPC manager integration for task lifecycle management

Auto-Generated Source Template

/**
 * Auto-generated execution implementation for MOD_NAME ChiMod
 * Implements Container virtual APIs directly using switch-case dispatch
 * 
 * This file is autogenerated - do not edit manually.
 * Changes should be made to the autogen tool or the YAML configuration.
 */

#include <[namespace]/MOD_NAME/MOD_NAME_runtime.h>
#include <[namespace]/MOD_NAME/autogen/MOD_NAME_methods.h>
#include <chimaera/chimaera.h>

namespace chimaera::MOD_NAME {

//==============================================================================
// Runtime Virtual API Implementations
//==============================================================================

void Runtime::Run(chi::u32 method, hipc::FullPtr<chi::Task> task_ptr, chi::RunContext& rctx) {
  switch (method) {
    case Method::kCreate: {
      Create(task_ptr.Cast<CreateTask>(), rctx);
      break;
    }
    case Method::kDestroy: {
      Destroy(task_ptr.Cast<DestroyTask>(), rctx);
      break;
    }
    case Method::kCustom: {
      Custom(task_ptr.Cast<CustomTask>(), rctx);
      break;
    }
    default: {
      // Unknown method - do nothing
      break;
    }
  }
}

void Runtime::Del(chi::u32 method, hipc::FullPtr<chi::Task> task_ptr) {
  // Use IPC manager to deallocate task from shared memory
  auto* ipc_manager = CHI_IPC;
  
  switch (method) {
    case Method::kCreate: {
      ipc_manager->DelTask(task_ptr.Cast<CreateTask>());
      break;
    }
    case Method::kDestroy: {
      ipc_manager->DelTask(task_ptr.Cast<DestroyTask>());
      break;
    }
    case Method::kCustom: {
      ipc_manager->DelTask(task_ptr.Cast<CustomTask>());
      break;
    }
    default: {
      // For unknown methods, still try to delete from main segment
      ipc_manager->DelTask(task_ptr);
      break;
    }
  }
}

// SaveIn, LoadIn, SaveOut, LoadOut, and NewCopy follow similar patterns...

} // namespace chimaera::MOD_NAME

CMake Integration

The auto-generated source file must be included in the RUNTIME_SOURCES:

add_chimod_client(
  CHIMOD_NAME MOD_NAME
  SOURCES src/MOD_NAME_client.cc
)
add_chimod_runtime(
  CHIMOD_NAME MOD_NAME
  SOURCES src/MOD_NAME_runtime.cc src/autogen/MOD_NAME_lib_exec.cc
)

Benefits

Cleaner Runtime Code: Runtime implementations focus on business logic, not dispatching
Better Compilation: Source files compile once instead of being inlined in every header include
Consistent Pattern: All ChiMods use identical dispatch logic
Header Simplification: No need to include complex autogen headers
Better IDE Support: Proper source files work better with IDEs than inline templates

Important Notes

Auto-generated files: These files should be generated by tools, not hand-written
Do not edit: Manual changes to autogen files will be lost when regenerated
Template consistency: All ChiMods should follow the same autogen template pattern
Build integration: Autogen source files must be included in CMake build

External ChiMod Development

When developing ChiMods in external repositories (outside the main Chimaera project), you need to link against the installed Chimaera libraries and use the CMake package discovery system.

Prerequisites

Before developing external ChiMods, ensure Chimaera is properly installed:

# Configure and build Chimaera
cmake --preset debug
cmake --build build

# Install Chimaera libraries and CMake configs  
cmake --install build --prefix /usr/local

This installs:

Core Chimaera library (libcxx.so)
ChiMod libraries (libchimaera_admin_runtime.so, libchimaera_admin_client.so, etc.)
CMake package configuration files for external discovery
Header files for development

External ChiMod Repository Structure

Your external ChiMod repository should follow this structure:

my_external_chimod/
├── chimaera_repo.yaml          # Repository namespace configuration
├── CMakeLists.txt              # Root CMake configuration
├── modules/                    # ChiMod modules directory (name is flexible)
│   └── my_module/
│       ├── chimaera_mod.yaml   # Module configuration
│       ├── CMakeLists.txt      # Module build configuration  
│       ├── include/
│       │   └── [namespace]/
│       │       └── my_module/
│       │           ├── my_module_client.h
│       │           ├── my_module_runtime.h
│       │           ├── my_module_tasks.h
│       │           └── autogen/
│       │               └── my_module_methods.h
│       └── src/
│           ├── my_module_client.cc
│           ├── my_module_runtime.cc
│           └── autogen/
│               └── my_module_lib_exec.cc

Note: The directory name for modules (shown here as modules/) is flexible. You can use chimods/, components/, plugins/, or any other name that fits your project structure. The directory name doesn't need to match the namespace.

Repository Configuration (chimaera_repo.yaml)

Create a chimaera_repo.yaml file in your repository root to define the namespace:

# Repository-level configuration
namespace: myproject      # Your custom namespace (replaces "chimaera")

This namespace will be used for:

CMake target names: myproject_my_module_runtime, myproject_my_module_client
Library file names: libmyproject_my_module_runtime.so, libmyproject_my_module_client.so
C++ namespaces: myproject::my_module

Root CMakeLists.txt

Your repository's root CMakeLists.txt must find and link to the installed Chimaera packages:

cmake_minimum_required(VERSION 3.20)
project(my_external_chimod)

set(CMAKE_CXX_STANDARD_20)
set(CMAKE_CXX_STANDARD_REQUIRED ON)

# Find required Chimaera packages
# These packages are installed by 'cmake --install build --prefix /usr/local'
find_package(chimaera REQUIRED)              # Core Chimaera (automatically includes ChimaeraCommon.cmake)
find_package(chimaera_admin REQUIRED)        # Admin ChiMod (often required)

# Set CMAKE_PREFIX_PATH if Chimaera is installed in a custom location
# set(CMAKE_PREFIX_PATH "/path/to/[namespace]/install" ${CMAKE_PREFIX_PATH})

# ChimaeraCommon.cmake utilities are automatically included by find_package(chimaera)
# This provides add_chimod_client(), add_chimod_runtime(), and other build functions

# Add subdirectories containing your ChiMods
add_subdirectory(modules/my_module)  # Use your actual directory name

ChiMod CMakeLists.txt

Each ChiMod's CMakeLists.txt uses the standard Chimaera build utilities:

cmake_minimum_required(VERSION 3.20)

# Create both client and runtime libraries using standard Chimaera utilities
# These functions are provided by ChimaeraCommon.cmake (automatically included via find_package(chimaera))
# Creates targets: my_namespace_my_module_client, my_namespace_my_module_runtime
# Creates aliases: my_namespace::my_module_client, my_namespace::my_module_runtime
add_chimod_client(
  CHIMOD_NAME my_module
  SOURCES src/my_module_client.cc
)
add_chimod_runtime(
  CHIMOD_NAME my_module
  SOURCES
    src/my_module_runtime.cc 
    src/autogen/my_module_lib_exec.cc
)

# Installation is automatic - no separate install_chimod() call required
# Package name: my_namespace_my_module (for find_package)

# Optional: Add additional dependencies if your ChiMod needs external libraries
# get_property(RUNTIME_TARGET GLOBAL PROPERTY my_module_RUNTIME_TARGET)
# get_property(CLIENT_TARGET GLOBAL PROPERTY my_module_CLIENT_TARGET)
# target_link_libraries(${RUNTIME_TARGET} PRIVATE some_external_lib)
# target_link_libraries(${CLIENT_TARGET} PRIVATE some_external_lib)

External Applications Using Your ChiMod

Once installed, external applications can find and link to your ChiMod. Based on our external unit test patterns (see test/unit/external-chimod/CMakeLists.txt):

# External application CMakeLists.txt
find_package(my_namespace_my_module REQUIRED)  # Your ChiMod package
find_package(chimaera REQUIRED)                # Core Chimaera (automatically includes utilities)
find_package(chimaera_admin REQUIRED)          # Admin ChiMod (often required)

# Simple linking pattern - ChiMod libraries include all dependencies
target_link_libraries(my_external_app
  my_namespace::my_module_client    # Your ChiMod client
  chimaera::admin_client            # Admin client (if needed)
  ${CMAKE_THREAD_LIBS_INIT}         # Threading support
)
# Core Chimaera library is automatically included by ChiMod dependencies

External ChiMod Implementation

Your external ChiMod implementation follows the same patterns as internal ChiMods:

CreateParams Configuration

In your my_module_tasks.h, the CreateParams must reference your custom namespace:

struct CreateParams {
  // Your module-specific parameters
  std::string config_data_;
  chi::u32 worker_count_;
  
  // CRITICAL: Library name must match your namespace
  static constexpr const char* chimod_lib_name = "myproject_my_module";
  
  // Constructors and serialization...
};

C++ Namespace

Use your custom namespace throughout your implementation:

// In all header and source files
namespace myproject::my_module {

// Your ChiMod implementation...
class Runtime : public chi::Container {
  // Implementation...
};

class Client : public chi::ContainerClient {  
  // Implementation...
};

} // namespace myproject::my_module

Building External ChiMods

# Configure your external ChiMod project
mkdir build && cd build
cmake ..

# Build your ChiMods
make

# Optional: Install your ChiMods
make install

The build system will automatically:

Link all necessary core Chimaera dependencies
Link against chimaera::admin_client and chimaera::admin_runtime (for non-admin modules)
Generate libraries with your custom namespace: libmyproject_my_module_runtime.so
Configure proper include paths and dependencies

Usage in Applications

Applications using your external ChiMod would reference it as:

#include <chimaera/chimaera.h>
#include <[namespace]/my_module/my_module_client.h>
#include <[namespace]/admin/admin_client.h>

int main() {
  // Initialize Chimaera (client mode with embedded runtime)
  chi::CHIMAERA_INIT(chi::ChimaeraMode::kClient, true);

  // Create your ChiMod client
  const chi::PoolId pool_id = chi::PoolId(7000, 0);
  myproject::my_module::Client client(pool_id);

  // Use your ChiMod
  auto pool_query = chi::PoolQuery::Local();
  client.Create(pool_query);
}

CHIMAERA_INIT Initialization Modes

Chimaera provides a unified initialization function CHIMAERA_INIT() that supports different operational modes:

Client Mode with Embedded Runtime (Most Common):

// Initialize both client and runtime in single process
// Recommended for: Applications, unit tests, and benchmarks
chi::CHIMAERA_INIT(chi::ChimaeraMode::kClient, true);

Client-Only Mode (Advanced):

// Initialize client only - connects to external runtime
// Recommended for: Production deployments with separate runtime process
chi::CHIMAERA_INIT(chi::ChimaeraMode::kClient, false);

Runtime/Server Mode (Advanced):

// Initialize runtime/server only - no client
// Recommended for: Standalone runtime processes
chi::CHIMAERA_INIT(chi::ChimaeraMode::kServer, false);

Usage Example (Unit Tests/Benchmarks):

#include <chimaera/chimaera.h>
#include <[namespace]/my_module/my_module_client.h>

TEST(MyModuleTest, BasicOperation) {
  // Initialize both client and runtime in single process
  chi::CHIMAERA_INIT(chi::ChimaeraMode::kClient, true);

  // Create your ChiMod client
  const chi::PoolId pool_id = chi::PoolId(7000, 0);
  myproject::my_module::Client client(pool_id);

  // Test your ChiMod functionality
  auto pool_query = chi::PoolQuery::Local();
  client.Create(pool_query, "test_pool");

  // Assertions and test logic...
}

When to Use Each Mode:

Client with Embedded Runtime (kClient, true): Unit tests, benchmarks, and standalone applications
Client Only (kClient, false): Production applications connecting to existing external runtime
Server/Runtime Only (kServer, false): Dedicated runtime processes

Dependencies and Installation Paths

External ChiMod development requires these components to be installed:

Core Package: chimaera (includes main library and ChimaeraCommon.cmake utilities)
Admin ChiMod: chimaera::admin_client and chimaera::admin_runtime (required for most modules)
CMake Configs: Package discovery files (automatically installed with packages)
Headers: All Chimaera framework headers (installed with packages)

All build utilities (add_chimod_client(), add_chimod_runtime()) are automatically available via find_package(chimaera).

If Chimaera is installed in a custom location, set CMAKE_PREFIX_PATH:

export CMAKE_PREFIX_PATH="/path/to/[namespace]/install:$CMAKE_PREFIX_PATH"

Common External Development Issues

ChimaeraCommon.cmake Not Found:

Ensure Chimaera was installed with cmake --install build --prefix <path>
Verify CMAKE_PREFIX_PATH includes the Chimaera installation directory
Check that find_package(chimaera REQUIRED) succeeded (ChimaeraCommon.cmake is included automatically)

Library Name Mismatch:

Ensure CreateParams::chimod_lib_name exactly matches your namespace and module name
For namespace "myproject" and module "my_module": chimod_lib_name = "myproject_my_module"
The system automatically appends "_runtime" to find the runtime library
Target names use format: myproject_my_module_runtime and myproject_my_module_client

Missing Dependencies:

The ChiMod build functions automatically link admin and rt library dependencies
Ensure all external dependencies (Boost, MPI, etc.) are available in your build environment
Use the same dependency versions that Chimaera was built with
For runtime code, rt library is automatically included for async I/O support

External ChiMod Checklist

Repository Configuration: chimaera_repo.yaml with custom namespace
CMake Setup: Root CMakeLists.txt finds chimaera package
ChiMod Configuration: chimaera_mod.yaml with method definitions
Library Name: CreateParams::chimod_lib_name matches namespace pattern
C++ Namespace: All code uses custom namespace consistently
Build Integration: ChiMod CMakeLists.txt uses add_chimod_client() and add_chimod_runtime() (installation is automatic)
Dependencies: All required external libraries available at build time
Automatic Linking: Rely on ChiMod build functions for rt and admin dependencies

Example Module

See the chimods/MOD_NAME directory for a complete working example that demonstrates:

Task definition with proper constructors
Client API with async-only methods
Runtime container with execution logic
Build system integration
YAML configuration

Creating a New Module

Copy the MOD_NAME template directory
Rename all MOD_NAME occurrences to your module name
Update the chimaera_mod.yaml configuration
Define your tasks in the _tasks.h file
Implement client API in _client.h/cc
Implement runtime logic in _runtime.h/cc
Add CHI_TASK_CC(YourContainerClass) at the end of runtime source
Add to the build system
Test with client and runtime

Recent Changes and Best Practices

Container Initialization Pattern

Starting with the latest version, container initialization has been simplified:

No Separate Init Method: The Init method has been merged with Create
Create Does Everything: The Create method now handles both container creation and initialization
Access to Task Data: Since Create receives the CreateTask, you have access to pool_id and pool_query from the task

Framework-Managed Task Cleanup

Task cleanup is handled by the framework using the IPC manager:

No Custom Del Methods Required: Individual DelTaskType methods are no longer needed
IPC Manager Handles Cleanup: The framework automatically calls ipc_manager->DelTask() to deallocate tasks from shared memory
Memory Segment Deallocation: Tasks are properly removed from their respective memory segments (typically kMainSegment)

Simplified ChiMod Entry Points

ChiMod entry points are now hidden behind the CHI_TASK_CC macro:

Single Macro Call: Replace complex extern "C" blocks with one macro
Automatic Container Integration: Works seamlessly with chi::Container base class
Cleaner Module Code: Eliminates boilerplate entry point code

// Old approach (complex extern "C" block)
extern "C" {
  chi::ChiContainer* alloc_chimod() { /* ... */ }
  chi::ChiContainer* new_chimod(/*...*/) { /* ... */ }
  const char* get_chimod_name() { /* ... */ }
  void destroy_chimod(/*...*/) { /* ... */ }
  bool is_chimaera_chimod_ = true;
}

// New approach (simple macro)
CHI_TASK_CC(chimaera::MOD_NAME::Runtime)

void Create(hipc::FullPtr<CreateTask> task, chi::RunContext& ctx) {
  // Container is already initialized via Init() before Create is called
  // Do NOT call Init() here

  // Container-specific initialization logic
  // All tasks will be routed through the external queue lanes
  // which are automatically mapped to workers at runtime startup

  // Container is now ready for operation
}

FullPtr Parameter Pattern

All runtime methods use hipc::FullPtr<TaskType>:

void Custom(hipc::FullPtr<CustomTask> task, chi::RunContext& ctx) { ... }

Benefits of FullPtr:

Shared Memory Safety: Provides safe access across process boundaries
Automatic Dereferencing: Use task->field just like raw pointers
Memory Management: Framework handles allocation/deallocation
Null Checking: Use task.IsNull() to check validity

Custom Namespace Configuration

Overview

While the default namespace is chimaera, you can customize the namespace for your ChiMod modules. This is useful for:

Project Branding: Use your own project or company namespace
Avoiding Conflicts: Prevent naming conflicts with other ChiMod collections
Module Organization: Group related modules under a custom namespace

Configuring Custom Namespace

The namespace is controlled by the chimaera_repo.yaml file in your project root:

namespace: your_custom_namespace

For example:

namespace: mycompany

Required Changes for Custom Namespace

When using a custom namespace, you must update several components:

1. CreateParams chimod_lib_name

The most critical change is updating the chimod_lib_name in your CreateParams:

// Default chimaera namespace
struct CreateParams {
  static constexpr const char* chimod_lib_name = "chimaera_your_module";
};

// Custom namespace example
struct CreateParams {
  static constexpr const char* chimod_lib_name = "mycompany_your_module";
};

2. Module Namespace Declaration

Update your module's C++ namespace:

// Default
namespace chimaera::your_module {
  // module code
}

// Custom
namespace mycompany::your_module {
  // module code
}

3. CMake Library Names

The CMake system automatically uses your custom namespace. Libraries will be named:

Default: libchimaera_module_runtime.so, libchimaera_module_client.so
Custom: libmycompany_module_runtime.so, libmycompany_module_client.so

4. Runtime Integration

If your runtime code references the admin module or other system modules, update the references:

// Default admin module reference
auto* admin_chimod = module_manager->GetChiMod("chimaera_admin");

// Custom namespace admin module
auto* admin_chimod = module_manager->GetChiMod("mycompany_admin");

Checklist for Custom Namespace

Update chimaera_repo.yaml with your custom namespace
Update CreateParams::chimod_lib_name to use custom namespace prefix
Update C++ namespace declarations in all module files
Update runtime references to admin module and other system modules
Update any hardcoded module names in configuration or startup code
Rebuild all modules after namespace changes
Update library search paths if needed for deployment

Example: Complete Custom Namespace Module

# chimaera_repo.yaml
namespace: mycompany

// mymodule_tasks.h
namespace mycompany::mymodule {

struct CreateParams {
  static constexpr const char* chimod_lib_name = "mycompany_mymodule";
  // ... other parameters
};

using CreateTask = chimaera::admin::BaseCreateTask<CreateParams, Method::kCreate>;

}  // namespace mycompany::mymodule

// mymodule_runtime.h
namespace mycompany::mymodule {

class Runtime : public chi::Container {
public:
  using CreateParams = mycompany::mymodule::CreateParams;  // Required for CHI_TASK_CC
  // ... rest of class
};

}  // namespace mycompany::mymodule

// mymodule_runtime.cc
CHI_TASK_CC(mycompany::mymodule::Runtime)

Important Notes

Library Name Consistency: The chimod_lib_name must exactly match what the CMake system generates
Admin Module: If you customize the namespace, you may also want to rebuild the admin module with your custom namespace
Backward Compatibility: Changing namespace breaks compatibility with existing deployments using default namespace
Documentation: Update any module-specific documentation to reflect the new namespace

Advanced Topics

Task Scheduling

Tasks can be scheduled with different priorities:

kLowLatency: For time-critical operations
kHighLatency: For batch processing

Automatic Routing Architecture

The framework handles all task routing automatically:

Client-Side Enqueuing:
- Tasks are enqueued via IpcManager::Enqueue() from client code
- Lane selection uses PID+TID hash for automatic distribution across lanes
- Formula: lane_id = hash(PID, TID) % num_lanes
Worker-Lane Mapping (1:1 Direct Mapping):
- Number of lanes automatically equals number of sched workers (default: 8)
- Each worker assigned exactly one lane: worker i → lane i
- No round-robin needed - perfect 1:1 correspondence
- Lane headers track assigned worker ID
No Configuration Required:
- Lane count automatically matches sched worker count from config
- No separate task_queue_lanes configuration needed
- Change worker count → lane count adjusts automatically

Example: With 8 sched workers (default):

8 lanes created automatically in external queue
Worker 0 → Lane 0, Worker 1 → Lane 1, ..., Worker 7 → Lane 7
Client tasks distributed via hash to lanes 0-7
Each worker processes tasks from its dedicated lane

Error Handling

void Custom(hipc::FullPtr<CustomTask> task, chi::RunContext& ctx) {
  try {
    // Operation logic
    task->result_code_ = 0;
  } catch (const std::exception& e) {
    task->result_code_ = 1;
    task->data_ = chi::string(main_allocator_, e.what());
  }
  // Framework handles task completion automatically
}

Debugging Tips

Check Shared Memory: Use ipcs -m to view segments
Verify Task State: Check task completion status
Monitor Queue Depth: Use GetProcessQueue() to inspect queues
Enable Debug Logging: Set CHI_DEBUG environment variable
Use GDB: Attach to runtime process for debugging

Common Issues and Solutions

Tasks Not Being Executed:

Cause: Tasks not being routed to worker queues
Solution: Verify task pool_query is set correctly and pool exists
Debug: Add logging in task execution methods to verify they're being called

Queue Overflow or Deadlocks:

Cause: Tasks being enqueued but not dequeued from lanes
Solution: Verify lane creation in Create() method and proper task routing
Debug: Check lane sizes with lane->Size() and lane->IsEmpty()

Memory Leaks in Shared Memory:

Cause: Tasks not being properly cleaned up
Solution: Ensure framework Del dispatcher is working correctly
Debug: Monitor shared memory usage with ipcs -m

Performance Considerations

Minimize Allocations: Reuse buffers when possible
Batch Operations: Submit multiple tasks together
Use Appropriate Segments: Put large data in client_data_segment
Avoid Blocking: Use async operations when possible
Profile First: Measure before optimizing

Unit Testing

Unit testing for ChiMods is covered in the separate Module Test Guide. This guide provides comprehensive information on:

Test environment setup and configuration
Environment variables and module discovery
Test framework integration patterns
Complete test examples with fixtures
CMake integration and build setup
Best practices for ChiMod testing

The test guide demonstrates how to test both runtime and client components in the same process, enabling comprehensive integration testing without complex multi-process coordination.

Quick Reference Checklist

When creating a new Chimaera module, ensure you have:

Task Definition Checklist (`_tasks.h`)

Tasks inherit from chi::Task or use GetOrCreatePoolTask template (recommended for non-admin modules)
Use GetOrCreatePoolTask: For non-admin modules instead of BaseCreateTask directly
Use BaseCreateTask with IS_ADMIN=true: Only for admin module
SHM default constructor (if custom task)
Emplace constructor with all required parameters (if custom task)
Uses HSHM serializable types (chi::string, chi::vector, etc.)
Method constant assigned in constructor (e.g., method_ = Method::kCreate;)
No static casting: Use Method namespace constants directly
Include auto-generated methods file for Method constants

Runtime Container Checklist (`_runtime.h/cc`)

Inherits from chi::Container
Init() method overridden - calls base class Init() then initializes client for this ChiMod
Create() method does NOT call chi::Container::Init() (container is already initialized before Create is called)
All task methods use hipc::FullPtr<TaskType> parameters
NO custom Del methods needed - framework calls ipc_manager->DelTask() automatically
Uses CHI_TASK_CC(ClassName) macro for entry points
Routing is automatic - tasks are routed through external queue lanes mapped to workers (1:1 worker-to-lane mapping)

Client API Checklist (`_client.h/cc`)

Inherits from chi::ContainerClient
Uses CHI_IPC->NewTask<TaskType>() for allocation
Uses CHI_IPC->Send() for task submission (returns Future)
Async-only API: All methods return chi::Future<TaskType>
CRITICAL: AsyncCreate passes this pointer for PostWait callback to update pool_id_

Build System Checklist

CMakeLists.txt creates both client and runtime libraries
chimaera_mod.yaml defines module metadata
Auto-generated methods file: autogen/MOD_NAME_methods.h with Method namespace
Include chimaera.h: In methods file for GLOBAL_CONST macro
GLOBAL_CONST constants: Use namespace constants, not enum class
Proper install targets configured
Links against chimaera library

Common Pitfalls to Avoid

Pool Name Requirements

CRITICAL: All ChiMod Create functions MUST require a user-provided pool_name parameter. Never auto-generate pool names using pool_id_ during Create operations.

Why Pool Names Are Required

pool_id_ Not Available: pool_id_ is not set until after Create completes
User Intent: Users should explicitly name their pools for better organization
Uniqueness: Users can ensure uniqueness better than auto-generation
Debugging: Named pools are easier to identify during debugging

Pool Naming Guidelines

Descriptive Names: Use names that identify purpose or content
File-based Devices: For BDev file devices, pool_name serves as the file path
RAM-based Devices: For BDev RAM devices, pool_name should be unique identifier
Unique Identifiers: Consider timestamp + PID combinations when needed

Correct Pool Naming Usage

// BDev file-based device - pool_name is the file path
std::string file_path = "/path/to/device.dat";
const chi::PoolId bdev_pool_id(7000, 0);
bdev_client.Create(pool_query, file_path, bdev_pool_id,
                   chimaera::bdev::BdevType::kFile);

// BDev RAM-based device - pool_name is unique identifier
std::string pool_name = "my_ram_device_" + std::to_string(timestamp);
const chi::PoolId ram_pool_id(7001, 0);
bdev_client.Create(pool_query, pool_name, ram_pool_id,
                   chimaera::bdev::BdevType::kRam, ram_size);

// Other ChiMods - pool_name is descriptive identifier
std::string pool_name = "my_container_" + user_identifier;
const chi::PoolId mod_pool_id(7002, 0);
mod_client.Create(pool_query, pool_name, mod_pool_id);

Incorrect Pool Naming Usage

// WRONG: Using pool_id_ before it's set (will be 0 or garbage)
std::string bad_name = "pool_" + std::to_string(pool_id_.ToU64());

// WRONG: Using empty strings
client.Create(pool_query, "", pool_id);

// WRONG: Auto-generating inside Create function
// Create functions should not auto-generate names
void Create(pool_query) {
    std::string auto_name = "pool_" + generate_id();  // Wrong approach
}

Client Interface Pattern (Async-Only)

All ChiMod clients should follow the async-only interface pattern:

class Client : public chi::ContainerClient {
 public:
  // Async-only Create with required pool_name - returns Future
  chi::Future<CreateTask> AsyncCreate(
      const chi::PoolQuery& pool_query,
      const std::string& pool_name,
      const chi::PoolId& custom_pool_id /* user-provided name */) {
    auto* ipc_manager = CHI_IPC;
    // Use pool_name directly, never generate internally
    auto task = ipc_manager->NewTask<CreateTask>(
        chi::CreateTaskId(),
        chi::kAdminPoolId,  // Always use admin pool
        pool_query,
        CreateParams::chimod_lib_name,  // Never hardcode
        pool_name,          // User-provided name
        custom_pool_id,     // Target pool ID
        this                // Client pointer for PostWait callback
    );
    return ipc_manager->Send(task);
  }

  // Example of async-only operation pattern
  chi::Future<CustomTask> AsyncCustom(
      const chi::PoolQuery& pool_query,
      const std::string& input_data,
      chi::u32 operation_id) {
    auto* ipc_manager = CHI_IPC;
    auto task = ipc_manager->NewTask<CustomTask>(
        chi::CreateTaskId(),
        pool_id_,           // Use client's pool_id_ for non-Create operations
        pool_query,
        input_data,
        operation_id);
    return ipc_manager->Send(task);
  }
};

Usage Pattern (Caller Side):

// Initialize and create
chimaera::my_module::Client client;
const chi::PoolId pool_id(7000, 0);

auto create_task = client.AsyncCreate(chi::PoolQuery::Dynamic(), "my_pool", pool_id);
create_task.Wait();

if (create_task->GetReturnCode() != 0) {
    // Handle error
    return;
}

// Perform operations
auto op_task = client.AsyncCustom(chi::PoolQuery::Local(), "data", 1);
op_task.Wait();

// Access results
auto result = op_task->output_data_.str();

BDev-Specific Requirements

Single Interface: Use only one AsyncCreate() method (no multiple overloads)
File Devices: pool_name parameter serves as the file path
RAM Devices: pool_name parameter serves as unique identifier
Method Signature: AsyncCreate(pool_query, pool_name, custom_pool_id, bdev_type, total_size=0, io_depth=32, alignment=4096)

Compose Configuration Feature

The compose feature allows automatic pool creation from YAML configuration files. This enables declarative infrastructure setup where all required pools can be defined in configuration and created during runtime initialization or via utility script.

CreateParams LoadConfig Requirement

CRITICAL: All ChiMod CreateParams structures MUST implement a LoadConfig() method to support compose feature.

/**
 * Load configuration from PoolConfig (for compose mode)
 * Required for compose feature support
 * @param pool_config Pool configuration from compose section
 */
void LoadConfig(const chi::PoolConfig& pool_config) {
  // Parse YAML config string
  YAML::Node config = YAML::Load(pool_config.config_);

  // Load module-specific parameters from YAML
  if (config["parameter_name"]) {
    parameter_name_ = config["parameter_name"].as<Type>();
  }

  // Parse size strings (e.g., "2GB", "512MB")
  if (config["capacity"]) {
    std::string capacity_str = config["capacity"].as<std::string>();
    total_size_ = hshm::ConfigParse::ParseSize(capacity_str);
  }
}

Compose Configuration Format

compose:
- mod_name: chimaera_bdev        # ChiMod library name
  pool_name: ram://test          # Pool name (or file path for BDev)
  pool_query: dynamic            # Either "dynamic" or "local"
  pool_id: 200.0                 # Pool ID in "major.minor" format
  capacity: 2GB                  # Module-specific parameters
  bdev_type: ram                 # Additional parameters as needed
  io_depth: 32
  alignment: 4096

- mod_name: chimaera_another_mod
  pool_name: my_pool
  pool_query: local
  pool_id: 201.0
  custom_param: value

Usage Modes

1. Automatic During Runtime Init: Pools are automatically created when runtime initializes if compose section is present in configuration:

export CHI_SERVER_CONF=/path/to/config_with_compose.yaml
chimaera runtime start

2. Manual via chimaera compose Utility: Create pools using compose configuration against running runtime:

chimaera compose /path/to/compose_config.yaml

Implementation Checklist

When adding compose support to a ChiMod:

Add LoadConfig(const chi::PoolConfig& pool_config) method to CreateParams
Parse all module-specific parameters from YAML config
Handle optional parameters with defaults
Use hshm::ConfigParse::ParseSize() for size strings
Include <yaml-cpp/yaml.h> and <chimaera/config_manager.h> in tasks header
Test with compose configuration before release

Example Admin ChiMod LoadConfig

void LoadConfig(const chi::PoolConfig& pool_config) {
  // Admin doesn't have additional configuration fields
  // YAML config parsing would go here for modules with config fields
  (void)pool_config;  // Suppress unused parameter warning
}

Example BDev ChiMod LoadConfig

void LoadConfig(const chi::PoolConfig& pool_config) {
  YAML::Node config = YAML::Load(pool_config.config_);

  // Load BDev type
  if (config["bdev_type"]) {
    std::string type_str = config["bdev_type"].as<std::string>();
    if (type_str == "file") {
      bdev_type_ = BdevType::kFile;
    } else if (type_str == "ram") {
      bdev_type_ = BdevType::kRam;
    }
  }

  // Load capacity (parse size strings)
  if (config["capacity"]) {
    std::string capacity_str = config["capacity"].as<std::string>();
    total_size_ = hshm::ConfigParse::ParseSize(capacity_str);
  }

  // Load optional parameters
  if (config["io_depth"]) {
    io_depth_ = config["io_depth"].as<chi::u32>();
  }
  if (config["alignment"]) {
    alignment_ = config["alignment"].as<chi::u32>();
  }
}

Table of Contents
Overview
Architecture
- Core Principles
- Key Components
Coding Style
- General Guidelines
- Code Formatting
Module Structure
- Task Definition (MOD_NAME_tasks.h)
  - CreateParams Dual-Mode System
- Client Implementation (MOD_NAME_client.h/cc)
- ChiMod CreateTask Pool Assignment Requirements
- ChiMod Name Requirements
- Runtime Container (MOD_NAME_runtime.h/cc)
- Execution Modes and Dynamic Scheduling
Configuration and Code Generation
- Overview
- chimaera_repo.yaml
- chimaera_mod.yaml
- chimaera repo refresh Utility
- Workflow Summary
Task Development
- Task Requirements
- Task Methods: Copy and Aggregate
- Task Naming Conventions
- Method System and Auto-Generated Files
- Data Annotations
- Task Lifecycle
C++20 Coroutines for ChiMod Development
- Overview
- TaskResume Return Type
- co_await - Suspending Execution
- co_return - Completing Coroutines
- Common Coroutine Patterns
- When to Use Coroutines
- PoolManager Coroutine Integration
- Admin Runtime Coroutine Methods
- Best Practices
- Migration from Non-Coroutine Methods
- Framework Del Implementation
Synchronization Primitives
- Why Use Chimaera Synchronization Primitives?
- CoMutex: Cooperative Mutual Exclusion
- CoRwLock: Cooperative Reader-Writer Lock
- TaskId Grouping Behavior
- Best Practices
- Example: Module with Synchronized Data Structure
Pool Query and Task Routing
- Overview of PoolQuery
- PoolQuery Types
- PoolQuery Usage Guidelines
  - Best Practices
  - Common Patterns
- Runtime Routing Implementation
- PoolQuery in Task Definitions
- Advanced PoolQuery Features
  - Query Introspection
  - Combining with Task Priorities
- Troubleshooting PoolQuery Issues
Client-Server Communication
- Client Implementation Patterns
  - Async Create Pattern
- Memory Segments
- IPC Queue
Memory Management
- Task Memory Allocation
- Best Practices
- Task Allocation and Deallocation Pattern
- CHI_IPC Buffer Allocation
- Task Data Structures
  - Strings and Vectors
  - Serialization Support
- Bulk Transfer Support with ar.bulk
Build System Integration
- CMakeLists.txt Template
- CMakeLists.txt Guidelines
- ChiMod Build Functions Reference
- Target Naming and Linking
  - Target Format
- Automatic Dependencies
- Targets Created by ChiMod Functions
- Module Configuration (chimaera_mod.yaml)
- Auto-Generated Method Files
- Runtime Entry Points
Auto-Generated Code Pattern
- Overview
- New Pattern: Auto-Generated Source Files
- File Structure
- Auto-Generated Source Template
- CMake Integration
- Benefits
- Important Notes
External ChiMod Development
- Prerequisites
- External ChiMod Repository Structure
- Repository Configuration (chimaera_repo.yaml)
- Root CMakeLists.txt
- ChiMod CMakeLists.txt
- External Applications Using Your ChiMod
- External ChiMod Implementation
  - CreateParams Configuration
  - C++ Namespace
- Building External ChiMods
- Usage in Applications
- CHIMAERA_INIT Initialization Modes
- Dependencies and Installation Paths
- Common External Development Issues
- External ChiMod Checklist
Example Module
- Creating a New Module
Recent Changes and Best Practices
- Container Initialization Pattern
- Framework-Managed Task Cleanup
- Simplified ChiMod Entry Points
- FullPtr Parameter Pattern
Custom Namespace Configuration
- Overview
- Configuring Custom Namespace
- Required Changes for Custom Namespace
- Checklist for Custom Namespace
- Example: Complete Custom Namespace Module
- Important Notes
Advanced Topics
- Task Scheduling
- Automatic Routing Architecture
- Error Handling
Debugging Tips
- Common Issues and Solutions
Performance Considerations
Unit Testing
Quick Reference Checklist
- Task Definition Checklist (_tasks.h)
- Runtime Container Checklist (_runtime.h/cc)
- Client API Checklist (_client.h/cc)
- Build System Checklist
- Common Pitfalls to Avoid
Pool Name Requirements
- Why Pool Names Are Required
- Pool Naming Guidelines
- Correct Pool Naming Usage
- Incorrect Pool Naming Usage
- Client Interface Pattern (Async-Only)
- BDev-Specific Requirements
Compose Configuration Feature
- CreateParams LoadConfig Requirement
- Compose Configuration Format
- Usage Modes
- Implementation Checklist
- Example Admin ChiMod LoadConfig
- Example BDev ChiMod LoadConfig

Table of Contents​

Overview​

Architecture​

Core Principles​

Key Components​

Coding Style​

General Guidelines​

Code Formatting​

Module Structure​

Task Definition (MOD_NAME_tasks.h)​

CreateParams Dual-Mode System​

Client Implementation (MOD_NAME_client.h/cc)​

ChiMod CreateTask Pool Assignment Requirements​

Why This is Required​

Correct Usage​

Incorrect Usage​

Key Points​

ChiMod Name Requirements​

Correct Usage​

Incorrect Usage​

Why This is Required​

Implementation Pattern​

Runtime Container (MOD_NAME_runtime.h/cc)​

Execution Modes and Dynamic Scheduling​

ExecMode Overview​

kExec Mode (Default)​

kDynamicSchedule Mode​

Example: GetOrCreatePool with Dynamic Scheduling​

Using Dynamic() PoolQuery​

Benefits of Dynamic Scheduling​

Implementation Guidelines​

Worker Implementation Details​

Configuration and Code Generation​

Overview​

chimaera_repo.yaml​

chimaera_mod.yaml​

chimaera repo refresh Utility​

Usage​

Generated Files​

When to Run chimaera repo refresh​

Important Notes​

Workflow Summary​

Task Development​

Task Requirements​

Task Methods: Copy and Aggregate​

Copy Method​

Aggregate Method (Optional)​

Copy/Aggregate Usage in Networking​

Complete Example: ReadTask with Copy and Aggregate​

Task Naming Conventions​

Required Naming Pattern​

Examples​

Naming Rules​

Backward Compatibility​

Benefits of Consistent Naming​

Method System and Auto-Generated Files​

Method Definitions (autogen/MOD_NAME_methods.h)​

BaseCreateTask Template System​

GetOrCreatePoolTask vs BaseCreateTask Usage​

BaseCreateTask Template Parameters​

BaseCreateTask Structure​

Usage Examples​

Data Annotations​

Task Lifecycle​

C++20 Coroutines for ChiMod Development​

Overview​

TaskResume Return Type​

co_await - Suspending Execution​

co_return - Completing Coroutines​

Common Coroutine Patterns​

Pattern 1: Nested Pool Creation​

Pattern 2: Sequential Subtask Execution​

Pattern 3: Parallel Subtask Execution​

When to Use Coroutines​

PoolManager Coroutine Integration​

Admin Runtime Coroutine Methods​

Best Practices​

Migration from Non-Coroutine Methods​

Framework Del Implementation​

Synchronization Primitives​

Table of Contents

Overview

Architecture

Core Principles

Key Components

Coding Style

General Guidelines

Code Formatting

Module Structure

Task Definition (MOD_NAME_tasks.h)

CreateParams Dual-Mode System

Client Implementation (MOD_NAME_client.h/cc)

ChiMod CreateTask Pool Assignment Requirements

Why This is Required

Correct Usage

Incorrect Usage

Key Points

ChiMod Name Requirements

Correct Usage

Incorrect Usage

Why This is Required

Implementation Pattern

Runtime Container (MOD_NAME_runtime.h/cc)

Execution Modes and Dynamic Scheduling

ExecMode Overview

kExec Mode (Default)

kDynamicSchedule Mode

Example: GetOrCreatePool with Dynamic Scheduling

Using Dynamic() PoolQuery

Benefits of Dynamic Scheduling

Implementation Guidelines

Worker Implementation Details

Configuration and Code Generation

Overview

chimaera_repo.yaml

chimaera_mod.yaml

chimaera repo refresh Utility

Usage

Generated Files

When to Run chimaera repo refresh

Important Notes

Workflow Summary

Task Development

Task Requirements

Task Methods: Copy and Aggregate

Copy Method

Aggregate Method (Optional)

Copy/Aggregate Usage in Networking

Complete Example: ReadTask with Copy and Aggregate

Task Naming Conventions

Required Naming Pattern

Examples

Naming Rules

Backward Compatibility

Benefits of Consistent Naming

Method System and Auto-Generated Files

Method Definitions (autogen/MOD_NAME_methods.h)

BaseCreateTask Template System

GetOrCreatePoolTask vs BaseCreateTask Usage

BaseCreateTask Template Parameters

BaseCreateTask Structure

Usage Examples

Data Annotations

Task Lifecycle

C++20 Coroutines for ChiMod Development

Overview

TaskResume Return Type

co_await - Suspending Execution

co_return - Completing Coroutines

Common Coroutine Patterns

Pattern 1: Nested Pool Creation

Pattern 2: Sequential Subtask Execution

Pattern 3: Parallel Subtask Execution

When to Use Coroutines

PoolManager Coroutine Integration

Admin Runtime Coroutine Methods

Best Practices

Migration from Non-Coroutine Methods

Framework Del Implementation

Synchronization Primitives