Simulation reporting
====================

Jano can describe a temporal simulation over a concrete dataset and expose it in three complementary ways:

- a structured ``SimulationSummary``,
- a standalone HTML timeline report,
- or plot-ready ``SimulationChartData`` that you can feed into your own Python visualizations.

The entry point is ``describe_simulation()`` on ``TemporalBacktestSplitter``.

If you want to run a full simulation without manual fold iteration, the recommended interface is ``WalkForwardPolicy``.

The overall workflow is deliberately layered:

- use high-level classes when the question is already encapsulated,
- inspect or prune iterations through ``plan()`` when needed,
- and fall back to manual fold iteration when you want to compose everything yourself.

The overall workflow is deliberately layered:

- use high-level classes when the question is already encapsulated,
- inspect or prune iterations through ``plan()`` when needed,
- and fall back to manual fold iteration when you want to compose everything yourself.

The same API accepts three tabular inputs:

- ``pandas.DataFrame``
- ``numpy.ndarray`` using integer column references such as ``time_col=0``
- ``polars.DataFrame`` when the optional Polars dependency is installed

That means the temporal configuration stays the same even if the upstream data source changes. The only thing that changes is how you reference columns:

- by name for pandas and Polars
- by integer position for NumPy arrays

Example
-------

.. container:: example-block

   pandas.DataFrame

.. code-block:: python

   import pandas as pd

   from jano import TemporalPartitionSpec, WalkForwardPolicy

   frame = pd.DataFrame(
       {
           "timestamp": pd.date_range("2024-01-01", periods=365, freq="D"),
           "feature": range(365),
           "target": range(100, 465),
       }
   )

   policy = WalkForwardPolicy(
       time_col="timestamp",
       partition=TemporalPartitionSpec(
           layout="train_test",
           train_size="10D",
           test_size="5D",
       ),
       step="5D",
       strategy="rolling",
   )

   result = policy.run(
       frame,
       title="Walk-forward simulation",
   )

   print(result.total_folds)
   print(result.to_frame().head())

   html = result.html
   chart_data = result.chart_data

   print(html[:120])
   print(chart_data.segment_stats)

If you want to inspect the simulation before materializing folds, use ``plan()``:

.. container:: example-block

   Planned simulation

.. code-block:: python

   plan = policy.plan(frame, title="Walk-forward plan")
   print(plan.total_folds)
   print(plan.to_frame().head())

   filtered = plan.exclude_windows(
       train=[("2025-12-20", "2026-01-05")],
   ).select_from_iteration(5)

   result = filtered.materialize()

The plan frame includes the iteration index plus segment boundaries and row counts, so you can inspect the structure first and only materialize the folds you actually want.

You can anchor the simulation to a specific point in time and cap the number of folds:

.. container:: example-block

   Anchored simulation

.. code-block:: python

   policy = WalkForwardPolicy(
       time_col="timestamp",
       partition=TemporalPartitionSpec(
           layout="train_test",
           train_size="15D",
           test_size="4D",
       ),
       step="1D",
       strategy="rolling",
       start_at="2025-09-01",
       max_folds=15,
   )

   result = policy.run(frame, title="15 daily retraining iterations")

``WalkForwardPolicy`` also accepts ``end_at`` if you want to constrain the simulation to a bounded time window before folds are generated.

If your source data is a NumPy array, reference the time column by integer position:

.. container:: example-block

   NumPy input

.. code-block:: python

   import numpy as np

   values = np.array(
       [
           ["2025-09-01", 0.2, 1],
           ["2025-09-02", 0.4, 0],
           ["2025-09-03", 0.1, 1],
           ["2025-09-04", 0.3, 0],
       ],
       dtype=object,
   )

   simulation = TemporalSimulation(
       time_col=0,
       partition=TemporalPartitionSpec(
           layout="train_test",
           train_size="2D",
           test_size="1D",
       ),
       step="1D",
       strategy="single",
   )

If your source data is a Polars frame, the same configuration works with named columns:

.. container:: example-block

   polars.DataFrame

.. code-block:: python

   import polars as pl

   frame = pl.DataFrame(
       {
           "timestamp": ["2025-09-01", "2025-09-02", "2025-09-03", "2025-09-04"],
           "feature": [0.2, 0.4, 0.1, 0.3],
           "target": [1, 0, 1, 0],
       }
   ).with_columns(pl.col("timestamp").str.strptime(pl.Datetime, "%Y-%m-%d"))

   simulation = TemporalSimulation(
       time_col="timestamp",
       partition=TemporalPartitionSpec(
           layout="train_test",
           train_size="2D",
           test_size="1D",
       ),
       step="1D",
       strategy="single",
   )

   result = simulation.run(frame)

Low-level manual control
------------------------

When you need direct control over folds or want to integrate with an external training loop, use ``TemporalBacktestSplitter`` directly.

.. code-block:: python

   from jano import TemporalBacktestSplitter, TemporalPartitionSpec

   splitter = TemporalBacktestSplitter(
       time_col="timestamp",
       partition=TemporalPartitionSpec(
           layout="train_test",
           train_size="10D",
           test_size="5D",
       ),
       step="5D",
       strategy="rolling",
   )

   for split in splitter.iter_splits(frame):
       print(split.summary())

The same splitter can also precompute the full partition geometry:

.. code-block:: python

   plan = splitter.plan(frame)
   print(plan.to_frame()[["iteration", "train_start", "train_end", "test_start", "test_end"]])

This is the fully manual mode. It is the right place when you want to compose the full process yourself: partition layouts, temporal gaps, special date exclusions, feature lookback windows, model training loops or any custom evaluation logic that should not be hidden behind a higher-level helper.

Fixed cutoff studies
--------------------

These are special use cases on top of the basic simulation workflow.

Jano now exposes them as dedicated temporal policies instead of leaving them as manual recipes.

.. container:: example-block

   Fixed test, expanding train

.. code-block:: python

   from jano import TrainHistoryPolicy

   policy = TrainHistoryPolicy(
       "timestamp",
       cutoff="2025-09-15",
       train_sizes=["7D", "14D", "21D", "28D"],
       test_size="4D",
   )

   result = policy.evaluate(
       frame,
       model=model,
       target_col="target",
       feature_cols=["feature_1", "feature_2"],
       metrics=["mae", "rmse"],
   )

   print(result.to_frame()[["train_size", "mae", "rmse"]])
   print(result.find_optimal_train_size(metric="rmse", tolerance=0.01))

This keeps the same test slice fixed while train expands toward the past. It is the right shape for questions about training-history sufficiency, data efficiency and whether more historical data is really worth carrying into production training jobs.

The opposite special case is also common: keep train fixed, move test forward day by day and measure for how long a model or rule keeps its performance without retraining. That pattern answers a different operational question:

- how many days can this object stay in production before it degrades?
- how quickly does performance decay after the training cutoff?
- how often should retraining happen?

In other words:

- fixed test + growing train helps study training-history sufficiency
- fixed train + moving test helps study performance durability after deployment

.. container:: example-block

   Fixed train, moving test

.. code-block:: python

   from jano import DriftMonitoringPolicy

   policy = DriftMonitoringPolicy(
       "timestamp",
       cutoff="2025-09-15",
       train_size="30D",
       test_size="3D",
       step="1D",
       max_windows=10,
   )

   result = policy.evaluate(
       frame,
       model=model,
       target_col="target",
       feature_cols=["feature_1", "feature_2"],
       metrics=["mae", "rmse"],
   )

   print(result.to_frame()[["window", "test_start", "rmse"]])
   print(result.find_drift_onset(metric="rmse", threshold=0.15, baseline="first"))

This keeps the same training history fixed while the evaluation window moves forward over time. It is the right shape when you want to estimate how long an object can stay in production before retraining becomes necessary.

Composed policy: optimize train history inside each walk-forward iteration
--------------------------------------------------------------------------

When the question is more complex, you can still stay on the recommended surface.

``RollingTrainHistoryPolicy`` runs an outer walk-forward loop and, inside each iteration,
chooses the smallest train window that stays within tolerance of the best score for that
iteration's fixed test slice.

.. container:: example-block

   Walk-forward with inner train-history optimization

.. code-block:: python

   from jano import RollingTrainHistoryPolicy, TemporalPartitionSpec

   policy = RollingTrainHistoryPolicy(
       "timestamp",
       partition=TemporalPartitionSpec(
           layout="train_test",
           train_size="30D",
           test_size="1D",
       ),
       step="1D",
       strategy="rolling",
       max_folds=10,
       train_sizes=["5D", "10D", "15D", "30D"],
   )

   result = policy.evaluate(
       frame,
       model=model,
       target_col="target",
       feature_cols=["feature_1", "feature_2"],
       metrics="rmse",
       metric="rmse",
       tolerance=0.01,
   )

   print(result.to_frame().head())
   print(result.summary())

Temporal semantics and leakage control
--------------------------------------

When a single timestamp column is not enough, you can pass a ``TemporalSemanticsSpec`` instead of a plain ``time_col`` string.

This lets you separate:

- the timeline used for reporting and global simulation bounds,
- the internal ordering column,
- and the timestamp column used to decide whether each segment is eligible.

That matters in production-like datasets where availability and event time differ. For example, a flight may depart on one day but only become usable for supervised training when its arrival is known.

.. code-block:: python

   from jano import TemporalBacktestSplitter, TemporalPartitionSpec, TemporalSemanticsSpec

   splitter = TemporalBacktestSplitter(
       time_col=TemporalSemanticsSpec(
           timeline_col="departured_at",
           segment_time_cols={
               "train": "arrived_at",
               "test": "departured_at",
           },
       ),
       partition=TemporalPartitionSpec(
           layout="train_test",
           train_size="14D",
           test_size="3D",
           gap_before_train="1D",
           gap_before_test="1D",
           gap_after_test="2D",
       ),
       step="1D",
       strategy="rolling",
   )

In that configuration, the simulation is reported over the ``departured_at`` timeline, while the train set only includes rows whose ``arrived_at`` falls inside the train window. This prevents rows from entering train before the target would actually be available in production.

Feature-specific lookback windows
---------------------------------

Some pipelines need another layer beyond the fold definition itself: different feature groups
may need different amounts of past data even when the supervised ``train`` segment is fixed.

.. code-block:: python

   from jano import FeatureLookbackSpec

   split = next(splitter.iter_splits(frame))
   lookbacks = FeatureLookbackSpec(
       default_lookback="15D",
       group_lookbacks={"lag_features": "65D"},
       feature_groups={"lag_features": ["lag_30", "lag_60"]},
   )

   history = split.slice_feature_history(
       frame,
       lookbacks,
       time_col="timestamp",
       segment_name="train",
   )

   recent_context = history["__default__"]
   lag_context = history["lag_features"]

This is useful when recent features only need a short context window while lagged or
seasonal features need a much deeper historical slice for the same model.

Simple HTML preview
-------------------

Below is a compact mock of the kind of timeline the generated HTML report shows.

.. raw:: html

   <div class="simulation-preview">
     <div class="preview-top">
       <div class="preview-kicker">Simulation report</div>
       <div class="preview-title">Walk-forward simulation</div>
       <div class="preview-meta">
         <span class="preview-chip">Rows: 365</span>
         <span class="preview-chip">Folds: 6</span>
         <span class="preview-chip">Strategy: rolling</span>
       </div>
     </div>
     <div class="preview-body">
       <div class="preview-row">
         <div class="preview-label">Fold 0</div>
         <div class="preview-track">
           <span class="preview-segment train" style="left: 0%; width: 44%;"></span>
           <span class="preview-segment validation" style="left: 48%; width: 12%;"></span>
           <span class="preview-segment test" style="left: 64%; width: 16%;"></span>
         </div>
       </div>
       <div class="preview-row">
         <div class="preview-label">Fold 1</div>
         <div class="preview-track">
           <span class="preview-segment train" style="left: 8%; width: 44%;"></span>
           <span class="preview-segment validation" style="left: 56%; width: 12%;"></span>
           <span class="preview-segment test" style="left: 72%; width: 16%;"></span>
         </div>
       </div>
       <div class="preview-row">
         <div class="preview-label">Fold 2</div>
         <div class="preview-track">
           <span class="preview-segment train" style="left: 16%; width: 44%;"></span>
           <span class="preview-segment validation" style="left: 64%; width: 12%;"></span>
           <span class="preview-segment test" style="left: 80%; width: 14%;"></span>
         </div>
       </div>
     </div>
   </div>

What it returns
---------------

By default, ``describe_simulation()`` returns a ``SimulationSummary`` object with:

- dataset span and row count,
- total number of folds,
- fold-by-fold segment boundaries,
- a tabular view through ``to_frame()``,
- a serializable structure through ``to_dict()``,
- plot-ready timeline metadata through ``chart_data``,
- and an HTML report accessible through ``html`` or ``write_html()``.

You can also request a specific output directly:

- ``output="summary"`` returns ``SimulationSummary``,
- ``output="html"`` returns the rendered HTML string,
- ``output="chart_data"`` returns ``SimulationChartData``.

What the HTML shows
-------------------

The generated report draws one line per fold over the full dataset timeline and now includes:

- a richer summary header with dataset span, fold count, strategy and sizing mode,
- segment profile cards with average, minimum and maximum row counts,
- a clearer per-fold timeline with labels and row-count chips.

Each segment is color-coded:

- train in blue,
- validation in orange,
- test in green.

This makes it easier to inspect how a proposed simulation will behave before plugging it into a model or evaluation pipeline.

Using chart data directly
-------------------------

``SimulationChartData`` is designed for downstream plotting without reparsing HTML.

It includes:

- fold-level segment positions in timeline percentages,
- original start and end timestamps,
- row counts per segment,
- segment colors and aggregate statistics.

Example:

.. code-block:: python

   chart_data = splitter.describe_simulation(frame, output="chart_data")

   first_fold = chart_data.folds[0]
   first_train = first_fold["segments"]["train"]

   print(first_train["offset_pct"], first_train["width_pct"])
   print(chart_data.segment_stats["train"])