What is TanaT#

TanaT (Temporal ANalysis of Trajectories) is an extensible Python library for temporal sequence analysis.

The name of the library refers to a variety of wine grape that originates from south of France, taking continuity with the TraMineR library which widely inspired this work (Traminer is also a variety of wine grape).

It has been thought to address analysis and modeling issues of patient care pathways. This application guides the choices of the first features to be implemented. Nonetheless, our core data model fits the needs of multiple different contexts.

What Makes TanaT Different?#

Unlike traditional time series libraries (more especially aeon or sktime ), TanaT is designed for irregularly sampled, symbolic event-based temporal data. In practice, this means:

  • Sequences in a pool can have different lengths (different numbers of observations per individual).

  • Observations carry complex features (e.g. symbolic labels "GP", "EMERGENCY") rather than purely numeric values.

  • Classical data analysis methods do not apply directly. TanaT provides dedicated metrics and algorithms instead.

These characteristics are common in multiple contexts such as healthcare (patient pathways), web analytics (user journeys), industry 4.0 (logs), tourism (human activities), etc.

Compared to the few alternative libraries for sequence analysis (TraMineR or Sequenzo ), TanaT proposes an original and rich data model. The first fundamental difference lies in the way temporal information is represented: contrary to the above libraries, we do not discretize time. This makes our representation lighter for most cases. The second core difference is the richness of event description.

Core Framework Functionalities#

The TanaT framework provides a complete workflow for temporal sequence analysis, from data ingestion to advanced analytics and visualization.

Data Containers

Flexible representations for events, intervals, and states, at both individual and population levels.

Metadata

Automatic inference and explicit control of temporal and feature metadata.

Builder & Storage

Persistent storage and retrieval of sequences and trajectories.

Visualisation

Rich visualization tools for exploring and interpreting temporal sequences and analysis results.

Zeroing & Alignment

Temporal alignment and reference-date management for comparative analysis.

Ready to try it?#

Head to Installation to set up TanaT, then follow First Steps to build your first sequence pool in under five minutes.