tanat.store.sequence.builder.type package#
Submodules#
tanat.store.sequence.builder.type.event module#
Event sequence store builder.
- class tanat.store.sequence.builder.type.event.EventSequenceStoreBuilder[source]#
Bases:
SequenceStoreBuilderFluent builder for Event sequence stores.
Exposes
time_columnexplicitly on everyadd_*call.- add_csv(path: str | Path, *, id_column: str, features: str | list[str], time_column: str | None = None, is_static: bool = False, **reader_kwargs) EventSequenceStoreBuilder[source]#
Register a CSV file.
- add_dataframe(data: DataFrame | LazyFrame, *, id_column: str, features: str | list[str], time_column: str | None = None, is_static: bool = False, **_kw) EventSequenceStoreBuilder[source]#
Register an in-memory Polars / Pandas DataFrame.
- add_parquet(path: str | Path, *, id_column: str, features: str | list[str], time_column: str | None = None, is_static: bool = False, **reader_kwargs) EventSequenceStoreBuilder[source]#
Register a Parquet file (glob patterns supported).
- add_sql(connection: str, query: str, *, id_column: str, features: str | list[str], time_column: str | None = None, is_static: bool = False, **sql_kwargs) EventSequenceStoreBuilder[source]#
Register a SQL query (requires
connectorx).
tanat.store.sequence.builder.type.interval module#
Interval sequence store builder.
- class tanat.store.sequence.builder.type.interval.IntervalSequenceStoreBuilder(*, sort_anchor: str = 'start')[source]#
Bases:
SequenceStoreBuilderFluent builder for Interval sequence stores.
Exposes
start_columnandend_columnexplicitly on everyadd_*call. Passsort_anchorat construction time to control the intra-sequence sort column:"start"(default),"end", or"middle"(midpoint of each interval).- add_csv(path: str | Path, *, id_column: str, features: str | list[str], start_column: str | None = None, end_column: str | None = None, is_static: bool = False, **reader_kwargs) IntervalSequenceStoreBuilder[source]#
Register a CSV file.
- add_dataframe(data: DataFrame | LazyFrame, *, id_column: str, features: str | list[str], start_column: str | None = None, end_column: str | None = None, is_static: bool = False, **_kw) IntervalSequenceStoreBuilder[source]#
Register an in-memory Polars / Pandas DataFrame.
- add_parquet(path: str | Path, *, id_column: str, features: str | list[str], start_column: str | None = None, end_column: str | None = None, is_static: bool = False, **reader_kwargs) IntervalSequenceStoreBuilder[source]#
Register a Parquet file (glob patterns supported).
- add_sql(connection: str, query: str, *, id_column: str, features: str | list[str], start_column: str | None = None, end_column: str | None = None, is_static: bool = False, **sql_kwargs) IntervalSequenceStoreBuilder[source]#
Register a SQL query (requires
connectorx).
tanat.store.sequence.builder.type.state module#
State sequence store builder.
- class tanat.store.sequence.builder.type.state.StateSequenceStoreBuilder(*, end_value: datetime | int | float | None = None, validate_continuity: bool = True)[source]#
Bases:
SequenceStoreBuilderFluent builder for State sequence stores.
States are contiguous and non-overlapping intervals defined by
start_columnand an optionalend_column.When
end_columnis omitted at registration time,T_ENDis auto-computed as the nextT_STARTwithin each sequence. Passend_valueat construction time to set the sentinel for the last state of every sequence (None→ leaves the lastT_ENDasnull).When
end_columnis provided by the user,validate_continuity(default:True) checks that states are truly contiguous (T_END[i] == T_START[i+1]within each sequence) before writing. Set it toFalseto skip this check on large datasets where the cost of a fullcollect()is unacceptable.- __init__(*, end_value: datetime | int | float | None = None, validate_continuity: bool = True) None[source]#
- add_csv(path: str | Path, *, id_column: str, features: str | list[str], start_column: str | None = None, end_column: str | None = None, is_static: bool = False, **reader_kwargs) StateSequenceStoreBuilder[source]#
Register a CSV file.
- add_dataframe(data: DataFrame | LazyFrame, *, id_column: str, features: str | list[str], start_column: str | None = None, end_column: str | None = None, is_static: bool = False, **_kw) StateSequenceStoreBuilder[source]#
Register an in-memory Polars / Pandas DataFrame.
- add_parquet(path: str | Path, *, id_column: str, features: str | list[str], start_column: str | None = None, end_column: str | None = None, is_static: bool = False, **reader_kwargs) StateSequenceStoreBuilder[source]#
Register a Parquet file (glob patterns supported).
- add_sql(connection: str, query: str, *, id_column: str, features: str | list[str], start_column: str | None = None, end_column: str | None = None, is_static: bool = False, **sql_kwargs) StateSequenceStoreBuilder[source]#
Register a SQL query (requires
connectorx).
Module contents#
Register SequenceStoreBuilder subtypes.
- class tanat.store.sequence.builder.type.EventSequenceStoreBuilder[source]#
Bases:
SequenceStoreBuilderFluent builder for Event sequence stores.
Exposes
time_columnexplicitly on everyadd_*call.- add_csv(path: str | Path, *, id_column: str, features: str | list[str], time_column: str | None = None, is_static: bool = False, **reader_kwargs) EventSequenceStoreBuilder[source]#
Register a CSV file.
- add_dataframe(data: DataFrame | LazyFrame, *, id_column: str, features: str | list[str], time_column: str | None = None, is_static: bool = False, **_kw) EventSequenceStoreBuilder[source]#
Register an in-memory Polars / Pandas DataFrame.
- add_parquet(path: str | Path, *, id_column: str, features: str | list[str], time_column: str | None = None, is_static: bool = False, **reader_kwargs) EventSequenceStoreBuilder[source]#
Register a Parquet file (glob patterns supported).
- add_sql(connection: str, query: str, *, id_column: str, features: str | list[str], time_column: str | None = None, is_static: bool = False, **sql_kwargs) EventSequenceStoreBuilder[source]#
Register a SQL query (requires
connectorx).
- class tanat.store.sequence.builder.type.IntervalSequenceStoreBuilder(*, sort_anchor: str = 'start')[source]#
Bases:
SequenceStoreBuilderFluent builder for Interval sequence stores.
Exposes
start_columnandend_columnexplicitly on everyadd_*call. Passsort_anchorat construction time to control the intra-sequence sort column:"start"(default),"end", or"middle"(midpoint of each interval).- add_csv(path: str | Path, *, id_column: str, features: str | list[str], start_column: str | None = None, end_column: str | None = None, is_static: bool = False, **reader_kwargs) IntervalSequenceStoreBuilder[source]#
Register a CSV file.
- add_dataframe(data: DataFrame | LazyFrame, *, id_column: str, features: str | list[str], start_column: str | None = None, end_column: str | None = None, is_static: bool = False, **_kw) IntervalSequenceStoreBuilder[source]#
Register an in-memory Polars / Pandas DataFrame.
- add_parquet(path: str | Path, *, id_column: str, features: str | list[str], start_column: str | None = None, end_column: str | None = None, is_static: bool = False, **reader_kwargs) IntervalSequenceStoreBuilder[source]#
Register a Parquet file (glob patterns supported).
- add_sql(connection: str, query: str, *, id_column: str, features: str | list[str], start_column: str | None = None, end_column: str | None = None, is_static: bool = False, **sql_kwargs) IntervalSequenceStoreBuilder[source]#
Register a SQL query (requires
connectorx).
- class tanat.store.sequence.builder.type.StateSequenceStoreBuilder(*, end_value: datetime | int | float | None = None, validate_continuity: bool = True)[source]#
Bases:
SequenceStoreBuilderFluent builder for State sequence stores.
States are contiguous and non-overlapping intervals defined by
start_columnand an optionalend_column.When
end_columnis omitted at registration time,T_ENDis auto-computed as the nextT_STARTwithin each sequence. Passend_valueat construction time to set the sentinel for the last state of every sequence (None→ leaves the lastT_ENDasnull).When
end_columnis provided by the user,validate_continuity(default:True) checks that states are truly contiguous (T_END[i] == T_START[i+1]within each sequence) before writing. Set it toFalseto skip this check on large datasets where the cost of a fullcollect()is unacceptable.- __init__(*, end_value: datetime | int | float | None = None, validate_continuity: bool = True) None[source]#
- add_csv(path: str | Path, *, id_column: str, features: str | list[str], start_column: str | None = None, end_column: str | None = None, is_static: bool = False, **reader_kwargs) StateSequenceStoreBuilder[source]#
Register a CSV file.
- add_dataframe(data: DataFrame | LazyFrame, *, id_column: str, features: str | list[str], start_column: str | None = None, end_column: str | None = None, is_static: bool = False, **_kw) StateSequenceStoreBuilder[source]#
Register an in-memory Polars / Pandas DataFrame.
- add_parquet(path: str | Path, *, id_column: str, features: str | list[str], start_column: str | None = None, end_column: str | None = None, is_static: bool = False, **reader_kwargs) StateSequenceStoreBuilder[source]#
Register a Parquet file (glob patterns supported).
- add_sql(connection: str, query: str, *, id_column: str, features: str | list[str], start_column: str | None = None, end_column: str | None = None, is_static: bool = False, **sql_kwargs) StateSequenceStoreBuilder[source]#
Register a SQL query (requires
connectorx).