Forked from Tembo Time-Series API

The purpose of this extension is to provide a cohesive user experience around the creation, maintenance, and use of time-series tables. Originally written by Tembo, Inc and later transfered to Adam Hendel, this fork serves to keep the pg_timeseries extension buildable against current PostgreSQL with currently available dependencies, primarily in support of the OpenNMS pgtimeseries tss integration plugin

Installation

Getting Started

Assuming you already have a partitioned table created, simply call the enable_ts_table function with your table name.

SELECT enable_ts_table('sensor_readings');

With this one call, several things will happen:

The table will be restructured as a series of partitions using PostgreSQL's native PARTITION features
Each partition covers a particular range of time (one week by default)
New partitions will be created for some time in the future (one month by default)
Once an hour, a maintenance job will create any missing partitions as well as needed future ones

Using your tables

So you've got a table. Now what?

Indexes

The time-series tables you create start out life as little more than typical partitioned PostgreSQL tables. But this simplicity also means all of PostgreSQL's existing functionality will "just work" with them. A fairly important piece of a time-series table is an index along the time dimension.

Traditional B-Tree indexes work well for time-series data, but you may wish to benchmark BRIN indexes as well, as they may perform better in specific query scenarios (often queries with many results). Start with B-Tree if you don't anticipate more than a million records in each partition (by default, partitions are one week long).

Partition Sizing

Related to the above information on indexes is the question of partition size. Because calculating the total size of partitioned tables can be tedious, this extension provides several easy-to-use views surfacing this information.

To examine the table (data), index, and total size for each of your partitions, simple query the time-series partition information view, ts_part_info. A general rule of thumb is that each partition should be able to fit within roughly one quarter of your available memory. This assumes that not much apart from the time-series workload is going on, and things like parallel workers may complicate matters, but work on getting partition total size down to around a quarter of your memory and you're off to a good start.

Retention

On the other hand, you may be worried about plugging a firehose of data into your storage layer to begin with… While the ts_table_info view may allay your fears, at some point you will want to remove some of your time-series data.

Fortunately, it's incredibly easy to simply drop time-series partitions on a schedule. Call set_ts_retention_policy with your time-series table and an interval (say, '90 days') to establish such a policy. Once an hour, any partitions falling entirely outside the retention window will be dropped. Use clear_ts_retention_policy to revert to the default behavior (infinite retention). Each of these functions will return the previous retention policy when called.

Compression

Compression options still be configured to maintain backwards compatibility, but no longer applies and the apply_compression_policy function has been stubbed out.

Analytics Helpers

This extension includes several functions intended to make writing correct time-series queries easier. Certain concepts can be difficult to express in standard SQL and helper functions can aid in readability and maintainability.

`first` and `last`

These two functions help clean up the syntax of a fairly common pattern: a query is grouped by one dimension, but a user wants to know what the first or last row in a group is when ordered by a different dimension.

For instance, you might have a cloud computing platform reporting metrics and wish to know the latest (in time) CPU utilization metric for each machine in the platform:

SELECT machine_id,
       last(cpu_util, recorded_at)
FROM events
GROUP BY machine_id;

`date_bin_table`

This function automates the tedium of aligning time-series values to a given width, or "stride", and makes sure to include NULL rows for any time periods where the source table has no data points.

It must be called against a time-series table, but apart from that consideration using it is pretty straightforward:

SELECT * FROM date_bin_table(NULL::target_table, '1 hour', '[2024-02-01 00:00, 2024-02-02 15:00]');

The output of this query will differ from simply hitting the target table directly in three ways:

Rows will be sorted by time, ascending
The time column's values will be binned to the provided width
Extra rows will be added for periods with no data. They will include the time stamp for that bin and NULL in all other columns

Requirements

The pg_timeseries extension depends on other extensions:

We recommend referring to documentation within these projects for more advanced use cases, or for a better understanding at how this extension works.

Roadmap

This fork of pg_timeseries is primarily geared towards the OpenNMS pg_timeseries plugin, and getting the extension buildable again.

Remove the requirement for the abandoned columnar extension
Remove the requirement for the abandoned Tembo fork of pg_ivm
Make it buildable without these requirements
Make sure it works with all currently supported releases of PostgreSQL

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
.github		.github
doc		doc
sql		sql
test		test
timeseries-pg		timeseries-pg
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
META.json		META.json
Makefile		Makefile
README.md		README.md
Trunk.toml		Trunk.toml
timeseries.conf		timeseries.conf
timeseries.control		timeseries.control

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Forked from Tembo Time-Series API

Installation

Getting Started

Using your tables

Indexes

Partition Sizing

Retention

Compression

Analytics Helpers

`first` and `last`

`date_bin_table`

Requirements

Roadmap

About

Uh oh!

Releases

Packages

Languages

License

dino2gnt/pg_timeseries_extension

Folders and files

Latest commit

History

Repository files navigation

Forked from Tembo Time-Series API

Installation

Getting Started

Using your tables

Indexes

Partition Sizing

Retention

Compression

Analytics Helpers

first and last

date_bin_table

Requirements

Roadmap

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

`first` and `last`

`date_bin_table`

Packages