This repository contains the MATCH_RECOGNIZE query templates and scripts for importing raw datasets into Trino (Hive + HDFS) and PostgreSQL 14.
The raw CSV files for the datasets can be obtained from here.
Directory structure:
- query-templates:
- (benchmark dataset)
- (pattern definition)
- (pattern name).sql: MATCH_RECOGNIZE query with example parameters.
- (pattern definition)
- (benchmark dataset)
- scripts:
- hive_import_(dataset).sql: create tables in Hive after copying the raw CSV files into HDFS.
- trino_import_(dataset).sql: create tables in Trino from Hive tables.
- postgres_import_(dataset).sql: create tables in PostgreSQL from raw CSV files. Citus extension is required.