Filesystem and Usage
SQreamDB writes and reads data from disk.
The SQreamDB storage directory, sometimes referred to as a storage cluster is a collection of database objects, metadata database, and logs.
Each SQreamDB worker and the metadata server must have access to the storage cluster in order to function properly.
Directory organization
The cluster root is the directory in which all data for SQreamDB is stored.
databases
The databases directory houses all of the actual data in tables and columns.
Each database is stored as its own directory. Each table is stored under its respective database, and columns are stored in their respective table.
In the example above, the database named retail
contains a table directory with a directory named 23
.
Tip
To find table IDs, use a catalog query:
master=> SELECT table_name, table_id FROM sqream_catalog.tables WHERE table_name = 'customers';
table_name | table_id
-----------+---------
customers | 23
Each table directory contains a directory for each physical column. An SQL column may be built up of several physical columns (e.g. if the data type is nullable).
Tip
To find column IDs, use a catalog query:
master=> SELECT column_id, column_name FROM sqream_catalog.columns WHERE table_id=23;
column_id | column_name
----------+------------
0 | name@null
1 | name@val
2 | age@null
3 | age@val
4 | email@null
5 | email@val
Each column directory will contain extents, which are collections of chunks.
metadata
or rocksdb
SQreamDB’s metadata is an embedded key-value store, based on RocksDB. RocksDB helps SQreamDB ensure efficient storage for keys, handle atomic writes, snapshots, durability, and automatic recovery.
The metadata is where all database objects are stored, including roles, permissions, database and table structures, chunk mappings, and more.
temp
The temp
directory is where SQreamDB writes temporary data.
The directory to which SQreamDB writes temporary data can be changed to any other directory on the filesystem. SQreamDB recommends remapping this directory to a fast local storage to get better performance when executing intensive larger-than-RAM operations like sorting. SQreamDB recommends an SSD or NVMe drive, in mirrored RAID 1 configuration.
If desired, the temp
folder can be redirected to a local disk for improved performance, by setting the tempPath
setting in the legacy configuration file.
logs
The logs directory contains logs produced by SQreamDB.
See more about the logs in the Logging guide.