This document introduces the relevant parameters of the data table category in the system configuration parameters.
Notes!
To ensure the stability and security of the system, please be sure to manually modify the relevant parameters**.
When the CREATE TABLE
command does not explicitly specify the access method, set the default table access method.
Data Type | Default Value | Value Range | Set Classification |
---|---|---|---|
enum | heap | heap / ao_row / ao_column / mars2 / mars3 | master; session |
Controls the creation of the table when using CREATE TABLE
or CREATE TABLE AS
that does not contain the DISTRIBUTED BY
clause.
CREATE TABLE
, if this parameter value is off
(default) and the Create Table command does not contain the DISTRIBUTED BY
clause, the YMatrix database will select the table distribution key according to the following command:LIKE
or INHERITS
clause is specified, YMatrix will copy the distribution key from the source table or the parent table.PRIMARY KEY
or UNIQUE
constraint is specified, YMatrix selects the largest subset of all key columns as the distributed key.LIKE
or INHERITS
clause is specified, YMatrix selects the first appropriate column as the distribution key. (Columns with geometric or user-defined data types do not meet the criteria for YMatrix distribution key columns.)on
, the YMatrix database follows these rules to create the table when the DISTRIBUTED BY
clause is not specified:PRIMARY KEY
or UNIQUE
column is not specified, the distribution of the table is random (DISTRIBUTED RANDOMLY
). Table distribution is random even if the table creation command contains LIKE
or INHERITS
clause.PRIMARY KEY
or UNIQUE
column is specified, the DISTRIBUTED BY
clause must also be specified. If the DISTRIBUTED BY
clause is not specified as part of the table creation command, the command will fail.CREATE TABLE AS
command that does not contain a distribution clause:off
, the table distribution policy is determined based on the command.on
, the table distribution policy is random.| Data Type | Default | Set Category | | --- | --- | --- | --- | | boolean | off | master; system; reload |
When creating a table using the CREATE TABLE command, set the default values for the following table storage options.- The supported items are as follows, specifying multiple storage option values as comma-separated lists:
ALTER DATABASE mytest SET gp_default_storage_options = 'orientation=column, compressiontype=rle_type'
. Create appended tables in the mytest database using column-oriented tables and RLE compression. You need to specify only appendoptimized=TRUE
in the WITH
clause.gpconfig -c 'gp_default_storage_options' -v 'appendoptimized=true, orientation=column'
.gpconfig -s 'gp_default_storage_options'
.Default value | Value range | Set classification |
---|---|---|
appendoptimized=FALSE blocksize=32768 checksum=TRUE compresstype=none compresslevel=0 orientation=ROW |
appendoptimized= TRUE / FALSE blocksize= integer between 8192 and 2097152 checksum= TRUE / FALSE compresstype= ZLIB / ZSTD / QUICKLZ2 / RLE_TYPE / NONE compresslevel= integer between 0 and 19 orientation= ROW / COLUMN |
Use the gpconfig tool to specify: master; session; reload Specify: table |
Enable compressed Segment files during the VACUUM
command.
VACUUM
only truncates the Segment file to the EOF
value, and the current behavior is the same.| Data Type | Default | Set Category | | --- | --- | | boolean | on | master; session; reload; superuser |
Specifies the threshold ratio (percentage) of hidden rows when running VACUUM
without the FULL
option (lazy VACUUM
).
Data Type | Default Value | Value Range | Set Classification |
---|---|---|---|
int | 10 | 0 ~ 100 | master; session; reload |
Specifies the minimum size for safe write operations to AO tables in an immature file system.- When specifying a number of bytes greater than 0
, the appended optimized writer adds the fill data to that number to prevent data corruption due to file system errors.
32768
is usually used.Data Type | Default Value | Value Range | Set Classification |
---|---|---|---|
int | 0 | 0 ~ INT_MAX | master; system; restart |
Compression threshold. Used to control how many tuples (tuples) are compressed at one time, and is the upper limit of the number of Tuples compressed in the same unit.
Data Type | Default Value | Value Range | Set Classification |
---|---|---|---|
int | 1000 | 1 ~ 10000 | table |
Used to control all MARS2 tables, how many Runs can trigger merges when L0 reaches.
level0_merge_threshold
.Data Type | Default Value | Value Range | Set Classification |
---|---|---|---|
int | 32 | 10 ~ 2048 | master; session/system; reload |
Used to control how many Runs to trigger merges in a single table L0.
Data Type | Default Value | Value Range | Set Classification |
---|---|---|---|
int | 32 | 1 ~ 2048 | table |
Controls the size of L0 -> L1 upgrade (MB). When L0 merges the result Run exceeds this size, it will be upgraded to L1.
Data Type | Default Value | Value Range | Set Classification |
---|---|---|---|
int | 25 | 1 ~ 10000 | table |
Controls the size of L1 -> L2 upgrade (MB) in single table. When L1 merges, the result Run exceeds this size will be upgraded to L2.
Data Type | Default Value | Value Range | Set Classification |
---|---|---|---|
int | 1000 | 1 ~ 10000 | table |
Controls the sorted memory size (KB) of a single insert, which will share if the insert target table is a partitioned table.
2097152KB
, i.e. 2GB. The value range is 128KB ~ 2147483647KB
(i.e. 2048GB).Data Type | Default Value | Value Range | Set Classification |
---|---|---|---|
int | 2097152 | 128 ~ 2147483647 | master; session/system; reload |
Controls at least how much sorted memory is allocated for a single partition table.
16384KB
, i.e. 16MB. The value range is 128KB ~ 2147483647KB
(i.e. 2048GB).Data Type | Default Value | Value Range | Set Classification |
---|---|---|---|
int | 16384 | 128 ~ 2147483647 | master; session/system; reload |
Notes!
This part of the parameters must be configured using theWITH
clause when creating the table. It is suitable for a single table and cannot be modified once it is configured.
Compression threshold. Used to control how many tuples (tuples) are compressed at one time, and is the upper limit of the number of Tuples compressed in the same unit.
Data Type | Default Value | Value Range | Set Classification |
---|---|---|---|
int | 1200 | 1 ~ 100000 | table |
Specifies the magnification factor for the Level size.
rowstore_size * (level_size_amplifier ^ level)
. The larger the value, the slower the reading speed and the faster the writing speed.Data Type | Default Value | Value Range | Set Classification |
---|---|---|---|
int | 8 | 1 ~ 1000 | table |
Specifies the data loading mode inside MARS3.
normal
represents the normal mode. The newly written data is first written to the row storage run of L0 layer, and then accumulated to the column storage run of L1 layer. Compared with the bulk
mode, the column storage conversion will change from synchronous to asynchronous, but it is suitable for high-frequency small batch write scenarios with sufficient I/O capabilities and delay-sensitive.bluk
represents a batch loading mode, suitable for low-frequency large-batch writing scenarios, directly write to the column storage run of L1 layer. Compared with normal
mode, one I/O is reduced. The column storage conversion has changed from asynchronous to synchronous, suitable for low-frequency large-batch writing of data with insufficient I/O capabilities and insensitive to delay.Data Type | Default Value | Value Range | Set Classification |
---|---|---|---|
enum | normal | normal / bluk | table |
Used to control when L0 Run switches. When the data size exceeds the parameter value (in MB), the next Run will be switched.
Data Type | Default Value | Value Range | Set Classification |
---|---|---|---|
int | 64 | 8 ~ 1024 | table |
Turn on/off file system cache for object storage.
Data Type | Default | Set Category |
---|---|---|
boolean | off | master; session; reload |
Sets how often to update the access authorization for the session.
Data Type | Default Value | Value Range | Set Classification |
---|---|---|---|
int | 0 | 0 ~ INT_MAX | master; system; reload |
The database background task regularly detects the time interval (unit s) of whether the data dump requirements meets the time interval (unit s).
Data Type | Default Value | Value Range | Set Classification |
---|---|---|---|
int | 3600 | 1 ~ INT_MAX | master; system; reload |
Turn on/off the memory cache of object storage.
Data Type | Default | Set Category |
---|---|---|
boolean | off | master; session; reload |
Turn on/off the automatic dump function.
Data Type | Default | Set Category |
---|---|---|
boolean | off | master; system; reload |
Enable or disable the use of external tables that execute operating system commands or scripts on the Segment host (CREATE EXTERNAL TABLE EXECUTE
syntax).
Data Type | Default | Set Category |
---|---|---|
boolean | on | master; system; restart |
Enable filter pushdown when reading data from an external table.
Data Type | Default | Set Category |
---|---|---|
boolean | on | master; session; reload |
Set the number of segments to scan external table data during external table operations, with the goal of preventing the system from overloading the scan data and taking away resources from other concurrent operations.
gpfdist://
protocol to access external table data.Data Type | Default Value | Value Range | Set Classification |
---|---|---|---|
int | 64 | 1 ~ INT_MAX | master; session; reload |
Assuming this parameter value is n
, the YMatrix database stops processing the input row if the previous n
row contains a erroneous format when using the COPY
command or importing data from an external table.
n
row, the YMatrix database continues to process the input row.0
will disable this limit.SEGMENT REJECT LIMIT
clause for the COPY
command or external table definition to limit the number of rejected rows.INT_MAX
is the maximum value that can be stored as an integer in the system.Data Type | Default Value | Value Range | Set Classification |
---|---|---|---|
int | 1000 | 0 ~ INT_MAX | master; session; reload |
For single-row error handling on COPY
and external table SELECT
, set the number of rows processed before SEGMENT REJECT LIMIT n PERCENT
begins.
Data Type | Default Value | Value Range | Set Classification |
---|---|---|---|
int | 300 | 0 ~ INT_MAX | master; session; reload |
Discard packets from the gpfdist v1 protocol.
Data Type | Default | Set Category |
---|---|---|
boolean | off | master; session; reload |
When an SQL query is read from an external table, the parameter value specifies the time (in seconds) the YMatrix database waits before canceling the query when the data stops returning from the external table.- Default value 0
, specifying no timeout. The YMatrix database does not cancel the query.
readable_external_table_timeout
. If gpfdist
does not return any data within the specified time, the YMatrix database cancels the query.Data Type | Default Value | Value Range | Set Classification |
---|---|---|---|
int | 0 | 0 ~ INT_MAX | master; system; reload |
The buffer size (KB) of YMatrix databases used for network communication, such as the gpfdist
utility and external web tables (using HTTP).
32KB ~ 131072KB
(i.e. 128MB).Data Type | Default Value | Value Range | Set Classification |
---|---|---|---|
int | 64 | 32 ~ 131072 | segments; session; reload |
This parameter controls whether SSL certificate authentication is enabled when using the gpfdists
protocol to define external tables of the YMatrix database to use SSL security.
on
, and SSL authentication is enabled when the YMatrix database communicates with the gpfdist
utility to read data from or write data to an external data source.off
will disable SSL certificate authentication. Ignore these SSL exceptions:gpfdist
.gpfdist
.off
to disable authentication when testing communication between an external table of the YMatrix database and the gpfdist
utility that provides external data.gpfdists
SSL certificate will expose security risks.Data Type | Default | Set Category |
---|---|---|
boolean | on | master; session; reload |