YMatrix - The “hyper-converged” database trusted by large enterprises

YMatrix is an enterprise-grade distributed database product based on the PostgreSQL system. YMatrix integrates time series, analysis (OLAP), transactions (OLTP), and AI capabilities into one, offering core advantages such as full coverage, low cost, high performance, high availability, easy scalability, and security compliance. It uses the concept of “hyper-convergence” to solve the problems of complex traditional architecture and high operation and maintenance costs, providing enterprises with a “one-stop” data storage solution.

Full Scenario Capabilities

Time Series Scenarios

Specializes in time series scenarios, providing powerful high concurrency capabilities and deep optimization for time series scenarios such as intelligent connected vehicles and smart factories. Supports advanced syntax such as CTE and window functions, as well as native time series-specific functions. Specifically supports out-of-order and batch writing for complex network environments. Supports zero service interruption cluster horizontal expansion to flexibly respond to data growth. Supports downgrading cold data to object storage, which can significantly reduce storage costs.

Analysis Scenarios

Supports massive data volumes ranging from TB to PB, providing reliable and high-performance data processing and data service capabilities for large-scale enterprise reporting, BI, and other analytical applications. Not only does it offer powerful performance and excellent multi-table JOIN capabilities, but it also supports advanced analytical features such as window functions and materialized views. In addition to traditional data warehouse batch processing capabilities, it features the innovative Domino stream computing engine, enabling real-time stream processing of data via SQL to replace Flink and Spark.

Transaction Scenarios

Complete ACID properties provide financial-grade data reliability, meeting the requirements of important systems such as finance and ERP for database performance, data accuracy, and consistency. It also supports advanced features such as stored procedures, triggers, and off-site disaster recovery, making it widely applicable to various complex transaction scenarios.

AI Scenarios

It provides vector search capabilities for large language models (LLMs), helping enterprises quickly build “AI agents” based on business data; without the need for Spark, PL/Python can be executed directly within the database, fully utilizing hardware resources and improving machine learning efficiency; it also supports multi-modal data management and hybrid search capabilities.

Core Advantages

All Scenarios

  • A single database supports time series, analytics, transactions, and AI four major scenarios workloads
  • Complete ACID characteristics, supports SQL : 2016 standard
  • Supports multi-model data management and hybrid search for structured data (relational tables), semi-structured data (JSON, XML, Vector), and unstructured data (text, images, video)
  • Supports vector storage, vector indexing, quantization algorithms, and other capabilities to quickly build “enterprise intelligence agents”

Low Cost

High Performance

  • Supports multi-node + multi-core parallel computing and batch data analysis
  • Supports out-of-order, batch writing in complex network environments, with high concurrent writing capabilities
  • Integrated HTAP architecture significantly enhances the efficiency of complex queries and in-database analysis
  • The Domino stream computing engine supports real-time data streaming and rapid processing within the database, enabling sub-second, real-time, and incremental data analysis capabilities
  • Real-time data streaming and rapid processing within the database provide sub-second, real-time, and incremental data analysis capabilities, dynamically displaying analysis results based on data changes

High availability

Scalable

  • Supports online smooth scaling, which can be performed via command line or graphical interface
  • Supports clusters with up to 100+ nodes, suitable for processing TB to PB-level data
  • Fully compatible with PostgreSQL/Greenplum ecosystem upstream and downstream toolchains

Security and Compliance

  • Permission control: Role-based access control mechanism, Row and column permission control
  • Identity authentication: Trust-based authentication, password authentication, PAM certificates, and other authentication methods
  • Encryption: Provides different levels of encryption. Storage encryption (supports national cryptography SM4), specified field encryption, GSSAPI authentication, client-side encryption, SSL encrypted transmission, cross-network encrypted passwords, and database partition encryption
  • Security auditing: Records user login/logout events and post-login database operations, with configurable audit levels based on security requirements
  • Resource control: Strict address access restrictions to ensure trusted user origins, configurable maximum concurrent connection limits, and default connection timeout policies

Advanced components

Visualized Operation and Maintenance MatrixUI

MatrixUI is a graphical operation and maintenance management tool designed to provide operation and maintenance engineers with a simple, easy-to-use, and comprehensive information source.

  • Graphical installation: Complete cluster deployment in 10 minutes, simulate time series scenario queries and writes in 3 minutes
  • Graphical operations monitoring: One-click self-service inspection, one-click second-level scaling, cluster inspection, Kafka import quick configuration, load analysis

    High-concurrency writing MatrixGate

    MatrixGate is a high-performance data loading tool that distributes data evenly across all data nodes to achieve parallel writing

  • Supports integration with different data sources and data types
  • Supports batch writing and streaming writing of data
  • Low latency, high concurrency: Supports parallel writing of massive amounts of data, fully utilizes bandwidth to compress data, and can improve write speed by up to 100 times
  • Supports UPSERT capabilities: Used to address complex write issues such as data out-of-order and batching in scenarios where data is merged in batches. Compared to traditional single-node write solutions, it can improve speed by up to 100 times and is suitable for high-throughput, low-latency streaming data write scenarios.

Incremental Backup MatrixArchive

MatrixArchive is to save all data saved by a normally running YMatrix database cluster at a specific point in time and backup data according to certain rules. Backed up data can ensure data integrity and consistency at that specific point in time. In addition, from all files for a given backup, an available YMatrix database cluster can be restored, with the content consistent with the data from the original cluster at a specific point in time.

Point-to-point migration MatrixShift

MatrixShift is a dedicated data migration tool that supports full, incremental, and conditional migration from various versions of Greenplum and YMatrix. It has the characteristics of efficient (point-to-point transmission, small table optimization, transmission compression), flexible configuration, etc.

  • Supports migration of cluster data from Greenplum to YMatrix
  • Full-scene migration: Supports a variety of migration scenarios such as full-scale, incremental, and conditional filtering.
  • Point-to-point efficient migration: implements data transmission from Segment to Segment, eliminating the possible single-point bottlenecks in regular migration operations

In-depth understanding of YMatrix databases

YMatrix System Architecture Quick Installation Standard Cluster Deployment Solution Scenario Application

In addition to providing [Enterprise Version] (https://ymatrix.cn/download), YMatrix also provides a free trial of [Community Version] (https://ymatrix.cn/download). Welcome your use and experience!