YMatrix - Enterprise-Grade Hyper-Converged Database

YMatrix is a hyper-converged relational database product


YMatrix is a hyper-converged database developed by Simdwave based on the classic open-source databases PostgreSQL and Greenplum. In addition to excelling in time-series scenarios, it also supports traditional use cases such as Online Transaction Processing (OLTP) and Online Analytical Processing (OLAP).

It addresses enterprise requirements including high availability, security, high performance, automated operations, visualized installation, and data processing, ensuring reliable deployment for enterprise users.

Its core values are hyper-convergence, high read/write performance, high compression ratio, and high availability.

A community edition of YMatrix is available—your feedback and experience are welcome.

YMatrix delivers core value through hyper-convergence, high performance, and high availability


  1. Hyper-Converged Architecture

The hyper-converged architecture of YMatrix resolves the issue of data silos in traditional databases, enabling "one database, multiple uses."

  • Micro-Kernel: By configuring different micro-kernels (combinations of storage engine and execution engine), YMatrix adapts to various business scenarios. For example:
    • HEAP storage engine + Volcano execution engine for OLTP workloads.
    • MARS3 storage engine + Vectorized execution engine for time-series workloads.
  • MPP: YMatrix leverages an MPP architecture to distribute query loads across the cluster, utilizing all system resources in parallel to achieve high performance.
  1. High Performance

YMatrix focuses on performance across all scenarios, including ingestion, storage, querying, analytics, and machine learning.

  • Ingestion: MatrixGate, a high-speed data loading tool, supports multiple data sources and types, batch and streaming ingestion, enabling real-time data loading with full transactional guarantees.
  • Storage: Offers the MARS series storage engines, supporting hybrid row-column storage and advanced encoding/compression. Also supports traditional engines like HEAP and AO. Supports automatic expiration of old partitions and creation of new ones.
  • Querying: Features a vectorized execution engine and Runtime Filter optimization, delivering industry-leading performance in analytical and real-time query workloads.
  1. High Availability
  • Automatic Failover: YMatrix 5.X introduces a new automated operations mechanism. When the Master or Primary segment fails, the system automatically switches to the standby node to complete failover.
  • Automatic Failback: After failover, only the new Primary/Master exists without a healthy Mirror/Standby. If another failure occurs, recovery is not possible. Use the mxrecover tool to rebuild a healthy Mirror/Standby for the new Primary/Master.
  • Streaming Replication: Supports data synchronization between primary and standby nodes using PostgreSQL's streaming replication protocol.

YMatrix provides visualized deployment and management, enterprise-grade security, and a comprehensive ecosystem


  1. Visualized Installation and Operations
  • Graphical Installation: Deploy a cluster in 10 minutes; simulate time-series write and query workloads in 3 minutes.
  • Graphical Monitoring and Management: One-click health checks and second-level horizontal scaling.
  1. Enterprise-Grade Security

YMatrix offers 360-degree access security, covering authentication, privilege control, encryption, auditing, and resource management.

  • Authentication: Supports multiple methods including trust, password, and PAM authentication.
  • Privilege Control: Implements Role-Based Access Control (RBAC), simplifying user-to-permission mapping.
  • Encryption: Provides multi-layered encryption:
    • Encrypted password storage.
    • Column-level encryption.
    • SSL host authentication.
    • Client-side encryption.
    • Network data encryption.
    • Password encryption over networks.
    • Tablespace-level encryption.
  • Auditing: Logs user login/logout events and database activities, with audit levels configurable based on security needs.
  • Resource Control: Enforces strict IP-based access restrictions; limits maximum concurrent connections per user; includes default connection timeout policies.
  1. Comprehensive Ecosystem
  • Fully compatible with upstream and downstream tools from the PostgreSQL/Greenplum ecosystem.

YMatrix supports diverse business scenarios


  • Complex data processing requiring a converged architecture

In traditional enterprises, massive amounts of data are often scattered across departments, systems, and applications due to organizational strategy, architectural design, or digital transformation efforts. These isolated data silos hinder interoperability and utilization, forming barriers that impede competitive advantage. Beyond technical complexity, they severely impact business operations, management, and growth—making them a critical obstacle in digital transformation.

YMatrix’s hyper-converged architecture has been successfully deployed in production environments such as factory data platforms, corporate group data warehouses, intelligent connected vehicles, and IoT device operations. It significantly reduces technical barriers related to selection, procurement, usage, and maintenance, receiving positive feedback. For example, in smart manufacturing, a single YMatrix instance can collect, store, compute, model, query, and analyze data from ERP (Enterprise Resource Planning), MES (Manufacturing Execution Systems), and equipment systems.

  • Scenarios involving complex time-series analysis

Time-series data forms the foundation of IoT, connected vehicles, industrial internet, and smart cities. Its defining characteristic is real-time processing, placing high demands on database write and storage capabilities. Enterprises must address challenges such as cost-effective performance, secure and rapid scaling to prevent data backlog, and lowering technical barriers to respond quickly to evolving data needs.

YMatrix is optimized for time-series workloads. Thanks to the physical sorting, asynchronous upload, and batch ingestion features of the MARS storage engine, combined with MatrixGate’s high-concurrency, high-performance bulk ingestion, YMatrix exceeds expectations in real-time ingestion, high-speed writes, real-time queries, and transactional integrity.

YMatrix supports graphical scaling—simple operations enable second-level expansion. It also supports smooth, online scaling without service interruption, ensuring business continuity, minimizing downtime losses, and reducing risks.

  • Broad IoT scenarios with massive device counts

Typical broad IoT applications include smart campuses, smart homes, intelligent transportation, smart water systems, smart agriculture, and meteorological monitoring. Massive devices generate massive volumes of data, making storage cost (compression ratio) and access efficiency (decompression speed) key factors in building stable data infrastructure. High-speed ingestion and real-time query performance directly affect end-user experience.

Beyond PB-scale cluster capacity, YMatrix features patented Encoding Chain compression technology. Business users can select optimal encoding schemes tailored to each column’s data characteristics, achieving superior cost-effectiveness and saving over 50% in storage costs—turning massive data storage into a manageable burden.

Leveraging MatrixGate’s distributed, high-concurrency, streaming, and batch ingestion capabilities, YMatrix achieves second-level data ingestion when paired with adequate hardware.

With full vectorization (version 5.0 and above), YMatrix achieves SSB benchmark performance 1.24x that of ClickHouse, reaching world-class levels in throughput and latency.

  • Traditional Data Warehouse (OLAP) Scenarios

YMatrix is compatible with the PostgreSQL/Greenplum ecosystem, supporting classic OLAP workloads in finance, telecom, government, energy, and manufacturing sectors, including Business Intelligence (BI) and reporting.

These scenarios typically involve non-time-series data processed via Hadoop ecosystems: historical data stored in Hadoop, with Spark used for computing report metrics—a complex pipeline.

With YMatrix, you can unify structured and unstructured data, federated data access, Kafka stream integration via GUI, and hot/cold data separation—all within one platform. This simplifies data consumption workflows while providing automatic failover and recovery mechanisms, resulting in a secure, simple, and easy-to-use solution.