From Greenplum to Apache Cloudberry
Presented by:
Shine Zhang
Shine is a distinguished technology leader with over two decades of experience in software engineering and database innovation.
As the CTO at Synx Data Labs, Shine drives the development of cutting-edge distributed PostgreSQL solutions, championing innovation through agile methodologies.
Previously, Shine held pivotal roles at Broadcom and VMware, leading groundbreaking projects in Greenplum databases, enterprise data migration, and Kubernetes integration.
A former senior engineer at Microsoft, Shine made significant contributions to SQL Server advancements in data warehousing and XML query processing.
Beyond the tech realm, Shine is passionate about serving communities and enjoys biking along the scenic trails of North Dallas, Texas.
If you are interested in using PostgreSQL kernel for analytics, come and join us! We are introducing Apache Cloudberry as a PostgreSQL variant to the community. It's designed based on PostgreSQL as kernel and aims to process distributed analytic workloads. Cloudberry adopts a MPP shared-nothing architecture fully integrated with PostgreSQL 14.4.
We added quite a few new features to make it a product-ready choice for enterprises. Some highlights include: - Faster: we added vectorization execution engine, aggregation pushdown, parallel execution, incremental materialized view - Safer: we offered high availability, built-in data security mechanism, - Easier to use: we added query across multiple clusters, PAX as hybrid row-column storage format, and dynamic table to manage unstructured data and a central console dashboard for easier management - Work with AI: within Cloudberry, you can build a vector database using pgvector, do fulltext search with ZomboDB
We are also more than happy to share the 2025 roadmap proposal with the developer community and working on adding more tools to make it easier to bootstrap and use Cloudberry! Please join us to learn more about the project and make the PostgreSQL community better together!
- Date:
- Duration:
- 50 min
- Room:
- Conference:
- Postgres Conference 2025
- Language:
- Track:
- Variants and Cloud
- Difficulty:
- Easy