Citus Data Blog

Thoughts on scaling out PostgreSQL, big data architectures, distributed systems, and the PostgreSQL community.

Sharding Postgres with semi-structured data and its performance implications

If you’re looking at Citus it’s likely you’ve outgrown a single node database. In most cases your application is no longer performing as you’d like. In cases where your data is still under 100 GB a single Postgres instance will still work well for you, and is a great choice. At levels beyond that Citus can help, but how you model your data has a major impact on how much performance you’re able to get out of the system.

Craig Kerstiens Jul 25, 2016

When to use unstructured datatypes in Postgres–Hstore vs. JSON vs. JSONB

Since Postgres started supporting NoSQL (via hstore, json, and jsonb), the question of when to use Postgres in relational mode vs NoSQL mode has come up a lot. Do you entirely abandon traditional table structures, and go with documents all the way? Or do you intermingle both? The answer unsurprisingly is: it depends. Each newer model including hstore, JSON, and JSONB has their ideal use cases. Here we’ll dig deeper into each and see when you should consider using them.

Craig Kerstiens Jul 14, 2016

Citus Cloud now Generally Available

At Citus we want to enable you to build real-time applications across large amounts of data with ease. One part of that is Citus makes it simple for you to shard your data and use scale-out capabilities to leverage all your processing power. Another part is Citus Cloud: our managed, hosted offering of Citus running on AWS.

Today taking advantage of Citus becomes even easier with Citus Cloud going into general availability. You can read on to discover what’s included with Citus Cloud or sign-up to get started right away.

Craig Kerstiens Jul 13, 2016

PG Conf SV - Call For Papers Extended

PG Conf Silicon Valley is happening again this year in November and we’re looking to make it even better and more informative than last year. To do that we’re looking to you, both as an attendee and to come speak. We’ve already received a lot of great...

Craig Kerstiens Jul 8, 2016

Announcing Citus 5.1, Improving your data ingest speed and experience

At Citus we want to make dealing with large amounts of operational and analytical workloads easier. Data ingestion speed is key, being the necessary first step in working with any new database. Moreover ingestion is something you’ll do repeatedly in testing and development so the bulk-loading user experience is important as well. With the release of Citus 5.1 the experience in loading data is much better all around, and we’ve managed to sneak in a few other improvements as well. Read more below or give it a try today.

Sumedh Pathak May 20, 2016

Scalable Real-time Product Search using PostgreSQL with Citus

Product search is a common, yet sometimes challenging use-case for online retailers and marketplaces. It typically involves a combination of full-text search and filtering by attributes which differ for every product category. More complex use-cases may have many sellers that offer the same product, but with a different price and different properties.

Marco Slot Apr 28, 2016

Introducing Citus Cloud

At Citus we believe in making databases easier. Key to that is empowering users to scale Postgres beyond the typical limits of a single node. Our latest Citus release makes it easier than ever to scale memory and processors while retaining access to familiar SQL queries and rich Postgres features. But database management can be tricky even in the single-node case, so we at Citus have been hard at work building the next step in our journey to make databases easier: Citus Cloud, an on-demand cloud service on top of Amazon Web Services available today in private beta.

Craig Kerstiens Apr 19, 2016

Master-less Distributed Queue with PG Paxos

The following post is contributed by 8Kdata

An introduction to pg_paxos

Pg_paxos is a database level implementation of the widely renowned Paxos protocol, invented by Leslie Lamport. Pg_paxos offers a master-less (or multi-master, if you prefer) layer that can be enabled directly in the database without the need for external tools or transaction managers.

8Kdata Apr 13, 2016

Page 1 of 8

Next page