Posts
All the articles I've posted.
-
Data This Week #7
Netflix's RDS-to-Aurora PostgreSQL migration, DuckDB cost optimization, real-time dashboards with LISTEN/NOTIFY, Airflow on Minikube, and the dbt vs. SQLMesh debate in 2026.
-
Data This Week #6
Xiaomi's unified lakehouse with Doris & Paimon, Top-K in Postgres, dbt run monitoring, PostgreSQL internals, Netflix's DataJunction semantic layer, and schema evolution debates.
-
Data This Week #5
Spark DAG compilation deep dive, query federation with StarRocks, Pinterest's CDC migration, CyberArk AI with Iceberg, Databricks Zerobus Ingest, and data quality tooling debates.
-
Data This Week #4
How OpenAI scales PostgreSQL for ChatGPT, Dropbox's enterprise RAG, 3x faster Spark on Iceberg, dbt with DuckDB, local AWS Lakehouse setups, and new tool Alibaba ZVec.
-
Data This Week #3
BigQuery cost optimization, Apache Iceberg updates, MinIO alternatives, AWS SageMaker governance, and new tools like Nao — curated for data engineers.
-
Data This Week #2
RisingWave HTTP streaming to Iceberg, CedarDB string compression, Alibaba open-sources AliSQL (MySQL + DuckDB), Databricks Lakebase GA, and AI-powered data quality monitoring.
-
Data This Week #1
PostgreSQL dominance in 2025, Arrow-based database connectivity, Uber's petabyte-scale replication, Netflix AI graph search, and new tools OpenEverest and Pandas 3.0.