Linux 7.0 Cuts PostgreSQL Performance in Half on AWS — F...

What Happened

An engineer at AWS has reported that PostgreSQL performance drops by approximately 50% when running on Linux 7.0 compared to the 6.x kernel series. The regression was identified through production-representative benchmarks on AWS infrastructure, and the findings have drawn significant attention on Hacker News (171 points), signaling broad concern across the infrastructure community.

The performance hit is not a minor edge case — it's a halving of throughput on one of the most widely deployed databases in the world, running on the most widely used cloud platform. The report, covered by Phoronix, indicates that the root cause lies in kernel-level changes introduced in the 7.0 release cycle, not in PostgreSQL itself. This is a critical distinction: no amount of `postgresql.conf` tuning will recover what the kernel took away.

Perhaps most concerning is the assessment that fixing the issue "may not be easy." This suggests the regression stems from a fundamental architectural change in the kernel — likely in the scheduler, memory management, or I/O subsystem — rather than a simple bug that can be patched in a point release.

Why It Matters

PostgreSQL and Linux are the foundational pairing for a massive portion of production infrastructure. AWS alone runs millions of PostgreSQL instances across RDS, Aurora PostgreSQL-compatible, and customer-managed EC2 deployments. A 50% throughput regression doesn't just mean slower queries — it means doubled infrastructure costs to maintain the same performance envelope, or degraded user experience for anyone who upgrades without testing.

When your kernel upgrade doubles your database bill, that's not a performance regression — it's a budget emergency.

The Linux kernel's major version transitions have a history of database performance surprises. The 5.x to 6.x transition brought its own set of scheduler and memory management changes that affected database workloads, though none as dramatic as what's being reported here. PostgreSQL is particularly sensitive to kernel behavior because of its process-per-connection architecture and heavy reliance on the OS page cache and buffer management. Unlike databases that manage their own memory pools more aggressively, Postgres trusts the kernel to do the right thing with shared buffers, huge pages, and I/O scheduling.

The Hacker News discussion reflects a community that has learned this lesson repeatedly. Database administrators and infrastructure engineers know that kernel upgrades on database servers are never routine — they're treated with the same caution as a major PostgreSQL version upgrade, complete with shadow traffic testing and gradual rollouts.

The "not easy" fix is the real story here. When a kernel developer says a fix won't be straightforward, it typically means one of two things: the regression is a side effect of a deliberate architectural improvement that benefits other workloads, or the fix requires rethinking assumptions that are baked into multiple subsystems. Either way, it means the Linux kernel community faces an uncomfortable trade-off: revert useful changes to restore database performance, or ask the database community to wait while a proper solution is engineered.

The Kernel-Database Interface Problem

This regression highlights a deeper structural issue in how databases and operating systems co-evolve. PostgreSQL's architecture was designed for an era of Linux kernel behavior that has been incrementally changing. The implicit contract between PostgreSQL and the Linux kernel — around process scheduling, memory page management, and I/O prioritization — has no formal specification, and it breaks silently.

Modern kernel development optimizes for a broad set of workloads: containers, microservices, cloud-native applications with many short-lived processes. Database workloads look fundamentally different — long-lived processes, large shared memory segments, sequential and random I/O patterns that don't match the assumptions of general-purpose schedulers. Every time the kernel improves for the common case, it risks degrading the database case.

This is not a new tension. The introduction of transparent huge pages (THP) years ago caused similar PostgreSQL performance disasters, leading to the now-standard advice to disable THP on database servers. The cgroup v2 migration introduced its own set of database-specific gotchas. Each kernel generation adds another item to the "things to check before upgrading your database server's kernel" list.

The PostgreSQL community has historically responded to these issues by adding kernel-specific workarounds — configuration parameters that compensate for kernel behavior changes. But this approach has limits. At some point, the database needs a kernel that behaves predictably, not a pile of workarounds for kernel regressions.

What This Means for Your Stack

If you're running PostgreSQL on Linux in production — whether self-managed on EC2, on bare metal, or even on managed services where you control the kernel — do not upgrade to Linux 7.0 on database servers until this is resolved. Pin your kernel version explicitly. If you're using rolling-release distributions (Arch, Fedora, etc.) on database infrastructure, this is a good time to reconsider that choice.

If you're running managed PostgreSQL services (RDS, Aurora, Cloud SQL), your cloud provider will handle this — but it may delay their adoption of 7.0 features you were counting on. Contact your provider's support to ask about their kernel qualification timeline.

For teams doing infrastructure-as-code, add kernel version constraints to your database server provisioning. Your Terraform modules, Ansible playbooks, or CloudFormation templates should treat the kernel version as a first-class configuration parameter for database nodes, not something that floats with the latest AMI.

For capacity planning, factor in the possibility that kernel upgrades may not be performance-neutral. If your database servers are running at 60%+ CPU utilization, a 50% throughput regression means you're going from healthy to overloaded with a single `apt upgrade`. Build kernel version testing into your load testing pipeline.

Looking Ahead

This regression will likely accelerate two trends. First, expect more database vendors and cloud providers to invest in kernel-bypass technologies — io_uring, user-space networking, and direct storage access — that reduce their dependency on kernel behavior. Second, the conversation about whether PostgreSQL's process-per-connection model needs fundamental rethinking will get louder, as each kernel generation makes the assumptions underlying that architecture a little less reliable. For now, the practical advice is simple: don't upgrade, test everything, and watch the kernel mailing list for resolution timelines.

Linux 7.0 Cuts PostgreSQL Performance in Half on AWS — Fix Won't Be Quick

// tldr

// viewpoints

// deep dive

What Happened

Why It Matters

The Kernel-Database Interface Problem

What This Means for Your Stack

Looking Ahead

// read from source

AWS Engineer Reports PostgreSQL Perf Halved by Linux 7.0, Fix May Not Be Easy

// community takes

Linux 7.0 Cuts PostgreSQL Performance in Half on AWS — Fix Won't Be Quick

// tldr

// viewpoints

// deep dive

What Happened

Why It Matters

The Kernel-Database Interface Problem

What This Means for Your Stack

Looking Ahead

// read from source

AWS Engineer Reports PostgreSQL Perf Halved by Linux 7.0, Fix May Not Be Easy

// community takes

// share this