high-perf-browser
Rating is derived from the repo's GitHub stars and shown for reference.
name: high-perf-browser
description: 'Optimize web performance through network protocols, resource loading, and browser rendering internals. Use when the user mentions "page load speed", "Core Web Vitals", "HTTP/2", "resource hints", "network latency", "render blocking", "TCP optimization", "service worker", or "critical rendering path". Also trigger when diagnosing slow page loads, optimizing time to first byte, choosing between WebSocket and SSE, or reducing bundle sizes. Covers TCP/TLS optimization, caching strategies, WebSocket/SSE, and protocol selection. For UI visual performance, see refactoring-ui. For font loading, see web-typography.'
license: MIT
metadata:
author: wondelai
version: "1.2.0"
High Performance Browser Networking Framework
A systematic approach to web performance grounded in how browsers, protocols, and networks actually work. Apply these principles when building frontend applications, setting performance budgets, configuring servers, or diagnosing slow page loads.
Core Principle
Latency, not bandwidth, is the bottleneck. Most web performance problems stem from too many round trips, not too little throughput. A 5x bandwidth increase yields diminishing returns; a 5x latency reduction transforms the user experience.
The foundation: Every request passes through DNS resolution, TCP handshake, TLS negotiation, and HTTP exchange before a single byte of content arrives — each step adding round-trip latency. High-performance applications minimize round trips, parallelize requests, and eliminate unnecessary network hops. Understanding the protocol stack is the prerequisite for meaningful optimization.
Scoring
Goal: 10/10. When reviewing or building web applications, rate performance 0-10 based on adherence to the principles below. A 10/10 means full alignment with all guidelines; lower scores indicate gaps to address. Always provide the current score and the specific improvements needed to reach 10/10.
The High Performance Browser Networking Framework
Six domains for building fast, resilient web applications:
1. Network Fundamentals
Core concept: Every HTTP request pays a latency tax — DNS lookup, TCP three-way handshake, TLS negotiation — before any application data flows. Reducing or eliminating these round trips is the single highest-leverage optimization.
Why it works: Light travels at a finite speed: a New York–London packet takes ~28ms one way regardless of bandwidth. These physics-level constraints cannot be solved with bigger pipes — only with fewer trips.
Key insights:
- TCP three-way handshake adds one full RTT before data transfer begins
- TCP slow start limits initial throughput to ~14KB (10 segments) in the first round trip — keep critical resources under this threshold
- TLS 1.2 adds 2 RTTs; TLS 1.3 reduces this to 1 RTT (0-RTT with session resumption)
- Head-of-line blocking in TCP means one lost packet stalls all streams on that connection
- Bandwidth-delay product caps in-flight data; high-latency links underutilize bandwidth
Code applications:
| Context | Pattern | Example |
|---|---|---|
| Connection warmup | Pre-establish connections to critical origins | <link rel="preconnect" href="https://cdn.example.com"> |
| DNS prefetch | Resolve third-party domains early (saves 20-120ms) | <link rel="dns-prefetch" href="https://analytics.example.com"> |
| TLS optimization | TLS 1.3 + session resumption | ssl_protocols TLSv1.3; with session tickets |
| Connection reuse | Keep-alive avoids repeated handshakes | Connection: keep-alive (default in HTTP/1.1+) |
See: references/network-fundamentals.md for TCP congestion control, bandwidth-delay product, and TLS handshake details.
2. HTTP Protocol Evolution
Core concept: HTTP evolved from a simple request-response protocol into a multiplexed, binary system. Choosing the right protocol version and configuring it properly eliminates entire categories of performance problems.
Why it works: HTTP/1.1 forces workarounds (domain sharding, sprites, concatenation) because it cannot multiplex. HTTP/2 multiplexes but inherits TCP head-of-line blocking; HTTP/3 (QUIC over UDP) eliminates it. Each generation removes a bottleneck — and makes the previous generation's workarounds counterproductive.
Key insights:
- HTTP/1.1 allows one outstanding request per TCP connection; browsers open 6 per host as a workaround
- HTTP/2 multiplexes unlimited streams over one connection — domain sharding becomes counterproductive
- HPACK header compression in HTTP/2 cuts repetitive header overhead by 85-95%
- HTTP/3 (QUIC) eliminates TCP head-of-line blocking and enables 0-RTT resumption and connection migration
- Prefer
103 Early Hintsover HTTP/2 Server Push (which over-pushes and is widely deprecated) - Connection coalescing lets one HTTP/2 connection serve multiple hostnames sharing a certificate
Code applications:
| Context | Pattern | Example |
|---|---|---|
| HTTP/2 migration | Remove HTTP/1.1 workarounds | Undo domain sharding, sprites, file concatenation |
| 103 Early Hints | Send preload hints before the full response | 103 with Link: </style.css>; rel=preload |
| QUIC/HTTP/3 | Advertise HTTP/3 on CDN or origin | Alt-Svc: h3=":443" header |
| Stream prioritization | Signal resource importance | CSS and fonts highest priority; images lower |
See: references/http-protocols.md for protocol comparison, migration strategies, and server push vs. Early Hints.
3. Resource Loading and Critical Rendering Path
Core concept: The browser must build the DOM, CSSOM, and render tree before painting pixels: HTML → DOM → CSSOM → Render Tree → Layout → Paint → Composite. Any resource that blocks this pipeline delays first paint.
Why it works: CSS is render-blocking (no paint until CSSOM is ready) while JavaScript is parser-blocking (<script> halts DOM construction until it downloads and executes) — so each needs a different optimization strategy. Every blocking resource adds latency directly to time-to-first-paint.
Key insights:
asyncdownloads in parallel and executes immediately (use for independent scripts);deferdownloads in parallel but executes after DOM parsing (use for most scripts)<link rel="preload">fetches critical resources at high priority now;rel="prefetch"fetches likely next-navigation resources at low priority- Inline above-the-fold CSS and async-load the rest to eliminate the render-blocking CSS request
- Fonts can block text rendering for up to 3s — use
font-display: swap
Code applications:
| Context | Pattern | Example |
|---|---|---|
| Critical CSS | Inline above-the-fold styles in <head> |
<style>/* critical */</style> + async full CSS |
| Script loading | defer by default; async for independents |
<script src="app.js" defer></script> |
| Resource hints | Preload critical fonts, hero images | <link rel="preload" href="font.woff2" as="font" crossorigin> |
| Image optimization | Lazy-load below-fold; modern formats | <img loading="lazy" src="photo.avif" srcset="..."> |
See: references/resource-loading.md for async/defer behavior, resource hint strategies, and image and font optimization.
4. Caching Strategies
Core concept: The fastest network request is one that never happens. Layer caches — browser memory, disk, service worker, CDN, origin — to eliminate round trips for repeat visitors.
Why it works: Cache-Control headers tell the browser and intermediaries exactly how long a response stays valid; content-hashed URLs make aggressive immutable caching safe. Each cache hit eliminates a full network round trip.
Key insights:
Cache-Control: no-cachestill caches but revalidates every time;no-storenever caches — don't confuse themETag/Last-Modifiedenable conditional requests (304 Not Modified) that skip the body transfer- Service workers provide a programmable cache layer that works offline (cache-first shell, network-first dynamic content)
- Misconfigured
Varyheaders cause CDN cache pollution — serve the wrong encoding or format to the wrong client
Code applications:
| Context | Pattern | Example |
|---|---|---|
| Static assets | Immutable cache + hash busting | style.a1b2c3.css with Cache-Control: max-age=31536000, immutable |
| HTML documents | Revalidate on every request | Cache-Control: no-cache with ETag |
| API responses | Short TTL + background refresh | Cache-Control: max-age=60, stale-while-revalidate=3600 |
| CDN config | Cache at edge with correct Vary | Vary: Accept-Encoding, Accept |
See: references/caching-strategies.md for cache hierarchy, service worker patterns, and CDN configuration.
5. Core Web Vitals Optimization
Core concept: Core Web Vitals — LCP, INP, CLS — are Google's user-centric metrics covering loading, interactivity, and visual stability. They impact search ranking and reflect real user experience.
Why it works: These metrics measure what users perceive, not what servers report: a fast TTFB means nothing if the hero image loads late (LCP) or main-thread JavaScript blocks input (INP). Optimizing for them is optimizing for real perception.
Key insights (targets):
- LCP < 2.5s — optimize the largest visible element (hero image, heading block, video poster)
- INP < 200ms — keep the main thread free; break long tasks
- CLS < 0.1 — reserve space for dynamic content before it loads
- TTFB < 800ms and FCP < 1.8s feed everything downstream
- Measure with Real User Monitoring (RUM) in production — lab/synthetic tests miss real-device and network variance
Code applications:
| Context | Pattern | Example |
|---|---|---|
| LCP | Preload LCP element; raise its priority | <img src="hero.webp" fetchpriority="high"> |
| INP | Break long tasks; yield to main thread | scheduler.yield() or setTimeout chunking |
| CLS | Reserve space for async content | <img width="800" height="600"> or CSS aspect-ratio |
| Performance budget | Block deploys that exceed thresholds | LCP < 2.5s, INP < 200ms, CLS < 0.1 in CI |
See: references/core-web-vitals.md for measurement tools, debugging workflows, and optimization checklists.
6. Real-Time Communication
Core concept: When data must flow continuously, the transport choice — WebSocket, SSE, or long polling — determines latency, resource usage, and scalability.
Why it works: HTTP's request-response model adds overhead to every real-time update. WebSocket offers full-duplex with ~2-byte framing; SSE offers simpler server-to-client push over plain HTTP. Match the transport to the data flow direction and frequency instead of defaulting to the most powerful option.
Key insights:
- WebSocket: bidirectional (chat, gaming, collaborative editing); SSE: server-to-client only, auto-reconnects, proxy-friendly, simpler
- Long polling is a fallback only — high overhead from repeated HTTP requests
- Each WebSocket is a separate TCP connection that bypasses HTTP/2 multiplexing
- Send heartbeat/ping frames — mobile networks silently drop idle connections
- Reconnect with exponential backoff and queue messages while disconnected
Code applications:
| Context | Pattern | Example |
|---|---|---|
| Chat / collaboration | WebSocket + heartbeat + reconnection | new WebSocket('wss://...') with ping every 30s |
| Live feeds / notifications | SSE for server-to-client streaming | new EventSource('/api/updates') |
| Connection resilience | Exponential backoff on reconnect | 1s, 2s, 4s, 8s... capped at 30s |
| Scaling | Pub/sub broker behind WebSocket servers | Redis Pub/Sub or NATS |
See: references/real-time-communication.md for WebSocket lifecycle, SSE patterns, and scaling strategies.
Common Mistakes
| Mistake | Why It Fails | Fix |
|---|---|---|
| Adding bandwidth to fix slow pages | Latency is the bottleneck, not throughput | Reduce round trips: preconnect, cache, CDN |
| Loading all JS upfront | Parser-blocking scripts delay paint and interactivity | Code-split; defer; lazy-load non-critical modules |
| No resource hints | Browser discovers critical resources too late | preconnect + preload for above-fold criticals |
Missing Cache-Control / no-store everywhere |
Every visit re-downloads everything | Proper max-age + content hashing |
| Ignoring CLS | Layout shifts destroy trust and ranking | Explicit dimensions on images, embeds, ads |
| WebSocket for everything | Needless complexity when SSE/polling suffices | Match transport to data flow; SSE for server push |
| Domain sharding on HTTP/2 | Defeats multiplexing; extra TCP connections | Consolidate origins; let HTTP/2 multiplex |
| No compression | Text resources transfer at full size | Enable Brotli (preferred) or Gzip on server/CDN |
Quick Diagnostic
| Question | If No | Action |
|---|---|---|
| Is TTFB under 800ms? | Server or network too slow | CDN, server caching, check backend |
| Is LCP under 2.5s? | Largest element loads too late | Preload LCP resource; fetchpriority="high" |
| Is INP under 200ms? | Main thread blocked | Break long tasks; defer non-critical JS |
| Is CLS under 0.1? | Elements shift after render | Explicit dimensions; reserve space |
| Are static assets content-hashed and cached? | Repeat visitors re-download | Hashed filenames + Cache-Control: immutable |
| Is HTTP/2 or HTTP/3 enabled? | No multiplexing or header compression | Enable HTTP/2 on server; HTTP/3 via CDN |
| Are render-blocking resources minimized? | CSS and sync JS delay first paint | Inline critical CSS; defer scripts; prune unused CSS |
| Is compression enabled (Brotli/Gzip)? | Uncompressed text transfers | Enable Brotli on server/CDN; Gzip fallback |
Reference Files
- network-fundamentals.md: TCP handshake, congestion control, TLS optimization, DNS resolution, head-of-line blocking
- http-protocols.md: HTTP/1.1 workarounds, HTTP/2 multiplexing, HTTP/3 and QUIC, migration strategies
- resource-loading.md: Critical rendering path, async/defer, resource hints, image and font optimization
- caching-strategies.md: Cache-Control headers, service workers, CDN configuration, cache invalidation
- core-web-vitals.md: LCP, INP, CLS optimization, measurement tools, performance budgets
- real-time-communication.md: WebSocket, SSE, long polling, connection management, scaling
Further Reading
Based on Ilya Grigorik's comprehensive guide to browser networking and web performance:
- "High Performance Browser Networking" by Ilya Grigorik (the complete reference for networking protocols, browser internals, and performance optimization)
- hpbn.co -- Free online edition maintained by the author
About the Author
Ilya Grigorik is a web performance engineer who spent over a decade at Google working on Chrome, web platform performance, and HTTP standards, and co-chaired the W3C Web Performance Working Group. His book High Performance Browser Networking (O'Reilly, 2013) is widely regarded as the definitive reference on how browsers interact with the network.