Design Uber / Ride-Sharing

Geospatial Indexing, Ride Matching, Surge Pricing, ETA Calculation, and Real-Time Location Tracking

Uber processes 25M+ rides per day across 10,000+ cities. The core challenges: tracking millions of drivers sending location updates every few seconds via WebSockets, performing geospatial queries to match riders with nearby drivers in real time using geohash or H3 indexing, computing dynamic ETA factoring in traffic, road networks, and historical patterns, implementing surge pricing that balances supply and demand, and building a matching algorithm that optimizes for rider wait time, driver utilization, and fairness. At peak, the system handles millions of location updates per second and completes matches in under 5 seconds.

Geospatial Indexing Visualizer (Geohash)

Geohash divides the world into a grid of cells. Each character added to the hash increases precision. Nearby locations share common prefixes, enabling efficient spatial queries. Click a cell to see its geohash at different precision levels.

Precision level: 3 chars

Click a cell to see its geohash encoding

Ride Matching Algorithm Simulator

When a rider requests a ride, the system finds nearby drivers, scores them by distance, rating, and acceptance rate, then matches the best candidate. Click on the grid to place a rider, then watch the matching algorithm run.

Place a rider and click "Run Matching" to simulate the algorithm

Capacity Estimation

Estimate the infrastructure requirements for a ride-sharing platform at scale.

Active riders (millions): 50

Active drivers (millions): 5

Rides per day (millions): 25

Avg ride duration (min): 15

Location updates/sec per driver: 4

Location updates/sec--

Rides/sec--

Storage/day--

Peak QPS--

WebSocket conns--

Server count--

Redis memory--

Kafka throughput--

ETA Calculation Engine

ETA combines straight-line distance with road network factors and real-time traffic conditions. Click on the grid to set start and end points, then adjust the time of day to see how traffic affects the estimate.

Time of day: 12:00 (Afternoon)

Set start and end points to calculate ETA

Surge Pricing Calculator

Surge pricing dynamically adjusts fares based on the ratio of ride requests (demand) to available drivers (supply). When demand exceeds supply, prices increase to incentivize more drivers and manage rider expectations.

Ride requests (demand): 100

Available drivers (supply): 80

1.0x1.5x2.0x2.5x3.0x+

1.0x

Balanced

Demand/Supply Ratio--

Base fare$12.00

Surge fare--

Tier--

Architecture

Rider App

Driver App

→

API Gateway

→

Trip Service

↓

Location Service

Redis (driver coords)

↓

Matching Service

Geohash + scoring

↓

Pricing Service

Surge + fare calc

↓

PostgreSQL

Trips, Users

↓

Kafka

Events, Logs

↓

Maps Service

External API (ETA)

Key Design Decisions

Geohash vs QuadTree vs H3 Geohash String-based, easy to store in Redis
Prefix matching for proximity
Edge cases at cell boundaries
Simple to implement
 
vs
 QuadTree Adaptive resolution per region
Better for non-uniform distribution
In-memory tree structure
Complex to distribute across nodes
 
vs
 H3 (Uber's choice) Hexagonal grid, uniform distances
Hierarchical resolution levels
No boundary edge cases
Open source, battle-tested
 

Location Storage: Redis vs In-Memory

Redis with GEO commands is the recommended choice. GEOADD stores driver locations, GEORADIUS queries nearby drivers within a given radius. Redis handles 100K+ operations/sec per node. For extreme scale, shard by geohash prefix. In-memory stores are faster but lose data on restart and are harder to distribute.

Push vs Pull for Driver Locations

Push model via WebSockets — drivers push location updates every 3-5 seconds. The server does not poll. WebSocket connections are persistent, reducing connection overhead. For 5M active drivers, that means ~5M concurrent WebSocket connections distributed across a fleet of connection servers. Use consistent hashing to route drivers to specific servers.