How Things Work - A Visual Guide to Technology

Every time you tap your phone screen, swipe through photos, or pinch to zoom, you are exploiting a fundamental property of physics: capacitance. The capacitive touchscreen has become so ubiquitous that we rarely pause to consider how it works - how a sheet of glass can detect the precise location of your fingertip, track multiple fingers simultaneously, and reject accidental touches from your palm. The engineering is elegant, the physics is straightforward, and the implications for human-computer interaction are profound.

The physics of capacitance

Capacitance is the ability of a system to store electric charge. The simplest capacitor is two conductive plates separated by an insulator. When voltage is applied, charge accumulates on the plates, creating an electric field between them. The amount of charge stored for a given voltage - the capacitance - depends on the plate area, the separation distance, and the properties of the insulator.

Your body is electrically conductive - you are mostly salt water, after all. When your finger approaches a charged conductor, it distorts the electric field and changes the capacitance of the system. A capacitive touchscreen detects these changes and uses them to determine where you touched.

The capacitance of a parallel-plate capacitor follows a simple formula: C = ε₀ × εᵣ × A / d, where ε₀ is the permittivity of free space, εᵣ is the relative permittivity of the insulator, A is the plate area, and d is the separation distance. A fingertip near an electrode effectively creates a new capacitor in parallel with the existing electrode capacitance. The human body adds perhaps 100-200 picofarads to ground, which couples to the electrode and changes its effective capacitance by a measurable amount - typically a few percent.

// Capacitance of a parallel plate capacitor
// C = ε₀ × εᵣ × A / d
function capacitance(
  area: number,           // plate area in m²
  distance: number,       // separation in m
  dielectric: number = 1  // relative permittivity
): number {
  const EPSILON_0 = 8.854e-12 // F/m (vacuum permittivity)
  return EPSILON_0 * dielectric * area / distance
}

The electrode grid

A modern capacitive touchscreen contains a grid of transparent electrodes made from Indium Tin Oxide (ITO). This remarkable material is both electrically conductive and optically transparent - essential properties when you need to sense touch while still seeing the display beneath.

The electrodes are arranged in rows and columns, typically on two separate layers or on opposite sides of a single substrate. The rows might be the 'drive' electrodes that transmit signals, while the columns are 'sense' electrodes that detect changes. Where each row crosses each column, there is a small capacitance between them - this is the mutual capacitance at that intersection.

ITO has limitations: indium is expensive and increasingly scarce, and ITO films can crack when flexed. Alternatives are emerging: silver nanowire networks offer better flexibility, carbon nanotubes provide similar transparency with better durability, and metal mesh patterns (thin enough to be invisible) work well for larger displays. Apple's newer devices use a metal mesh approach for some sensing layers, while flexible displays often use silver nanowires that can bend without breaking.

Mutual Capacitance Sensing Grid

TX0

TX1

TX2

TX3

TX4

TX5

TX6

TX7

RX0

RX1

RX2

RX3

RX4

RX5

RX6

RX7

Active Touch Points

Click grid to simulate touch

Sensing Mode

Mutual capacitance measures the coupling between TX (transmit) and RX (receive) electrodes at each intersection. Touch reduces this coupling.

How Mutual Capacitance Works

• TX electrodes transmit a signal while RX electrodes measure the received charge
• Each intersection forms a capacitor between the crossing conductors
• A finger near an intersection couples some charge to ground, reducing measured capacitance
• True multi-touch: each node is measured independently (no ghost points)

Interactive capacitive grid demonstration. Click to simulate touch and see how the capacitance changes at each intersection.

Mutual capacitance sensing

When your finger approaches an intersection, it absorbs some of the electric field that would otherwise couple between the electrodes. The measured signal at that intersection decreases. By scanning through all the rows and columns, the controller builds a complete map of capacitance changes across the screen - a 'touch image' showing exactly where fingers are making contact.

This mutual capacitance approach is what enables multi-touch. Because each intersection is measured independently, the system can track multiple fingers without ambiguity. Early capacitive screens used a simpler approach - self-capacitance - that could only reliably track one or two touches.

The scanning process happens continuously - typically 60 to 240 times per second on modern devices. The controller drives one row at a time with an AC signal while measuring the response on all columns simultaneously. With perhaps 30 rows and 50 columns, that's 1,500 intersection points scanned dozens of times per second. The entire scan takes just a few milliseconds, fast enough to track even rapid gestures with minimal lag.

Self-capacitance vs mutual capacitance

An older and simpler approach is self capacitance sensing. Instead of measuring the capacitance between electrodes, self capacitance measures the capacitance of each electrode to ground. When your finger - which is effectively grounded through your body - approaches an electrode, its self capacitance increases.

Self capacitance is cheaper to implement but has a critical limitation: it cannot accurately track multiple touches. If you place two fingers on the screen at positions (A, B) and (C, D), the row electrodes see touches at rows A and C, while column electrodes see touches at columns B and D. The system cannot tell whether the actual touches are at (A, B) and (C, D) or at (A, D) and (C, B) - these ghost points are indistinguishable.

Many modern touchscreens use a hybrid approach, combining self-capacitance for initial detection with mutual capacitance for disambiguation. Self-capacitance is more sensitive, especially for hovering detection before actual contact. Once a potential touch is detected, the system switches to mutual capacitance scanning in that region for accurate tracking. This combination provides both sensitivity and multi-touch accuracy while managing power consumption.

Signal processing

The capacitance changes from touch are tiny - often just a few femtofarads (10^-15 farads) against a background of picofarads. Detecting these small changes requires sophisticated analog circuitry and careful shielding from electrical noise. Display panels themselves are significant noise sources, with their rapidly switching pixels generating electromagnetic interference.

Touch Signal Processing Pipeline

Raw SignalStep 1 of 6

Analog voltage from sensor with noise

Raw Signal

The capacitive sensor outputs an analog voltage proportional to the capacitance at each node. Environmental noise from display drivers, chargers, and RF interference contaminates this signal.

Touch signal processing pipeline from raw capacitance measurements to final touch coordinates.

The touch controller uses various techniques to extract signal from noise: differential measurement rejects common-mode interference, averaging multiple samples reduces random noise, and synchronizing measurements with display blanking periods avoids the worst interference. Some controllers can even adapt their noise rejection in real-time based on detected interference patterns.

Charger noise is a particular challenge. Cheap or damaged chargers can inject significant noise into the device's ground plane, creating interference that looks like touch input. This is why some phones become nearly unusable while charging from low-quality adapters - the touch controller cannot distinguish real touches from electrical noise. High-quality devices include additional filtering and ground isolation to maintain touch accuracy regardless of charging conditions.

Touch tracking algorithms

Touch tracking algorithms use prediction and matching. Based on a touch's position and velocity, the system predicts where it will be in the next frame. Each detected touch in the new frame is matched to the prediction that best fits. When touches get too close together or cross paths, the algorithms must make educated guesses - occasionally getting it wrong, which is why fast crossing gestures sometimes confuse touch interfaces.

Most touch controllers implement some form of Kalman filtering - a mathematical technique that combines noisy measurements with motion predictions to estimate the true touch position. The filter maintains a model of position, velocity, and sometimes acceleration, updating the model with each new measurement while accounting for measurement uncertainty. This smooths jittery input while preserving responsiveness to sudden direction changes.

Palm rejection is another algorithmic challenge. When you write with a stylus, your palm rests on the screen. The system must distinguish between the small, intentional stylus touches and the large, accidental palm contact. Machine learning models trained on vast datasets of touch patterns help modern devices make these distinctions reliably.

Palm rejection uses multiple cues: palm touches are typically larger in area, lower in pressure (on devices that measure it), and appear at screen edges while other input happens centrally. The contact shape matters too - fingertips are roughly circular while palm edges are elongated. Some systems even use the touch timing: a palm typically lands as a slow, spreading contact, while intentional touches are more immediate. Combining these features in a classifier lets the system reject palms while still accepting legitimate large touches.

Latency and responsiveness

Early touchscreens had latencies of 100ms or more - noticeably sluggish. Modern smartphones achieve 20-40ms, with some gaming phones pushing below 10ms. Reducing latency makes interfaces feel more direct and responsive, as if you are manipulating objects rather than sending commands to a computer.

Achieving low latency requires optimization at every stage: fast sensor scanning, efficient signal processing, quick communication to the application processor, and minimal delay in the graphics pipeline. Apple's ProMotion displays scan touch at 120Hz - twice the rate of standard screens - enabling latencies that can be felt as increased responsiveness even if not consciously perceived.

Some systems use touch prediction to reduce perceived latency. By extrapolating the touch trajectory a frame or two into the future, the system can begin rendering a response before the touch data fully arrives. This can make 40ms latency feel like 20ms for smooth, predictable motions. The prediction can be wrong for sudden direction changes, requiring a correction that might be visible as a small jump - a trade-off between perceived responsiveness and occasional visual artifacts.

Beyond basic touch

Advanced touchscreens can detect more than just contact. Hover detection senses a finger approaching before it touches - the capacitance change begins before physical contact. This enables preview behaviors and can help predict where a touch will land.

Some devices add pressure sensitivity, though this typically requires additional sensor technology beyond basic capacitive touch. Apple's Force Touch and 3D Touch used separate strain gauges or capacitive pressure sensors beneath the screen to measure how hard you pressed. These enable new interaction possibilities - like the difference between a tap and a long press, but in the pressure domain.

Emerging touch technologies push further. Ultrasonic touch systems use sound waves instead of capacitance, enabling touch sensing through thick gloves or even underwater. In-display fingerprint sensors use the same basic principles but with much higher resolution to capture fingerprint ridges. Some research systems can identify individual users by the unique electrical properties of their touch. The capacitive touchscreen revolutionized computing interfaces; its successors may be equally transformative.

The capacitive touchscreen transformed how we interact with computers, replacing buttons and styluses with direct manipulation. The physics is just capacitance - a property known for centuries - but the engineering required to make it work reliably at scale represents decades of refinement. Every tap, swipe, and pinch relies on precisely measuring tiny changes in electric fields, processing noisy signals in real-time, and translating physical gestures into digital commands.