Class D modulators can be implemented in many ways, supported by a large quantity of related research and intellectual property. This article will only introduce fundamental concepts. All Class D modulation techniques encode information about the audio signal into a stream of pulses. Generally, the pulse widths are linked to the amplitude of the audio signal, and the spectrum of the pulses includes the desired audio signal plus undesired (but unavoidable) high-frequency content. The total integrated high-frequency power in all schemes is roughly the same, since the total power in the time-domain waveforms is similar, and by Parseval's theorem, power in the time domain must equal power in the frequency domain. However, the distribution of energy varies widely: in some schemes, there are high energy tones atop a low noise floor, while in other schemes, the energy is shaped so that tones are eliminated but the noise floor is higher.
The most common modulation technique is pulse-width modulation (PWM). Conceptually, PWM compares the input audio signal to a triangular or ramping waveform that runs at a fixed carrier frequency. This creates a stream of pulses at the carrier frequency. Within each period of the carrier, the duty ratio of the PWM pulse is proportional to the amplitude of the audio signal. In the example of Figure 7, the audio input and triangular wave are both centered around 0 V, so that for 0 input, the duty ratio of the output pulses is 50%. For large positive input, it is near 100%, and it is near 0% for large negative input. If the audio amplitude exceeds that of the triangle wave, full modulation occurs, where the pulse train stops switching, and the duty ratio within individual periods is either 0% or 100%.
Figure 7. PWM concept and example.
PWM is attractive because it allows 100-dB or better audio-band SNR at PWM carrier frequencies of a few hundred kilohertz—low enough to limit switching losses in the output stage. Also, many PWM modulators are stable up to nearly 100% modulation, in concept permitting high output power-up to the point of overloading. However, PWM has several problems: First, the PWM process inherently adds distortion in many implementations (Reference 4); next, harmonics of the PWM carrier frequency produce EMI within the AM radio band; and finally, PWM pulse widths become very small near full modulation. This causes problems in most switching output-stage gate-driver circuits—with their limited drive capability, they cannot switch properly at the excessive speeds needed to reproduce short pulses with widths of a few nanoseconds. Consequently, full modulation is often unattainable in PWM-based amplifiers, limiting maximum achievable output power to something less than the theoretical maximum—which considers only power-supply voltage, transistor on resistance, and speaker impedance.
An alternative to PWM is pulse-density modulation (PDM), in which the number of pulses in a given time window is proportional to the average value of the input audio signal. Individual pulse widths cannot be arbitrary as in PWM, but are instead "quantized" to multiples of the modulator clock period. 1 bit sigma-delta modulation is a form of PDM.