Introduction
In early semiconductor processing, a "gate-first" flow was the standard method for fabricating metal-oxide-semiconductor field-effect transistors (MOSFETs) . In this legacy approach, the gate electrode—typically made of doped polysilicon—was patterned before the formation of the source and drain (S/D) regions . However, as transistor scaling advanced toward sub-30nm dimensions, the conventional silicon dioxide gate dielectric met its physical limits due to excessive quantum tunneling leakage .
To overcome this barrier, high-k metal gate (HKMG) technology was introduced to maintain strong capacitive coupling with the channel without accelerating leakage . Integrating high-k dielectrics and metal gate electrodes, however, presented a significant thermal budget challenge: the extremely high temperatures required to activate S/D dopants caused the delicate metal gates and high-k materials to interdiffuse, recrystallize, and shift their threshold voltage (Vt) .
To protect these sensitive gate materials from thermal degradation, the industry shifted to a "gate-last" or replacement metal gate (RMG) scheme . At the center of this integration logic is the dummy gate (DG)—a temporary, sacrificial placeholder gate structure formed early in the front end of line (FEOL) process flow . The dummy gate (DG) defines the physical channel area, stands resilient against the high thermal budgets of dopant activation, and is subsequently removed and replaced by the final, high-performance metal gate stack .
Physics & Mechanism
To understand why a dummy gate (DG) is necessary, one must examine the solid-state physics of transistor doping and interface thermodynamics . The modulation of silicon conductivity depends heavily on the introduction of donor or acceptor impurities, which shift the Fermi level and establish low-resistance S/D regions . These dopants must be thermally activated via high-temperature annealing, which restores the damaged silicon crystal lattice following ion implantation .
However, high-temperature treatments pose a severe threat to metal-oxide-semiconductor interfaces . If a metal gate is present during this high-temperature anneal, thermal energy drives the diffusion of metal atoms into the underlying gate dielectric, inducing interface states and Fermi level pinning . This pinning alters the metal-semiconductor work function, leading to unpredictable shifts in the transistor's threshold voltage . Furthermore, high temperatures can cause the amorphous high-k dielectric (such as hafnium-based oxides) to crystallize, creating grain boundaries that act as high-leakage paths for electrical current (Engineering Practice).
The integration of a dummy gate (DG) solves this physical dilemma by decoupling the high-temperature S/D activation annealing from the work function metal deposition:
[Dummy Gate Patterning]
│
▼
[High-Temp S/D Dopant Annealing] <-- Protected by stable Dummy Gate
│
▼
[Dummy Gate Removal (Etching)]
│
▼
[Low-Temp HKMG Deposition] <-- Protected from high-temp degradation
By utilizing a sacrificial material (typically amorphous silicon or polysilicon) as a dummy gate (DG), the device undergoes high-temperature S/D activation while the active gate region remains safely filled with the dummy material . The dummy gate (DG) acts as a mechanical barrier and physical template, preserving the pristine geometry of the channel .
Once the thermal steps are complete, the dummy gate (DG) is selectively etched away to reveal a clean trench . Because the final work function metals and gate electrodes are subsequently deposited at low temperatures, they do not undergo high-temperature degradation . This preserves the precise electrostatic field control of the gate over the inversion layer . Ultimately, this process architecture ensures that dual gate configurations (optimized separately for NMOS and PMOS) retain their target work function values, securing a steep subthreshold swing (SS) and reducing off-state leakage current .
Process Principles
The fabrication and subsequent replacement of a dummy gate (DG) involve a highly sensitive sequence of deposition, planarization, and etching steps where process parameters directionally affect device outcomes .
Sacrificial Film Deposition
The process begins with the deposition of the dummy gate (DG) material, typically amorphous silicon or polysilicon, over a thin oxide layer that protects the underlying single-crystal silicon substrate . This deposition is usually performed via low-pressure chemical vapor deposition (LPCVD) or plasma-enhanced chemical vapor deposition (PECVD) (Engineering Practice). Increasing the deposition temperature generally increases the crystallinity of the silicon film, which directionally improves its mechanical stability but can make subsequent selective etching more difficult due to grain-boundary-induced etch rate variations . Conversely, lower temperatures favor amorphous silicon, which offers higher isotropic etch uniformity and smoother line edges (Engineering Practice).
Chemical Mechanical Planarization (CMP)
Following dummy gate patterning, S/D epitaxy, and interlayer dielectric (ILD) oxide deposition, the wafer must be planarized to expose the top surface of the dummy gate . This step relies on chemical mechanical planarization (CMP), where the removal rate is heavily influenced by the pattern density of the layout . If the local pattern density is too low, the CMP pad bends into the wider oxide regions, causing "dishing" and erosion that reduce the final height of both the oxide and the dummy gate .
To balance these mechanical forces, chip layouts include "inactive gates" (or dummy features) that act as planarization loads [A2, A3]. Retaining these inactive gates throughout the CMP steps minimizes height variations between dense and isolated gate structures, ensuring a uniform remaining dummy gate height across the entire wafer [A2, A3].
Selective Etching
The core of the RMG scheme is the complete and highly selective removal of the dummy gate (DG) without damaging the surrounding structures . Wet etching is typically favored for its high selectivity, often utilizing basic solutions like tetramethylammonium hydroxide (TMAH) to dissolve the amorphous silicon or polysilicon dummy gate .
The chemical reaction rate of TMAH etching is highly sensitive to temperature and solution concentration (Engineering Practice). Increasing the etching temperature directionally accelerates the removal rate but can degrade selectivity to the surrounding silicon nitride spacers or oxide isolation structures . If the chemical selectivity of the dummy gate removal process is insufficient, the surrounding spacers may become recessed, which alters the final channel length and can lead to gate-to-source/drain short-circuits during subsequent metal deposition .
High Etch Temperature ──► Faster DG Removal Rate ──► Lower Selectivity ──► Spacer Recess / Channel Damage
Challenges & Failure Modes
Implementing a dummy gate (DG) integration flow introduces several physical and chemical failure modes that must be carefully managed to maintain line yield .
CMP Dishing and Gate Height Inconsistency
If the layout density of dummy gates is non-uniform, local variations in CMP polishing rates will lead to systematic gate height deviations . When the dummy gate is later replaced by metal, these height variations translate directly into differences in metal gate volume and cross-sectional area [A2, A3]. From an electrical standpoint, a reduced metal gate cross-section increases gate line resistance, which degrades high-frequency switching performance and causes asymmetric RC propagation delays across the circuit [T4, A2].
Etch Residues and Void Formation
As transistor dimensions shrink, the aspect ratio of the dummy gate trench increases significantly . During the dummy gate (DG) removal process, transport of the chemical etchant to the bottom of these narrow trenches becomes mass-transport limited, governed by the boundary layer velocity and diffusion dynamics . If etchant diffusion is restricted, incomplete removal of the sacrificial silicon occurs, leaving behind residues at the trench bottom .
These residues block the subsequent atomic layer deposition (ALD) of work function metals, leading to void formation or preventing the metal gate from contacting the gate dielectric altogether . This failure mode manifests electrically as a catastrophic loss of gate electrostatic control, resulting in severe subthreshold leakage or complete device failure .
Substrate Erosion and Spacer Pull-Down
During the selective wet clean or dry etch of the dummy gate (DG), the chemistry must not attack the underlying thin gate oxide or the single-crystal channel material . If the etchant chemical selectivity is poor, or if the protective bottom oxide is too thin, the etchant will penetrate the substrate and erode the channel region (Engineering Practice). This erosion distorts the channel geometry, increases surface scattering, and degrades carrier mobility . Additionally, spacer pull-down (the vertical loss of the side spacer height) reduces the physical distance between the replacement metal gate and the heavily doped S/D regions, dramatically increasing parasitic gate-to-S/D capacitance and elevating the risk of dielectric breakdown .
Incomplete Etchant Diffusion ──► DG Residues at Trench Bottom ──► ALD Metal Blocking ──► Void Formation & Vt Shift
Technology Node Evolution
The engineering of the dummy gate (DG) has undergone dramatic shifts as CMOS architectures transitioned from planar to three-dimensional geometries .
28nm Node (Planar RMG)
At the 28nm Planar Flow, the RMG scheme using a dummy gate (DG) was first adopted at scale to implement stable high-k gate-last integration . The dummy gate was a relatively simple, flat polysilicon feature patterned on a planar silicon substrate . The primary engineering focus was maintaining gate CD uniformity across the wafer and preventing CMP dishing in open areas, which was resolved by inserting dummy fill patterns in the layout .
14nm Node (FinFET Transition)
With the transition to the 14nm FinFET node, the dummy gate (DG) had to wrap around three-dimensional silicon fins rather than sitting on a flat surface . This structural change introduced severe topological challenges (Engineering Practice). The sacrificial silicon had to be deposited conformally over the fins without leaving internal voids, and the subsequent anisotropic dry etching had to completely clear the dummy gate material from the narrow spaces between adjacent fins without eroding the fin structures themselves . Dual gate integration also matured here, requiring precise masking to selectively replace dummy gates for NMOS and PMOS separately while retaining mechanical integrity .
7nm Node and Beyond (Aggressive Pitch Scaling)
At the 7nm FinFET node, the contact poly pitch scaled down aggressively . This pitch scaling required extremely thin dummy gates with high aspect ratios, making them highly susceptible to mechanical bending or collapse during wet processing . The removal process shifted toward highly advanced dry chemical etching and specialized wet chemistries that could penetrate high-aspect-ratio trenches without capillary-force-induced pattern collapse (Engineering Practice).
Planar (28nm) ──────► FinFET (14nm) ──────► FinFET (7nm) ──────► Nanosheet (GAA)
(Flat DG) (3D Fin-Wrapping) (High Aspect Ratio) (Inner Spacer + Sacrificial SiGe)
Gate-All-Around (GAA) Nanosheets
In the latest gate-all-around (GAA) nanosheet architectures, dummy gate integration logic has evolved further . In GAA, the channel consists of suspended silicon nanosheets separated by sacrificial silicon-germanium (SiGe) layers (Engineering Practice). Here, the dummy gate (DG) is patterned around the epitaxial Si/SiGe superlattice . After dummy gate removal, an additional, highly selective lateral etch is performed to remove the sacrificial SiGe layers, releasing the silicon nanosheets so that the replacement metal gate can completely surround each channel .
Related Processes
The dummy gate (DG) module does not exist in isolation; it is highly dependent on and integrated with several adjacent FEOL process steps .
Lithography and Bottom Anti-Reflective Coating (BARC)
The initial patterning of the dummy gate (DG) requires advanced lithography to define the target gate length . To prevent light reflection from the substrate during exposure and to control critical dimensions, a bottom anti-reflective coating (BARC) is applied beneath the photoresist (Engineering Practice). This step is critical for minimizing line edge roughness, which otherwise transfers into the dummy gate and ultimately degrades the electrical uniformity of the final replacement metal gate .
Self-Aligned Silicide (Salicide)
Before the dummy gate (DG) is removed, the source and drain areas must undergo silicidation to form low-resistance electrical contacts, a process known as self-aligned silicide . The presence of the dummy gate (DG) is crucial during this step; it physically blocks the silicide metals (such as nickel or cobalt) from reacting with the channel area, confining the low-resistance silicide precisely to the S/D regions and preventing catastrophic gate-to-channel shorting .
Post-Etch Wet Clean
After the dummy gate (DG) is chemically removed, the open trench is highly vulnerable to trace organic and inorganic contaminants . Specialized wet clean chemistries are deployed to strip away polymer residues and native oxides from the trench sidewalls without etching the dielectric spacers or the exposed channel surface (Engineering Practice). This clean is essential to ensure that the subsequent ALD layers exhibit ideal adhesion and interface quality (Engineering Practice).
Work Function Metal Deposition
Immediately following the trench clean, ALD is used to deposit ultra-thin conformal layers of work function metals, such as titanium nitride (TiN) or tantalum carbide (TaC), into the cavity left by the dummy gate . The precise composition and thickness of these layers determine the final device work function . Because the dummy gate (DG) was successfully removed, these metals are deposited at temperatures low enough to preserve their exact atomic structures and prevent any interdiffusion into the high-k dielectric stack .